Web Scraping to Markdown: Extract and Clean Articles