Web Scraping Scraping Javascript Rendered Web Pages using Puppeteer Being able to scrape sites to extract a max of information sometimes requires JavaScript execution. Puppeteer allows the execution and manipulation of headless browser instances.
Content Extraction Extracting clean data from blog and news articles Several open source tools allow the extraction of clean text from article HTML. We list the most popular ones below, and run a benchmark to see how they stack up against the Ujeebu API