Ujeebu
  • Products
  • Demo
  • Pricing
  • Documentation
  • Blog
  • Community
  • Contact
  • Products
  • Demo
  • Pricing
  • Documentation
  • Blog
  • Community
  • Contact

Content Extraction

A collection of 3 posts

A Simple Scraper using Puppeteer
Content Extraction

A Simple Scraper using Puppeteer

Web scraping is the process of extracting data from websites. One popular library for web scraping is Puppeteer. Puppeteer is a Node.js library that provides a high-level API to control headless Chrome or Chromium over the DevTools Protocol.

  • Sam
5 min read
Is Web Scraping Legal?
Content Extraction

Is Web Scraping Legal?

The issues of legality and ethics surrounding web scraping are a massive grey area. While some may be in favor of web scraping, others might not share the same enthusiasm. This is what makes the subject so controversial.

  • Sam
6 min read
Extracting clean data from blog and news articles
Content Extraction

Extracting clean data from blog and news articles

Several open source tools allow the extraction of clean text from article HTML. We list the most popular ones below, and run a benchmark to see how they stack up against the Ujeebu API

  • Sam
4 min read
Ujeebu © 2023
Latest Posts Facebook Twitter