Scrapfly Blog

Tutorials, guides, and insights on web scraping, data extraction, and automation 272 articles

// Articles

8 posts
Latest
Web Scraping With NodeJS and Javascript
http nodejs data-parsing

Web Scraping With NodeJS and Javascript

In this article we'll take a look at scraping using Javascript through NodeJS. We'll cover common web scraping libraries, frequently encountered challenges and wrap everything up by scraping etsy.com

Feb 14, 2022 23 min read
Parsing HTML with Xpath
python data-parsing parsel

Parsing HTML with Xpath

Introduction to xpath in the context of web-scraping. How to extract data from HTML documents using xpath, best practice...

Feb 07, 2022 12 min read
Parsing HTML with CSS Selectors
data-parsing css-selectors

Parsing HTML with CSS Selectors

Introduction to using CSS selectors to parse web-scraped content. Best practices, available tools and common challenges ...

Feb 07, 2022 12 min read
Web Scraping With PHP 101
http data-parsing xpath

Web Scraping With PHP 101

Introduction to web scraping with PHP. How to handle http connections, parse html files for data, best practices, tips a...

Feb 06, 2022 22 min read
Web Scraping With Scrapy: The Complete Guide in 2026
python xpath scrapeguide

Web Scraping With Scrapy: The Complete Guide in 2026

Tutorial on web scraping with scrapy and Python through a real world example project. Best practices, extension highligh...

Feb 04, 2022 17 min read
Web Scraping with Selenium and Python
python headless-browser selenium

Web Scraping with Selenium and Python

Introduction to web scraping dynamic javascript powered websites and web apps using Selenium browser automation library ...

Jan 10, 2022 15 min read
How to Parse Web Data with Python and Beautifulsoup
python data-parsing beautifulsoup

How to Parse Web Data with Python and Beautifulsoup

Beautifulsoup is one the most popular libraries in web scraping. In this tutorial, we'll take a hand-on overview of how ...

Jan 03, 2022 16 min read
How to Scrape Dynamic Websites Using Headless Web Browsers
python headless-browser playwright

How to Scrape Dynamic Websites Using Headless Web Browsers

Introduction to using web automation tools such as Puppeteer, Playwright, Selenium and ScrapFly to render dynamic websit...

Jan 02, 2022 16 min read

Ready to scale your web scraping?

Anti-bot bypass, browser rendering, and rotating proxies — all in one API.