🚀 We are hiring! See open positions

Scrapfly Blog

Tutorials, guides, and insights on web scraping, data extraction, and automation 268 articles

// Articles

12 posts
Latest
How Headers Are Used to Block Web Scrapers and How to Fix It
http blocking

How Headers Are Used to Block Web Scrapers and How to Fix It

Introduction to web scraping headers - what do they mean, how to configure them in web scrapers and how to avoid being blocked.

Apr 18, 2022 14 min read
Web Scraping with Python
http python data-parsing

Web Scraping with Python

Introduction tutorial to web scraping with Python. How to collect and parse public data. Challenges, best practices and ...

Apr 04, 2022 38 min read
Web Scraping Graphql with Python
http python graphql

Web Scraping Graphql with Python

Introduction to web scraping graphql powered websites. How to create graphql queries in python and what are some common ...

Apr 04, 2022 11 min read
Web Scraping With R Tutorial and Example Project
http data-parsing scrapeguide

Web Scraping With R Tutorial and Example Project

Introduction to web scraping with R language. How to handle http connections, parse html files, best practices, tips and...

Mar 24, 2022 10 min read
Top 4 Mobile Proxy Providers for Web Scraping
blocking proxies

Top 4 Mobile Proxy Providers for Web Scraping

Mobile proxies are really useful for avoiding web scraper blocking - so, which mobile proxy providers are the best and h...

Mar 11, 2022 9 min read
Web Scraping With Ruby
http data-parsing css-selectors

Web Scraping With Ruby

Introduction to web scraping with Ruby. How to handle http connections, parse html files for data, best practices, tips ...

Mar 07, 2022 21 min read
The Complete Guide To Using Proxies For Web Scraping
blocking proxies scrapeguide

The Complete Guide To Using Proxies For Web Scraping

Introduction to proxy usage in web scraping. What types of proxies are there? How to evaluate proxy providers and avoid ...

Mar 07, 2022 16 min read
How to Web Scrape with Puppeteer and NodeJS in 2026
nodejs headless-browser puppeteer

How to Web Scrape with Puppeteer and NodeJS in 2026

Introduction to using Puppeteer in Nodejs for web scraping dynamic web pages and web apps. Tips and tricks, best practic...

Feb 14, 2022 18 min read
Web Scraping With NodeJS and Javascript
http nodejs data-parsing

Web Scraping With NodeJS and Javascript

In this article we'll take a look at scraping using Javascript through NodeJS. We'll cover common web scraping libraries...

Feb 14, 2022 23 min read
Parsing HTML with Xpath
python data-parsing parsel

Parsing HTML with Xpath

Introduction to xpath in the context of web-scraping. How to extract data from HTML documents using xpath, best practice...

Feb 07, 2022 12 min read
Parsing HTML with CSS Selectors
data-parsing css-selectors

Parsing HTML with CSS Selectors

Introduction to using CSS selectors to parse web-scraped content. Best practices, available tools and common challenges ...

Feb 07, 2022 12 min read
Web Scraping With PHP 101
http data-parsing xpath

Web Scraping With PHP 101

Introduction to web scraping with PHP. How to handle http connections, parse html files for data, best practices, tips a...

Feb 06, 2022 22 min read

Ready to scale your web scraping?

Anti-bot bypass, browser rendering, and rotating proxies — all in one API.