Playwright Knowledgebase

Playwright is a powerful open-source automation library for web browsers, developed by Microsoft. It allows developers to control real web browsers like Chromium, Firefox, and WebKit with a single API in multiple programming languages, including Python, JavaScript, and Java.

Playwright is an evolution of the Puppeteer library, designed to provide more features and better support for modern web applications. While Playwright is intended for web application testing it has a growing web scraping community and implements most features needed for headless browser scraping.

Web Scraping Dynamic Websites With Scrapy Playwright

Learn about Selenium Playwright. A Scrapy integration that allows web scraping dynamic web pages with Scrapy. We'll explain web scraping with Scrapy Playwright through an example project and how to use it for common scraping use cases, such as clicking elements, scrolling and waiting for elements.

Web Scraping Dynamic Websites With Scrapy Playwright

See below for more on Playwright in the context of web scraping and data programming 👇

How to take screenshots in NodeJS?

Learn how to screenshot in Node.js using Playwright & Puppeteer. Includes installation, concepts, and customization tips.

#screenshots
#headless-browser
#puppeteer
#playwright
#nodejs

How to handle popup dialogs in Playwright?

To handle alert-type pop ups in Playwright the on "dialog" event can be captured and interacted with in both Python and NodeJS playwright clients

#playwright
#python

How to click on cookie popups and modal alerts in Playwright?

To click on modal popups like the infamous cookie conset alert we can either find and click the agree button or remove it entirely. Here's how.

#playwright

How to scroll to the bottom of the page with Playwright?

Learn how to scroll to the bottom of the page with Playwright using three distinct approaches for both Python and NodeJS clients.

#playwright

How to check if element exists in Playwright?

To check whether an HTML element is present on the page using Playwright the page.locator() method can be used. Here's how.

#playwright
#python

How to run Playwright in Jupyter notebooks?

Learn why the synchronous execution of Playwright is blocked on Jupyter notebooks and how to solve it using asyncio.

#playwright
#jupyter

How to download a file with Playwright and Python?

To download files using Playwright we can either simulate the button click or extract the url and download it using HTTP. Here's how.

#playwright

How to load local files in Playwright?

To load local files as page URLs in Playwright we can use the file:// protocol. Here's how to do it.

#playwright

Articles Related to Playwright

Bypass Proxy Detection with Browser Fingerprint Impersonation

Stop proxy blocks with browser fingerprint impersonation using this guide for Playwright, Selenium, curl-impersonate & Scrapfly

PROXIES
SELENIUM
PLAYWRIGHT
PUPPETEER
BLOCKING
Bypass Proxy Detection with Browser Fingerprint Impersonation

Playwright Examples for Web Scraping and Automation

Learn Playwright with Python and JavaScript examples for automating browsers like Chromium, WebKit, and Firefox.

PLAYWRIGHT
PYTHON
NODEJS
Playwright Examples for Web Scraping and Automation

Web Scraping with Playwright and JavaScript

Learn about Playwright - a browser automation toolkit for server side Javascript like NodeJS, Deno or Bun.

PLAYWRIGHT
HEADLESS-BROWSER
NODEJS
Web Scraping with Playwright and JavaScript

Playwright vs Selenium

Explore the key differences between Playwright vs Selenium in terms of performance, web scraping, and automation testing for modern web applications.

HEADLESS-BROWSER
PLAYWRIGHT
SELENIUM
Playwright vs Selenium

What is a Headless Browser? Top 5 Headless Browser Tools

Quick overview of new emerging tech of browser automation - what exactly are these tools and how are they used in web scraping?

HEADLESS-BROWSER
PLAYWRIGHT
SELENIUM
PUPPETEER
What is a Headless Browser? Top 5 Headless Browser Tools

How to Scrape With Headless Firefox

Discover how to use headless Firefox with Selenium, Playwright, and Puppeteer for web scraping, including practical examples for each library.

HEADLESS-BROWSER
PUPPETEER
SELENIUM
NODEJS
PLAYWRIGHT
PYTHON
How to Scrape With Headless Firefox

Web Scraping Dynamic Websites With Scrapy Playwright

Learn about Selenium Playwright. A Scrapy integration that allows web scraping dynamic web pages with Scrapy. We'll explain web scraping with Scrapy Playwright through an example project and how to use it for common scraping use cases, such as clicking elements, scrolling and waiting for elements.

PYTHON
PLAYWRIGHT
SCRAPY
HEADLESS-BROWSER
Web Scraping Dynamic Websites With Scrapy Playwright

How to use Headless Chrome Extensions for Web Scraping

In this article, we'll explore different useful Chrome extensions for web scraping. We'll also explain how to install Chrome extensions with various headless browser libraries, such as Selenium, Playwright and Puppeteer.

PYTHON
NODEJS
TOOLS
PLAYWRIGHT
PUPPETEER
SELENIUM
How to use Headless Chrome Extensions for Web Scraping

How to Scrape Google Maps

We'll take a look at to find businesses through Google Maps search system and how to scrape their details using either Selenium, Playwright or ScrapFly's javascript rendering feature - all of that in Python.

SCRAPEGUIDE
PYTHON
SELENIUM
PLAYWRIGHT
How to Scrape Google Maps

How to Scrape Dynamic Websites Using Headless Web Browsers

Introduction to using web automation tools such as Puppeteer, Playwright, Selenium and ScrapFly to render dynamic websites for web scraping

HEADLESS-BROWSER
PYTHON
SELENIUM
PUPPETEER
PLAYWRIGHT
How to Scrape Dynamic Websites Using Headless Web Browsers