Playwright Examples for Web Scraping and Automation
Learn Playwright with Python and JavaScript examples for automating browsers like Chromium, WebKit, and Firefox.
Playwright supports one of the most popular ways to parse HTML content in web scraping - XPath selectors. To use XPath in Playwright we can use page.locator()
method and prefix our selector with xpath=
or //
. For example:
from playwright.sync_api import sync_playwright
with sync_playwright() as pw:
browser = pw.chromium.launch(headless=False)
context = browser.new_context(viewport={"width": 1920, "height": 1080})
page = context.new_page()
page.goto("https://google.com/")
h2_element = page.locator("//h2")
# or
h2_element = page.locator("xpath=//h2")
⚠ It's possible that this command will try to find elements before the page has fully loaded if it's a dynamic javascript page. For more see How to wait for page to load in Playwright?
Also see: How to find elements by CSS selectors in Playwright?
This knowledgebase is provided by Scrapfly data APIs, check us out! 👇