How to download a file with Playwright and Python?

by scrapecrow Dec 05, 2022

# playwright

To download files using Playwright we can either find the download button/link using the locator function and then click it or we can download it using HTTP client like httpx or requests in Python:

from pathlib import Path
from playwright.sync_api import sync_playwright
import httpx  # or import requests

def download_file_with_playwright():
    with sync_playwright() as pw:
        browser = pw.chromium.launch(headless=False)
        context = browser.new_context(viewport={"width": 1920, "height": 1080})

        page = context.new_page()
        page.goto('https://httpbin.dev/html')

        # we can either click the download button using locator:
        file = page.locator('a')
        file.click()

        # or we can download the file manually which is more flexible and faster
        url = file.get_attribute('href')
        response = httpx.get(url)
        Path('file.txt').write_bytes(response.content)

Related Articles

Bypass Proxy Detection with Browser Fingerprint Impersonation

Stop proxy blocks with browser fingerprint impersonation using this guide for Playwright, Selenium, curl-impersonate & Scrapfly

Bypass Proxy Detection with Browser Fingerprint Impersonation

Playwright Examples for Web Scraping and Automation

Learn Playwright with Python and JavaScript examples for automating browsers like Chromium, WebKit, and Firefox.

Playwright Examples for Web Scraping and Automation

Web Scraping with Playwright and JavaScript

Learn about Playwright - a browser automation toolkit for server side Javascript like NodeJS, Deno or Bun.

HEADLESS-BROWSER

Web Scraping with Playwright and JavaScript

Playwright vs Selenium

Explore the key differences between Playwright vs Selenium in terms of performance, web scraping, and automation testing for modern web applications.

HEADLESS-BROWSER

Playwright vs Selenium

What is a Headless Browser? Top 5 Headless Browser Tools

Quick overview of new emerging tech of browser automation - what exactly are these tools and how are they used in web scraping?

HEADLESS-BROWSER

What is a Headless Browser? Top 5 Headless Browser Tools

How to Scrape With Headless Firefox

Discover how to use headless Firefox with Selenium, Playwright, and Puppeteer for web scraping, including practical examples for each library.

HEADLESS-BROWSER

How to Scrape With Headless Firefox