🚀 We are hiring! See open positions

How to get page source in Puppeteer?

When web scraping, we often want to retrieve full page source (full HTML of the web page) we can parse it for data using tools like Cheerio. Using Puppeteer, to get the page source we can use page.content() method:

javascript
const puppeteer = require('puppeteer');

async function run() {
    const browser = await puppeteer.launch();
    const page = await browser.newPage();
    await page.goto("https://httpbin.dev/html");

    let source = await page.content();
    // OR the faster method that doesn't wait for images to load:
    let source = await page.content({"waitUntil": "domcontentloaded"});

    console.log(source);
    browser.close();
}

run();

⚠ It's possible that this command will retrieve page source before the page fully loads if it's a dynamic javascript page. For more see how-to-wait-for-page-to-load-in-puppeteer

Scale Your Web Scraping
Anti-bot bypass, browser rendering, and rotating proxies — all in one API. Start with 1,000 free credits.
No credit card required 1,000 free API credits Anti-bot bypass included
Not ready? Get our newsletter instead.