Scrapfly n8n Integration
Scrapfly is a verified community node on n8n, a popular no-code automation platform. The Scrapfly n8n integration includes access to all Scrapfly APIs:
- Use Web Scraping API to scrape any webpage data.
- Use Extraction API to extract data from n8n inputs.
- Use Screenshot API to take screenshots of any webpage.
Installation
Scrapfly is now a verified community node on n8n, which means it has been reviewed and approved by the n8n team for quality and security.
To install the Scrapfly integration in your n8n instance:
- Open your n8n instance settings
- Navigate to the Community nodes section in the sidebar
- Click Install a community node
- Search for "Scrapfly" and select it from the verified nodes list
- Click Install and wait for the installation to complete
Once installed, you'll find the Scrapfly node available in your node palette with four main operations:
- Scrape web page URLs - Extract HTML source with headless browsers, proxies, and anti-bot bypass
- Extract data from documents - Parse HTML, Text, or Markdown using AI with pre-defined templates or LLM prompts
- Capture screenshots - Take screenshots of web pages as image files
- Get account info - View your subscription and usage information
For more details, see the n8n community nodes installation guide or visit the npm package page.
Example Templates
Using n8n and Scrapfly creating web-scraping powered automations is very easy and accessible without any coding experience. Let's take a look at some example templates for common n8n scenarios.
Scraping with Google sheets
Google Sheets is a popular cloud-based spreadsheet tool ideal for automation and web scraping. This example demonstrates how Google Sheets can be used as a web scraping database.
This example watches Google Sheets for newly added URLs and scrapes them using Scrapfly Web Scraping API. Here's a quick breakdown:
- Watch for new rows to be added to a Google Sheet
-
On new URL added:
- Scrape the URL using Scrapfly Web Scraping API
- Parse Scraped HTML using Scrapfly Extraction API
- Save the extracted data back to Google Sheets
Scheduling Scrape Loops
n8n provides a scheduler system that can scrape multiple URLs at regular intervals. This example demonstrates how to schedule a scrape loop using n8n and store results in Google Sheets.
Screenshot to Google Drive
n8n provides a file system infrastructure for handling binary files. This template takes a screenshot of a web page using the Screenshot API and saves the results to Google Drive.
Errors
n8n will return the Scrapfly API error message in the n8n error reporter that matches the standard Scrapfly API error messages. For more see:
Pricing
No additional costs.