What are scrapy pipelines and how to use them?

# define our pipeline code: # pipelines.py import datetime class AddScrapedDatePipeline: def process_item(self, item, spider): current_utc_datetime = datetime.datetime.utcnow() item['scraped_date'] = current_utc_datetime.isoformat() return item # settings.py # activate pipeline in settings: ITEM_PIPELINES = { 'your_project_name.pipelines.AddScrapedDatePipeline': 300, }

Provided by Scrapfly

This knowledgebase is provided by Scrapfly data APIs, check us out! 👇

Web Scraping API - scrape without blocking, control cloud browsers, and more.

Extraction API - AI and LLM for parsing data.

Screenshot API - capture pages or elements with no blocks.

Mar 06, 2024

Web Scraping Dynamic Websites With Scrapy Playwright

Learn about Selenium Playwright. A Scrapy integration that allows web scraping dynamic web pages with Scrapy. We'll explain web scraping with Scrapy Playwright through an example project and how to use it for common scraping use cases, such as clicking elements, scrolling and waiting for elements.

Web Scraping Dynamic Web Pages With Scrapy Selenium

Mar 04, 2024

What are scrapy pipelines and how to use them?

Provided by Scrapfly

Related Questions

Related Posts

Web Scraping Dynamic Websites With Scrapy Playwright

Web Scraping Dynamic Web Pages With Scrapy Selenium

Scrapy Splash Guide: Scrape Dynamic Websites With Scrapy

Web Scraping With Scrapy: The Complete Guide in 2025