Web Scraping Without Blocking With Undetected ChromeDriver
In this tutorial we'll be taking a look at a new popular web scraping tool Undetected ChromeDriver which is a Selenium extension that allows to bypass many scraper blocking techniques.
When scraping pages protected by PerimeterX we might encounter messages like "Please verify you are Human: Press & Hold":
This message means the web scraper has been identified and is being blocked. PerimeterX is using various fingerprinting and detection techniques, like:
And other details like request pattern, HTTP version etc. To avoid being blocked one option is to fortify the web scraper against these identification methods.
For more on all technologies used for blocking web scrapers see our full introduction guide
Alternatively, a web scraping API such as ScrapFly can be used to scrape all PerimeterX pages. For that see the Anti Scraping Protection feature.