Web Scraping With Ruby
Introduction to web scraping with Ruby. How to handle http connections, parse html files for data, best practices, tips and an example project.
CSS selectors and XPath are both path languages for HTML parsing. Xpath is more powerful but CSS is more approachable - which is one is better?
Python has several options for executing XPath selectors against HTML. The most popular ones are lxml and parsel. Here's how to use them.
To select HTML elements by class name in XPath we can use the @ attribute selector and comparison function contains(). Here's how to do it.
To select elements by text using XPath contains() function can be used. Here's how to do it.
To execute XPath selectors in playwright the page.locator() method can be used. Here's how.
To parse HTML using XPath in Nodejs we can use one of two popular libraries like osmosis or xmldom. Here's how.
To find elements by XPath using Puppeteer the $x() method can be used. Here's how to use it.
To find HTML nodes by class name CSS selectors or XPath can be used. Here's how to do it.
BeautilfulSoup for Python doesn't support XPath selectors but there are popular alternatives to fill in this niche. Here are some.
BeautifulSoup is a popular HTML library for Python. It's most popular alternatives are lxml, parsel and html5lib. Here's how they differ from bs4.
Introduction to web scraping with Ruby. How to handle http connections, parse html files for data, best practices, tips and an example project.
Introduction to xpath in the context of web-scraping. How to extract data from HTML documents using xpath, best practices and available tools.
Introduction to web scraping with PHP. How to handle http connections, parse html files for data, best practices, tips and an example project.
Tutorial on web scraping with scrapy and Python through a real world example project. Best practices, extension highlights and common challenges.