How to find HTML elements by class?

by scrapecrow Oct 24, 2022

When web scraping, the most common way to navigate HTML data is to find elements by class name. For that, we can use CSS or XPath Selectors:

CSS selectors

Use the .class notation, which will find any nodes that contain full class name:

`.some-class`
will match:
`<a class="some-class"></a>`
`<a class="first some-class third"></a>`

Use the [class*="<partial>"] notation which will find any nodes that contain a given string:

`[class*="some-class"]`
will match:
`<a class="some-class"></a>`
`<a class="first some-class third"></a>`

Xpath selectors

Alternatively, XPath selectors can be used with similar functions:

//*[@class="link"] will find any element where the class is exactly equal to "link"
//*[contains(@class, "link")] will find any element where class contains the string "link"

How to Parse XML

In this article, we'll explain about XML parsing. We'll start by defining XML files, their format and how to navigate them for data extraction.

Web Scraping With Ruby

Introduction to web scraping with Ruby. How to handle http connections, parse html files for data, best practices, tips and an example project.

Ultimate XPath Cheatsheet for HTML Parsing in Web Scraping

Ultimate companion for HTML parsing using XPath selectors. This cheatsheet contains all syntax explanations with interactive examples.

XPATH

DATA-PARSING

Ultimate CSS Selector Cheatsheet for HTML Parsing

Ultimate companion for HTML parsing using CSS selectors. This cheatsheet contains all syntax explanations with interactive examples.

CSS-SELECTORS

DATA-PARSING

Ultimate CSS Selector Cheatsheet for HTML Parsing

Web Scraping With NodeJS and Javascript

In this article we'll take a look at scraping using Javascript through NodeJS. We'll cover common web scraping libraries, frequently encountered challenges and wrap everything up by scraping etsy.com