How to select elements by attribute using CSS selectors?

CSS selectors allow selecting elements by any attribute value such as class, id, href etc. This means we can extract any HTML elements based on attribute value with CSS selectors.

While class and id have special notation shortcuts - . and # respectively - any attribute can be selected using the attribute selector ([attribute]) which supports multiple operators:

The = match can be used for exact equality e.g. [attr=match] :

<div> <div title="product">select</div> <div title="sold-product">ignore</div> </div>

The ~= turns the attribute into an array of space-separated values and checks whether it contains the match (similar to .class and #id matches):

<div> <div title="product">select</div> <div title="sold product new">select</div> <div title="sold-product">ignore</div> </div>

The |= checks for exact equality except ignores hypen suffixes (e.g. -language):

<div> <div title="product">select</div> <div title="product-language-en">select</div> <div title="sold-product">ignore</div>  <div title="new product-language-en disabled">ignore</div> </div>

The ^= checks whether the attribute starts with the match:

<div> <div title="product1">select</div> <div title="product2">select</div> <div title="product3">select</div> <div title="4product">ignore</div> </div>

The $= checks whether the attribute ends with the match:

<div> <div title="1product">select</div> <div title="2product">select</div> <div title="3product">select</div> <div title="product4">ignore</div> </div>

The *= checks whether the attribute contains the match:

<div> <div title="1product-en">select</div> <div title="2product-de">select</div> <div title="prod-duct">ignore</div> </div>

Finally, to make all of these matches case insensitive add i before the closing bracket, e.g. [attr=match i]

Provided by Scrapfly

This knowledgebase is provided by Scrapfly data APIs, check us out! 👇

Web Scraping API - scrape without blocking, control cloud browsers, and more.
Extraction API - AI and LLM for parsing data.
Screenshot API - capture pages or elements with no blocks.

Try ScrapFly for FREE!

How to Parse XML

Jan 15, 2024

How to Parse XML

In this article, we'll explain about XML parsing. We'll start by defining XML files, their format and how to navigate them for data extraction.

Ultimate CSS Selector Cheatsheet for HTML Parsing

Aug 30, 2023

Ultimate CSS Selector Cheatsheet for HTML Parsing

Ultimate companion for HTML parsing using CSS selectors. This cheatsheet contains all syntax explanations with interactive examples.

Web Scraping With Ruby

Mar 07, 2022

Web Scraping With Ruby

Introduction to web scraping with Ruby. How to handle http connections, parse html files for data, best practices, tips and an example project.

Web Scraping With NodeJS and Javascript

Feb 14, 2022

Web Scraping With NodeJS and Javascript

In this article we'll take a look at scraping using Javascript through NodeJS. We'll cover common web scraping libraries, frequently encountered challenges and wrap everything up by scraping etsy.com