How to get the name of an HTML element in XPath?

To get the name of the selected element in XPath the name() function can be used. This function returns the name of the closest element in the current context.

The name() function is useful in HTML parsing in broad selection cases like selecting multiple names. See this interactive example:

<!-- select all links by selecting the @href attributes --> <article> <p>paragraph 1</p> <a>link 1</a> <p>paragraph 2</p> <script>some script data</script> <p>paragraph 3</p> </article>

Above, we're selecting only paragraph and link elements by using wildcard match (the * symbol) and a name() matching predicate.

Question tagged: XPath

Related Posts

How to Parse XML

In this article, we'll explain about XML parsing. We'll start by defining XML files, their format and how to navigate them for data extraction.

Ultimate XPath Cheatsheet for HTML Parsing in Web Scraping

Ultimate companion for HTML parsing using XPath selectors. This cheatsheet contains all syntax explanations with interactive examples.

Web Scraping With Ruby

Introduction to web scraping with Ruby. How to handle http connections, parse html files for data, best practices, tips and an example project.