[Blog](https://scrapfly.io/blog)   /  [python](https://scrapfly.io/blog/tag/python)   /  [How to Scrape G2 Company Data and Reviews](https://scrapfly.io/blog/posts/how-to-scrape-g2-company-data-and-reviews)   # How to Scrape G2 Company Data and Reviews

 by [Mazen Ramadan](https://scrapfly.io/blog/author/mazen) Jun 23, 2026 10 min read [\#python](https://scrapfly.io/blog/tag/python) [\#scrapeguide](https://scrapfly.io/blog/tag/scrapeguide) 

 [  ](https://www.linkedin.com/sharing/share-offsite/?url=https%3A%2F%2Fscrapfly.io%2Fblog%2Fposts%2Fhow-to-scrape-g2-company-data-and-reviews "Share on LinkedIn") [  ](https://x.com/intent/tweet?url=https%3A%2F%2Fscrapfly.io%2Fblog%2Fposts%2Fhow-to-scrape-g2-company-data-and-reviews&text=How%20to%20Scrape%20G2%20Company%20Data%20and%20Reviews "Share on X") [  ](https://www.facebook.com/sharer/sharer.php?u=https%3A%2F%2Fscrapfly.io%2Fblog%2Fposts%2Fhow-to-scrape-g2-company-data-and-reviews "Share on Facebook")    

 
Summarize this article with

 [  ](https://chat.openai.com/?q=Summarize%20this%20article%20and%20explain%20how%20Scrapfly%20helps%20me%20scrape%20any%20website%20at%20scale%20and%20bypass%20anti-bot%20systems%20for%20my%20use%20case%3A%20https%3A%2F%2Fscrapfly.io%2Fblog%2Fposts%2Fhow-to-scrape-g2-company-data-and-reviews) [  ](https://claude.ai/new?q=Summarize%20this%20article%20and%20explain%20how%20Scrapfly%20helps%20me%20scrape%20any%20website%20at%20scale%20and%20bypass%20anti-bot%20systems%20for%20my%20use%20case%3A%20https%3A%2F%2Fscrapfly.io%2Fblog%2Fposts%2Fhow-to-scrape-g2-company-data-and-reviews) [  ](https://x.com/i/grok?text=Summarize%20this%20article%20and%20explain%20how%20Scrapfly%20helps%20me%20scrape%20any%20website%20at%20scale%20and%20bypass%20anti-bot%20systems%20for%20my%20use%20case%3A%20https%3A%2F%2Fscrapfly.io%2Fblog%2Fposts%2Fhow-to-scrape-g2-company-data-and-reviews) [  ](https://www.perplexity.ai/search/new?q=Summarize%20this%20article%20and%20explain%20how%20Scrapfly%20helps%20me%20scrape%20any%20website%20at%20scale%20and%20bypass%20anti-bot%20systems%20for%20my%20use%20case%3A%20https%3A%2F%2Fscrapfly.io%2Fblog%2Fposts%2Fhow-to-scrape-g2-company-data-and-reviews) [  ](https://www.google.com/search?udm=50&aep=11&q=Summarize%20this%20article%20and%20explain%20how%20Scrapfly%20helps%20me%20scrape%20any%20website%20at%20scale%20and%20bypass%20anti-bot%20systems%20for%20my%20use%20case%3A%20https%3A%2F%2Fscrapfly.io%2Fblog%2Fposts%2Fhow-to-scrape-g2-company-data-and-reviews) 


[G2.com](https://www.g2.com/) is a leading website for software product and service data. It features thousands of product profiles, their reviews and alternative suggestions in various categories. However, due to the high protection level and the heavy use of CAPTCHA challenges, scraping G2.com can be challenging.

In this article, we'll explore web scraping G2. We'll explain how to scrape company data, reviews and alternatives from the website without getting blocked. We'll also use some web scraping tricks to make our scraper resilient, such as error handling and retrying logic. Let's get started!

## Key Takeaways

Learn to scrape G2.com software reviews and company data using Python with ScrapFly SDK, bypassing Datadome protection and CAPTCHA challenges for comprehensive business intelligence.

- Use ScrapFly SDK to bypass G2's Datadome anti-scraping protection and CAPTCHA challenges automatically
- Parse HTML with XPath and CSS selectors to extract software product reviews and company information
- Handle G2's heavy anti-bot measures with proper error handling and retry logic implementation
- Extract structured data including product ratings, reviews, alternative suggestions, and company profiles
- Implement asynchronous scraping with proper rate limiting to avoid triggering additional security measures
- Use robust error handling to manage blocked requests and maintain scraper stability

[**View Source Code**github.com/scrapfly/scrapfly-scrapers/tree/main/g2-scraper](https://github.com/scrapfly/scrapfly-scrapers/tree/main/g2-scraper)

**Get web scraping tips in your inbox**Trusted by 100K+ developers and 30K+ enterprises. Unsubscribe anytime.


## Why Scrape G2?

G2 provides comprehensive **software product and service** details as well as **metadata, review** and **alternative** information with detailed pros/cons comparisons. So, if you are looking to become a customer, scrapping G2's company data can help in decision-making and product comparisons.

Web Scraping G2's reviews can also be a good resource for developing **Machine Learning** models. Companies can analyze these reviews through **sentiment analysis** to gain insights into specific companies or market niches.

Moreover, manually exploring tens of company review pages on the website can be tedious and time-consuming. Therefore, scraping G2 can save a lot of manual effort by quickly retrieving thousands of reviews.


## Project Setup

To scrape G2.com, we'll use a few Python packages:

- `scrapfly-sdk` for bypassing G2 anti-scraping challenges and blocking.
- `async` for increasing our [web scraping speed](https://scrapfly.io/blog/posts/web-scraping-speed) by running our code asynchronously.

Note that `asyncio` comes pre-installed in Python, you will only have to install the other packages using the following pip command:

shell```shell
pip install scrapfly-sdk
```


## Avoid G2 Web Scraping Blocking

G2 heavily relies on [How to Bypass Datadome Anti Scraping in 2026](https://scrapfly.io/blog/posts/how-to-bypass-datadome-anti-scraping) challenges to prevent scraping. For example, let's send a simple request to the website using [How to Web Scrape with HTTPX and Python](https://scrapfly.io/blog/posts/web-scraping-with-python-httpx). We'll use headers similar to real browsers to decrease the chance of getting detected and blocked:

python```python
from httpx import Client

# initializing an httpx client
client = Client(
    # enable http2
    http2=True,
    # add basic browser like headers to prevent being blocked
    headers={
        "accept-language": "en-US,en;q=0.9",
        "user-agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/96.0.4664.110 Safari/537.36",
        "accept": "text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8",
        "accept-language": "en-US;en;q=0.9",
        "accept-encoding": "gzip, deflate, br",
    }
)

response = client.get("https://www.g2.com")
print(response)
"<Response [403 Forbidden]>"
```


The above requests get detected and required to solve a CAPTCHA challenge:


G2 scraper blockingTo scrape G2 without getting blocked we don't actually need to solve the captcha. We're just not going to get it at all! For that, we'll use [ScrapFly](https://scrapfly.io/web-scraping-api) - a web scraping API that allows for scraping at scale by providing:

- [Cloud headless browsers](https://scrapfly.io/docs/scrape-api/javascript-rendering) - for scraping dynamically loaded content with running headless browsers yourself.
- [Anti scraping protection bypass](https://scrapfly.io/docs/scrape-api/anti-scraping-protection) - for bypassing any website scraping blocking.
- [Residential proxies](https://scrapfly.io/docs/scrape-api/proxy) from over 50+ countries - for avoiding IP address blocking and throttling, while also allowing for scraping from almost any geographic location.
- [And much more!](https://scrapfly.io/docs/scrape-api/getting-started)

By using the Scrapfly's `asp` feature with the [ScrapFly SDK](https://scrapfly.io/docs/sdk/python). We can easily bypass G2 scraper blocking:

python```python
from scrapfly import ScrapeConfig, ScrapflyClient, ScrapeApiResponse

scrapfly = ScrapflyClient(key="Your ScrapFly API key")

api_response: ScrapeApiResponse = scrapfly.scrape(
    ScrapeConfig(
        # some g2 URL
        url="https://www.g2.com",
        # cloud headless browser similar to Playwright
        render_js=True,
        # bypass anti scraping protetion
        asp=True,
        # set the geographical location to a specific country
        country="US",
    )
)
# Print the website's status code
print(api_response.upstream_status_code)
"200"
```


We'll use ScrapFly as our HTTP client for the rest of the article. So, to follow along, you need to get a ScrapFly API key 👇


## How to Scrape G2 Search Pages

Let's start by scraping search pages on G2. Use the search bar to search for any keyword on the website and you will get a page similar to this:


ScrapFly service does the heavy lifting for youThe search pages support pagination by adding a `page` parameter at the URL:

```
https://www.g2.com/search?query=Infrastructure&page=2
```


The above parameter can be used for crawling over search pages.

We'll request the search pages using ScrapFly and use the function we created to parse the data from the HTML:


'); const s = document.createElement("script"); s.src = [ "https://gist.github.com/", "scrapfly-dev/", "307450c738fbc28b228a54c1918121f2", ".js" ].join(""); document.write('Here, we've added a `scrape_search()` function that sends a request to the first search page using the ScarpFly client. Then, we extract its data using the `parse_search_page()` function we defined earlier. We also defined a `parse_search_page()` function, which parses the company data from the page HTML using XPath selectors. We also extract the total search results to get the number of total pages, which we'll use later to crawl over search pagination.

As for pagination crawling, we add the remaining search page URLs to a scraping list and scrape them concurrently. Next, we remove the successful requests from the URL list and extend the first page data with the new ones.

Here is a sample output of the result we got:

Sample outputjson```json
[
    {
      "name": "Oracle Cloud Infrastructure",
      "link": "https://www.g2.com/products/oracle-oracle-cloud-infrastructure/reviews",
      "image": "https://images.g2crowd.com/uploads/product/image/large_detail/large_detail_2753ea8c7953188158425365667be750/oracle-oracle-cloud-infrastructure.png",
      "rate": 4.2,
      "reviewsNumber": 371,
      "description": null,
      "categories": [
        "Other Product Suites"
      ]
    },
    ....
]
```


Our G2 scraper can successfully scrape company data from search pages. Let's scrape company reviews next.


Scrapfly

#### Scale your web scraping effortlessly

Scrapfly handles proxies, browsers, and anti-bot bypass — so you can focus on data.

[Try Free →](https://scrapfly.io/register)## How to Scrape G2 Company Reviews

In this section, we'll scrape company reviews from their pages. Before we start, let's have a look at the G2 review pages. Go to any company or product page on the website, such as [digitalocean](https://www.g2.com/products/digitalocean/reviews) page and you will find the reviews that should look like this:


Review on G2 company pagesReview pages also support pagination by adding the same `page` parameter:

```
https://www.g2.com/products/digitalocean/reviews?page=2
```


Since each review page contains 25 reviews, we'll iterate over the review cards to extract each review data. Like we did earlier with search pages, we'll request the first page and then crawl over the remaining ones:


'); const s = document.createElement("script"); s.src = \[ "https://gist.github.com/", "scrapfly-dev/", "7ab4a770b8813b387efb88ca538771d8", ".js" \].join(""); document.write('The above code is similar to the G2 search scraping logic we wrote earlier. We start by scraping the first review page and the total number of reviews. Next, we add the remaining review pages to a scraping list and scrape them concurrently. Finally, we save the result to a JSON file.

Here is a sample output of the result we got:

Sample outputjson```json
[
    {
      "author": {
        "authorName": "Marlon P.",
        "authorProfile": "https://www.g2.com/users/d523e9ac-7e5b-453f-85f8-9ab05b27a556",
        "authorPosition": "Desenvolvedor de front-end",
        "authorCompanySize": []
      },
      "review": {
        "reviewTags": [
          "Validated Reviewer",
          "Verified Current User",
          "Review source: Seller invite",
          "Incentivized Review"
        ],
        "reviewData": "2023-11-14",
        "reviewRate": 4.5,
        "reviewTitle": "Good for beginners",
        "reviewLikes": "It was very simple to start playing around and be able to test the projects I'm learning about for a cool price. I use it at work and it's easy to create new machines. Initial configuration is simple with the app Free tier is so short.  is now than the company need money but, for me 90 days it was very fast to user the credits. \n\nThere is an app configuration file that I find very annoying to configure. It would be cool if there was a way to test that locally. I've had a lot of problems that doing several deployments in production to see my app's configuration is ok. leave personal projects public. And in the company, when I have to use it, I find it very simple to use the terminal via the platform ",
        "reviewDilikes": "Free tier is so short.  is now than the company need money but, for me 90 days it was very fast to user the credits. \n\nThere is an app configuration file that I find very annoying to configure. It would be cool if there was a way to test that locally. I've had a lot of problems that doing several deployments in production to see my app's configuration is ok. "
      }
    },
    ....
]
```


Our G2 scraper can successfully scrape review pages. Next, we'll scrape company competitor pages for company alternative listings.


## How to Scrape G2 Company Alternatives

G2 company competitor pages offer detailed company product comparisons. However, we'll be focusing on the company's alternative listings. However, other comparison details can be scraped in the same way.

First, go to any company alternative page, like the [digitalocean alternatives page](https://www.g2.com/products/digitalocean/competitors/alternatives). The company alternatives listing should look like this:


Company alternatives on G2As we can see from the image, the company listings can be narrowed down according to a specific market niche, like small business, mid-market and enterprise alternatives. While the default URL represents the top 10 alternatives filter, we can apply other filters by adding the filter name at the end of the URL:

```
https://www.g2.com/products/digitalocean/competitors/alternatives/small-business
```


We'll make use of this filter to control the G2 scraping alternatives:


'); const s = document.createElement("script"); s.src = [ "https://gist.github.com/", "scrapfly-dev/", "ef74e0947ef0794b0dc893b65fb7664d", ".js" ].join(""); document.write('Above, we define a `parse_alternatives()` function. It iterates over the alternative cards in the HTML and extracts the company listings data from each card. It extracts the data from the HTML after we request alternative page URL.

Here is a sample output of the result we got:

Sample outputjson```json
[
  {
    "name": "Hostwinds",
    "link": "https://www.g2.com/products/hostwinds/reviews",
    "ranking": "#1",
    "numberOfReviews": 438,
    "rate": 4.9,
    "description": "Hostwinds offers website hosting for individuals and businesses of all sizes, with 24/7/365 support and nightly backups."
  },
  ....
]
```


---

With this last piece, our G2 scraper is complete! It can scrape company data from search, competitor and review pages on G2.com. There are pages on the website that are worth scraping, such as detailed company comparison pages. These pages can be scraped by following the steps in our previous G2 scraping code snippets.


## FAQ

Is it legal to scrape G2?Yes, all the data on G2.com is publicly available and it's legal to scrape as long as the website is not harmed in the process. However, commercializing personal data such as reviewers' emails may violate GDPR compliance in the EU countries. Refer to our previous article on [web scraping legality](https://scrapfly.io/is-web-scraping-legal) for more details.


Is there a public API for G2.com?At the time of writing, there are no public APIs for G2 we can use for web scraping. Though G2's HTML is pretty descriptive, making scraping it through HTML parsing viable.


Are there alternatives for G2?Yes, Trustpilot.com is another popular website for company reviews. Refer to our [\#scrapeguide](https://scrapfly.io/blog/tags/scrapeguide/) blog tag for its scraping guide and for other related web scraping guides.

[**View Source Code**github.com/scrapfly/scrapfly-scrapers/tree/main/g2-scraper](https://github.com/scrapfly/scrapfly-scrapers/tree/main/g2-scraper)


## Web Scraping G2 - Summary

G2.com is a global website for company reviews and comparisons, known for its high protection level.

We explained how to avoid G2 scraping blocking using ScrapFly. We also went through a step-by-step guide on how to scrape G2 using Python. We have used HTML parsing to scrape search, review and competitor pages on G2.

Legal Disclaimer and PrecautionsThis tutorial covers popular web scraping techniques for education. Interacting with public servers requires diligence and respect:

- Do not scrape at rates that could damage the website.
- Do not scrape data that's not available publicly.
- Do not store PII of EU citizens protected by GDPR.
- Do not repurpose *entire* public datasets which can be illegal in some countries.

Scrapfly does not offer legal advice but these are good general rules to follow. For more you should consult a lawyer.


  [  Add as a preferred source ](https://google.com/preferences/source?q=scrapfly.io) Table of Contents


  Table of Contents- [Key Takeaways](#key-takeaways)
- [Why Scrape G2?](#why-scrape-g2)
- [Project Setup](#project-setup)
- [Avoid G2 Web Scraping Blocking](#avoid-g2-web-scraping-blocking)
- [How to Scrape G2 Search Pages](#how-to-scrape-g2-search-pages)
- [How to Scrape G2 Company Reviews](#how-to-scrape-g2-company-reviews)
- [How to Scrape G2 Company Alternatives](#how-to-scrape-g2-company-alternatives)
- [FAQ](#faq)
- [Web Scraping G2 - Summary](#web-scraping-g2-summary)
 
    Join the Newsletter  Get monthly web scraping insights 

 
 Scale Your Web Scraping

Anti-bot bypass, browser rendering, and rotating proxies, all in one API. Start with 1,000 free credits.

  No credit card required  1,000 free API credits  Anti-bot bypass included 

 [Start Free](https://scrapfly.io/register) [View Docs](https://scrapfly.io/docs/onboarding) 

 Not ready? Get our newsletter instead. 

 
 ## Related Articles

 [  

 python hidden-api 

### How to Scrape Trustpilot.com Reviews and Company Data

In today's scrapeguide we'll be taking a look at Trustpilot - one of the biggest sources of company reviews and how to s...

 
 ](https://scrapfly.io/blog/posts/how-to-scrape-trustpilot-com-reviews) [  

 python scrapeguide 

### How to Scrape Amazon.com Product Data and Reviews

This scrape guide covers the biggest e-commerce platform in US - Amazon.com. We'll take a look how to scrape product dat...

 
 ](https://scrapfly.io/blog/posts/how-to-scrape-amazon) [  

 http python 

### Web Scraping with Python

Introduction tutorial to web scraping with Python. How to collect and parse public data. Challenges, best practices and ...

 
 ](https://scrapfly.io/blog/posts/web-scraping-with-python) 

  
 Scale your web scraping effortlessly, **1,000 free credits** [Start Free](https://scrapfly.io/register)