Guide to Python Requests Headers

Q: How do I set Content-Type headers for POST requests with JSON data in Python requests?

Set Content-Type: application/json and use the json parameter: requests.post(url, json=data, headers={'Content-Type': 'application/json'}) . The json parameter automatically sets the correct Content-Type and serializes the data. See our complete guide on POST requests .

Q: What are the minimum headers needed to mimic a real browser in Python requests?

Essential headers: User-Agent , Accept , Accept-Language , Accept-Encoding , Connection: keep-alive , and Sec-Fetch-* headers. Include Referer for navigation context and DNT: 1 for privacy compliance. See our guide on avoiding blocking with headers .

Q: How can I find out what headers a website expects?

Using browser developer tools , you can inspect the headers sent with each request to a website. Copying these headers into your Python requests can help your request mimic browser traffic.

by Ziad Shamndy Sep 26, 2025

#python #requests #http

When interacting with web servers, whether for web scraping or API work, Python requests headers are a powerful yet often overlooked tool. These headers communicate silently, telling the server who’s calling, why, and in what format data should be returned.

In this guide, we’ll cover everything you need to know about setting up headers with Python’s requests library, why header order matters, and how understanding headers can improve the success of your web interactions.

For those new to the library, you can get started by installing it with pip install requests to follow along with this guide.

Key Takeaways

Master Python requests headers for web scraping using proper HTTP headers, authentication, and session management to bypass anti-bot measures and improve scraping success rates.

Use requests.Session() for persistent headers and cookie management across multiple requests
Implement proper User-Agent headers to mimic real browsers and avoid bot detection
Add authentication headers including Authorization and API keys for secure endpoints
Handle different content types with Accept headers and proper encoding for international content
Bypass rate limiting and anti-scraping measures with realistic header patterns
Implement exponential backoff retry logic with 403 status code detection for rate limiting

What Are Headers in Python Requests?

In HTTP, headers are key-value pairs that accompany each request and response, guiding the server on how to process the request. Headers specify expectations, formats, and permissions, playing a critical role in server-client communication. For instance, headers can tell the server about the type of device sending the request, or whether the client expects a JSON response.

Each request initiates a dialogue between the client (like a browser or application) and server, with headers acting as instructions. The most common headers include:

Content-Type: Indicates the media type (e.g., application/json), helping the server understand content format, especially for POST requests.
Authorization: Used for sending credentials or API tokens for accessing protected resources.
User-Agent: Identifies the client application, which helps servers distinguish real users from automated bots.
Accept: Specifies the content types (e.g., JSON, XML) the client can process, enabling the server to send compatible responses.
Cookie: Transmits stored cookies for session continuity.
Cache-Control: Directs caching behavior, specifying cache duration and conditions.

Headers can be easily managed using Python’s requests library, allowing you to get headers from a response or set custom headers to tailor each request.

Example: Getting Headers with Python Requests

In Python requests to get the headers can be done with response.headers.

import requests

response = requests.get('https://httpbin.dev')
print(response.headers)
{
  "Access-Control-Allow-Credentials": "true",
  "Access-Control-Allow-Origin": "*",
  "Content-Security-Policy": "frame-ancestors 'self' *.httpbin.dev; font-src 'self' *.httpbin.dev; default-src 'self' *.httpbin.dev; img-src 'self' *.httpbin.dev https://cdn.scrapfly.io; media-src 'self' *.httpbin.dev; script-src 'self' 'unsafe-inline' 'unsafe-eval' *.httpbin.dev; style-src 'self' 'unsafe-inline' *.httpbin.dev https://unpkg.com; frame-src 'self' *.httpbin.dev; worker-src 'self' *.httpbin.dev; connect-src 'self' *.httpbin.dev",
  "Content-Type": "text/html; charset=utf-8",
  "Date": "Fri, 25 Oct 2024 14:14:02 GMT",
  "Permissions-Policy": "fullscreen=(self), autoplay=*, geolocation=(), camera=()",
  "Referrer-Policy": "strict-origin-when-cross-origin",
  "Strict-Transport-Security": "max-age=31536000; includeSubDomains; preload",
  "X-Content-Type-Options": "nosniff",
  "X-Xss-Protection": "1; mode=block",
  "Transfer-Encoding": "chunked"
}

The output shows headers the server sends back, with details like

media type Content-Type
security policies (Content-Security-Policy)
allowed origins (Access-Control-Allow-Origin).

Example: Setting Custom Headers

Custom headers, like adding a User-Agent for device emulation, can make requests appear more authentic:

headers = {'User-Agent': 'my-app/0.0.1'}
response = requests.get('https://httpbin.dev/headers', headers=headers)
print(response.json())
{
"headers": {
  "Accept": ["*/*"],
  "Accept-Encoding": ["gzip, deflate"],
  "Host": ["httpbin.dev"],
  "User-Agent": ["my-app/0.0.1"],
  "X-Forwarded-For": ["45.242.24.152"],
  "X-Forwarded-Host": ["httpbin.dev"],
  "X-Forwarded-Port": ["443"],
  "X-Forwarded-Proto": ["https"],
  "X-Forwarded-Server": ["traefik-2kvlz"],
  "X-Real-Ip": ["45.242.24.152"]
}}

This setup helps ensure each request appears browser-like, reducing the chance of triggering anti-bot measures. In Python requests, setting headers lets you precisely control interactions with the server.

Are Headers Case-Sensitive?

A frequent question when working with Python requests headers is whether header names are case-sensitive.

According to the HTTP/1.1 specification, header names are case-insensitive, meaning Content-Type, content-type, and CONTENT-TYPE are all equivalent. However, sticking to standard naming conventions like Content-Type instead of alternative casing is a good practice. Standardizing the format helps prevent confusion, especially when integrating with third-party APIs or systems that may interpret headers differently.

Why Case Sensitivity Plays a Role in Bot Detection?

When web servers evaluate requests, subtle details such as inconsistent header casing can reveal the nature of a client. Many legitimate browsers and applications follow specific casing conventions, like capitalizing Content-Type. Bots or scripts, however, may not follow these conventions uniformly. By analyzing requests with unconventional casing, servers can flag or block potential bots.

In practice, Python’s requests library automatically handles case normalization for headers when using functions like python requests set headers. This means that regardless of how you write the header name, the library converts it to a standardized format, ensuring compatibility with the server. However, note that while the header names themselves are case-insensitive, header values (such as “application/json” in Content-Type) may still be interpreted literally and should be formatted accurately.

Example of Case-Insensitive Headers

In Python’s requests library, you can set headers in any case, and the library will interpret them correctly:

import requests

# Setting 'content-type' in lowercase
headers = {'content-type': 'application/json'}  # This works just fine
response = requests.post('https://httpbin.dev/api', headers=headers)
print(response.request.headers)
{
  "Content-Type": "application/json",
  "User-Agent": "python-requests/2.28.1",
  "Accept-Encoding": "gzip, deflate",
  "Accept": "*/*",
  "Connection": "keep-alive"
}

As shown above, requests automatically converted content-type to the standard Content-Type. This demonstrates that Python’s requests library will normalize header names for you, maintaining compatibility with web servers regardless of the case used in the original code.

Does Header Order Matter?

In most standard API interactions, the order of headers sent with a Python requests headers call does not affect functionality, as the HTTP specification does not require a specific order for headers. However, when dealing with advanced anti-bot and anti-scraping systems, header order can play an unexpectedly significant role in determining whether a request is accepted or blocked.

Why Header Order Matters for Bot Detection

Anti-bot systems, such as Cloudflare, DataDome, and PerimeterX, often go beyond simple header verification and analyze the "fingerprint" of a request. This includes the order in which headers are sent. Human users (via browsers) typically send headers in a consistent order. For example, browser requests might commonly follow an order such as User-Agent, Accept, Accept-Language, Referer, and so on. In contrast, automation libraries or scrapers may send headers in a different order or add non-standard headers, which can serve as red flags for detection algorithms.

Example: Browser Headers vs. Python Requests Headers

In a browser, you might observe headers in this order:

{
  "User-Agent": "Mozilla/5.0...",
  "Accept": "text/html,application/xhtml+xml,application/xml;q=0.9",
  "Accept-Language": "en-US,en;q=0.5",
  "Accept-Encoding": "gzip, deflate, br",
  "Referer": "https://example.com",
  "Connection": "keep-alive"
}

With Python’s requests library, headers might look slightly different:

import requests

headers = {
    "Accept": "application/json",
    "User-Agent": "my-scraper/1.0",
    "Connection": "keep-alive",
    "Referer": "https://httpbin.dev"
}
response = requests.get("https://httpbin.dev/headers", headers=headers)
print(response.json())  # This output may vary based on server handling
{
  "headers": {
    "Accept": "application/json",
    "User-Agent": "my-scraper/1.0",
    "Connection": "keep-alive",
    "Referer": "https://httpbin.dev"
  }
}

This slight difference in header ordering can hint to anti-bot systems that the request might be automated, especially if combined with other signals, such as the User-Agent format or missing headers.

By analyzing this order, advanced detection systems can identify patterns often associated with automated scripts or bots. When a request does not match the usual order, the server may assume it’s coming from a bot, potentially resulting in blocked requests or captcha challenges.

Standard Headers in Python Requests

When setting up Python requests headers to mimic browser requests, it's helpful to know which headers are standard in most web browsers. These headers inform the server about the client’s capabilities and preferences, making the request appear more legitimate.

Key Standard Headers

Standard headers mimic browser behavior, increasing the success of requests. Key headers include:

User-Agent: Identifies the browser and OS, helping the request appear like genuine browser traffic. Example: Mozilla/5.0 (Windows NT 10.0; Win64; x64) Chrome/110.0.0.0.
Accept: Declares accepted content types, e.g., text/html for web pages, application/json for APIs.
Accept-Language: Preferred languages, such as en-US, to match browser settings.
Accept-Encoding: Lists accepted compression methods (e.g., gzip, deflate) to reduce data size.
Referer: Provides the URL of the previous page, giving context to the server.
Connection: Defines connection type; typically set to keep-alive for browser-like behavior.

Verifying Browser Headers

To ensure requests mimic real browsers:

Browser Developer Tools:
- Use the Network tab in Chrome or Firefox Developer Tools to inspect headers.
- Copy headers directly from real browser requests for authenticity.
Proxy Tools:
- Tools like Fiddler allow for detailed inspection and modification of HTTP headers in real-time, ideal for refining requests.

Example: Mimicking Headers in Python

import requests

headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/110.0.0.0 Safari/537.36',
    'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,*/*;q=0.8',
    'Accept-Language': 'en-US,en;q=0.5',
    'Accept-Encoding': 'gzip, deflate, br',
    'Referer': 'https://httpbin.dev',
    'Connection': 'keep-alive'
}
response = requests.get('https://httpbin.dev', headers=headers)
print(response.status_code)

This request uses browser-like headers to make the interaction appear more natural. By observing the headers and header order from browser tools, you can customize these in Python to make your request as close to a real browser request as possible.

Importance of the User-Agent String

The User-Agent string plays a crucial role in how servers respond to requests. It identifies the application, operating system, and device making the request, allowing servers to tailor their responses accordingly.

User-Agent strings are typically generated by the browser itself and can vary based on the version of the browser, the operating system, and even the hardware configuration.

You can learn more about How to Effectively Use User Agents for Web Scraping in our dedicated article:

How to Effectively Use User Agents for Web Scraping

In this article, we’ll take a look at the User-Agent header, what it is and how to use it in web scraping. We'll also generate and rotate user agents to avoid web scraping blocking.

Headers for POST Requests

When using Python requests headers with POST requests, headers play a vital role in how the server interprets the data sent by the client. POST requests are typically used to send data to a server to create, update, or modify resources, often requiring additional headers to clarify the data’s structure, format, and purpose.

Key Headers for POST Requests

Content-Type: Indicates the data format, such as application/json for JSON data, application/x-www-form-urlencoded for form submissions, or multipart/form-data for files. Setting this correctly ensures the server parses your data as expected.
User-Agent: Identifies the client application, which helps with API access and rate limit policies.
Authorization: Needed for secure endpoints to authenticate requests, often using tokens or credentials.
Accept: Specifies the desired response format (e.g., application/json), aiding in consistent data handling and error processing.

Example Usage of Headers for POST Requests

To send data in a JSON format, you typically set the Content-Type header to application/json and pass the data as JSON. Here’s an example with python requests post headers to send a JSON payload:

import requests

# Setting headers for a JSON POST request
headers = {
    'Content-Type': 'application/json',  # Indicates JSON payload
    'User-Agent': 'my-app/0.0.1'
}
data = '{"key": "value"}'
response = requests.post('https://httpbin.dev/api', headers=headers, json=data)

print(response.status_code)
print(response.json())  # View response if it's in JSON format

Content-Type: Setting this to application/json allows the server to recognize and parse the payload as JSON.
User-Agent: Identifies the client making the request.
data: The JSON object you wish to send to the server.

Using python requests post headers in this way ensures the server processes your data correctly and may prevent the request from being blocked.

Browser-Specific Headers

When a server expects traffic from real users, it may check for certain browser-specific headers that are typically sent only by actual web browsers. These headers help identify and differentiate browsers from automated scripts, which is particularly important when navigating anti-bot protections on certain sites. By configuring Python requests headers to mimic these browser-specific patterns, you can make your requests appear more human-like, often increasing the chances of successful requests.

Common Browser-Specific Headers

DNT (Do Not Track): Informs the server of the user’s tracking preference (1 means "do not track"), making the request more browser-like.
Sec-Fetch-Site: Shows the origin relationship, with values like same-origin, cross-site, and none, helping mimic genuine navigation context.
Sec-Fetch-Mode: Defines request purpose, such as navigate for page loads, making it useful for replicating typical browser behavior.
Sec-Fetch-Dest: Indicates content type (document, image, script), useful for mimicking specific resource requests.

Example of Browser-Specific Headers in Python Requests:
Set browser-specific headers when making requests using the requests library in Python.

import requests

headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/94.0.4606.81 Safari/537.36',
    'DNT': '1',                       # Respects user preference for tracking
    'Sec-Fetch-Site': 'none',          # Represents a top-level navigation action
    'Sec-Fetch-Mode': 'navigate',      # Simulates a full-page load
    'Sec-Fetch-Dest': 'document'       # Indicates the content type expected
}
response = requests.get('https://httpbin.dev', headers=headers)
print(response.status_code)
200
print(response.headers)
{
  "Access-Control-Allow-Credentials": "true",
  "Access-Control-Allow-Origin": "*",
  "Content-Security-Policy": "frame-ancestors 'self' *.httpbin.dev; font-src 'self' *.httpbin.dev; default-src 'self' *.httpbin.dev; img-src 'self' *.httpbin.dev https://cdn.scrapfly.io; media-src 'self' *.httpbin.dev; script-src 'self' 'unsafe-inline' 'unsafe-eval' *.httpbin.dev; style-src 'self' 'unsafe-inline' *.httpbin.dev https://unpkg.com; frame-src 'self' *.httpbin.dev; worker-src 'self' *.httpbin.dev; connect-src 'self' *.httpbin.dev",
  "Content-Type": "text/html; charset=utf-8",
  "Date": "Sun, 27 Oct 2024 11:48:47 GMT",
  "Permissions-Policy": "fullscreen=(self), autoplay=*, geolocation=(), camera=()",
  "Referrer-Policy": "strict-origin-when-cross-origin",
  "Strict-Transport-Security": "max-age=31536000; includeSubDomains; preload",
  "X-Content-Type-Options": "nosniff",
  "X-Xss-Protection": "1; mode=block",
  "Transfer-Encoding": "chunked"
}

By including these headers, you can make your request appear closer to those typically sent by browsers, reducing the likelihood of being flagged as a bot or encountering access restrictions.

Why Use Browser-Specific Headers?

Anti-Bot Detection: Browser-specific headers help requests resemble regular user traffic, making it harder for anti-bot systems to flag them.
Enhanced Compatibility: Some sites offer different responses for browser-like requests, making these headers useful for sites that restrict non-browser traffic.
Request Authenticity: Mimicking browser behavior with these headers can increase request success rates by reducing the chance of blocks.

Blocking Requests with Invalid Headers

When working with Python requests headers, it’s essential to use valid, correctly formatted headers. Many servers actively monitor incoming headers to detect unusual or incomplete requests. Requests with invalid or missing headers—such as a missing User-Agent, improperly set Content-Type, or contradictory headers—are common signals of automated or suspicious traffic and can lead to immediate blocking.

For example, headers that contradict each other, like mixing Accept: text/html with Content-Type: application/json, may cause the server to reject your request, as this combination doesn’t align with typical browser behavior.

Additionally, some websites use AI-powered anti-bot tools to scrutinize headers and pinpoint bot-like inconsistencies. Testing headers for potential issues is best done on a controlled platform.

Practical Tips to Avoid Blocking

These practical tips for setting headers, Like using User-Agent, matching Content-Type, and avoiding excessive headers help reduce detection and minimize request blocking.

Include Required Headers: Always include essential headers like User-Agent to avoid server rejections.
Match Expected Content-Type: When sending data, use the correct Content-Type, such as application/json for JSON data or multipart/form-data for file uploads.
Avoid Unnecessary Headers: Adding excessive or irrelevant headers may signal automation, especially if they’re not consistent with standard browser requests.

Taking these precautions when setting headers can significantly improve the success rate of your requests and help you bypass potential blocks effectively.

Power Up with Scrapfly

While requests is a powerful HTTP client library it's not a great tool for scraping as it's hard to scale and easy to identify and block.

ScrapFly provides web scraping, screenshot, and extraction APIs for data collection at scale.

Anti-bot protection bypass - scrape web pages without blocking!
Rotating residential proxies - prevent IP address and geographic blocks.
JavaScript rendering - scrape dynamic web pages through cloud browsers.
Full browser automation - control browsers to scroll, input and click on objects.
Format conversion - scrape as HTML, JSON, Text, or Markdown.
Python and Typescript SDKs, as well as Scrapy and no-code tool integrations.

FAQs

Why does my Python requests script get blocked even with User-Agent headers set?

Anti-bot systems analyze multiple factors beyond User-Agent, including header order, missing browser-specific headers (like Sec-Fetch-*), request timing patterns, and IP reputation. Use complete browser header sets and consider proxy rotation.

How do I set Content-Type headers for POST requests with JSON data in Python requests?

Set Content-Type: application/json and use the json parameter: requests.post(url, json=data, headers={'Content-Type': 'application/json'}). The json parameter automatically sets the correct Content-Type and serializes the data. See our complete guide on POST requests.

Does header order matter in Python requests for avoiding bot detection?

Yes, advanced anti-bot systems analyze header order patterns. Browsers send headers in consistent orders (User-Agent, Accept, Accept-Language, etc.). Use browser developer tools to copy exact header sequences for better mimicry.

What are the minimum headers needed to mimic a real browser in Python requests?

Essential headers: User-Agent, Accept, Accept-Language, Accept-Encoding, Connection: keep-alive, and Sec-Fetch-* headers. Include Referer for navigation context and DNT: 1 for privacy compliance. See our guide on avoiding blocking with headers.

How can I debug which headers a website is rejecting in my Python requests?

Use browser developer tools to compare your request headers with successful browser requests. Test with tools like httpbin.dev to see what headers are being sent. Monitor response status codes and error messages for clues about rejected headers.

What role do headers play in HTTP requests?

Headers convey additional information with each request, such as the type of data expected, client information, and authorization details. They're essential for communicating preferences and ensuring that servers handle requests correctly.

Why are headers important in web scraping and API requests?

Headers can help bypass anti-bot detection, authenticate requests, and ensure the correct data format in responses. Customizing headers to resemble real browser requests is especially helpful for scraping and accessing restricted APIs.

How can I find out what headers a website expects?

Using browser developer tools, you can inspect the headers sent with each request to a website. Copying these headers into your Python requests can help your request mimic browser traffic.

Summary

Working with Python requests headers is essential for both web scraping and API interactions. Understanding how to set, get, and manipulate headers can help you create more effective and reliable requests. Whether you're dealing with GET or POST requests, mimicking browser headers, or trying to avoid detection, the way you handle headers can make or break your scraping success.

By following best practices, Such as using standard headers, setting appropriate values for POST requests, and ensuring header order, Your requests will be better equipped to navigate the complex landscape of modern web services.

Guide to Python Requests Headers

Explore this Article with AI

Key Takeaways

What Are Headers in Python Requests?

Are Headers Case-Sensitive?

Why Case Sensitivity Plays a Role in Bot Detection?

Example of Case-Insensitive Headers

Does Header Order Matter?

Why Header Order Matters for Bot Detection

Standard Headers in Python Requests

Key Standard Headers

Verifying Browser Headers

Importance of the User-Agent String

How to Effectively Use User Agents for Web Scraping

Headers for POST Requests

Key Headers for POST Requests

Browser-Specific Headers

Common Browser-Specific Headers

Why Use Browser-Specific Headers?

Blocking Requests with Invalid Headers

Practical Tips to Avoid Blocking

Power Up with Scrapfly

FAQs

Why does my Python requests script get blocked even with User-Agent headers set?

How do I set Content-Type headers for POST requests with JSON data in Python requests?

Does header order matter in Python requests for avoiding bot detection?

What are the minimum headers needed to mimic a real browser in Python requests?

How can I debug which headers a website is rejecting in my Python requests?

What role do headers play in HTTP requests?

Why are headers important in web scraping and API requests?

How can I find out what headers a website expects?

Summary

Explore this Article with AI

Related Knowledgebase

How to save and load cookies in Python requests?

What Python libraries support HTTP2?

Python httpx vs requests vs aiohttp - key differences

How to use proxies with Python httpx?

How to use cURL in Python?

How to fix Python requests MissingSchema error?

How to fix python requests ConnectTimeout error?

How to fix Python requests SSLError?

How to fix Python requests TooManyRedirects error?

How to Copy as cURL With Firefox?

How To Copy as cURL With Google Chrome?

How to Copy as cURL With Safari?

Related Articles

Guide to Python requests POST method

How to Scrape Naver.com

How to Scrape Imovelweb.com

How to Scrape AutoScout24

How to Scrape Allegro.pl