How to get file type of an URL in Python?

import mimetypes # mimetypes module can analysize string for file extensions: mimetypes.guess_type("http://example.com/file.pdf") ('application/pdf', None) mimetypes.guess_type("http://example.com/song.mp3") ('audio/mpeg', None) mimetypes.guess_type("http://example.com/file-without-extension") (None, None) # for files without extension we can make head request which only downloads the metadata import httpx response = httpx.head("https://httpbin.dev/html").headers['Content-Type'] 'text/html; charset=utf-8' httpx.head("https://wiki.mozilla.org/images/3/37/Mozilla_MDN_Guide.pdf").headers['Content-Type'] 'application/pdf'

Provided by Scrapfly

This knowledgebase is provided by Scrapfly data APIs, check us out! 👇

Web Scraping API - scrape without blocking, control cloud browsers, and more.

Extraction API - AI and LLM for parsing data.

Screenshot API - capture pages or elements with no blocks.

GPT Crawler: The AI Training Data Collection Guide

Mar 20, 2025

How to get file type of an URL in Python?

Provided by Scrapfly

Related Questions

Related Posts

GPT Crawler: The AI Training Data Collection Guide

Guide to List Crawling: Everything You Need to Know

How to Find All URLs on a Domain

What is Googlebot User Agent String?