Data Wants to Be Structured

Using AI, computer vision, machine learning and natural language processing, Diffbot provides comprehensive tools to understand and extract from any web page.

Automatic APIs

Diffbot's Automatic APIs automatically extract content from supported page types: articles, products, discussions, images and more.

Learn about Automatic APIs »

Crawlbot and Bulk Processing

Crawlbot lets you apply Diffbot APIs to entire sites, extracting hundreds or thousands of pages into a single structured index. The Bulk API lets you process tens to a million URLs in a single job.

Learn more about Crawlbot »

Custom APIs

Extract any data from any web page using easy-to-create custom rules.

Learn more about Custom APIs »