Products to structure the world's information

Using AI, computer vision, machine learning and natural language processing, Diffbot provides software developers with tools to extract and understand objects from any web page.

Automatic APIs

Diffbot's Automatic APIs automatically extract content from supported page types: articles, products, discussions, images and more. Diffbot uses advanced AI technology to retrieve clean, structured data without need for manual rules or site-specific training.

Test our Automatic APIs

Crawlbot and Bulk Processing

Crawlbot lets you apply any Diffbot API to an entire site, structuring hundreds or thousands of pages into a single, searchable index. With Bulk Processing, extract structured data from hundreds to millions of URLs in a single job.

Learn more

Custom APIs

Extract any data from any web page using easy-to-create custom rules and an instant API.

Learn more