Diffbot API Documentation
Developer Token
diffbot maintains a variety of APIs for developers who wish to take advantage of our technology in their applications.
Sign up for one of our plans to get a developer token to get started using our APIs.
Personal accounts are free.
Follow API
The follow API is used to follow the changes to any webpage. Diffbot automatically determines the part of the page that the user likely wants to follow, extracts metadata such as the title, image and text summary, scores, and segments the page into structurally meaningful blocks. Responses are returned in DML (Diffbot markup language) format.
Ondemand APIs
The On-Demand API is used for one-off extractions where the URL doesn't need to be tracked over time. The Diffbot On-demand APIs are divided into page types:
Frontpage API
The Frontpage API is designed for analyzing home pages and index pages.
Article API
The article API is used to to extract clean article text from news article web pages. It's powerful when combined with the permalinks that are extracted by the above APIs.
Client Libraries
Node.js - from Mark Bao, CTO of OnSwipe
.NET - from NLP geek-trepreneur Wesley Tansey
C# - from Chris Choyce, creater of Pistashio
Clojure - from rockstar hacker Wei Hsu
Python - generously contributed by Nik Cubrilovic, updated by Adomas Paltanavicius
Another Python client - by David Torres
Ruby - by Nicolás Sanguinetti
PHP and symfony bundle by Laurent Sarrazin