Plans Technology Documentation Blog

Diffbot lets your application interpret web pages the way humans do, visually.

Diffbot provides a set of visual learning APIs that enable developers to use web data in their own applications.

Simply provide Diffbot a URL and it will visually classify the components of the webpage and return back relevant semantic metadata.

An API for the Web

We discovered that the entire web can be classified into a set of 30 structural page types. Understanding common page layouts (like headlines, bylines and articles), contextual keywords, and content changes buried deep within pages enables applications to follow websites, observe when changes occur, and display that content in a variety of media. Diffbot is the foundation of a new way to develop applications that consume web content.



Benefits

  • Save Development Time

    The Diffbot API makes it simple for your app to get web data. Just pass us a URL. We'll do the rest.
  • Low-Maintenance

    Websites redesign on average every 2-3 months. Don't get stuck maintaining a set of site-specific scrapers.
  • Future-proof

    Diffbot is constantly learning from improved algorithms and updated models are pushed every week. Benefit from the largest community of developers submitting training data to Diffbot.
company     press     support     privacy      terms