The Account API now returns a list of child or sub-tokens.
Fixed an issue in the Analyze API where products with an API-Toolkit-overridden price field would not reflect changes in the “details” field (offerPriceDetails, regularPriceDetails, etc.).
Fixed an Article API issue for certain top-level domains where articles dated in the near future (e.g., tomorrow) would incorrectly be returned with a date from the prior year.
Crawlbot will now successfully spider URLs that contain (invalid) UTF-8 characters.
Global Index API: search-by-tag can optionally be performed using a tag-match shorthand.
We now offer an Account API for tracking token API usage and billing history.
Global Index API: negative search queries (diffbot AND -"machine learning") are now functioning as documented;
Fixed an issue where Crawlbot and Bulk API data downloads did not include a filename.
The breadcrumb element is now a default field in the Article API.
APIs no longer ignore “format characters”—invisible characters that may have an effect on neighboring characters. For example, ‌.
Crawlbot and Bulk Service URL Reports now offer an option to download the last 500 URLs crawled.
Global Index API: Faceted date queries will no longer return a min value of 0.
Analyze API now supports a “fallback” API via the argument fallback. By passing an API value, any otherwise unsupported pages will be forcibly processed via that API. E.g., passing fallback=article will result in any “other” pages being processed by the Article API.
Hey, as of today we’re publishing a changelog. It’s visible… here.
Additional token support within a single account has been added. Additional tokens are available on a case-by-case basis to paying customers. Please contact firstname.lastname@example.org if you would like to discuss additional tokens.
API Toolkit now allows direct update of URL pattern / regular expression without having to create a new ruleset.
API Toolkit rule output automatically trims fields to remove leading or trailing blank spaces.
The diffbotUri field is now computed based on rule-based output, if a custom rule is used to override default output.
The resolvedPageUrl is correctly returned in Custom APIs (if a submitted page is redirected).
Each tag in the tags element now returns a list of all matching rdfTypes.
Email invoices now return both dollars and cents.
Performance improvements to Article API to prevent intermittent extra-long responses.
Semantria output updated to include additional fields.
Fix for timeout parameter when sending data to Semantria.
Invoices are now visible and printable within the Developer Dashboard (under “Account > Billing History”).
Tokens are now case-insensitive across all Diffbot APIs.
Article API now returns the siteName, publisherCountry and publisherRegion of an article, if it can be determined or if already known.
If price data is overridden with a custom rule, the corresponding “details” field (offerPriceDetails) will be computed from the overridden value.
Spotify embeds are properly maintained/returned with Article API html.
Multipage articles, when concatenated, will no longer return duplicate images (that appear on multiple pages).
Article and discussion tagging now supports Spanish and Chinese language tags, in addition to English and French.