Home/Data Extraction Software/Diffbot
Updated on: October 24, 2021

What is Diffbot ?

Diffbot - Data Extraction Software

Diffbot

Extract Data in Minutes
(27 Ratings) Write Review

Diffbot software is a platform used to extract data from the web. The software uses machine learning to transform, retrieve structured data without manual rules or site-specific training. The Crawlbot uses Diffbot API to extract data from entire sites. It has an easy-to-use API tool to make integration on any platform or language. Developers, Small, Medium companies make use of the software.

Diffbot Technical details

Support 24/7 (Live rep) Business Hours Online Customer Type Large Enterprises Medium Business Small Business
API Location / Phone Number Menlo Park, CA / 1-855-885-4800
Deployment SaaS/Web/Cloud Mobile - Android Mobile - iOS Installed - Windows Installed - Mac Category Data Extraction Software

Diffbot Pricing

Pricing ModelFree Trial , Subscription
Startup
$299 /Month

Features

  • Monthly Credits : 250,000
  • Per Additional Credit : $.001
  • User License : 1
  • Extraction APIs
  • Knowledge Graph
  • 5 Calls Per Second
  • Email Support
  • Custom API
  • Standard Integrations
Plus
$899 /Month

Features

  • Monthly Credits : 1,000,000
  • Per Additional Credit : $.0009
  • User Licenses : 3
  • Includes features of Startup plan, plus
  • Crawlbot
  • 25 Calls Per Second
  • Bulk Processing
  • 30 Day Storage
  • Raw HTML
  • ProServ Available (hourly charge applies)
  • Add Source to Global Crawl
Enterprise
Custom

Features

  • Monthly Credits : 2,000,000+
  • Per Additional Credit : $.00085
  • User Licenses : 5+
  • 25+ Calls Per Second
  • Includes features of Plus plan, plus
  • Phone Support
  • Additional Storage
  • Proxy Access
  • Custom Integrations
  • Dedicated Success Manager
  • Data Refreshes
  • CRM Integration
  • Support SLA
Screenshot of the Vendor Pricing Page
View Full Screen

Disclaimer: The pricing details were last updated on 06/07/2020 from the vendor website and may be different from actual. Please confirm with the vendor website before purchasing.

Learn more about Diffbot pricing.

Diffbot FAQs

Yes, Diffbot provides API.

Ask the Community View Community

Diffbot Alternatives Diffbot Alternatives

Import.io
(36 RATINGS)
Import.io Import.io is a Web Data Integration (WDI) software used primarily by businesses, sales and... read more
Visit Website
Parseur
(53 RATINGS)
Parseur Parseur is a powerful email and document parsing software founded in 2016 by Sylvestre Dupont and... read more
Visit Website
Zyte
Zyte Zyte is a robust web scraping solution that helps companies to get access to useful information... read more
Visit Website
ScrapeHunt
ScrapeHunt ScrapeHunt is an intuitive platform helping users build SaaS bootstrap with their databases. This... read more
Visit Website
Vidado
Vidado Vidado is an AI-powered platform used to scan and process the data from paper. The platform uses... read more
Visit Website
Base64.ai
Base64.ai Bаse64.аi is an advanced document processing automation platform that аutоmаtes document... read more
Visit Website
Influencers Club
(1 RATINGS)
Influencers Club Influencers Club is an advanced data extraction software, explicitly developed to help users in... read more
Visit Website

Diffbot Reviews

OVERALL RATING
4.8
Based on 27 Rating(s)
Rating Distribution
  • 100 %
  • 0 %
  • 0 %
  • 0 %
  • 0 %
SHARE YOUR EXPERIENCE Write a Review
Sort By
Filter by Source
Nitin A
Nitin ASource : g2crowd.com
(Reviewed on 23 November 2020) "social media and news monitoring"

What do you like best?

Diffbot provides great APIs, technical resource, and overall service. Their technical resources are one of the most advanced and highly accurate. Diffbot's team keeps their APIs up to date with social media's rapid evolution. The customer support is equally helpful and very friendly. They are very willing to work with flexible scenarios, accommodate needs and low budgets for small research groups, provide demo and trial accounts to experiment. Overall, they are the best social media data provider and analysis company, in my experience of over a decade.

What do you dislike?

This is more like a suggestion. Diffbot has several excellent capabilities and they are constantly improving and adding new features. Current customers and perhaps prospective ones too would benefit from a weekly/monthly newsletter, or social media updates, about these new developments.

Recommendations to others considering the product:

I would strongly recommend Diffbot. But if you are still undecided, contact their support staff for demo/trial account. You won't regret it!

What problems are you solving with the product? What benefits have you realized?

Social media and news monitoring.

Diffbot's services have allowed us to streamline our data collection method. Previously, we wrote our own web crawlers/scrapers for blog sites which would break quite frequently. Diffbot has removed that hurdle. We are now looking forward to using the NLP/AI capabilities provided by Diffbot.

...more
Eddie C
Eddie CSource : g2crowd.com
(Reviewed on 01 September 2020) "A very good service for anyone needing content extraction and much more."

What do you like best?

Having tried a number of similar services in the past, we were very pleasantly surprised as to how good the content extraction is. The contacts we have dealt with at Diffbot have also been extremely helpful.

What do you dislike?

There's not much to dislike. It does the job very well.

What problems are you solving with the product? What benefits have you realized?

Clean spidering and content extraction of websites.

...more
User in Online Media
User in Online MediaSource : g2crowd.com
(Reviewed on 31 August 2020) "Diffbot's Knowledge Graph is truly a web-scale database you can query"

What do you like best?

The KG is amazingly comprehensive. Products, people, corporations, and more all linked together in a contextual way.

KG provides a user friendly way of feeling like you've scraped the whole web. No custom scraping rules, no need to figure out the nuances of where information is housed online. Just query and see if what you're looking for is on the public web.

Finally, export features are great. You can export to CSV or JSON. I believe there are also a host of APIs where you can extract data on different entity types.

What do you dislike?

For advanced queries you do have to learn Diffbot's query language (DQL)

Recommendations to others considering the product:

Try out the free trial. It doesn't take long to get up and running with the KG. In a matter of a few minutes you can begin to see what types of entities are returned from queries. If you want a little more hand holding reach out for a demo and their team will show you some cool queries, use cases for the Knowledge Graph, etc.

Also, Diffbot's crawling product is relatively low barrier to entry. Try it out to pull ALL SORTS of data from competing sites.

What problems are you solving with the product? What benefits have you realized?

We've used Diffbot's KG for a variety of online media operations including:

- Live news monitoring of higher education entities

- Pulling of trends for data journalism projects

- Product price fluctuations for the purposes of placing affiliate links

...more
Georg H
Georg HSource : g2crowd.com
(Reviewed on 10 June 2020) "Great service for both, quick MVPs and professional applications"

What do you like best?

The parsing quality is best of breed - and adapts well to all sorts of websites

What do you dislike?

The scheduling and organizing of crawljobs is not perfect yet - hoping to see some improvements coming up there.

What problems are you solving with the product? What benefits have you realized?

We use Diffbot to crawl a large amount of global news outlets

...more
Ian K
Ian KSource : g2crowd.com
(Reviewed on 02 June 2020) "Like an extension of our infrastructure"

What do you like best?

Working with just one engineer, we were able to get a simple integration going within a week. We used the Article API to scale up and improve something we had already been doing in-house but didn't have the necessary resources to justify doing on our own. Diffbot allowed us to outsource something that was not a core focus and use those freed up resources to scale up other aspects of our infrastructure.

What do you dislike?

Not much really. Our rep keeps reminding us we're only using a fraction of what we could be using. One of these days we'll have the time to explore some of the higher-level knowledge graph APIs, one of these days.

What problems are you solving with the product? What benefits have you realized?

Crawling and extracting information from HTML.

...more
Oleg L
Oleg LSource : g2crowd.com
(Reviewed on 01 June 2020) "Using Diffbot to analyze product pages"

What do you like best?

Diffbot is the best web crawler and analyzer on the market. You can get the structured data about any web page that you want.

What do you dislike?

We have been using Diffbot for 5 months so far, and haven't got any issues with it.

Recommendations to others considering the product:

I recommend Diffbot.

What problems are you solving with the product? What benefits have you realized?

We are using Diffbot to get structured data about e-commerce web pages. It just does the job. One API call and you get the result.

...more
User in Information Technology and Services
User in Information Technology and ServicesSource : g2crowd.com
(Reviewed on 01 June 2020) "Diffbot has been invaluable for news monitoring"

What do you like best?

We use Diffbot for news monitoring, and their article extraction capabilities are scalable, cost efficient and the right fit for our use case.

What do you dislike?

There's not much to dislike for how we use Diffbot.

What problems are you solving with the product? What benefits have you realized?

Scalable news monitoring is difficult to accomplish when your solution is completely built or managed in-house - Diffbot AI solves the technical challenges of article extraction from unstructured web pages, for us to get rich structured public data.

...more
Artur R
Artur RSource : g2crowd.com
(Reviewed on 29 May 2020) "Content extraction done right"

What do you like best?

We're a happy customer for about 6 years now, and we tend to forget Diffbot is there, since their data flows seaminglessly. Our work depends a lot on data processing, and we don't want to worry about how data sources provide their data, or when change their process along the way. With Diffbot we can really focus on processing.

What do you dislike?

Nothing worth mentioning. The few glitches we had in the past were promptly dealt by their support.

What problems are you solving with the product? What benefits have you realized?

We're using data extraction APIs for getting web data. We're evaluating the knowledge graph.

...more
User in Internet
User in InternetSource : g2crowd.com
(Reviewed on 28 May 2020) "Diffbot great for extracting data without engineering help!"

What do you like best?

Their support team is very helpful. Even without purchasing their support plan to have an SLA, they usually get back within a week and provide thorough responses. Sometimes, they'll even see your API configuration, adjust it for you, and explain how the new setting is better.

I would highly recommend Diffbot for their robust and dependable products, supportive sales and customer support staff, and transparent pricing plans. Even their base plans make it easy for any company or team of any size to test it and determine what their positive ROI looks like.

What do you dislike?

Documentation could be improved a bit. It can be hard for new users who aren't familiar with HTML and CSS how to apply specific filters and selectors. My recommendation here is to provide templates or additional documentation on best practices for scraping data from popular sources such as Wikipedia.

Another small thing they can improve on is providing better visibility into account usage statistics for accounts with multiple tokens, which are all tied into one parent account.

What problems are you solving with the product? What benefits have you realized?

Their data extraction APIs are customizable and flexible. Almost any page on the internet can be scraped. It expedites data extraction for our team as we don't need to depend on custom python scripts or software engineers to help collect data for our needs. We were able to reduce time from days to mere hours to get working APIs to extract data. For a startup that is now part of a much larger company, this type of efficiency helped us allocate our engineers to more important sprints.

...more
Eric S
Eric SSource : g2crowd.com
(Reviewed on 28 May 2020) "Diffbot is our favorite content provider by a landslide"

What do you like best?

We needed a content sourcing solution for our product, Tanjo Animated Personas, or TAPs. Tanjo Animated personas are simulated customers that learn and evolve over time. Our personas need to read a continual stream of articles, in order to evolve and function properly. Diffbot gives us an easy way to source that content.

We have been a Diffbot customer for over 5 years, and have used all of their products, including Crawlbot and Knowledge Graph. Before Diffbot, we mainly relied on RSS feeds and custom scrapers to import articles into our system. The results were often inconsistent, with misread or malformed text blocks. It was tedious and unsustainable. Diffbot provided an almost limitless set of sources with high quality data.

Implementing Diffbot has greatly improved scalability, efficiency and quality of feeding internet articles into our platform. They are always willing to work with us if we encounter any issues. They take customer feedback seriously and are willing to hear out suggestions for what features could be improved or added. We appreciate Diffbot’s flexibility to work with us for our needs.

What do you dislike?

Diffbot has always been open to hearing our suggestions for what could be improved or added to their website. I don't think it would be fair to "dislike" anything since they have taken our feedback seriously in the past and iterated on their platform. If we think things could be better, we let Diffbot know.

What problems are you solving with the product? What benefits have you realized?

We needed an automated method to extract article text and images from popular websites online that was much more reliable and required much less effort to maintain. Diffbot provides an almost limitless set of sources with high quality data.

...more
Read All Reviews

Videos on Diffbot

Diffbot's Crawlbot: A Basic Overview
Diffbot's Crawlbot: A Basic Overview

Diffbot Integration

95%
WordPress
(23698 RATINGS) Website Builder Software

Disclaimer

The research is compiled using multiple sources, let us know of any feedback on feedback@saasworthy.com