Home/ Data Extraction Software/ Diffbot/ Reviews
Updated on: February 13, 2025
Get access to a trillion of intuitive facts with Diffbot
95%
5%
0%
0%
0%
Diffbot's users appreciate its accurate and reliable data extraction capabilities, which enable them to gather meaningful insights from unstructured text data. They also value its user-friendly interface and extensive documentation, which make it easy to integrate with various applications and programming languages. Additionally, users praise Diffbot's excellent customer support, which promptly addresses any queries or issues. However, some users have expressed concerns regarding the accuracy of extracted data in certain cases, particularly when dealing with complex or niche content.
AI-Generated from the text of User Reviews
1) Enrichment data
2) Ability to query data in aggregate
1) Being charged based on entities
2) Being charged as we go (I wish there was a way to limit my queries)
Lead enrichment
Lead sourcing
Customer profiling
High detection accuracy and uptime: most of the time we can send API requests and know that the responses from Diffbot will be valid.
Some old versions of Python are used (<3.0) and could be upgraded.
We have been using the Article and Analyse APIs as a core part of our pipeline. After doing a build-vs-buy comparison, we realized that it would be far preferable to leave this step to an external best-in-class solution, rather than to build (and importantly *maintain*) in-house. Wherever the automated page structure analysis fails, our team can easily "teach" it the structure, and in the rare cases where that fails, the Diffbot team are very responsive to address issues.
Diffbot provides great APIs, technical resource, and overall service. Their technical resources are one of the most advanced and highly accurate. Diffbot's team keeps their APIs up to date with social media's rapid evolution. The customer support is equally helpful and very friendly. They are very willing to work with flexible scenarios, accommodate needs and low budgets for small research groups, provide demo and trial accounts to experiment. Overall, they are the best social media data provider and analysis company, in my experience of over a decade.
This is more like a suggestion. Diffbot has several excellent capabilities and they are constantly improving and adding new features. Current customers and perhaps prospective ones too would benefit from a weekly/monthly newsletter, or social media updates, about these new developments.
I would strongly recommend Diffbot. But if you are still undecided, contact their support staff for demo/trial account. You won't regret it!
Social media and news monitoring.
Diffbot's services have allowed us to streamline our data collection method. Previously, we wrote our own web crawlers/scrapers for blog sites which would break quite frequently. Diffbot has removed that hurdle. We are now looking forward to using the NLP/AI capabilities provided by Diffbot.
Having tried a number of similar services in the past, we were very pleasantly surprised as to how good the content extraction is. The contacts we have dealt with at Diffbot have also been extremely helpful.
There's not much to dislike. It does the job very well.
Clean spidering and content extraction of websites.
The KG is amazingly comprehensive. Products, people, corporations, and more all linked together in a contextual way.
KG provides a user friendly way of feeling like you've scraped the whole web. No custom scraping rules, no need to figure out the nuances of where information is housed online. Just query and see if what you're looking for is on the public web.
Finally, export features are great. You can export to CSV or JSON. I believe there are also a host of APIs where you can extract data on different entity types.
For advanced queries you do have to learn Diffbot's query language (DQL)
Try out the free trial. It doesn't take long to get up and running with the KG. In a matter of a few minutes you can begin to see what types of entities are returned from queries. If you want a little more hand holding reach out for a demo and their team will show you some cool queries, use cases for the Knowledge Graph, etc.
Also, Diffbot's crawling product is relatively low barrier to entry. Try it out to pull ALL SORTS of data from competing sites.
We've used Diffbot's KG for a variety of online media operations including:
- Live news monitoring of higher education entities
- Pulling of trends for data journalism projects
- Product price fluctuations for the purposes of placing affiliate links
The KG is amazingly comprehensive. Products, people, corporations, and more all linked together in a contextual way.
KG provides a user friendly way of feeling like you've scraped the whole web. No custom scraping rules, no need to figure out the nuances of where information is housed online. Just query and see if what you're looking for is on the public web.
Finally, export features are great. You can export to CSV or JSON. I believe there are also a host of APIs where you can extract data on different entity types.
For advanced queries you do have to learn Diffbot's query language (DQL)
Try out the free trial. It doesn't take long to get up and running with the KG. In a matter of a few minutes you can begin to see what types of entities are returned from queries. If you want a little more hand holding reach out for a demo and their team will show you some cool queries, use cases for the Knowledge Graph, etc.
Also, Diffbot's crawling product is relatively low barrier to entry. Try it out to pull ALL SORTS of data from competing sites.
We've used Diffbot's KG for a variety of online media operations including:
- Live news monitoring of higher education entities
- Pulling of trends for data journalism projects
- Product price fluctuations for the purposes of placing affiliate links
The parsing quality is best of breed - and adapts well to all sorts of websites
The scheduling and organizing of crawljobs is not perfect yet - hoping to see some improvements coming up there.
We use Diffbot to crawl a large amount of global news outlets
Working with just one engineer, we were able to get a simple integration going within a week. We used the Article API to scale up and improve something we had already been doing in-house but didn't have the necessary resources to justify doing on our own. Diffbot allowed us to outsource something that was not a core focus and use those freed up resources to scale up other aspects of our infrastructure.
Not much really. Our rep keeps reminding us we're only using a fraction of what we could be using. One of these days we'll have the time to explore some of the higher-level knowledge graph APIs, one of these days.
Crawling and extracting information from HTML.
We use Diffbot for news monitoring, and their article extraction capabilities are scalable, cost efficient and the right fit for our use case.
There's not much to dislike for how we use Diffbot.
Scalable news monitoring is difficult to accomplish when your solution is completely built or managed in-house - Diffbot AI solves the technical challenges of article extraction from unstructured web pages, for us to get rich structured public data.
Looking for the right SaaS
We can help you choose the best SaaS for your specific requirements. Our in-house experts will assist you with their hand-picked recommendations.
Want more customers?
Our experts will research about your product and list it on SaaSworthy for FREE.
Diffbot makes the difficult task of managing data and extracting useful information much easier. They provide access to a seemingly infinite amount of company and contact information and are continuously improving their user interface to add even more value. I use Diffbot every chance I can!
Diffbot is very responsive and always willing to help. Their interface still needs some improvements, but I have been their client for over a year now and have seen vast improvements.
Diffbot is a better version of ZoomInfo with more capabilities beyond primary company, industry and contact info. They have additional tools which allow for data enrichment and are progressing towards in-depth market analytics. Indeed a total-package solution.