Deepdive

DeepDive is an open-source framework for constructing knowledge bases from messy, heterogeneous data sources. It is designed to help researchers and developers extract, integrate, and reason about information with uncertainty. The system lets users specify declarative rules and templates that describe how to identify entities, relationships, and attributes in the input data, how to join and reconcile competing evidence, and how to resolve ambiguities across sources. Based on these specifications, DeepDive performs probabilistic inference to generate a set of facts, each with an associated confidence score.

DeepDive operates on a relational data model and uses a pipeline approach that combines data extraction, feature

Development of DeepDive originated at Stanford University and has been released as an open-source project. It

a

a

a