Warning: imagejpeg(C:\Inetpub\vhosts\kidney.de\httpdocs\phplern\28316365
.jpg): Failed to open stream: No such file or directory in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 117 Proc+ACM+SIGMOD+Int+Conf+Manag+Data
2016 ; 2016
(ä): 847-859
Nephropedia Template TP
gab.com Text
Twit Text FOAVip
Twit Text #
English Wikipedia
Extracting Databases from Dark Data with DeepDive
#MMPMID28316365
Zhang C
; Shin J
; Ré C
; Cafarella M
; Niu F
Proc ACM SIGMOD Int Conf Manag Data
2016[Jun]; 2016
(ä): 847-859
PMID28316365
show ga
DeepDive is a system for extracting relational databases from dark data: the mass
of text, tables, and images that are widely collected and stored but which cannot
be exploited by standard relational tools. If the information in dark data -
scientific papers, Web classified ads, customer service notes, and so on - were
instead in a relational database, it would give analysts a massive and valuable
new set of "big data." DeepDive is distinctive when compared to previous
information extraction systems in its ability to obtain very high precision and
recall at reasonable engineering cost; in a number of applications, we have used
DeepDive to create databases with accuracy that meets that of human annotators.
To date we have successfully deployed DeepDive to create data-centric
applications for insurance, materials science, genomics, paleontologists, law
enforcement, and others. The data unlocked by DeepDive represents a massive
opportunity for industry, government, and scientific researchers. DeepDive is
enabled by an unusual design that combines large-scale probabilistic inference
with a novel developer interaction cycle. This design is enabled by several core
innovations around probabilistic training and inference.