Lars Juhl Jensen
The pragmatic text minerIt’s just another type of poorly standardized
data
guilt by association
biomedical literature
as smart as a dog
teach it specific tricks
named entity recognition
comprehensive lexicon
prostate specific antigen
prefixes and suffixes
flexible matching
hyphens and spaces
prostate specific antigen
prostate-specific antigen
within paragraphs
what we normally use
Medline abstracts
what we should use
full-text articles
different interfaces
different formats
different licenses
unifying text & data
curated knowledge
experimental data
computational predictions
integrated web resources
chemical networks
subcellular localization
compartments.jensenlab.org
tissue expression
tissues.jensenlab.org
disease associations
different formats
different identifiers
common identifiers
score calibration
data visualization
collaboration model
Encyclopedia of Life
Biodiversity Heritage Library
pharmacovigilance
adverse drug reactions
electronic health records
one place to get all
the format is not crucial
AcknowledgmentsProtein networks
Michael KuhnDamian Szklarczyk
Andrea Franceschini Milan SimonovicAlexander RothSune Pletscher-
FrankildJianyi Lin
Pablo MinguezChristian von Mering
Peer Bork
Localization and diseaseSune Pletscher-FrankildAlberto SantosJanos BinderKalliopi TsafouChristian StolteAlbert PallejaHeiko HornEvangelos PafilisReinhardt SchneiderSean O’ Donoghue