Back to Search
Start Over
Trialstreamer: A living, automatically updated database of clinical trial reports
- Source :
- Journal of the American Medical Informatics Association : JAMIA
- Publication Year :
- 2020
-
Abstract
- Objective Randomized controlled trials (RCTs) are the gold standard method for evaluating whether a treatment works in health care but can be difficult to find and make use of. We describe the development and evaluation of a system to automatically find and categorize all new RCT reports. Materials and Methods Trialstreamer continuously monitors PubMed and the World Health Organization International Clinical Trials Registry Platform, looking for new RCTs in humans using a validated classifier. We combine machine learning and rule-based methods to extract information from the RCT abstracts, including free-text descriptions of trial PICO (populations, interventions/comparators, and outcomes) elements and map these snippets to normalized MeSH (Medical Subject Headings) vocabulary terms. We additionally identify sample sizes, predict the risk of bias, and extract text conveying key findings. We store all extracted data in a database, which we make freely available for download, and via a search portal, which allows users to enter structured clinical queries. Results are ranked automatically to prioritize larger and higher-quality studies. Results As of early June 2020, we have indexed 673 191 publications of RCTs, of which 22 363 were published in the first 5 months of 2020 (142 per day). We additionally include 304 111 trial registrations from the International Clinical Trials Registry Platform. The median trial sample size was 66. Conclusions We present an automated system for finding and categorizing RCTs. This yields a novel resource: a database of structured information automatically extracted for all published RCTs in humans. We make daily updates of this database available on our website (https://trialstreamer.robotreviewer.net).
- Subjects :
- 0301 basic medicine
Vocabulary
Databases, Factual
AcademicSubjects/SCI01060
evidence based medicine
Computer science
media_common.quotation_subject
Psychological intervention
Health Informatics
010501 environmental sciences
computer.software_genre
Research and Applications
01 natural sciences
law.invention
03 medical and health sciences
Medical Subject Headings
0302 clinical medicine
Randomized controlled trial
Bias
law
automatic database curation
Health care
Humans
030212 general & internal medicine
Data Curation
AcademicSubjects/MED00580
0105 earth and related environmental sciences
media_common
Data Management
Randomized Controlled Trials as Topic
Evidence-Based Medicine
Database
business.industry
Evidence-based medicine
3. Good health
Clinical trial
030104 developmental biology
Sample size determination
randomized controlled trials
research synthesis
AcademicSubjects/SCI01530
business
Classifier (UML)
computer
Subjects
Details
- ISSN :
- 1527974X
- Volume :
- 27
- Issue :
- 12
- Database :
- OpenAIRE
- Journal :
- Journal of the American Medical Informatics Association : JAMIA
- Accession number :
- edsair.doi.dedup.....8af971839c10d4b0d85cd6223e898b79