Back to Search
Start Over
LitInspector: literature and signal transduction pathway mining in PubMed abstracts
- Source :
- Nucleic Acids Research
- Publication Year :
- 2009
- Publisher :
- Oxford University Press (OUP), 2009.
-
Abstract
- LitInspector is a literature search tool providing gene and signal transduction pathway mining within NCBI's PubMed database. The automatic gene recognition and color coding increases the readability of abstracts and significantly speeds up literature research. A main challenge in gene recognition is the resolution of homonyms and rejection of identical abbreviations used in a 'non-gene' context. LitInspector uses automatically generated and manually refined filtering lists for this purpose. The quality of the LitInspector results was assessed with a published dataset of 181 PubMed sentences. LitInspector achieved a precision of 96.8%, a recall of 86.6% and an F-measure of 91.4%. To further demonstrate the homonym resolution qualities, LitInspector was compared to three other literature search tools using some challenging examples. The homonym MIZ-1 (gene IDs 7709 and 9063) was correctly resolved in 87% of the abstracts by LitInspector, whereas the other tools achieved recognition rates between 35% and 67%. The LitInspector signal transduction pathway mining is based on a manually curated database of pathway names (e.g. wingless type), pathway components (e.g. WNT1, FZD1), and general pathway keywords (e.g. signaling cascade). The performance was checked for 10 randomly selected genes. Eighty-two per cent of the 38 predicted pathway associations were correct. LitInspector is freely available at http://www.litinspector.org/.
- Subjects :
- PubMed
MEDLINE
Information Storage and Retrieval
Color-coding
Context (language use)
Biology
computer.software_genre
Bioinformatics
Homonym
Mice
Text mining
Terminology as Topic
Genetics
Animals
Humans
Gene recognition
business.industry
Articles
Readability
Rats
Literature research
Artificial intelligence
business
computer
Software
Natural language processing
Signal Transduction
Subjects
Details
- ISSN :
- 13624962 and 03051048
- Volume :
- 37
- Database :
- OpenAIRE
- Journal :
- Nucleic Acids Research
- Accession number :
- edsair.doi.dedup.....4ac402589bf8bcae8ff8e9ca93dd795a
- Full Text :
- https://doi.org/10.1093/nar/gkp303