Author: "Qing Treitler Zeng" / Topic: health informatics - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Qing Treitler Zeng"' showing total 8 results

Start Over Author "Qing Treitler Zeng" Topic health informatics

8 results on '"Qing Treitler Zeng"'

1. Scaling-up NLP Pipelines to Process Large Corpora of Clinical Notes

Author: Kalpana Gupta, Matthew H. Samore, Marjorie E. Carter, Guy Divita, Barbara W. Trautner, Qing Treitler Zeng, Adi V. Gundlapalli, and Andrew Redd
Subjects: 0301 basic medicine, Hospitals, Veterans, Computer science, Process (engineering), 030106 microbiology, Big data, Datasets as Topic, Health Informatics, Sample (statistics), computer.software_genre, Risk Assessment, Machine Learning, 03 medical and health sciences, 0302 clinical medicine, Health Information Management, Utah, Controlled vocabulary, Prevalence, Data Mining, Electronic Health Records, Humans, 030212 general & internal medicine, Natural Language Processing, Advanced and Specialized Nursing, business.industry, Decision Support Systems, Clinical, Pipeline (software), Replication (computing), Vocabulary, Controlled, Analytics, Catheter-Related Infections, Urinary Tract Infections, Artificial intelligence, Urinary Catheterization, business, computer, Algorithms, Natural language processing
Abstract: SummaryIntroduction: This article is part of the Focus Theme of Methods of Information in Medicine on “Big Data and Analytics in Healthcare”.Objectives: This paper describes the scale-up efforts at the VA Salt Lake City Health Care System to address processing large corpora of clinical notes through a natural language processing (NLP) pipeline. The use case described is a current project focused on detecting the presence of an indwelling uri-nary catheter in hospitalized patients and subsequent catheter-associated urinary tract infections.Methods: An NLP algorithm using v3NLP was developed to detect the presence of an indwelling urinary catheter in hospitalized patients. The algorithm was tested on a small corpus of notes on patients for whom the presence or absence of a catheter was already known (reference standard). In planning for a scale-up, we estimated that the original algorithm would have taken 2.4 days to run on a larger corpus of notes for this project (550,000 notes), and 27 days for a corpus of 6 million records representative of a national sample of notes. We approached scaling-up NLP pipelines through three techniques: pipeline replication via multi-threading, intra-annotator threading for tasks that can be further decomposed, and remote annotator services which enable annotator scale-out.Results: The scale-up resulted in reducing the average time to process a record from 206 milliseconds to 17 milliseconds or a 12-fold increase in performance when applied to a corpus of 550,000 notes.Conclusions: Purposely simplistic in nature, these scale-up efforts are the straight forward evolution from small scale NLP processing to larger scale extraction without incurring associated complexities that are inherited by the use of the underlying UIMA framework. These efforts represent generalizable and widely applicable techniques that will aid other computationally complex NLP pipelines that are of need to be scaled out for processing and analyzing big data.
Published: 2015

2. Mining Big Data in biomedicine and health care

Author: Qing Treitler Zeng and Samah Jamal Fodeh
Subjects: 0301 basic medicine, business.industry, Computer science, Big data, MEDLINE, Health Informatics, Data science, Health informatics, Computer Science Applications, 03 medical and health sciences, 030104 developmental biology, Health care, Data Mining, Humans, business, Delivery of Health Care, Biomedicine, Medical Informatics
Published: 2016

3. Clinical data mining

Author: Qing Treitler Zeng and Samah Jamal Fodeh
Subjects: Information retrieval, Medical Records Systems, Computerized, Computer science, business.industry, Health Informatics, computer.software_genre, Data science, Computer Science Applications, Text mining, Data Mining, Humans, Data mining, business, computer
Published: 2015

4. Exploring and Developing Consumer Health Vocabularies

Author: Qing Treitler Zeng and Tony Tse
Subjects: Vocabulary, Informatics, Knowledge management, Patients, Point (typography), Information seeking, Computer science, business.industry, The Practice of Informatics, media_common.quotation_subject, Consumer health, Health Informatics, Vocabulary, Controlled, Health, Terminology as Topic, Domain knowledge, business, Psychosocial, media_common
Abstract: Laypersons (''consumers'') often have difficulty finding, understanding, and acting on health in- formation due to gaps in their domain knowledge. Ideally, consumer health vocabularies (CHVs) would reflect the different ways consumers express and think about health topics, helping to bridge this vocabulary gap. However, despite the recent research on mismatches between consumer and professional language (e.g., lexical, semantic, and explanatory), there have been few systematic efforts to develop and evaluate CHVs. This paper presents the point of view that CHV development is practical and necessary for extending research on informatics-based tools to facilitate consumer health information seeking, retrieval, and understanding. In support of the view, we briefly describe a dis- tributed, bottom-up approach for (1) exploring the relationship between common consumer health expressions and professional concepts and (2) developing an open-access, preliminary (draft) ''first-generation'' CHV. While recognizing the limitations of the approach (e.g., not addressing psychosocial and cultural factors), we suggest that such explora- tory research and development will yield insights into the nature of consumer health expressions and assist developers in creating tools and applications to support consumer health information seeking. j J Am Med Inform Assoc. 2006;13:24-29. DOI 10.1197/jamia.M1761.
Published: 2006

5. The InterMed Approach to Sharable Computer-interpretable Guidelines: A Review

Author: Samson W. Tu, Robert A. Greenes, Aziz A. Boxwala, Edward H. Shortliffe, Qing Treitler Zeng, Dongwen Wang, Omolola Ogunyemi, Vimla L. Patel, and Mor Peleg
Subjects: Decision support system, Knowledge management, Standardization, business.industry, Computer science, Process (engineering), Modeling language, The Practice of Informatics, Health Informatics, Cognition, Decision Support Systems, Clinical, Viewpoints, United States, Computer Systems, Software Design, Practice Guidelines as Topic, Humans, Software design, Programming Languages, Architecture, business
Abstract: InterMed is a collaboration among research groups from Stanford, Harvard, and Columbia Universities. The primary goal of InterMed has been to develop a sharable language that could serve as a standard for modeling computer-interpretable guidelines (CIGs). This language, called GuideLine Interchange Format (GLIF), has been developed in a collaborative manner and in an open process that has welcomed input from the larger community. The goals and experiences of the InterMed project and lessons that the authors have learned may contribute to the work of other researchers who are developing medical knowledge-based tools. The lessons described include (1) a work process for multi-institutional research and development that considers different viewpoints, (2) an evolutionary lifecycle process for developing medical knowledge representation formats, (3) the role of cognitive methodology to evaluate and assist in the evolutionary development process, (4) development of an architecture and (5) design principles for sharable medical knowledge representation formats, and (6) a process for standardization of a CIG modeling language.
Published: 2004

6. Providing Concept-oriented Views for Clinical Data Using a Knowledge-based System: An Evaluation

Author: James J. Cimino, Kelly H. Zou, and Qing Treitler Zeng
Subjects: Measure (data warehouse), Information retrieval, Medical Records Systems, Computerized, business.industry, Computer science, Medical record, Information Theory, Information Storage and Retrieval, Information quality, Health Informatics, Information overload, Knowledge-based systems, Data retrieval, Knowledge base, Artificial Intelligence, Data Interpretation, Statistical, Humans, Relevance (information retrieval), Clinical Medicine, business, Original Investigation
Abstract: Objective: Clinical information systems typically present patient data in chronologic order, organized by the source of the information (e.g., laboratory, radiology). This study evaluates the functionality and utility of a knowledge-based system that generates concept- oriented views (organized around clinical concepts such as disease or organ system) of clinical data. Design: The authors have developed a system that uses a knowledge base of interrelationships between medical concepts to infer relationships between data in electronic medical records. They use these inferences to produce summaries, or views, of the data that are relevant to a specific concept of interest. They evaluated the ability of the system to select relevant information, reduce information overload, and support physician information retrieval. Measurements: The sensitivity and specificity of the system for identifying relevant patient information were calculated. Effect on information overload was assessed by comparing the amount of information in each view with the amount of information in the entire record. Information retrieval accuracy and cost (time) were used to measure the effect of using concept- oriented views on the efficiency and effectiveness of retrievals. Results: The sensitivity and specificity of the system for identifying relevant clinical information were generally in the range of 70 to 80 percent. Concept-oriented views are effective in reducing the amount of information retrieved (over 80 percent reduction) and, compared with source- oriented views, are able to improve physician retrieval accuracy (p = 0.04). Conclusion: Computer-generated, concept-oriented views can be used to reduce clinician information overload and improve the accuracy of clinical data retrieval.
Published: 2002

7. Term Identification Methods for Consumer Health Vocabulary Development

Author: Guy Divita, Allen C. Browne, Tony Tse, Long Ngo, Qing Treitler Zeng, Alla Keselman, Jon Crowell, and Sergey Goryachev
Subjects: Vocabulary, 020205 medical informatics, Computer science, media_common.quotation_subject, Health Informatics, 02 engineering and technology, lcsh:Computer applications to medicine. Medical informatics, Logistic regression, computer.software_genre, Machine learning, Automation, 03 medical and health sciences, 0302 clinical medicine, 0202 electrical engineering, electronic engineering, information engineering, Humans, 030212 general & internal medicine, natural language processing, Cooperative Behavior, Set (psychology), Health Education, vocabulary, media_common, Original Paper, business.industry, lcsh:Public aspects of medicine, lcsh:RA1-1270, Ambiguity, Models, Theoretical, Vocabulary development, Term (time), Identification (information), Logistic Models, ROC Curve, Vocabulary, Controlled, Consumer health information, lcsh:R858-859.7, Health education, Data mining, Artificial intelligence, business, computer
Abstract: Background The development of consumer health information applications such as health education websites has motivated the research on consumer health vocabulary (CHV). Term identification is a critical task in vocabulary development. Because of the heterogeneity and ambiguity of consumer expressions, term identification for CHV is more challenging than for professional health vocabularies. Objective For the development of a CHV, we explored several term identification methods, including collaborative human review and automated term recognition methods. Methods A set of criteria was established to ensure consistency in the collaborative review, which analyzed 1893 strings. Using the results from the human review, we tested two automated methods—C-value formula and a logistic regression model. Results The study identified 753 consumer terms and found the logistic regression model to be highly effective for CHV term identification (area under the receiver operating characteristic curve = 95.5%). Conclusions The collaborative human review and logistic regression methods were effective for identifying terms for CHV development.
Published: 2007

8. Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system

Author: Ross Lazarus, Sergey Goryachev, Shawn N. Murphy, Qing Treitler Zeng, Scott T. Weiss, and Margarita Sordo
Subjects: Research evaluation, Medical Records Systems, Computerized, Health Informatics, Comorbidity, computer.software_genre, lcsh:Computer applications to medicine. Medical informatics, Health informatics, Sensitivity and Specificity, 03 medical and health sciences, Pulmonary Disease, Chronic Obstructive, 0302 clinical medicine, Text mining, International Classification of Diseases, medicine, Humans, 030212 general & internal medicine, 030304 developmental biology, Asthma, Natural Language Processing, 0303 health sciences, business.industry, Medical record, Health Policy, Smoking, medicine.disease, Patient Discharge, 3. Good health, Computer Science Applications, lcsh:R858-859.7, Co morbidity, Smoking status, Data mining, Medical emergency, Principal diagnosis, business, computer, Research Article
Abstract: Background The text descriptions in electronic medical records are a rich source of information. We have developed a Health Information Text Extraction (HITEx) tool and used it to extract key findings for a research study on airways disease. Methods The principal diagnosis, co-morbidity and smoking status extracted by HITEx from a set of 150 discharge summaries were compared to an expert-generated gold standard. Results The accuracy of HITEx was 82% for principal diagnosis, 87% for co-morbidity, and 90% for smoking status extraction, when cases labeled "Insufficient Data" by the gold standard were excluded. Conclusion We consider the results promising, given the complexity of the discharge summaries and the extraction tasks.
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

8 results on '"Qing Treitler Zeng"'

1. Scaling-up NLP Pipelines to Process Large Corpora of Clinical Notes

2. Mining Big Data in biomedicine and health care

3. Clinical data mining

4. Exploring and Developing Consumer Health Vocabularies

5. The InterMed Approach to Sharable Computer-interpretable Guidelines: A Review

6. Providing Concept-oriented Views for Clinical Data Using a Knowledge-based System: An Evaluation

7. Term Identification Methods for Consumer Health Vocabulary Development

8. Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

8 results on '"Qing Treitler Zeng"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources