Author: "Karn, Sanjeev Kumar" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Karn, Sanjeev Kumar"' showing total 22 results

Start Over Author "Karn, Sanjeev Kumar"

22 results on '"Karn, Sanjeev Kumar"'

1. Fusion of Domain-Adapted Vision and Language Models for Medical Visual Question Answering

Author: Ha, Cuong Nhat, Asaadi, Shima, Karn, Sanjeev Kumar, Farri, Oladimeji, Heimann, Tobias, and Runkler, Thomas
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: Vision-language models, while effective in general domains and showing strong performance in diverse multi-modal applications like visual question-answering (VQA), struggle to maintain the same level of effectiveness in more specialized domains, e.g., medical. We propose a medical vision-language model that integrates large vision and language models adapted for the medical domain. This model goes through three stages of parameter-efficient training using three separate biomedical and radiology multi-modal visual and text datasets. The proposed model achieves state-of-the-art performance on the SLAKE 1.0 medical VQA (MedVQA) dataset with an overall accuracy of 87.5% and demonstrates strong performance on another MedVQA dataset, VQA-RAD, achieving an overall accuracy of 73.2%., Comment: Clinical NLP @ NAACL 2024
Published: 2024

2. General-Purpose vs. Domain-Adapted Large Language Models for Extraction of Structured Data from Chest Radiology Reports

Author: Dhanaliwala, Ali H., Ghosh, Rikhiya, Karn, Sanjeev Kumar, Ullaskrishnan, Poikavila, Farri, Oladimeji, Comaniciu, Dorin, and Kahn, Charles E.
Subjects: Computer Science - Computation and Language, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Radiologists produce unstructured data that can be valuable for clinical care when consumed by information systems. However, variability in style limits usage. Study compares system using domain-adapted language model (RadLing) and general-purpose LLM (GPT-4) in extracting relevant features from chest radiology reports and standardizing them to common data elements (CDEs). Three radiologists annotated a retrospective dataset of 1399 chest XR reports (900 training, 499 test) and mapped to 44 pre-selected relevant CDEs. GPT-4 system was prompted with report, feature set, value set, and dynamic few-shots to extract values and map to CDEs. Output key:value pairs were compared to reference standard at both stages and an identical match was considered TP. F1 score for extraction was 97% for RadLing-based system and 78% for GPT-4 system. F1 score for mapping was 98% for RadLing and 94% for GPT-4; difference was statistically significant (P<.001). RadLing's domain-adapted embeddings were better in feature extraction and its light-weight mapper had better f1 score in CDE assignment. RadLing system also demonstrated higher capabilities in differentiating between absent (99% vs 64%) and unspecified (99% vs 89%). RadLing system's domain-adapted embeddings helped improve performance of GPT-4 system to 92% by giving more relevant few-shot prompts. RadLing system offers operational advantages including local deployment and reduced runtime costs.
Published: 2023

3. Generation of Radiology Findings in Chest X-Ray by Leveraging Collaborative Knowledge

Author: Danu, Manuela Daniela, Marica, George, Karn, Sanjeev Kumar, Georgescu, Bogdan, Mansoor, Awais, Ghesu, Florin, Itu, Lucian Mihai, Suciu, Constantin, Grbic, Sasa, Farri, Oladimeji, and Comaniciu, Dorin
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Among all the sub-sections in a typical radiology report, the Clinical Indications, Findings, and Impression often reflect important details about the health status of a patient. The information included in Impression is also often covered in Findings. While Findings and Impression can be deduced by inspecting the image, Clinical Indications often require additional context. The cognitive task of interpreting medical images remains the most critical and often time-consuming step in the radiology workflow. Instead of generating an end-to-end radiology report, in this paper, we focus on generating the Findings from automated interpretation of medical images, specifically chest X-rays (CXRs). Thus, this work focuses on reducing the workload of radiologists who spend most of their time either writing or narrating the Findings. Unlike past research, which addresses radiology report generation as a single-step image captioning task, we have further taken into consideration the complexity of interpreting CXR images and propose a two-step approach: (a) detecting the regions with abnormalities in the image, and (b) generating relevant text for regions with abnormalities by employing a generative large language model (LLM). This two-step approach introduces a layer of interpretability and aligns the framework with the systematic reasoning that radiologists use when reviewing a CXR., Comment: Information Technology and Quantitative Management (ITQM 2023)
Published: 2023

4. shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation

Author: Karn, Sanjeev Kumar, Ghosh, Rikhiya, P, Kusuma, and Farri, Oladimeji
Subjects: Computer Science - Computation and Language
Abstract: Instruction-tuned generative Large language models (LLMs) like ChatGPT and Bloomz possess excellent generalization abilities, but they face limitations in understanding radiology reports, particularly in the task of generating the IMPRESSIONS section from the FINDINGS section. They tend to generate either verbose or incomplete IMPRESSIONS, mainly due to insufficient exposure to medical text data during training. We present a system which leverages large-scale medical text data for domain-adaptive pre-training of instruction-tuned LLMs to enhance its medical knowledge and performance on specific medical tasks. We show that this system performs better in a zero-shot setting than a number of pretrain-and-finetune adaptation methods on the IMPRESSIONS generation task, and ranks 1st among participating systems in Task 1B: Radiology Report Summarization at the BioNLP 2023 workshop., Comment: 1st Place in Task 1B: Radiology Report Summarization at BioNLP 2023
Published: 2023

5. RadLing: Towards Efficient Radiology Report Understanding

Author: Ghosh, Rikhiya, Karn, Sanjeev Kumar, Danu, Manuela Daniela, Micu, Larisa, Vunikili, Ramya, and Farri, Oladimeji
Subjects: Computer Science - Computation and Language
Abstract: Most natural language tasks in the radiology domain use language models pre-trained on biomedical corpus. There are few pretrained language models trained specifically for radiology, and fewer still that have been trained in a low data setting and gone on to produce comparable results in fine-tuning tasks. We present RadLing, a continuously pretrained language model using Electra-small (Clark et al., 2020) architecture, trained using over 500K radiology reports, that can compete with state-of-the-art results for fine tuning tasks in radiology domain. Our main contribution in this paper is knowledge-aware masking which is a taxonomic knowledge-assisted pretraining task that dynamically masks tokens to inject knowledge during pretraining. In addition, we also introduce an knowledge base-aided vocabulary extension to adapt the general tokenization vocabulary to radiology domain., Comment: Association for Computational Linguistics (ACL), 2023
Published: 2023

6. Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization

Author: Karn, Sanjeev Kumar, Liu, Ning, Schuetze, Hinrich, and Farri, Oladimeji
Subjects: Computer Science - Computation and Language
Abstract: The IMPRESSIONS section of a radiology report about an imaging study is a summary of the radiologist's reasoning and conclusions, and it also aids the referring physician in confirming or excluding certain diagnoses. A cascade of tasks are required to automatically generate an abstractive summary of the typical information-rich radiology report. These tasks include acquisition of salient content from the report and generation of a concise, easily consumable IMPRESSIONS section. Prior research on radiology report summarization has focused on single-step end-to-end models -- which subsume the task of salient content acquisition. To fully explore the cascade structure and explainability of radiology report summarization, we introduce two innovations. First, we design a two-step approach: extractive summarization followed by abstractive summarization. Second, we additionally break down the extractive part into two independent tasks: extraction of salient (1) sentences and (2) keywords. Experiments on English radiology reports from two clinical sites show our novel approach leads to a more precise summary compared to single-step and to two-step-with-single-extractive-process baselines with an overall improvement in F1 score Of 3-4%., Comment: Accepted at 60th Annual Meeting of the Association for Computational Linguistics 2022 Main Conference
Published: 2022

7. Few-Shot Learning of an Interleaved Text Summarization Model by Pretraining with Synthetic Data

Author: Karn, Sanjeev Kumar, Chen, Francine, Chen, Yan-Ying, Waltinger, Ulli, and Schuetze, Hinrich
Subjects: Computer Science - Computation and Language
Abstract: Interleaved texts, where posts belonging to different threads occur in a sequence, commonly occur in online chat posts, so that it can be time-consuming to quickly obtain an overview of the discussions. Existing systems first disentangle the posts by threads and then extract summaries from those threads. A major issue with such systems is error propagation from the disentanglement component. While end-to-end trainable summarization system could obviate explicit disentanglement, such systems require a large amount of labeled data. To address this, we propose to pretrain an end-to-end trainable hierarchical encoder-decoder system using synthetic interleaved texts. We show that by fine-tuning on a real-world meeting dataset (AMI), such a system out-performs a traditional two-step system by 22%. We also compare against transformer models and observed that pretraining with synthetic data both the encoder and decoder outperforms the BertSumExtAbs transformer model which pretrains only the encoder on a large dataset., Comment: Adapt-NLP: The Second Workshop on Domain Adaptation for NLP
Published: 2021

8. A Hierarchical Decoder with Three-level Hierarchical Attention to Generate Abstractive Summaries of Interleaved Texts

Author: Karn, Sanjeev Kumar, Chen, Francine, Chen, Yan-Ying, Waltinger, Ulli, and Schütze, Hinrich
Subjects: Computer Science - Computation and Language
Abstract: Interleaved texts, where posts belonging to different threads occur in one sequence, are a common occurrence, e.g., online chat conversations. To quickly obtain an overview of such texts, existing systems first disentangle the posts by threads and then extract summaries from those threads. The major issues with such systems are error propagation and non-fluent summary. To address those, we propose an end-to-end trainable hierarchical encoder-decoder system. We also introduce a novel hierarchical attention mechanism which combines three levels of information from an interleaved text, i.e, posts, phrases and words, and implicitly disentangles the threads. We evaluated the proposed system on multiple interleaved text datasets, and it out-performs a SOTA two-step system by 20-40%.
Published: 2019

9. News Article Teaser Tweets and How to Generate Them

Author: Karn, Sanjeev Kumar, Buckley, Mark, Waltinger, Ulli, and Schütze, Hinrich
Subjects: Computer Science - Computation and Language
Abstract: In this work, we define the task of teaser generation and provide an evaluation benchmark and baseline systems for the process of generating teasers. A teaser is a short reading suggestion for an article that is illustrative and includes curiosity-arousing elements to entice potential readers to read particular news items. Teasers are one of the main vehicles for transmitting news to social media users. We compile a novel dataset of teasers by systematically accumulating tweets and selecting those that conform to the teaser definition. We have compared a number of neural abstractive architectures on the task of teaser generation and the overall best performing system is See et al.(2017)'s seq2seq with pointer network.
Published: 2018

10. Stakeholder Analysis and Management for Promoting Small and Medium-Scale Vegetable Processing Technology in Nepal

Author: Joshi, Prateek, primary, Karn, Sanjeev Kumar, additional, and Koirala, Pramod, additional
Published: 2024
Full Text: View/download PDF

11. Strengthening Food Safety Governance in Nepal through Collaborative Capacity Development and Private Sector Engagement

Author: Joshi, Prateek, primary, Karn, Sanjeev Kumar, additional, and Koirala, Pramod, additional
Published: 2023
Full Text: View/download PDF

12. RadLing: Towards Efficient Radiology Report Understanding

Author: Ghosh, Rikhiya, primary, Farri, Oladimeji, additional, Karn, Sanjeev Kumar, additional, Danu, Manuela, additional, Vunikili, Ramya, additional, and Micu, Larisa, additional
Published: 2023
Full Text: View/download PDF

13. Generation of Radiology Findings in Chest X-Ray by Leveraging Collaborative Knowledge

Author: Danu, Manuela Daniela, primary, Marica, George, additional, Karn, Sanjeev Kumar, additional, Georgescu, Bogdan, additional, Mansoor, Awais, additional, Ghesu, Florin, additional, Itu, Lucian Mihai, additional, Suciu, Constantin, additional, Grbic, Sasa, additional, Farri, Oladimeji, additional, and Comaniciu, Dorin, additional
Published: 2023
Full Text: View/download PDF

14. shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation

Author: Karn, Sanjeev Kumar, primary, Ghosh, Rikhiya, additional, P, Kusuma, additional, and Farri, Oladimeji, additional
Published: 2023
Full Text: View/download PDF

15. Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization

Author: Karn, Sanjeev Kumar, primary, Liu, Ning, additional, Schuetze, Hinrich, additional, and Farri, Oladimeji, additional
Published: 2022
Full Text: View/download PDF

16. Status of Sodium Nitrite in Meat and Meat Products Available in the Market of Kathmandu, Nepal

Author: Pradhan, Hareram, primary, Poudel, Sabitri, primary, Bokkhim, Huma, primary, Rai, Krishna Prasad, primary, and Karn, Sanjeev Kumar, primary
Published: 2019
Full Text: View/download PDF

17. News Article Teaser Tweets and How to Generate Them

Author: Karn, Sanjeev Kumar, primary, Buckley, Mark, additional, Waltinger, Ulli, additional, and Schütze, Hinrich, additional
Published: 2019
Full Text: View/download PDF

18. Neural architectures for open-type relation argument extraction

Author: Roth, Benjamin, primary, Conforti, Costanza, additional, Poerner, Nina, additional, Karn, Sanjeev Kumar, additional, and Schütze, Hinrich, additional
Published: 2018
Full Text: View/download PDF

19. Brown Rice: Nutritional composition and Health Benefits

Author: Upadhyay, Atul, primary and Karn, Sanjeev Kumar, primary
Published: 2018
Full Text: View/download PDF

20. Neural architectures for open-type relation argument extraction.

Author: Roth, Benjamin, Conforti, Costanza, Poerner, Nina, Karn, Sanjeev Kumar, and Schütze, Hinrich
Subjects: ARTISTS' books, ARCHITECTURE, RANDOM fields, ARGUMENT, BOOK titles, QUESTION answering systems
Abstract: In this work, we focus on the task of open-type relation argument extraction (ORAE) : given a corpus, a query entity Q , and a knowledge base relation (e.g., " Q authored notable work with title X "), the model has to extract an argument of non-standard entity type (entities that cannot be extracted by a standard named entity tagger, for example, X : the title of a book or a work of art) from the corpus. We develop and compare a wide range of neural models for this task yielding large improvements over a strong baseline obtained with a neural question answering system. The impact of different sentence encoding architectures and answer extraction methods is systematically compared. An encoder based on gated recurrent units combined with a conditional random fields tagger yields the best results. We release a data set to train and evaluate ORAE, based on Wikidata and obtained by distant supervision. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

21. nala: text mining natural language mutation mentions.

Author: Cejuela, Juan Miguel, Bojchevski, Aleksandar, Uhlig, Carsten, Bekmukhametov, Rustem, Karn, Sanjeev Kumar, Mahmuti, Shpend, Baghudana, Ashish, Dubey, Ankit, Satagopam, Venkata P., and Rost, Burkhard
Subjects: NATURAL languages, GLUTAMIC acid, EXCITATORY amino acids, MOLECULAR recognition
Abstract: Motivation: The extraction of sequence variants from the literature remains an important task. Existing methods primarily target standard (ST) mutation mentions (e.g. 'E6V'), leaving relevant mentions natural language (NL) largely untapped (e.g. 'glutamic acid was substituted by valine at residue 6'). Results: We introduced three new corpora suggesting named-entity recognition (NER) to be more challenging than anticipated: 28-77% of all articles contained mentions only available in NL. Our new method nala captured NL and ST by combining conditional random fields with word embedding features learned unsupervised from the entire PubMed. In our hands, nala substantially outperformed the state-of-the-art. For instance, we compared all unique mentions in new discoveries correctly detected by any of three methods (SETH, tmVar, or nala). Neither SETH nor tmVar discovered anything missed by nala, while nala uniquely tagged 33% mentions. For NL mentions the corresponding value shot up to 100% nala-only. [ABSTRACT FROM AUTHOR]
Published: 2017
Full Text: View/download PDF

22. Shelf Stability, Sensory Qualities, and Bioavailability of Iron-Fortified Nepalese Curry Powder

Author: Karn, Sanjeev Kumar, primary, Chavasit, Visith, additional, Kongkachuichai, Ratchanee, additional, and Tangsuphoom, Nattapol, additional
Published: 2011
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

22 results on '"Karn, Sanjeev Kumar"'

1. Fusion of Domain-Adapted Vision and Language Models for Medical Visual Question Answering

2. General-Purpose vs. Domain-Adapted Large Language Models for Extraction of Structured Data from Chest Radiology Reports

3. Generation of Radiology Findings in Chest X-Ray by Leveraging Collaborative Knowledge

4. shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation

5. RadLing: Towards Efficient Radiology Report Understanding

6. Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization

7. Few-Shot Learning of an Interleaved Text Summarization Model by Pretraining with Synthetic Data

8. A Hierarchical Decoder with Three-level Hierarchical Attention to Generate Abstractive Summaries of Interleaved Texts

9. News Article Teaser Tweets and How to Generate Them

10. Stakeholder Analysis and Management for Promoting Small and Medium-Scale Vegetable Processing Technology in Nepal

11. Strengthening Food Safety Governance in Nepal through Collaborative Capacity Development and Private Sector Engagement

12. RadLing: Towards Efficient Radiology Report Understanding

13. Generation of Radiology Findings in Chest X-Ray by Leveraging Collaborative Knowledge

14. shs-nlp at RadSum23: Domain-Adaptive Pre-training of Instruction-tuned LLMs for Radiology Report Impression Generation

15. Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization

16. Status of Sodium Nitrite in Meat and Meat Products Available in the Market of Kathmandu, Nepal

17. News Article Teaser Tweets and How to Generate Them

18. Neural architectures for open-type relation argument extraction

19. Brown Rice: Nutritional composition and Health Benefits

20. Neural architectures for open-type relation argument extraction.

21. nala: text mining natural language mutation mentions.

22. Shelf Stability, Sensory Qualities, and Bioavailability of Iron-Fortified Nepalese Curry Powder

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

22 results on '"Karn, Sanjeev Kumar"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources