99 results on '"Amit Sheth"'
Search Results
2. A Semantic Web Approach to Fault Tolerant Autonomous Manufacturing
- Author
-
Fadi El Kalach, Ruwan Wickramarachchi, Ramy Harik, and Amit Sheth
- Subjects
Artificial Intelligence ,Computer Networks and Communications - Published
- 2023
- Full Text
- View/download PDF
3. iMetaverseKG: Industrial Metaverse Knowledge Graph to Promote Interoperability in Design and Engineering Applications
- Author
-
Utkarshani Jaimini, Tongtao Zhang, Georgia Olympia Brikis, and Amit Sheth
- Subjects
Computer Networks and Communications - Published
- 2022
- Full Text
- View/download PDF
4. Knowledge-Based Entity Prediction for Improved Machine Perception in Autonomous Systems
- Author
-
Ruwan Wickramarachchi, Cory Henson, and Amit Sheth
- Subjects
FOS: Computer and information sciences ,Artificial Intelligence (cs.AI) ,Computer Science - Artificial Intelligence ,Artificial Intelligence ,Computer Networks and Communications ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Knowledge-based entity prediction (KEP) is a novel task that aims to improve machine perception in autonomous systems. KEP leverages relational knowledge from heterogeneous sources in predicting potentially unrecognized entities. In this paper, we provide a formal definition of KEP as a knowledge completion task. Three potential solutions are then introduced, which employ several machine learning and data mining techniques. Finally, the applicability of KEP is demonstrated on two autonomous systems from different domains; namely, autonomous driving and smart manufacturing. We argue that in complex real-world systems, the use of KEP would significantly improve machine perception while pushing the current technology one step closer to achieving full autonomy., 6 pages, 4 figures, in IEEE Intelligent Systems, 2022
- Published
- 2022
- Full Text
- View/download PDF
5. Process Knowledge-Infused AI: Toward User-Level Explainability, Interpretability, and Safety
- Author
-
Amit Sheth, Manas Gaur, Kaushik Roy, Revathy Venkataraman, and Vedant Khandelwal
- Subjects
Computer Networks and Communications - Published
- 2022
- Full Text
- View/download PDF
6. Knowledge-Infused Learning: A Sweet Spot in Neuro-Symbolic AI
- Author
-
Manas Gaur, Kalpa Gunaratna, Shreyansh Bhatt, and Amit Sheth
- Subjects
Computer Networks and Communications - Published
- 2022
- Full Text
- View/download PDF
7. Synthesis of CdZnTeSe single crystals for room temperature radiation detector fabrication: mitigation of hole trapping effects using a convolutional neural network
- Author
-
Sandeep K. Chaudhuri, Joshua W. Kleppinger, OmerFaruk Karadavut, Ritwik Nag, Rojina Panta, Forest Agostinelli, Amit Sheth, Utpal N. Roy, Ralph B. James, and Krishna C. Mandal
- Subjects
Electrical and Electronic Engineering ,Condensed Matter Physics ,Atomic and Molecular Physics, and Optics ,Electronic, Optical and Magnetic Materials - Published
- 2022
- Full Text
- View/download PDF
8. Finding Influential Authors in Brand-Page Communities
- Author
-
Hemant Purohit, Jitendra Ajmera, Sachindra Joshi, Ashish Verma, and Amit Sheth
- Abstract
Enterprises are increasingly using social media forums to engage with their customer online- a phenomenon known as Social Customer Relation Management (Social CRM). In this context, it is important for an enterprise to identify “influential authors” and engage with them on a priority basis. We present a study towards finding influential authors on Twitter forums where an implicit network based on user interactions is created and analyzed. Furthermore, author profile features and user interaction features are combined in a decision tree classification model for finding influential authors. A novel objective evaluation criterion is used for evaluating various features and modeling techniques. We compare our methods with other approaches that use either only the formal connections or only the author profile features and show a significant improvement in the classification accuracy over these baselines as well as over using Klout score.
- Published
- 2021
- Full Text
- View/download PDF
9. Extracting Diverse Sentiment Expressions with Target-Dependent Polarity from Twitter
- Author
-
Lu Chen, Wenbo Wang, Meenakshi Nagarajan, Shaojun Wang, and Amit Sheth
- Abstract
The problem of automatic extraction of sentiment expressions from informal text, as in microblogs such as tweets is a recent area of investigation. Compared to formal text, such as in product reviews or news articles, one of the key challenges lies in the wide diversity and informal nature of sentiment expressions that cannot be trivially enumerated or captured using predefined lexical patterns. In this work, we present an optimization-based approach to automatically extract sentiment expressions for a given target (e.g., movie, or person) from a corpus of unlabeled tweets. Specifically, we make three contributions: (i) we recognize a diverse and richer set of sentiment-bearing expressions in tweets, including formal and slang words/phrases, not limited to pre-specified syntactic patterns; (ii) instead of associating sentiment with an entire tweet, we assess the target-dependent polarity of each sentiment expression. The polarity of sentiment expression is determined by the nature of its target; (iii) we provide a novel formulation of assigning polarity to a sentiment expression as a constrained optimization problem over the tweet corpus. Experiments conducted on two domains, tweets mentioning movie and person entities, show that our approach improves accuracy in comparison with several baseline methods, and that the improvement becomes more prominent with increasing corpus sizes.
- Published
- 2021
- Full Text
- View/download PDF
10. International Workshop on Knowledge Graphs
- Author
-
Ying Ding, Amit Sheth, Krzysztof W. Janowicz, Sergio Baranzini, Sharat Israni, Ilkay Altintas, Lilit Yeghiazarian, Ellie Young, and Sam Klein
- Published
- 2022
- Full Text
- View/download PDF
11. Cognitive Digital Twins for Smart Manufacturing
- Author
-
John G. Breslin, Ramy Harik, Pankesh Patel, Amit Sheth, and Muhammad Intizar Ali
- Subjects
Industry 4.0 ,Computer Networks and Communications ,Computer science ,business.industry ,Replica ,Big data ,Intelligent decision support system ,02 engineering and technology ,Virtual reality ,Object (computer science) ,Data science ,Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,Key (cryptography) ,Mainstream ,020201 artificial intelligence & image processing ,business - Abstract
Smart manufacturing or Industry 4.0, a trend initiated a decade ago, aims to revolutionize traditional manufacturing using technology driven approaches. Modern digital technologies such as the Industrial Internet of Things (IIoT), Big Data analytics, augmented/virtual reality, and artificial intelligence (AI) are the key enablers of new smart manufacturing approaches. The digital twin is an emerging concept whereby a digital replica can be built of any physical object. Digital twins are becoming mainstream; many organizations have started to rely on digital twins to monitor, analyze, and simulate physical assets and processes.
- Published
- 2021
- Full Text
- View/download PDF
12. Knowledge-Driven Drug-Use NamedEntity Recognition with Distant Supervision
- Author
-
Goonmeet, Bajaj, Ugur, Kursuncu, Manas, Gaur, Usha, Lokala, Ayaz, Hyder, Srinivasan, Parthasarathy, and Amit, Sheth
- Subjects
Humans ,Information Storage and Retrieval ,Names ,Natural Language Processing - Abstract
As Named Entity Recognition (NER) has been essential in identifying critical elements of unstructured content, generic NER tools remain limited in recognizing entities specific to a domain, such as drug use and public health. For such high-impact areas, accurately capturing relevant entities at a more granular level is critical, as this information influences real-world processes. On the other hand, training NER models for a specific domain without handcrafted features requires an extensive amount of labeled data, which is expensive in human effort and time. In this study, we employ distant supervision utilizing a domain-specific ontology to reduce the need for human labor and train models incorporating domain-specific (e.g., drug use) external knowledge to recognize domain specific entities. We capture entities related the drug use and their trends in government epidemiology reports, with an improvement of 8% in F1-score.
- Published
- 2022
13. Knowledge-Driven Drug-Use NamedEntity Recognition with Distant Supervision
- Author
-
Goonmeet Bajaj, Ugur Kursuncu, Manas Gaur, Usha Lokala, Ayaz Hyder, Srinivasan Parthasarathy, and Amit Sheth
- Abstract
As Named Entity Recognition (NER) has been essential in identifying critical elements of unstructured content, generic NER tools remain limited in recognizing entities specific to a domain, such as drug use and public health. For such high-impact areas, accurately capturing relevant entities at a more granular level is critical, as this information influences real-world processes. On the other hand, training NER models for a specific domain without handcrafted features requires an extensive amount of labeled data, which is expensive in human effort and time. In this study, we employ distant supervision utilizing a domain-specific ontology to reduce the need for human labor and train models incorporating domain-specific (e.g., drug use) external knowledge to recognize domain specific entities. We capture entities related the drug use and their trends in government epidemiology reports, with an improvement of 8% in F1-score.
- Published
- 2022
- Full Text
- View/download PDF
14. An Ontology for Cardiothoracic Surgical Education and Clinical Data Analytics
- Author
-
Maryam Panahiazar, Yorick Chern, Ramon Riojas, Omar S. Latif, Usha Lokala, Dexter Hadley, Amit Sheth, and Ramin E. Beygui
- Abstract
The development of an ontology facilitates the organization of the variety of concepts used to describe different terms in different resources. The proposed ontology will facilitate the study of cardiothoracic surgical education and data analytics in electronic medical records (EMR) with the standard vocabulary.
- Published
- 2022
- Full Text
- View/download PDF
15. An Ontology for Cardiothoracic Surgical Education and Clinical Data Analytics
- Author
-
Maryam, Panahiazar, Yorick, Chern, Ramon, Riojas, Omar S, Latif, Usha, Lokala, Dexter, Hadley, Amit, Sheth, and Ramin E, Beygui
- Subjects
Biological Ontologies ,Data Science ,Electronic Health Records ,Vocabulary - Abstract
The development of an ontology facilitates the organization of the variety of concepts used to describe different terms in different resources. The proposed ontology will facilitate the study of cardiothoracic surgical education and data analytics in electronic medical records (EMR) with the standard vocabulary.
- Published
- 2022
16. Characterizing Trends in Synthetic Cannabinoid Receptor Agonist Use from Patient Clinical Evaluations during Medical Toxicology Consultation
- Author
-
Edward W. Boyer, Raminta Daniuaityte, Amit Sheth, Robert G. Carlson, Maryann Mazer-Amirshahi, Collin Tebo, Paul M. Wax, Sharan L. Campleman, and Jeffrey Brent
- Subjects
Cannabinoid Receptor Agonists ,Drug ,medicine.medical_specialty ,Substance-Related Disorders ,business.industry ,media_common.quotation_subject ,Qualitative interviews ,Psychoactive substance ,Medicine (miscellaneous) ,Synthetic cannabinoid receptor agonist ,Article ,Family medicine ,Acute care ,Medical toxicology ,Hallucinogens ,medicine ,Humans ,business ,Referral and Consultation ,General Psychology ,Cannabis ,media_common ,Potential toxicity - Abstract
Synthetic cannabinoid receptor agonists (SCRAs) are a new class of compounds with profound psychoactive effects and potential toxicity. This study characterizes patterns in SCRA abuse using qualitative interviews with individuals receiving medical toxicology consultation. Patients with suspected exposure to a new psychoactive substance were interviewed by medical toxicologists upon presentation for acute care. Investigators collected clinical and qualitative data including knowledge, attitudes, beliefs, and practices related to psychoactive substance use. Responses were categorized by identifying themes, and statistics were generated to describe patterns of use. Overall, 69% (86) of the 124 cases of novel psychoactive substance use entered into the registry were associated with exposure to SCRAs. Most patients (68.8%) had used SCRAs at least once before the presenting episode. 47.7% considered SCRAs to be very easy to obtain, and 44.2% reported paying for the substances while 32.6% acquired it for free. Nearly half (48.8%) of patients reported their primary reason for use was to get high; a small proportion used SCRAs to avoid testing positive on drug screening (6.9%) or as an alternative to marijuana (4.6%). Findings suggest an independent and stable culture is developing around the use of SCRAs separate from their appeal as an "undetectable" alternative to marijuana.
- Published
- 2020
- Full Text
- View/download PDF
17. Knowledge Graphs to Empower Humanity-Inspired AI Systems
- Author
-
Valerie L. Shalin, Amit P. Sheth, Hemant Purohit, and Amit Sheth
- Subjects
Adaptive behavior ,Computer Networks and Communications ,Computer science ,Perspective (graphical) ,Social environment ,020206 networking & telecommunications ,Cognition ,02 engineering and technology ,Personalization ,Human–computer interaction ,Adaptive system ,Intentionality ,Humanity ,0202 electrical engineering, electronic engineering, information engineering - Abstract
We present a theoretically motivated design perspective, challenges, and applications of next-generation artificial intelligence (AI) systems. We envision systems with greater capabilities for meaningful human interaction, including socially adaptive behavior that incorporates personalization and sensitivity to social context and intentionality. Personalized knowledge graphs combining generic, common-sense, and domain-specific knowledge with both sociocultural values and norms and individual cognitive models provide a foundation for building humanity-inspired AI systems.
- Published
- 2020
- Full Text
- View/download PDF
18. Emoji Understanding and Applications in Social Media
- Author
-
Horacio Saggion, Amit Sheth, Sanjaya Wijeratne, and Emre Kiciman
- Subjects
Emoji ,Computer science ,General Engineering ,Media studies ,Social media - Published
- 2020
- Full Text
- View/download PDF
19. Personalized Digital Phenotype Score, Healthcare Management and Intervention Strategies Using Knowledge Enabled Digital Health Framework for Pediatric Asthma
- Author
-
Utkarshani Jaimini and Amit Sheth
- Subjects
0303 health sciences ,Self-management ,business.industry ,Asthma management ,medicine.disease ,Digital health ,respiratory tract diseases ,Health administration ,Personalization ,03 medical and health sciences ,0302 clinical medicine ,Intervention (counseling) ,Self-monitoring ,medicine ,030212 general & internal medicine ,Medical emergency ,business ,Pediatric asthma ,030304 developmental biology - Abstract
Asthma is a personalized, and multi-trigger respiratory condition which requires continuous monitoring and management of symptoms and medication adherence. We developed kHealth: Knowledge-enabled Digital Healthcare Framework to monitor and manage the asthma symptoms, medication adherence, lung function, daily activity, sleep quality, indoor, and outdoor environmental triggers of pediatric asthma patients. The kHealth framework collects up to 1852 data points per patient per day. It is practically impossible for the clinicians, parents, and the patient to analyze this vast amount of multimodal data collected from the kHealth framework. In this chapter, we describe the personalized scores, clinically relevant asthma categorization using digital phenotype score, actionable insights, and potential intervention strategies for better pediatric asthma management.
- Published
- 2022
- Full Text
- View/download PDF
20. Contributors
- Author
-
Karima Boudaoud, Ozgu Can, Inaldo Capistrano Costa, Antonella Carbonaro, Gustavo de Assis Costa, Gerard Deepak, Antonio De Nicola, Moses E. Ekpenyong, R.R. Rubia Gandhi, Manas Gaur, Ayush Goyal, Amelie Gyrard, Funebi F. Ijebu, Abinaya Inbamani, M.A. Jabbar, Utkarshani Jaimini, Normunds Kante, Janis Klovins, Antonio Kung, Thirumeni Mariammal, Anastasija Nikiforova, Fernando Ortiz-Rodriguez, M. Preethi, R. Rajalakshmi, Vita Rovite, A. Siva Sakthi, Saeedeh Shekharpour, Amit Sheth, Deepak Surya S., Francesco Taglino, Krishnaprasad Thirunarayan, Sanju Tiwari, Kommomo J. Usang, Patience U. Usip, Veerapandi Veerasamy, and Rita Zgheib
- Published
- 2022
- Full Text
- View/download PDF
21. Evaluating Biomedical Word Embeddings for Vocabulary Alignment at Scale in the UMLS Metathesaurus Using Siamese Networks
- Author
-
Goonmeet, Bajaj, Vinh, Nguyen, Thilini, Wijesiriwardene, Hong Yung, Yip, Vishesh, Javangula, Srinivasan, Parthasarathy, Amit, Sheth, and Olivier, Bodenreider
- Subjects
Article - Abstract
Recent work uses a Siamese Network, initialized with BioWordVec embeddings (distributed word embeddings), for predicting synonymy among biomedical terms to automate a part of the UMLS (Unified Medical Language System) Metathesaurus construction process. We evaluate the use of contextualized word embeddings extracted from nine different biomedical BERT-based models for synonymy prediction in the UMLS by replacing BioWordVec embeddings with embeddings extracted from each biomedical BERT model using different feature extraction methods. Surprisingly, we find that Siamese Networks initialized with BioWordVec embeddings still outperform the Siamese Networks initialized with embedding extracted from biomedical BERT model.
- Published
- 2022
- Full Text
- View/download PDF
22. Reasoning over personalized healthcare knowledge graph: a case study of patients with allergies and symptoms
- Author
-
Amelie Gyrard, Utkarshani Jaimini, Manas Gaur, Saeedeh Shekharpour, Krishnaprasad Thirunarayan, and Amit Sheth
- Published
- 2022
- Full Text
- View/download PDF
23. Learning to Automate Follow-up Question Generation using Process Knowledge for Depression Triage on Reddit Posts
- Author
-
Shrey Gupta, Anmol Agarwal, Manas Gaur, Kaushik Roy, Vignesh Narayanan, Ponnurangam Kumaraguru, and Amit Sheth
- Subjects
FOS: Computer and information sciences ,Computer Science - Computation and Language ,Artificial Intelligence (cs.AI) ,Computer Science - Artificial Intelligence ,Computation and Language (cs.CL) - Abstract
Conversational Agents (CAs) powered with deep language models (DLMs) have shown tremendous promise in the domain of mental health. Prominently, the CAs have been used to provide informational or therapeutic services to patients. However, the utility of CAs to assist in mental health triaging has not been explored in the existing work as it requires a controlled generation of follow-up questions (FQs), which are often initiated and guided by the mental health professionals (MHPs) in clinical settings. In the context of depression, our experiments show that DLMs coupled with process knowledge in a mental health questionnaire generate 12.54% and 9.37% better FQs based on similarity and longest common subsequence matches to questions in the PHQ-9 dataset respectively, when compared with DLMs without process knowledge support. Despite coupling with process knowledge, we find that DLMs are still prone to hallucination, i.e., generating redundant, irrelevant, and unsafe FQs. We demonstrate the challenge of using existing datasets to train a DLM for generating FQs that adhere to clinical process knowledge. To address this limitation, we prepared an extended PHQ-9 based dataset, PRIMATE, in collaboration with MHPs. PRIMATE contains annotations regarding whether a particular question in the PHQ-9 dataset has already been answered in the user's initial description of the mental health condition. We used PRIMATE to train a DLM in a supervised setting to identify which of the PHQ-9 questions can be answered directly from the user's post and which ones would require more information from the user. Using performance analysis based on MCC scores, we show that PRIMATE is appropriate for identifying questions in PHQ-9 that could guide generative DLMs towards controlled FQ generation suitable for aiding triaging. Dataset created as a part of this research: https://github.com/primate-mh/Primate2022
- Published
- 2022
- Full Text
- View/download PDF
24. Knowledge-infused Learning for Entity Prediction in Driving Scenes
- Author
-
Cory Henson, Ruwan Wickramarachchi, and Amit Sheth
- Subjects
Big Data ,autonomous driving ,Artificial Intelligence ,knowledge-infused learning ,Computer Science (miscellaneous) ,scene understanding ,Information technology ,entity prediction ,T58.5-58.64 ,neuro-symbolic computing ,knowledge graph embeddings ,Original Research ,Information Systems - Abstract
Scene understanding is a key technical challenge within the autonomous driving domain. It requires a deep semantic understanding of the entities and relations found within complex physical and social environments that is both accurate and complete. In practice, this can be accomplished by representing entities in a scene and their relations as a knowledge graph (KG). This scene knowledge graph may then be utilized for the task of entity prediction, leading to improved scene understanding. In this paper, we will define and formalize this problem as Knowledge-based Entity Prediction (KEP). KEP aims to improve scene understanding by predicting potentially unrecognized entities by leveraging heterogeneous, high-level semantic knowledge of driving scenes. An innovative neuro-symbolic solution for KEP is presented, based on knowledge-infused learning, which 1) introduces a dataset agnostic ontology to describe driving scenes, 2) uses an expressive, holistic representation of scenes with knowledge graphs, and 3) proposes an effective, non-standard mapping of the KEP problem to the problem of link prediction (LP) using knowledge-graph embeddings (KGE). Using real, complex and high-quality data from urban driving scenes, we demonstrate its effectiveness by showing that the missing entities may be predicted with high precision (0.87 Hits@1) while significantly outperforming the non-semantic/rule-based baselines.
- Published
- 2021
- Full Text
- View/download PDF
25. Personalized Health Knowledge Graph
- Author
-
Amelie, Gyrard, Manas, Gaur, Saeedeh, Shekarpour, Krishnaprasad, Thirunarayan, and Amit, Sheth
- Subjects
Article - Abstract
Our current health applications do not adequately take into account contextual and personalized knowledge about patients. In order to design “Personalized Coach for Healthcare” applications to manage chronic diseases, there is a need to create a Personalized Healthcare Knowledge Graph (PHKG) that takes into consideration a patient’s health condition (personalized knowledge) and enriches that with contextualized knowledge from environmental sensors and Web of Data (e.g., symptoms and treatments for diseases). To develop PHKG, aggregating knowledge from various heterogeneous sources such as the Internet of Things (IoT) devices, clinical notes, and Electronic Medical Records (EMRs) is necessary. In this paper, we explain the challenges of collecting, managing, analyzing, and integrating patients’ health data from various sources in order to synthesize and deduce meaningful information embodying the vision of the Data, Information, Knowledge, and Wisdom (DIKW) pyramid. Furthermore, we sketch a solution that combines: 1) IoT data analytics, and 2) explicit knowledge and illustrate it using three chronic disease use cases – asthma, obesity, and Parkinson’s.
- Published
- 2021
26. Drug Abuse Ontology to Harness Web-Based Data for Substance Use Epidemiology Research: Ontology Development Study
- Author
-
Usha Lokala, Francois Lamy, Raminta Daniulaityte, Manas Gaur, Amelie Gyrard, Krishnaprasad Thirunarayan, Ugur Kursuncu, and Amit Sheth
- Subjects
Public Health, Environmental and Occupational Health ,Health Informatics - Abstract
Background Web-based resources and social media platforms play an increasingly important role in health-related knowledge and experience sharing. There is a growing interest in the use of these novel data sources for epidemiological surveillance of substance use behaviors and trends. Objective The key aims were to describe the development and application of the drug abuse ontology (DAO) as a framework for analyzing web-based and social media data to inform public health and substance use research in the following areas: determining user knowledge, attitudes, and behaviors related to nonmedical use of buprenorphine and illicitly manufactured opioids through the analysis of web forum data Prescription Drug Abuse Online Surveillance; analyzing patterns and trends of cannabis product use in the context of evolving cannabis legalization policies in the United States through analysis of Twitter and web forum data (eDrugTrends); assessing trends in the availability of novel synthetic opioids through the analysis of cryptomarket data (eDarkTrends); and analyzing COVID-19 pandemic trends in social media data related to 13 states in the United States as per Mental Health America reports. Methods The domain and scope of the DAO were defined using competency questions from popular ontology methodology (101 ontology development). The 101 method includes determining the domain and scope of ontology, reusing existing knowledge, enumerating important terms in ontology, defining the classes, their properties and creating instances of the classes. The quality of the ontology was evaluated using a set of tools and best practices recognized by the semantic web community and the artificial intelligence community that engage in natural language processing. Results The current version of the DAO comprises 315 classes, 31 relationships, and 814 instances among the classes. The ontology is flexible and can easily accommodate new concepts. The integration of the ontology with machine learning algorithms dramatically decreased the false alarm rate by adding external knowledge to the machine learning process. The ontology is recurrently updated to capture evolving concepts in different contexts and applied to analyze data related to social media and dark web marketplaces. Conclusions The DAO provides a powerful framework and a useful resource that can be expanded and adapted to a wide range of substance use and mental health domains to help advance big data analytics of web-based data for substance use epidemiology research.
- Published
- 2022
- Full Text
- View/download PDF
27. Keynote 3: Don’t Handicap AI without Explicit Knowledge
- Author
-
Amit Sheth
- Subjects
Cognitive science ,Knowledge representation and reasoning ,Syntax (programming languages) ,Computer science ,Human intelligence ,media_common.quotation_subject ,Cognition ,Common sense ,computer.software_genre ,GeneralLiterature_MISCELLANEOUS ,Expert system ,Variety (cybernetics) ,Explicit knowledge ,computer ,media_common - Abstract
Knowledge representation as expert system rules or using frames and variety of logics, played a key role in capturing explicit knowledge during the hay days of AI in the past century. Such knowledge, aligned with planning and reasoning are part of what we refer to as Symbolic AI. The resurgent AI of this century in the form of Statistical AI has benefitted from massive data and computing. On some tasks, deep learning methods have even exceeded human performance levels. This gave the false sense that data alone is enough, and explicit knowledge is not needed. But as we start chasing machine intelligence that is comparable with human intelligence, there is an increasing realization that we cannot do without explicit knowledge. Neuroscience (role of long-term memory, strong interactions between different specialized regions of data on tasks such as multimodal sensing), cognitive science (bottom brain versus top brain, perception versus cognition), brain-inspired computing, behavioral economics (system 1 versus system 2), and other disciplines point to need for furthering AI to neuro-symbolic AI (i.e., hybrid of Statistical AI and Symbolic AI, also referred to as the third wave of AI). As we make this progress, the role of explicit knowledge becomes more evident. I will specifically look at our endeavor to support human-like intelligence, our desire for AI systems to interact with humans naturally, and our need to explain the path and reasons for AI systems’ workings. Nevertheless, the variety of knowledge needed to support understanding and intelligence is varied and complex. Using the example of progressing from NLP to NLU, I will demonstrate the dimensions of explicit knowledge, which may include, linguistic, language syntax, common sense, general (world model), specialized (e.g., geographic), and domain-specific (e.g., mental health) knowledge. I will also argue that despite this complexity, such knowledge can be scalability created and maintained (even dynamically or continually). Finally, I will describe our work on knowledge-infused learning as an example strategy for fusing statistical and symbolic AI in a variety of ways.
- Published
- 2021
- Full Text
- View/download PDF
28. A CdZnTeSe gamma spectrometer trained by deep convolutional neural network for radioisotope identification
- Author
-
Utpal N. Roy, Sandeep K. Chaudhuri, Amit Sheth, Forest Agostinelli, Rojina Panta, Ritwik Nag, Joshua W. Kleppinger, Ralph B. James, Krishna C. Mandal, and Kaushik Roy
- Subjects
Spectrometer ,Artificial neural network ,Physics::Instrumentation and Detectors ,Computer science ,Gamma ray spectrometer ,business.industry ,Preamplifier ,Astrophysics::High Energy Astrophysical Phenomena ,Gamma ray ,Convolutional neural network ,Particle detector ,Identification (information) ,Computer vision ,Artificial intelligence ,business - Abstract
We report the implementation of a deep convolutional neural network to train a high-resolution room-temperature CdZnTeSe based gamma ray spectrometer for accurate and precise determination of gamma ray energies for radioisotope identification. The prototype learned spectrometer consists of a NI PCI 5122 fast digitizer connected to a pre-amplifier to recognize spectral features in a sequence of data. We used simulated preamplifier pulses that resemble actual data for various gamma photon energies to train a CNN on the equivalent of 90 seconds worth of data and validated it on 10 seconds worth of simulated data.
- Published
- 2021
- Full Text
- View/download PDF
29. Comparing Suicide Risk Insights derived from Clinical and Social Media data
- Author
-
Rohith K, Thiruvalluru, Manas, Gaur, Krishnaprasad, Thirunarayan, Amit, Sheth, and Jyotishman, Pathak
- Subjects
Suicide ,Adolescent ,Risk Factors ,Substance-Related Disorders ,Humans ,Articles ,Social Media ,Suicidal Ideation - Abstract
Suicide is the 10(th) leading cause of death in the US and the 2(nd) leading cause of death among teenagers. Clinical and psychosocial factors contribute to suicide risk (SRFs), although documentation and self-expression of such factors in EHRs and social networks vary. This study investigates the degree of variance across EHRs and social networks. We performed subjective analysis of SRFs, such as self-harm, bullying, impulsivity, family violence/discord, using >13.8 Million clinical notes on 123,703 patients with mental health conditions. We clustered clinical notes using semantic embeddings under a set of SRFs. Likewise, we clustered 2180 suicidal users on r/SuicideWatch (~30,000 posts) and performed comparative analysis. Top-3 SRFs documented in EHRs were depressive feelings (24.3%), psychological disorders (21.1%), drug abuse (18.2%). In r/SuicideWatch, gun-ownership (17.3%), self-harm (14.6%), bullying (13.2%) were Top-3 SRFs. Mentions of Family violence, racial discrimination, and other important SRFs contributing to suicide risk were missing from both platforms.
- Published
- 2021
30. 'Who can help me?': Knowledge Infused Matching of Support Seekers and Support Providers during COVID-19 on Reddit
- Author
-
Biplav Srivastava, Manas Gaur, Kaushik Roy, Aditya Sharma, and Amit Sheth
- Subjects
FOS: Computer and information sciences ,0303 health sciences ,Matching (statistics) ,Social computing ,business.industry ,Internet privacy ,Context (language use) ,Semantics ,Mental health ,Computer Science - Information Retrieval ,03 medical and health sciences ,Knowledge-based systems ,Seekers ,0302 clinical medicine ,business ,Psychology ,Information Retrieval (cs.IR) ,030217 neurology & neurosurgery ,Natural language ,030304 developmental biology - Abstract
During the ongoing COVID-19 crisis, subreddits on Reddit, such as r/Coronavirus saw a rapid growth in user's requests for help (support seekers - SSs) including individuals with varying professions and experiences with diverse perspectives on care (support providers - SPs). Currently, knowledgeable human moderators match an SS with a user with relevant experience, i.e, an SP on these subreddits. This unscalable process defers timely care. We present a medical knowledge-infused approach to efficient matching of SS and SPs validated by experts for the users affected by anxiety and depression, in the context of with COVID-19. After matching, each SP to an SS labeled as either supportive, informative, or similar (sharing experiences) using the principles of natural language inference. Evaluation by 21 domain experts indicates the efficacy of incorporated knowledge and shows the efficacy the matching system.
- Published
- 2021
- Full Text
- View/download PDF
31. Modeling Islamist Extremist Communications on Social Media using Contextual Dimensions
- Author
-
Krishnaprasad Thirunarayan, Manas Gaur, Valerie L. Shalin, Amit Sheth, Carlos Castillo, Dilshod Achilov, I. Budak Arpinar, Ugur Kursuncu, and Amanuel Alambo
- Subjects
Social and Information Networks (cs.SI) ,FOS: Computer and information sciences ,Persuasion ,Radicalization ,Computer Science - Computation and Language ,Computer Networks and Communications ,business.industry ,media_common.quotation_subject ,Internet privacy ,Computer Science - Social and Information Networks ,Ambiguity ,Human-Computer Interaction ,Politics ,Mainstream ,Social media ,Sociology ,Ideology ,business ,Computation and Language (cs.CL) ,Social Sciences (miscellaneous) ,media_common ,Meaning (linguistics) - Abstract
Terror attacks have been linked in part to online extremist content. Although tens of thousands of Islamist extremism supporters consume such content, they are a small fraction relative to peaceful Muslims. The efforts to contain the ever-evolving extremism on social media platforms have remained inadequate and mostly ineffective. Divergent extremist and mainstream contexts challenge machine interpretation, with a particular threat to the precision of classification algorithms. Our context-aware computational approach to the analysis of extremist content on Twitter breaks down this persuasion process into building blocks that acknowledge inherent ambiguity and sparsity that likely challenge both manual and automated classification. We model this process using a combination of three contextual dimensions -- religion, ideology, and hate -- each elucidating a degree of radicalization and highlighting independent features to render them computationally accessible. We utilize domain-specific knowledge resources for each of these contextual dimensions such as Qur'an for religion, the books of extremist ideologues and preachers for political ideology and a social media hate speech corpus for hate. Our study makes three contributions to reliable analysis: (i) Development of a computational approach rooted in the contextual dimensions of religion, ideology, and hate that reflects strategies employed by online Islamist extremist groups, (ii) An in-depth analysis of relevant tweet datasets with respect to these dimensions to exclude likely mislabeled users, and (iii) A framework for understanding online radicalization as a process to assist counter-programming. Given the potentially significant social impact, we evaluate the performance of our algorithms to minimize mislabeling, where our approach outperforms a competitive baseline by 10.2% in precision., Comment: 22 pages
- Published
- 2019
- Full Text
- View/download PDF
32. Extending Patient-Chatbot Experience with Internet-of-Things and Background Knowledge: Case Studies with Healthcare Applications
- Author
-
Amit Sheth, Hong Yung Yip, and Saeedeh Shekarpour
- Subjects
Computer Networks and Communications ,business.industry ,Computer science ,media_common.quotation_subject ,Psychological intervention ,02 engineering and technology ,computer.software_genre ,Chatbot ,Article ,Smartwatch ,Artificial Intelligence ,Human–computer interaction ,Enabling ,Health care ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Quality (business) ,Dialog system ,business ,computer ,Wearable technology ,media_common - Abstract
Presents case studies in the healthcare industry that focus on the use of Chatbots to improve patient monitoring and medical services. The transition towards personalized health management requires public awareness about management strategies of self-monitoring, self-appraisal, and self-management, eventually paving a way to more timely interventions and higher quality patient–clinician interactions. A key enabler is patient generated health data, fueled in good part by the growth in wearable devices including smart watches and other Internet-of- Things (IoT) for health-tracking. These tracking devices provide “low-level” monitoring signals indicating health conditions such as sleep apnea and heart rhythm disorder. However, to make more sense of IoT data, it is imperative that we develop cognitive approaches where they mine, interlink, and abstract diverse IoT data. These cognitive approaches often needs to keep the user closely engaged to acquire more information, to obtain feedback, to collect verbal health conditions, and to provide intervention and management actions. The chatbot technology was initially introduced as an artificial conversational agent to simulate conversations with a user using voice or text interactions.
- Published
- 2019
- Full Text
- View/download PDF
33. Designing Children’s New Learning Partner: Collaborative Artificial Intelligence for Learning to Solve the Rubik’s Cube
- Author
-
Barnett Berry, Forest Agostinelli, Biplav Srivastava, Vedant Khandelwal, Hengtao Tang, Mihir Mavalankar, Amit Sheth, Matthew J. Irvin, and Dezhi Wu
- Subjects
Process (engineering) ,Computer science ,business.industry ,media_common.quotation_subject ,Deep learning ,05 social sciences ,Multitude ,Decision tree ,020207 software engineering ,Cube (algebra) ,02 engineering and technology ,Variety (cybernetics) ,0202 electrical engineering, electronic engineering, information engineering ,Curiosity ,Reinforcement learning ,0501 psychology and cognitive sciences ,Artificial intelligence ,business ,050107 human factors ,media_common - Abstract
Developing the problem solving skills of children is a challenging problem that is crucial for the future of our society. Given that artificial intelligence (AI) has been used to solve problems across a wide variety of domains, AI offers unique opportunities to develop problem solving skills using a multitude of tasks that pique the curiosity of children. To make this a reality, it is necessary to address the uninterpretable “black-box” that AI often appears to be. Towards this goal, we design a collaborative artificial intelligence algorithm that uses a human-in-the-loop approach to allow students to discover their own personalized solutions to problems. This collaborative algorithm builds on state-of-the-art AI algorithms and leverages additional interpretable structures, namely knowledge graphs and decision trees, to create a fully interpretable process that is able to explain solutions in their entirety. We describe this algorithm when applied to solving the Rubik’s cube as well as our planned user-interface and assessment methods.
- Published
- 2021
- Full Text
- View/download PDF
34. Assessing the Severity of Health States based on Social Media Posts
- Author
-
Joy Prakash Sain, Sriparna Saha, Shweta Yadav, Pushpak Bhattacharyya, Asif Ekbal, and Amit Sheth
- Subjects
FOS: Computer and information sciences ,Computer Science - Computation and Language ,020205 medical informatics ,business.industry ,Online health communities ,media_common.quotation_subject ,Applied psychology ,Natural language understanding ,02 engineering and technology ,Peer support ,computer.software_genre ,Literal and figurative language ,3. Good health ,0202 electrical engineering, electronic engineering, information engineering ,Task analysis ,Personality ,020201 artificial intelligence & image processing ,The Internet ,Social media ,Artificial intelligence ,Psychology ,business ,Computation and Language (cs.CL) ,computer ,media_common - Abstract
The unprecedented growth of Internet users has resulted in an abundance of unstructured information on social media including health forums, where patients request health-related information or opinions from other users. Previous studies have shown that online peer support has limited effectiveness without expert intervention. Therefore, a system capable of assessing the severity of health state from the patients' social media posts can help health professionals (HP) in prioritizing the user's post. In this study, we inspect the efficacy of different aspects of Natural Language Understanding (NLU) to identify the severity of the user's health state in relation to two perspectives(tasks) (a) Medical Condition (i.e., Recover, Exist, Deteriorate, Other) and (b) Medication (i.e., Effective, Ineffective, Serious Adverse Effect, Other) in online health communities. We propose a multiview learning framework that models both the textual content as well as contextual-information to assess the severity of the user's health state. Specifically, our model utilizes the NLU views such as sentiment, emotions, personality, and use of figurative language to extract the contextual information. The diverse NLU views demonstrate its effectiveness on both the tasks and as well as on the individual disease to assess a user's health.
- Published
- 2021
- Full Text
- View/download PDF
35. The Duality of Data and Knowledge Across the Three Waves of AI
- Author
-
Krishnaprasad Thirunarayan and Amit Sheth
- Subjects
FOS: Computer and information sciences ,I.2.4 ,Human intelligence ,Statistical learning ,Computer science ,Computer Science - Artificial Intelligence ,Duality (mathematics) ,Intelligent decision support system ,Cognition ,02 engineering and technology ,Data science ,GeneralLiterature_MISCELLANEOUS ,Computer Science Applications ,ComputingMethodologies_PATTERNRECOGNITION ,Artificial Intelligence (cs.AI) ,Hardware and Architecture ,020204 information systems ,Enabling ,0202 electrical engineering, electronic engineering, information engineering ,Software ,Repurposing ,Ai systems - Abstract
We discuss how over the last 30 to 50 years, Artificial Intelligence (AI) systems that focused only on data have been handicapped, and how knowledge has been critical in developing smarter, intelligent, and more effective systems. In fact, the vast progress in AI can be viewed in terms of the three waves of AI as identified by DARPA. During the first wave, handcrafted knowledge has been at the center-piece, while during the second wave, the data-driven approaches supplanted knowledge. Now we see a strong role and resurgence of knowledge fueling major breakthroughs in the third wave of AI underpinning future intelligent systems as they attempt human-like decision making, and seek to become trusted assistants and companions for humans. We find a wider availability of knowledge created from diverse sources, using manual to automated means both by repurposing as well as by extraction. Using knowledge with statistical learning is becoming increasingly indispensable to help make AI systems more transparent and auditable. We will draw a parallel with the role of knowledge and experience in human intelligence based on cognitive science, and discuss emerging neuro-symbolic or hybrid AI systems in which knowledge is the critical enabler for combining capabilities of the data-intensive statistical AI systems with those of symbolic AI systems, resulting in more capable AI systems that support more human-like intelligence., Comment: A version of this will appear as (cite as): IT Professional Magazine (special section to commemorate the 75th Anniversary of IEEE Computer Society), 23 (3) April-May 2021
- Published
- 2021
- Full Text
- View/download PDF
36. Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits
- Author
-
Amit Sheth, Manas Gaur, Kaushik Roy, and Qi Zhang
- Subjects
Information retrieval ,Computer science ,business.industry ,Natural (music) ,Regret ,Context (language use) ,Use case ,Recommender system ,Space (commercial competition) ,business ,Adaptation (computer science) ,Online advertising - Abstract
Contextual Bandits find important use cases in various real-life scenarios such as online advertising, recommendation systems, healthcare, etc. However, most of the algorithms use flat feature vectors to represent context whereas, in the real world, there is a varying number of objects and relations among them to model in the context. For example, in a music recommendation system, the user context contains what music they listen to, which artists create this music, the artist albums, etc. Adding richer relational context representations also introduces a much larger context space making exploration-exploitation harder. To improve the efficiency of exploration-exploitation knowledge about the context can be infused to guide the exploration-exploitation strategy. Relational context representations allow a natural way for humans to specify knowledge owing to their descriptive nature. We propose an adaptation of Knowledge Infused Policy Gradients to the Contextual Bandit setting and a novel Knowledge Infused Policy Gradients Upper Confidence Bound algorithm and perform an experimental analysis of a simulated music recommendation dataset and various real-life datasets where expert knowledge can drastically reduce the total regret and where it cannot.
- Published
- 2021
- Full Text
- View/download PDF
37. Knowledge-intensive Language Understanding for Explainable AI
- Author
-
Manas Gaur, Keyur Faldu, Amit Sheth, and Kaushik Roy
- Subjects
FOS: Computer and information sciences ,Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Networks and Communications ,Watson ,Computer science ,Space (commercial competition) ,Data science ,Transparency (behavior) ,GeneralLiterature_MISCELLANEOUS ,Harm ,ComputingMethodologies_PATTERNRECOGNITION ,Artificial Intelligence (cs.AI) ,Knowledge extraction ,Feature (machine learning) ,Task analysis ,Domain knowledge ,Computation and Language (cs.CL) - Abstract
AI systems have seen significant adoption in various domains. At the same time, further adoption in some domains is hindered by inability to fully trust an AI system that it will not harm a human. Besides the concerns for fairness, privacy, transparency, and explainability are key to developing trusts in AI systems. As stated in describing trustworthy AI "Trust comes through understanding. How AI-led decisions are made and what determining factors were included are crucial to understand." The subarea of explaining AI systems has come to be known as XAI. Multiple aspects of an AI system can be explained; these include biases that the data might have, lack of data points in a particular region of the example space, fairness of gathering the data, feature importances, etc. However, besides these, it is critical to have human-centered explanations that are directly related to decision-making similar to how a domain expert makes decisions based on "domain knowledge," that also include well-established, peer-validated explicit guidelines. To understand and validate an AI system's outcomes (such as classification, recommendations, predictions), that lead to developing trust in the AI system, it is necessary to involve explicit domain knowledge that humans understand and use., Comment: To appear in IEEE Internet Computing, September/October 2021 Issue
- Published
- 2021
- Full Text
- View/download PDF
38. Characterization of Time-variant and Time-invariant Assessment of Suicidality on Reddit using C-SSRS
- Author
-
Amanuel Alambo, Amit Sheth, Vamsi Aribandi, Jonanthan Beich, Krishnaprasad Thirunarayan, Ugur Kursuncu, Jyotishman Pathak, and Manas Gaur
- Subjects
Suicide Prevention ,FOS: Computer and information sciences ,H.4 ,Lexicography ,J.3 ,Databases, Factual ,J.4 ,Epidemiology ,Applied psychology ,Social Sciences ,Suicide, Attempted ,02 engineering and technology ,I.2 ,Sociology ,Medicine and Health Sciences ,0202 electrical engineering, electronic engineering, information engineering ,Psychology ,Computer Science - Computation and Language ,Multidisciplinary ,Depression ,Social Communication ,Computer Science - Social and Information Networks ,16. Peace & justice ,3. Good health ,Suicide ,Social Networks ,Area Under Curve ,Medicine ,Computation and Language (cs.CL) ,Network Analysis ,Research Article ,Computer and Information Sciences ,Computer Science - Artificial Intelligence ,Science ,MEDLINE ,Temporality ,Risk Assessment ,Suicidal Ideation ,Deep Learning ,020204 information systems ,Mental Health and Psychiatry ,medicine ,Humans ,Social media ,Lexicons ,Psychiatric Status Rating Scales ,Social and Information Networks (cs.SI) ,Behavior ,Suicide attempt ,Mood Disorders ,business.industry ,Deep learning ,Gold standard ,Biology and Life Sciences ,Linguistics ,020206 networking & telecommunications ,Mental illness ,medicine.disease ,Communications ,Artificial Intelligence (cs.AI) ,ROC Curve ,Medical Risk Factors ,Artificial intelligence ,business ,Columbia Suicide Severity Rating Scale ,Social Media ,Mental Health Therapies - Abstract
Suicide is the 10th leading cause of death in the U.S (1999-2019). However, predicting when someone will attempt suicide has been nearly impossible. In the modern world, many individuals suffering from mental illness seek emotional support and advice on well-known and easily-accessible social media platforms such as Reddit. While prior artificial intelligence research has demonstrated the ability to extract valuable information from social media on suicidal thoughts and behaviors, these efforts have not considered both severity and temporality of risk. The insights made possible by access to such data have enormous clinical potential - most dramatically envisioned as a trigger to employ timely and targeted interventions (i.e., voluntary and involuntary psychiatric hospitalization) to save lives. In this work, we address this knowledge gap by developing deep learning algorithms to assess suicide risk in terms of severity and temporality from Reddit data based on the Columbia Suicide Severity Rating Scale (C-SSRS). In particular, we employ two deep learning approaches: time-variant and time-invariant modeling, for user-level suicide risk assessment, and evaluate their performance against a clinician-adjudicated gold standard Reddit corpus annotated based on the C-SSRS. Our results suggest that the time-variant approach outperforms the time-invariant method in the assessment of suicide-related ideations and supportive behaviors (AUC:0.78), while the time-invariant model performed better in predicting suicide-related behaviors and suicide attempt (AUC:0.64). The proposed approach can be integrated with clinical diagnostic interviews for improving suicide risk assessments., Comment: 24 Pages, 8 Tables, 6 Figures; Accepted by PLoS One ; One of the two mentioned Datasets in the manuscript has Closed Access. We will make it public after PLoS One produces the manuscript
- Published
- 2021
- Full Text
- View/download PDF
39. Twitris v3: From Citizen Sensing to Analysis, Coordination and Action
- Author
-
Hemant Purohit and Amit Sheth
- Abstract
Citizen sensing, with a billion plus active users and billion plus tweets/week, is complemented by shared information from contextually relevant Web of Data (blogs, news, and media objects) and background knowledge. How can these enable us in informing, understanding and managing a broad variety of activities and events locally and around the world? Twitris, currently in version 3, is a scalable and interactive platform which continuously collects, aggregates, integrates, and analyzes the above forms of data and knowledge to give deeper insights, as well as facilitate research and development on coordination and targeted actions related to any event. In this demonstration, we will show Twitris’ comprehensive capabilities in spatio-temporal-thematic, people-contentnetwork, and sentiment-emotion-subjectivity analyses, with examples from business intelligence including brand tracking and advertising campaigns, social/political unrests, and disaster events such as U.S. Election 2012, Occupy Wall Street (OWS) protest, Hurricane Sandy, etc. Visitors from diverse backgrounds will be able to play with the system for analyses of archived as well as live events during the demo.
- Published
- 2021
- Full Text
- View/download PDF
40. Drug Abuse Ontology for the study of Substance Use Epidemiology on Social Media and Dark Web: Ontology development and usability study (Preprint)
- Author
-
Usha Lokala, Raminta Daniulaityte, Francois Lamy, Manas Gaur, Krishnaprasad Thirunarayan, Ugur Kursuncu, and Amit Sheth
- Abstract
BACKGROUND Web-based resources and social media platforms play an increasingly important role in health-related knowledge and experience sharing. There is a growing interest in the utilization of these novel data sources for epidemiological surveillance of substance use behaviors and trends. OBJECTIVE The key aims are to describe the development and application of the Drug Abuse Ontology as a framework for analyzing web-based data to inform public health surveillance for the following applications: 1) determining user knowledge, attitudes, and behaviors related to non-medical use of buprenorphine and other illicit opioids through analysis of web forum data; 2) understanding patterns and trends of cannabis product use in the context of evolving cannabis legalization policies in the U.S through analysis of Twitter and web forum data; and 3) gleaning trends in the availability of novel synthetic opioids through analysis of crypto market data. METHODS The domain and scope of the drug abuse ontology were defined using competency questions from two popular ontology methodologies (Neon and 101 ontology development methodology). The quality of the ontology is evaluated with a set of tools and best practices recognized by the Semantic Web community and the AI community that engage in natural language processing. The standard ontology metrics are also presented. RESULTS The current version of Drug Abuse Ontology comprises 315 classes, 31 relationships, and 814 instances among the classes. The ontology is flexible and can easily accommodate new concepts. The integration of the ontology with machine learning algorithms dramatically decreases the false alarm rate by adding external knowledge to the learning process. The ontology is being updated to capture evolving concepts and has been used for four different projects: PREDOSE, eDrugTrends, eDarkTrends, DAO applications in Mental Health and COVID scenario. CONCLUSIONS It has been found that the developed Drug Abuse Ontology (DAO) is useful to identify the most frequently used terms/slang terms on social media/dark web related to drug abuse posted by the general population .
- Published
- 2020
- Full Text
- View/download PDF
41. Knowledge-Infused Abstractive Summarization of Clinical Diagnostic Interviews: Framework Development Study (Preprint)
- Author
-
Gaur Manas, Vamsi Aribandi, Ugur Kursuncu, Amanuel Alambo, Valerie L Shalin, Krishnaprasad Thirunarayan, Jonathan Beich, Meera Narasimhan, and Amit Sheth
- Abstract
BACKGROUND In clinical diagnostic interviews, mental health professionals (MHPs) implement a care practice that involves asking open questions (eg, “What do you want from your life?” “What have you tried before to bring change in your life?”) while listening empathetically to patients. During these interviews, MHPs attempted to build a trusting human-centered relationship while collecting data necessary for professional medical and psychiatric care. Often, because of the social stigma of mental health disorders, patient discomfort in discussing their presenting problem may add additional complexities and nuances to the language they use, that is, hidden signals among noisy content. Therefore, a focused, well-formed, and elaborative summary of clinical interviews is critical to MHPs in making informed decisions by enabling a more profound exploration of a patient’s behavior, especially when it endangers life. OBJECTIVE The aim of this study is to propose an unsupervised, knowledge-infused abstractive summarization (KiAS) approach that generates summaries to enable MHPs to perform a well-informed follow-up with patients to improve the existing summarization methods built on frequency heuristics by creating more informative summaries. METHODS Our approach incorporated domain knowledge from the Patient Health Questionnaire-9 lexicon into an integer linear programming framework that optimizes linguistic quality and informativeness. We used 3 baseline approaches: extractive summarization using the SumBasic algorithm, abstractive summarization using integer linear programming without the infusion of knowledge, and abstraction over extractive summarization to evaluate the performance of KiAS. The capability of KiAS on the Distress Analysis Interview Corpus-Wizard of Oz data set was demonstrated through interpretable qualitative and quantitative evaluations. RESULTS KiAS generates summaries (7 sentences on average) that capture informative questions and responses exchanged during long (58 sentences on average), ambiguous, and sparse clinical diagnostic interviews. The summaries generated using KiAS improved upon the 3 baselines by 23.3%, 4.4%, 2.5%, and 2.2% for thematic overlap, Flesch Reading Ease, contextual similarity, and Jensen Shannon divergence, respectively. On the Recall-Oriented Understudy for Gisting Evaluation-2 and Recall-Oriented Understudy for Gisting Evaluation-L metrics, KiAS showed an improvement of 61% and 49%, respectively. We validated the quality of the generated summaries through visual inspection and substantial interrater agreement from MHPs. CONCLUSIONS Our collaborator MHPs observed the potential utility and significant impact of KiAS in leveraging valuable but voluminous communications that take place outside of normally scheduled clinical appointments. This study shows promise in generating semantically relevant summaries that will help MHPs make informed decisions about patient status.
- Published
- 2020
- Full Text
- View/download PDF
42. eDarkFind: Unsupervised Multi-view Learning for Sybil Account Detection
- Author
-
Krishnaprasad Thirunarayan, Usha Lokala, Shweta Yadav, Raminta Daniulaityte, Ramnath Kumar, Amit Sheth, and Francois R. Lamy
- Subjects
Cryptocurrency ,business.industry ,Computer science ,Darknet ,Volume (computing) ,010501 environmental sciences ,Ontology (information science) ,Encryption ,01 natural sciences ,Data science ,03 medical and health sciences ,0302 clinical medicine ,030212 general & internal medicine ,business ,0105 earth and related environmental sciences ,Anonymity - Abstract
Darknet crypto markets are online marketplaces using crypto currencies (e.g., Bitcoin, Monero) and advanced encryption techniques to offer anonymity to vendors and consumers trading for illegal goods or services. The exact volume of substances advertised and sold through these crypto markets is difficult to assess, at least partially, because vendors tend to maintain multiple accounts (or Sybil accounts) within and across different crypto markets. Linking these different accounts will allow us to accurately evaluate the volume of substances advertised across the different crypto markets by each vendor. In this paper, we present a multi-view unsupervised framework (eDarkFind) that helps modeling vendor characteristics and facilitates Sybil account detection. We employ a multi-view learning paradigm to generalize and improve the performance by exploiting the diverse views from multiple rich sources such as BERT, stylometric, and location representation. Our model is further tailored to take advantage of domain-specific knowledge such as the Drug Abuse Ontology to take into consideration the substance information. We performed extensive experiments and demonstrated that the multiple views obtained from diverse sources can be effective in linking Sybil accounts. Our proposed eDarkFind model achieves an accuracy of 98% on three real-world datasets which shows the generality of the approach.
- Published
- 2020
- Full Text
- View/download PDF
43. Knowledge Graph semantic enhancement of input data for improving AI
- Author
-
Shreyansh Bhatt, Jinjin Zhao, Valerie L. Shalin, and Amit Sheth
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Networks and Communications ,Computer science ,Group method of data handling ,02 engineering and technology ,Semantics ,Machine learning ,computer.software_genre ,Machine Learning (cs.LG) ,0202 electrical engineering, electronic engineering, information engineering ,Computer Science - Computation and Language ,business.industry ,Intelligent decision support system ,020206 networking & telecommunications ,Graph theory ,Term (time) ,I.2.1 ,Artificial Intelligence (cs.AI) ,Knowledge graph ,Task analysis ,Graph (abstract data type) ,Artificial intelligence ,business ,computer ,Computation and Language (cs.CL) - Abstract
Intelligent systems designed using machine learning algorithms require a large number of labeled data. Background knowledge provides complementary, real-world factual information that can augment the limited labeled data to train a machine learning algorithm. The term Knowledge Graph (KG) is in vogue as for many practical applications, it is convenient and useful to organize this background knowledge in the form of a graph. Recent academic research and implemented industrial intelligent systems have shown promising performance for machine learning algorithms that combine training data with a knowledge graph. In this article, we discuss the use of relevant KGs to enhance the input data for two applications that use machine learning—recommendation and community detection. The KG improves both accuracy and explainability.
- Published
- 2020
- Full Text
- View/download PDF
44. Identifying Depressive Symptoms from Tweets: Figurative Language Enabled Multitask Learning Framework
- Author
-
Krishnaprasad Thirunarayan, Amit Sheth, Jainish Chauhan, Joy Prakash Sain, Jeremiah A. Schumm, and Shweta Yadav
- Subjects
FOS: Computer and information sciences ,Decision support system ,Computer Science - Computation and Language ,Computer science ,business.industry ,Multi-task learning ,020206 networking & telecommunications ,02 engineering and technology ,computer.software_genre ,Triage ,Mental health ,Literal and figurative language ,Task (project management) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Social media ,Disorder screening ,Artificial intelligence ,business ,Computation and Language (cs.CL) ,computer ,Depressive symptoms ,Natural language processing - Abstract
Existing studies on using social media for deriving mental health status of users focus on the depression detection task. However, for case management and referral to psychiatrists, healthcare workers require practical and scalable depressive disorder screening and triage system. This study aims to design and evaluate a decision support system (DSS) to reliably determine the depressive triage level by capturing fine-grained depressive symptoms expressed in user tweets through the emulation of Patient Health Questionnaire-9 (PHQ-9) that is routinely used in clinical practice. The reliable detection of depressive symptoms from tweets is challenging because the 280-character limit on tweets incentivizes the use of creative artifacts in the utterances and figurative usage contributes to effective expression. We propose a novel BERT based robust multi-task learning framework to accurately identify the depressive symptoms using the auxiliary task of figurative usage detection. Specifically, our proposed novel task sharing mechanism, co-task aware attention, enables automatic selection of optimal information across the BERT layers and tasks by soft-sharing of parameters. Our results show that modeling figurative usage can demonstrably improve the model's robustness and reliability for distinguishing the depression symptoms., Accepted for publication in COLING 2020
- Published
- 2020
- Full Text
- View/download PDF
45. ALONE: A Dataset for Toxic Behavior Among Adolescents on Twitter
- Author
-
Amit Sheth, Manas Gaur, Ugur Kursuncu, Hale Inan, I. Budak Arpinar, Thilini Wijesiriwardene, Krishnaprasad Thirunarayan, and Valerie L. Shalin
- Subjects
business.industry ,Emoji ,Internet privacy ,Context (language use) ,02 engineering and technology ,Mental health ,Metadata ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Harassment ,020201 artificial intelligence & image processing ,Narrative ,Social media ,Internet users ,business ,Psychology - Abstract
The convenience of social media has also enabled its misuse, potentially resulting in toxic behavior. Nearly 66% of internet users have observed online harassment, and 41% claim personal experience, with 18% facing severe forms of online harassment. This toxic communication has a significant impact on the well-being of young individuals, affecting mental health and, in some cases, resulting in suicide. These communications exhibit complex linguistic and contextual characteristics, making recognition of such narratives challenging. In this paper, we provide a multimodal dataset of toxic social media interactions between confirmed high school students, called ALONE (AdoLescents ON twittEr), along with descriptive explanation. Each instance of interaction includes tweets, images, emoji and related metadata. Our observations show that individual tweets do not provide sufficient evidence for toxic behavior, and meaningful use of context in interactions can enable highlighting or exonerating tweets with purported toxicity.
- Published
- 2020
- Full Text
- View/download PDF
46. Reports of the Workshops of the 32nd AAAI Conference on Artificial Intelligence
- Author
-
Joseph C. Osborn, Nicholas Mattei, Martin Michalowski, Reuth Mirsky, Bruno Bouchard, William W. Streilein, Sarah Keren, Kokil Jaidka, Amit Sheth, David R. Martinez, Ilan Shimshoni, Arunesh Sinha, Howie Shrobe, Amelie Gyrard, K. Brent Venable, Atanu R. Sinha, Anna Zamansky, Eitan Farchi, Georgios Theocharous, Roni Khardon, Sébastien Gaboury, Noam Brown, Onn Shehory, Arash Shaban-Nejad, Kevin Bouchard, Biplav Srivastava, Neal Wagner, Parisa Kordjamshidi, Christopher W. Geib, Niyati Chhaya, and Cem Safak Sahin
- Subjects
Engineering ,business.industry ,Plan (drawing) ,Preference handling ,GeneralLiterature_MISCELLANEOUS ,Marketing science ,Knowledge extraction ,Artificial Intelligence ,Affective content analysis ,Smart environment ,Artificial intelligence ,Internet of Things ,business ,Intent recognition - Abstract
The AAAI-18 workshop program included 15 workshops covering a wide range of topics in AI. Workshops were held Sunday and Monday, February 2–7, 2018, at the Hilton New Orleans Riverside in New Orleans, Louisiana, USA. This report contains summaries of the Affective Content Analysis workshop; the Artificial Intelligence Applied to Assistive Technologies and Smart Environments; the AI and Marketing Science workshop; the Artificial Intelligence for Cyber Security workshop; the AI for Imperfect-Information Games; the Declarative Learning Based Programming workshop; the Engineering Dependable and Secure Machine Learning Systems workshop; the Health Intelligence workshop; the Knowledge Extraction from Games workshop; the Plan, Activity, and Intent Recognition workshop; the Planning and Inference workshop; the Preference Handling workshop; the Reasoning and Learning for Human-Machine Dialogues workshop; and the the AI Enhanced Internet of Things Data Processing for Intelligent Applications workshop.
- Published
- 2018
- Full Text
- View/download PDF
47. From Raw Data to Smart Manufacturing: AI and Semantic Web of Things for Industry 4.0
- Author
-
Muhammad Intizar Ali, Pankesh Patel, Amit Sheth, and Publica
- Subjects
Ubiquitous computing ,Industry 4.0 ,Computer Networks and Communications ,business.industry ,Computer science ,020206 networking & telecommunications ,Cloud computing ,02 engineering and technology ,World Wide Web ,Web of Things ,Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Use case ,business ,Raw data ,Semantic Web ,Smart manufacturing - Abstract
AI techniques combined with recent advancements in the Internet of Things, Web of Things, and Semantic Web-jointly referred to as the Semantic Web-promise to play an important role in Industry 4.0. As part of this vision, the authors present a Semantic Web of Things for Industry 4.0 (SWeTI) platform. Through realistic use case scenarios, they showcase how SweTI technologies can address Industry 4.0s challenges, facilitate cross-sector and cross-domain integration of systems, and develop intelligent and smart services for smart manufacturing.
- Published
- 2018
- Full Text
- View/download PDF
48. Next-Generation Smart Environments: From System of Systems to Data Ecosystems
- Author
-
Amit Sheth and Edward Curry
- Subjects
System of systems ,Computer Networks and Communications ,Computer science ,business.industry ,Intelligent decision support system ,Digital transformation ,020206 networking & telecommunications ,02 engineering and technology ,Artificial Intelligence ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Systems engineering ,Smart environment ,Internet of Things ,business ,Reliability (statistics) - Abstract
Digital transformation is driving a new wave of large-scale data-rich smart environments with data on every aspect of our world. The resulting data ecosystems present new challenges and opportunities in the design of intelligent systems and system of systems.
- Published
- 2018
- Full Text
- View/download PDF
49. Processing social media in real-time
- Author
-
Markus Strohmaier, Arkaitz Zubiaga, Damiano Spina, and Amit Sheth
- Subjects
Multimedia ,Computer science ,02 engineering and technology ,Library and Information Sciences ,Management Science and Operations Research ,computer.software_genre ,Computer Science Applications ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Media Technology ,020201 artificial intelligence & image processing ,Social media ,Online data processing ,computer ,Information Systems - Published
- 2019
- Full Text
- View/download PDF
50. 'Etazene, safer than heroin and fentanyl': Non-fentanyl novel synthetic opioid listings on one darknet market
- Author
-
Usha Lokala, Amit Sheth, Robert G. Carlson, Monica J. Barratt, Raminta Daniulaityte, and Francois R. Lamy
- Subjects
Pharmacology ,Synthetic opioid ,Substance-Related Disorders ,Darknet ,Toxicology ,Fentanyl ,Heroin ,Analgesics, Opioid ,03 medical and health sciences ,Psychiatry and Mental health ,0302 clinical medicine ,SAFER ,Environmental health ,medicine ,Humans ,Pharmacology (medical) ,030212 general & internal medicine ,Business ,Drug Overdose ,Substance use ,030217 neurology & neurosurgery ,medicine.drug - Abstract
Background Novel synthetic opioids are fueling the overdose deaths epidemic in North America.Recently, non-fentanyl novel synthetic opioids have emerged in forensic toxicological results. Cryptomarkets have become important platforms of distribution for illicit substances. This article presents the data concerning the availability trends of novel non-fentanyl synthetic opioids listed on one cryptomarket. Methods Listings from the EmpireMarket cryptomarket "Opiates" section were collected between June 2020 and August 2020. Collected data were processed using eDarkTrends Named Entity Recognition algorithm to identify novel synthetic opioids, and to analyze their availability trends in terms of frequency of listings, available average weights, average prices, quantity sold, and geographic indicators of shipment origin and destination information. Results 35,196 opioid-related listings were collected through 12 crawling sessions. 17 nonfentanyl novel synthetic opioids were identified in 2.9 % of the collected listings for an average of 9.2 kg of substance available at each data point. 587 items advertised as non-fentanyl novel synthetic opioids were sold on EmpireMarket for a total weight of between 858 g and 2.7 kg during the study period. 45.5 % of these listings were advertised as shipped from China. Conclusions Fourteen of the 17 non-fentanyl novel synthetic opioids were identified for the first time on one large cryptomarket suggesting a shift in terms of novel non-fentanyl synthetic opioids availability. This increased heterogeneity of available novel synthetic opioids could reduce the efficiency of existing overdose prevention strategies. Identification of new opioids underpins the value of cryptomarket data for early warning systems of emerging substance use trends.
- Published
- 2021
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.