Author: "Jiang, Jyun‐Yu" / Search Limiters: Available in Library Collection - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Jiang, Jyun‐Yu"' showing total 23 results

Start Over Author "Jiang, Jyun‐Yu" Search Limiters Available in Library Collection

23 results on '"Jiang, Jyun‐Yu"'

1. PEFA: Parameter-Free Adapters for Large-scale Embedding-based Retrieval Models

Author: Chang, Wei-Cheng, Jiang, Jyun-Yu, Zhang, Jiong, Al-Darabsah, Mutasem, Teo, Choon Hui, Hsieh, Cho-Jui, Yu, Hsiang-Fu, and Vishwanathan, S. V. N.
Subjects: Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: Embedding-based Retrieval Models (ERMs) have emerged as a promising framework for large-scale text retrieval problems due to powerful large language models. Nevertheless, fine-tuning ERMs to reach state-of-the-art results can be expensive due to the extreme scale of data as well as the complexity of multi-stages pipelines (e.g., pre-training, fine-tuning, distillation). In this work, we propose the PEFA framework, namely ParamEter-Free Adapters, for fast tuning of ERMs without any backward pass in the optimization. At index building stage, PEFA equips the ERM with a non-parametric k-nearest neighbor (kNN) component. At inference stage, PEFA performs a convex combination of two scoring functions, one from the ERM and the other from the kNN. Based on the neighborhood definition, PEFA framework induces two realizations, namely PEFA-XL (i.e., extra large) using double ANN indices and PEFA-XS (i.e., extra small) using a single ANN index. Empirically, PEFA achieves significant improvement on two retrieval applications. For document retrieval, regarding Recall@100 metric, PEFA improves not only pre-trained ERMs on Trivia-QA by an average of 13.2%, but also fine-tuned ERMs on NQ-320K by an average of 5.5%, respectively. For product search, PEFA improves the Recall@100 of the fine-tuned ERMs by an average of 5.3% and 14.5%, for PEFA-XS and PEFA-XL, respectively. Our code is available at https://github.com/amzn/pecos/tree/mainline/examples/pefa-wsdm24., Comment: Accept by WSDM 2024
Published: 2023
Full Text: View/download PDF

2. MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering

Author: Chen, Xiusi, Jiang, Jyun-Yu, Chang, Wei-Cheng, Hsieh, Cho-Jui, Yu, Hsiang-Fu, and Wang, Wei
Subjects: Computer Science - Computation and Language
Abstract: Recent advances in few-shot question answering (QA) mostly rely on the power of pre-trained large language models (LLMs) and fine-tuning in specific settings. Although the pre-training stage has already equipped LLMs with powerful reasoning capabilities, LLMs still need to be fine-tuned to adapt to specific domains to achieve the best results. In this paper, we propose to select the most informative data for fine-tuning, thereby improving the efficiency of the fine-tuning process with comparative or even better accuracy on the open-domain QA task. We present MinPrompt, a minimal data augmentation framework for open-domain QA based on an approximate graph algorithm and unsupervised question generation. We transform the raw text into a graph structure to build connections between different factual sentences, then apply graph algorithms to identify the minimal set of sentences needed to cover the most information in the raw text. We then generate QA pairs based on the identified sentence subset and train the model on the selected sentences to obtain the final model. Empirical results on several benchmark datasets and theoretical analysis show that MinPrompt is able to achieve comparable or better results than baselines with a high degree of efficiency, bringing consistent improvements in F-1 scores., Comment: ACL 2024 main conference
Published: 2023

3. printf: Preference Modeling Based on User Reviews with Item Images and Textual Information via Graph Learning

Author: Lin, Hao-Lun, Jiang, Jyun-Yu, Juan, Ming-Hao, and Cheng, Pu-Jen
Subjects: Computer Science - Information Retrieval
Abstract: Nowadays, modern recommender systems usually leverage textual and visual contents as auxiliary information to predict user preference. For textual information, review texts are one of the most popular contents to model user behaviors. Nevertheless, reviews usually lose their shine when it comes to top-N recommender systems because those that solely utilize textual reviews as features struggle to adequately capture the interaction relationships between users and items. For visual one, it is usually modeled with naive convolutional networks and also hard to capture high-order relationships between users and items. Moreover, previous works did not collaboratively use both texts and images in a proper way. In this paper, we propose printf, preference modeling based on user reviews with item images and textual information via graph learning, to address the above challenges. Specifically, the dimension-based attention mechanism directs relations between user reviews and interacted items, allowing each dimension to contribute different importance weights to derive user representations. Extensive experiments are conducted on three publicly available datasets. The experimental results demonstrate that our proposed printf consistently outperforms baseline methods with the relative improvements for NDCG@5 of 26.80%, 48.65%, and 25.74% on Amazon-Grocery, Amazon-Tools, and Amazon-Electronics datasets, respectively. The in-depth analysis also indicates the dimensions of review representations definitely have different topics and aspects, assisting the validity of our model design., Comment: In Proceedings of The 32nd ACM International Conference on Information and Knowledge Management (CIKM '23), ACM, 2023
Published: 2023
Full Text: View/download PDF

4. Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation

Author: Chen, Xiusi, Zhang, Yu, Deng, Jinliang, Jiang, Jyun-Yu, and Wang, Wei
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Few-shot question answering (QA) aims at precisely discovering answers to a set of questions from context passages while only a few training samples are available. Although existing studies have made some progress and can usually achieve proper results, they suffer from understanding deep semantics for reasoning out the questions. In this paper, we develop Gotta, a Generative prOmpT-based daTa Augmentation framework to mitigate the challenge above. Inspired by the human reasoning process, we propose to integrate the cloze task to enhance few-shot QA learning. Following the recent success of prompt-tuning, we present the cloze task in the same format as the main QA task, allowing the model to learn both tasks seamlessly together to fully take advantage of the power of prompt-tuning. Extensive experiments on widely used benchmarks demonstrate that Gotta consistently outperforms competitive baselines, validating the effectiveness of our proposed prompt-tuning-based cloze task, which not only fine-tunes language models but also learns to guide reasoning in QA tasks. Further analysis shows that the prompt-based loss incorporates the auxiliary task better than the multi-task loss, highlighting the strength of prompt-tuning on the few-shot QA task.
Published: 2023

5. PINA: Leveraging Side Information in eXtreme Multi-label Classification via Predicted Instance Neighborhood Aggregation

Author: Chien, Eli, Zhang, Jiong, Hsieh, Cho-Jui, Jiang, Jyun-Yu, Chang, Wei-Cheng, Milenkovic, Olgica, and Yu, Hsiang-Fu
Subjects: Computer Science - Machine Learning, Computer Science - Information Retrieval
Abstract: The eXtreme Multi-label Classification~(XMC) problem seeks to find relevant labels from an exceptionally large label space. Most of the existing XMC learners focus on the extraction of semantic features from input query text. However, conventional XMC studies usually neglect the side information of instances and labels, which can be of use in many real-world applications such as recommendation systems and e-commerce product search. We propose Predicted Instance Neighborhood Aggregation (PINA), a data enhancement method for the general XMC problem that leverages beneficial side information. Unlike most existing XMC frameworks that treat labels and input instances as featureless indicators and independent entries, PINA extracts information from the label metadata and the correlations among training instances. Extensive experimental results demonstrate the consistent gain of PINA on various XMC tasks compared to the state-of-the-art methods: PINA offers a gain in accuracy compared to standard XR-Transformers on five public benchmark datasets. Moreover, PINA achieves a $\sim 5\%$ gain in accuracy on the largest dataset LF-AmazonTitles-1.3M. Our implementation is publicly available., Comment: ICML 2023
Published: 2023

6. InfluencerRank: Discovering Effective Influencers via Graph Convolutional Attentive Recurrent Neural Networks

Author: Kim, Seungbae, Jiang, Jyun-Yu, Han, Jinyoung, and Wang, Wei
Subjects: Computer Science - Social and Information Networks, Computer Science - Artificial Intelligence
Abstract: As influencers play considerable roles in social media marketing, companies increase the budget for influencer marketing. Hiring effective influencers is crucial in social influencer marketing, but it is challenging to find the right influencers among hundreds of millions of social media users. In this paper, we propose InfluencerRank that ranks influencers by their effectiveness based on their posting behaviors and social relations over time. To represent the posting behaviors and social relations, the graph convolutional neural networks are applied to model influencers with heterogeneous networks during different historical periods. By learning the network structure with the embedded node features, InfluencerRank can derive informative representations for influencers at each period. An attentive recurrent neural network finally distinguishes highly effective influencers from other influencers by capturing the knowledge of the dynamics of influencer representations over time. Extensive experiments have been conducted on an Instagram dataset that consists of 18,397 influencers with their 2,952,075 posts published within 12 months. The experimental results demonstrate that InfluencerRank outperforms existing baseline methods. An in-depth analysis further reveals that all of our proposed features and model components are beneficial to discover effective influencers., Comment: ICWSM 2023
Published: 2023

7. Uncertainty in Extreme Multi-label Classification

Author: Jiang, Jyun-Yu, Chang, Wei-Cheng, Zhong, Jiong, Hsieh, Cho-Jui, and Yu, Hsiang-Fu
Subjects: Computer Science - Machine Learning
Abstract: Uncertainty quantification is one of the most crucial tasks to obtain trustworthy and reliable machine learning models for decision making. However, most research in this domain has only focused on problems with small label spaces and ignored eXtreme Multi-label Classification (XMC), which is an essential task in the era of big data for web-scale machine learning applications. Moreover, enormous label spaces could also lead to noisy retrieval results and intractable computational challenges for uncertainty quantification. In this paper, we aim to investigate general uncertainty quantification approaches for tree-based XMC models with a probabilistic ensemble-based framework. In particular, we analyze label-level and instance-level uncertainty in XMC, and propose a general approximation framework based on beam search to efficiently estimate the uncertainty with a theoretical guarantee under long-tail XMC predictions. Empirical studies on six large-scale real-world datasets show that our framework not only outperforms single models in predictive performance, but also can serve as strong uncertainty-based baselines for label misclassification and out-of-distribution detection, with significant speedup. Besides, our framework can further yield better state-of-the-art results based on deep XMC models with uncertainty quantification., Comment: 14 pages, 1 figure, 8 tables
Published: 2022

8. MIND-S is a deep-learning prediction model for elucidating protein post-translational modifications in human diseases

Author: Yan, Yu, Jiang, Jyun-Yu, Fu, Mingzhou, Wang, Ding, Pelletier, Alexander R, Sigdel, Dibakar, Ng, Dominic CM, Wang, Wei, and Ping, Peipei
Subjects: Biological Sciences, Bioinformatics and Computational Biology, Networking and Information Technology R&D (NITRD), Machine Learning and Artificial Intelligence, Bioengineering, 1.1 Normal biological development and functioning, Generic health relevance, Humans, Deep Learning, Proteins, Protein Processing, Post-Translational, Neural Networks, Computer, Amino Acids, AI, GWAS, cardiac proteome, graph neural network, interpretability, machine learning, multi-label, protein structure
Abstract: We present a deep-learning-based platform, MIND-S, for protein post-translational modification (PTM) predictions. MIND-S employs a multi-head attention and graph neural network and assembles a 15-fold ensemble model in a multi-label strategy to enable simultaneous prediction of multiple PTMs with high performance and computation efficiency. MIND-S also features an interpretation module, which provides the relevance of each amino acid for making the predictions and is validated with known motifs. The interpretation module also captures PTM patterns without any supervision. Furthermore, MIND-S enables examination of mutation effects on PTMs. We document a workflow, its applications to 26 types of PTMs of two datasets consisting of ∼50,000 proteins, and an example of MIND-S identifying a PTM-interrupting SNP with validation from biological data. We also include use case analyses of targeted proteins. Taken together, we have demonstrated that MIND-S is accurate, interpretable, and efficient to elucidate PTM-relevant biological processes in health and diseases.
Published: 2023

9. #StayHome or #Marathon? Social Media Enhanced Pandemic Surveillance on Spatial-temporal Dynamic Graphs

Author: Zhou, Yichao, Jiang, Jyun-yu, Chen, Xiusi, and Wang, Wei
Subjects: Computer Science - Social and Information Networks, Computer Science - Computation and Language
Abstract: COVID-19 has caused lasting damage to almost every domain in public health, society, and economy. To monitor the pandemic trend, existing studies rely on the aggregation of traditional statistical models and epidemic spread theory. In other words, historical statistics of COVID-19, as well as the population mobility data, become the essential knowledge for monitoring the pandemic trend. However, these solutions can barely provide precise prediction and satisfactory explanations on the long-term disease surveillance while the ubiquitous social media resources can be the key enabler for solving this problem. For example, serious discussions may occur on social media before and after some breaking events take place. These events, such as marathon and parade, may impact the spread of the virus. To take advantage of the social media data, we propose a novel framework, Social Media enhAnced pandemic suRveillance Technique (SMART), which is composed of two modules: (i) information extraction module to construct heterogeneous knowledge graphs based on the extracted events and relationships among them; (ii) time series prediction module to provide both short-term and long-term forecasts of the confirmed cases and fatality at the state-level in the United States and to discover risk factors for COVID-19 interventions. Extensive experiments show that our method largely outperforms the state-of-the-art baselines by 7.3% and 7.4% in confirmed case/fatality prediction, respectively., Comment: 7 figures, 6 tables
Published: 2021

10. Learning to Represent Human Motives for Goal-directed Web Browsing

Author: Jiang, Jyun-Yu, Lee, Chia-Jung, Yang, Longqi, Sarrafzadeh, Bahareh, Hecht, Brent, and Teevan, Jaime
Subjects: Computer Science - Information Retrieval
Abstract: Motives or goals are recognized in psychology literature as the most fundamental drive that explains and predicts why people do what they do, including when they browse the web. Although providing enormous value, these higher-ordered goals are often unobserved, and little is known about how to leverage such goals to assist people's browsing activities. This paper proposes to take a new approach to address this problem, which is fulfilled through a novel neural framework, Goal-directed Web Browsing (GoWeB). We adopt a psychologically-sound taxonomy of higher-ordered goals and learn to build their representations in a structure-preserving manner. Then we incorporate the resulting representations for enhancing the experiences of common activities people perform on the web. Experiments on large-scale data from Microsoft Edge web browser show that GoWeB significantly outperforms competitive baselines for in-session web page recommendation, re-visitation classification, and goal-based web page grouping. A follow-up analysis further characterizes how the variety of human motives can affect the difference observed in human behavioral patterns., Comment: Accepted by RecSys 2021
Published: 2021
Full Text: View/download PDF

11. Drug-Target Interaction Prediction with Graph Attention networks

Author: Wang, Haiyang, Zhou, Guangyu, Liu, Siqi, Jiang, Jyun-Yu, and Wang, Wei
Subjects: Quantitative Biology - Quantitative Methods, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Motivation: Predicting Drug-Target Interaction (DTI) is a well-studied topic in bioinformatics due to its relevance in the fields of proteomics and pharmaceutical research. Although many machine learning methods have been successfully applied in this task, few of them aim at leveraging the inherent heterogeneous graph structure in the DTI network to address the challenge. For better learning and interpreting the DTI topological structure and the similarity, it is desirable to have methods specifically for predicting interactions from the graph structure. Results: We present an end-to-end framework, DTI-GAT (Drug-Target Interaction prediction with Graph Attention networks) for DTI predictions. DTI-GAT incorporates a deep neural network architecture that operates on graph-structured data with the attention mechanism, which leverages both the interaction patterns and the features of drug and protein sequences. DTI-GAT facilitates the interpretation of the DTI topological structure by assigning different attention weights to each node with the self-attention mechanism. Experimental evaluations show that DTI-GAT outperforms various state-of-the-art systems on the binary DTI prediction problem. Moreover, the independent study results further demonstrate that our model can be generalized better than other conventional methods. Availability: The source code and all datasets are available at https://github.com/Haiyang-W/DTI-GRAPH
Published: 2021

12. COVID-19 Surveiller: toward a robust and effective pandemic surveillance system based on social media mining

Author: Jiang, Jyun-Yu, Zhou, Yichao, Chen, Xiusi, Jhou, Yan-Ru, Zhao, Liqi, Liu, Sabrina, Yang, Po-Chun, Ahmar, Jule, and Wang, Wei
Subjects: Data Management and Data Science, Information and Computing Sciences, Bioengineering, Prevention, Networking and Information Technology R&D (NITRD), 2.4 Surveillance and distribution, Aetiology, Good Health and Well Being, COVID-19, Data Mining, Humans, Pandemics, SARS-CoV-2, Social Media, pandemic surveillance, social media mining, knowledge graph, natural language processing, General Science & Technology
Abstract: The outbreak of the novel coronavirus, COVID-19, has become one of the most severe pandemics in human history. In this paper, we propose to leverage social media users as social sensors to simultaneously predict the pandemic trends and suggest potential risk factors for public health experts to understand spread situations and recommend proper interventions. More precisely, we develop novel deep learning models to recognize important entities and their relations over time, thereby establishing dynamic heterogeneous graphs to describe the observations of social media users. A dynamic graph neural network model can then forecast the trends (e.g. newly diagnosed cases and death rates) and identify high-risk events from social media. Based on the proposed computational method, we also develop a web-based system for domain experts without any computer science background to easily interact with. We conduct extensive experiments on large-scale datasets of COVID-19 related tweets provided by Twitter, which show that our method can precisely predict the new cases and death rates. We also demonstrate the robustness of our web-based pandemic surveillance system and its ability to retrieve essential knowledge and derive accurate predictions across a variety of circumstances. Our system is also available at http://scaiweb.cs.ucla.edu/covidsurveiller/. This article is part of the theme issue 'Data science approachs to infectious disease surveillance'.
Published: 2022

13. TahcoRoll: fast genomic signature profiling via thinned automaton and rolling hash

Author: Ju, Chelsea J-T, Jiang, Jyun-Yu, Li, Ruirui, Li, Zeyu, and Wang, Wei
Subjects: Information and Computing Sciences, Biological Sciences, Bioinformatics and Computational Biology, Genetics, Human Genome, Biotechnology, Aho–Corasick algorithm, genome sequencing, k-mers, multiple pattern matching, rolling hash
Abstract: ObjectivesGenomic signatures like k-mers have become one of the most prominent approaches to describe genomic data. As a result, myriad real-world applications, such as the construction of de Bruijn graphs in genome assembly, have been benefited by recognizing genomic signatures. In other words, an efficient approach of genomic signature profiling is an essential need for tackling high-throughput sequencing reads. However, most of the existing approaches only recognize fixed-size k-mers while many research studies have shown the importance of considering variable-length k-mers.MethodsIn this paper, we present a novel genomic signature profiling approach, TahcoRoll, by extending the Aho-Corasick algorithm (AC) for the task of profiling variable-length k-mers. We first group nucleotides into two clusters and represent each cluster with a bit. The rolling hash technique is further utilized to encode signatures and read patterns for efficient matching.ResultsIn extensive experiments, TahcoRoll significantly outperforms the most state-of-the-art k-mer counters and has the capability of processing reads across different sequencing platforms on a budget desktop computer.ConclusionsThe single-thread version of TahcoRoll is as efficient as the eight-thread version of the state-of-the-art, JellyFish, while the eight-thread TahcoRoll outperforms the eight-thread JellyFish by at least four times.
Published: 2021

14. Long Document Ranking with Query-Directed Sparse Transformer

Author: Jiang, Jyun-Yu, Xiong, Chenyan, Lee, Chia-Jung, and Wang, Wei
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: The computing cost of transformer self-attention often necessitates breaking long documents to fit in pretrained models in document ranking tasks. In this paper, we design Query-Directed Sparse attention that induces IR-axiomatic structures in transformer self-attention. Our model, QDS-Transformer, enforces the principle properties desired in ranking: local contextualization, hierarchical representation, and query-oriented proximity matching, while it also enjoys efficiency from sparsity. Experiments on one fully supervised and three few-shot TREC document ranking benchmarks demonstrate the consistent and robust advantage of QDS-Transformer over previous approaches, as they either retrofit long documents into BERT or use sparse attention without emphasizing IR principles. We further quantify the computing complexity and demonstrates that our sparse attention with TVM implementation is twice more efficient than the fully-connected self-attention. All source codes, trained model, and predictions of this work are available at https://github.com/hallogameboy/QDS-Transformer., Comment: Accepted by EMNLP 2020, 12 pages, 5 figures
Published: 2020

15. 'The Boating Store Had Its Best Sail Ever': Pronunciation-attentive Contextualized Pun Recognition

Author: Zhou, Yichao, Jiang, Jyun-Yu, Zhao, Jieyu, Chang, Kai-Wei, and Wang, Wei
Subjects: Computer Science - Computation and Language
Abstract: Humor plays an important role in human languages and it is essential to model humor when building intelligence systems. Among different forms of humor, puns perform wordplay for humorous effects by employing words with double entendre and high phonetic similarity. However, identifying and modeling puns are challenging as puns usually involved implicit semantic or phonological tricks. In this paper, we propose Pronunciation-attentive Contextualized Pun Recognition (PCPR) to perceive human humor, detect if a sentence contains puns and locate them in the sentence. PCPR derives contextualized representation for each word in a sentence by capturing the association between the surrounding context and its corresponding phonetic symbols. Extensive experiments are conducted on two benchmark datasets. Results demonstrate that the proposed approach significantly outperforms the state-of-the-art methods in pun detection and location tasks. In-depth analyses verify the effectiveness and robustness of PCPR., Comment: 10 pages, 4 figures, 7 tables, accepted by ACL 2020
Published: 2020

16. JEDI: circular RNA prediction based on junction encoders and deep interaction among splice sites

Author: Jiang, Jyun-Yu, Ju, Chelsea J-T, Hao, Junheng, Chen, Muhao, and Wang, Wei
Subjects: Biological Sciences, Bioinformatics and Computational Biology, Genetics, Human Genome, Machine Learning and Artificial Intelligence, Networking and Information Technology R&D (NITRD), Generic health relevance, Neural Networks, Computer, RNA, RNA Splice Sites, RNA Splicing, RNA, Circular, RNA, Long Noncoding, Mathematical Sciences, Information and Computing Sciences, Bioinformatics, Biological sciences, Information and computing sciences, Mathematical sciences
Abstract: MotivationCircular RNA (circRNA) is a novel class of long non-coding RNAs that have been broadly discovered in the eukaryotic transcriptome. The circular structure arises from a non-canonical splicing process, where the donor site backspliced to an upstream acceptor site. These circRNA sequences are conserved across species. More importantly, rising evidence suggests their vital roles in gene regulation and association with diseases. As the fundamental effort toward elucidating their functions and mechanisms, several computational methods have been proposed to predict the circular structure from the primary sequence. Recently, advanced computational methods leverage deep learning to capture the relevant patterns from RNA sequences and model their interactions to facilitate the prediction. However, these methods fail to fully explore positional information of splice junctions and their deep interaction.ResultsWe present a robust end-to-end framework, Junction Encoder with Deep Interaction (JEDI), for circRNA prediction using only nucleotide sequences. JEDI first leverages the attention mechanism to encode each junction site based on deep bidirectional recurrent neural networks and then presents the novel cross-attention layer to model deep interaction among these sites for backsplicing. Finally, JEDI can not only predict circRNAs but also interpret relationships among splice sites to discover backsplicing hotspots within a gene region. Experiments demonstrate JEDI significantly outperforms state-of-the-art approaches in circRNA prediction on both isoform level and gene level. Moreover, JEDI also shows promising results on zero-shot backsplicing discovery, where none of the existing approaches can achieve.Availability and implementationThe implementation of our framework is available at https://github.com/hallogameboy/JEDI.Supplementary informationSupplementary data are available at Bioinformatics online.
Published: 2021

17. Learning to Discriminate Perturbations for Blocking Adversarial Attacks in Text Classification

Author: Zhou, Yichao, Jiang, Jyun-Yu, Chang, Kai-Wei, and Wang, Wei
Subjects: Computer Science - Computation and Language
Abstract: Adversarial attacks against machine learning models have threatened various real-world applications such as spam filtering and sentiment analysis. In this paper, we propose a novel framework, learning to DIScriminate Perturbations (DISP), to identify and adjust malicious perturbations, thereby blocking adversarial attacks for text classification models. To identify adversarial attacks, a perturbation discriminator validates how likely a token in the text is perturbed and provides a set of potential perturbations. For each potential perturbation, an embedding estimator learns to restore the embedding of the original word based on the context and a replacement token is chosen based on approximate kNN search. DISP can block adversarial attacks for any NLP model without modifying the model structure or training procedure. Extensive experiments on two benchmark datasets demonstrate that DISP significantly outperforms baseline methods in blocking adversarial attacks for text classification. In addition, in-depth analysis shows the robustness of DISP across different situations., Comment: 10 pages, 8 tables, 4 figures
Published: 2019

18. Mutation effect estimation on protein–protein interactions using deep contextualized representation learning

Author: Zhou, Guangyu, Chen, Muhao, Ju, Chelsea JT, Wang, Zheng, Jiang, Jyun-Yu, and Wang, Wei
Subjects: Biological Sciences, Bioinformatics and Computational Biology, Machine Learning and Artificial Intelligence, Networking and Information Technology R&D (NITRD), Bioinformatics and computational biology, Genetics
Abstract: The functional impact of protein mutations is reflected on the alteration of conformation and thermodynamics of protein-protein interactions (PPIs). Quantifying the changes of two interacting proteins upon mutations is commonly carried out by computational approaches. Hence, extensive research efforts have been put to the extraction of energetic or structural features on proteins, followed by statistical learning methods to estimate the effects of mutations on PPI properties. Nonetheless, such features require extensive human labors and expert knowledge to obtain, and have limited abilities to reflect point mutations. We present an end-to-end deep learning framework, MuPIPR (Mutation Effects in Protein-protein Interaction PRediction Using Contextualized Representations), to estimate the effects of mutations on PPIs. MuPIPR incorporates a contextualized representation mechanism of amino acids to propagate the effects of a point mutation to surrounding amino acid representations, therefore amplifying the subtle change in a long protein sequence. On top of that, MuPIPR leverages a Siamese residual recurrent convolutional neural encoder to encode a wild-type protein pair and its mutation pair. Multi-layer perceptron regressors are applied to the protein pair representations to predict the quantifiable changes of PPI properties upon mutations. Experimental evaluations show that, with only sequence information, MuPIPR outperforms various state-of-the-art systems on estimating the changes of binding affinity for SKEMPI v1, and offers comparable performance on SKEMPI v2. Meanwhile, MuPIPR also demonstrates state-of-the-art performance on estimating the changes of buried surface areas. The software implementation is available at https://github.com/guangyu-zhou/MuPIPR.
Published: 2020

19. Prediction of microbial communities for urban metagenomics using neural network approach.

Author: Zhou, Guangyu, Jiang, Jyun-Yu, Ju, Chelsea J-T, and Wang, Wei
Subjects: Multi-label classification, Neural network, Urban metagenomics, Algorithms, Boston, Cities, Databases, Genetic, Metagenomics, Microbiota, Models, Genetic, Neural Networks, Computer, New York, Reproducibility of Results, Databases, Genetic, Models, Neural Networks, Computer, Genetics & Heredity
Abstract: BACKGROUND:Microbes are greatly associated with human health and disease, especially in densely populated cities. It is essential to understand the microbial ecosystem in an urban environment for cities to monitor the transmission of infectious diseases and detect potentially urgent threats. To achieve this goal, the DNA sample collection and analysis have been conducted at subway stations in major cities. However, city-scale sampling with the fine-grained geo-spatial resolution is expensive and laborious. In this paper, we introduce MetaMLAnn, a neural network based approach to infer microbial communities at unsampled locations given information reflecting different factors, including subway line networks, sampling material types, and microbial composition patterns. RESULTS:We evaluate the effectiveness of MetaMLAnn based on the public metagenomics dataset collected from multiple locations in the New York and Boston subway systems. The experimental results suggest that MetaMLAnn consistently performs better than other five conventional classifiers under different taxonomic ranks. At genus level, MetaMLAnn can achieve F1 scores of 0.63 and 0.72 on the New York and the Boston datasets, respectively. CONCLUSIONS:By exploiting heterogeneous features, MetaMLAnn captures the hidden interactions between microbial compositions and the urban environment, which enables precise predictions of microbial communities at unmeasured locations.
Published: 2019

20. Multi-scale Human Behavior Modeling with Heterogeneous Data

Author: Jiang, Jyun-Yu
Subjects: Computer science, Data Mining, Deep Learning, Human Behavior Modeling, Information Retrieval, Machine Learning, Natural Language Processing
Abstract: In this era of big data, massive data are generated from heterogeneous resources every day, which provides an unprecedented opportunity for deepening our understanding of complex human behaviors. Modeling human behaviors requires robust computational methods that can not only capture semantics and useful insights from sparse and heterogeneous data, but also unravel sophisticated human behaviors at different scales. In addition, the enormous data velocity and the unparalleled scale of deep models also pose significant challenges to efficiency. In this dissertation, we demonstrate a collection of research results that systematically improve the ecosystem of human behavior modeling based on representation learning. For heterogeneous data in various settings, we present practical representation learning methods to effectively and efficiently capture their semantics. Moreover, these representation learning methods can actually fill a niche to comfortably model different behaviors with atomic, compositional, and explainable operations, thereby modeling human behaviors at different scales.As a result, our proposed approaches not only address various real-world challenges in diverse domains, but also present the potentials to adopt valuable domain knowledge into machine learning.
Published: 2021

21. COVID-19 Surveiller: toward a robust and effective pandemic surveillance system basedon social media mining.

Author: Jiang, Jyun-Yu, Jiang, Jyun-Yu, Zhou, Yichao, Chen, Xiusi, Jhou, Yan-Ru, Zhao, Liqi, Liu, Sabrina, Yang, Po-Chun, Ahmar, Jule, Wang, Wei, Jiang, Jyun-Yu, Jiang, Jyun-Yu, Zhou, Yichao, Chen, Xiusi, Jhou, Yan-Ru, Zhao, Liqi, Liu, Sabrina, Yang, Po-Chun, Ahmar, Jule, and Wang, Wei
Abstract: The outbreak of the novel coronavirus, COVID-19, has become one of the most severe pandemics in human history. In this paper, we propose to leverage social media users as social sensors to simultaneously predict the pandemic trends and suggest potential risk factors for public health experts to understand spread situations and recommend proper interventions. More precisely, we develop novel deep learning models to recognize important entities and their relations over time, thereby establishing dynamic heterogeneous graphs to describe the observations of social media users. A dynamic graph neural network model can then forecast the trends (e.g. newly diagnosed cases and death rates) and identify high-risk events from social media. Based on the proposed computational method, we also develop a web-based system for domain experts without any computer science background to easily interact with. We conduct extensive experiments on large-scale datasets of COVID-19 related tweets provided by Twitter, which show that our method can precisely predict the new cases and death rates. We also demonstrate the robustness of our web-based pandemic surveillance system and its ability to retrieve essential knowledge and derive accurate predictions across a variety of circumstances. Our system is also available at http://scaiweb.cs.ucla.edu/covidsurveiller/. This article is part of the theme issue 'Data science approachs to infectious disease surveillance'.
Published: 2022

22. Supplementary Materials: 'COVID-19 Surveiller: Toward a Robust and Effective Pandemic Surveillance System based on Social Media Mining'

Author: Jiang, Jyun-Yu, Zhou, Yichao, Chen, Xiusi, Jhou, Yan-Ru, Zhao, Liqi, Liu, Sabrina, Yang, Po-Chun, Ahmar, Jule, and Wang, Wei
Abstract: The outbreak of the novel coronavirus, COVID-19, has become one of the most severe pandemics in human history. In this paper, we propose to leverage social media users as social sensors to simultaneously predict the pandemic trends and suggest potential risk factors for public health experts to understand spread situations and recommend proper interventions. More precisely, we develop novel deep learning models to recognize important entities and their relations over time, thereby establishing dynamic heterogeneous graphs to describe the observations of social media users. A dynamic graph neural network model can then forecast the trends (e.g. newly diagnosed cases and death rates) and identify high-risk events from social media. Based on the proposed computational method, we also develop a web-based system for domain experts without any computer science background to easily interact with. We conduct extensive experiments on large-scale datasets of COVID-19 related tweets provided by Twitter, which show that our method can precisely predict the new cases and death rates. We also demonstrate the robustness of our web-based pandemic surveillance system and its ability to retrieve essential knowledge and derive accurate predictions across a variety of circumstances. Our system is also available at http://scaiweb.cs.ucla.edu/covidsurveiller/.This article is part of the theme issue ‘Data science approachs to infectious diseases surveillance’.
Published: 2021
Full Text: View/download PDF

23. Personalized Question Routing via Heterogeneous Network Embedding

Author: Li, Zeyu, primary, Jiang, Jyun-Yu, additional, Sun, Yizhou, additional, and Wang, Wei, additional
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

23 results on '"Jiang, Jyun‐Yu"'

1. PEFA: Parameter-Free Adapters for Large-scale Embedding-based Retrieval Models

2. MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering

3. printf: Preference Modeling Based on User Reviews with Item Images and Textual Information via Graph Learning

4. Gotta: Generative Few-shot Question Answering by Prompt-based Cloze Data Augmentation

5. PINA: Leveraging Side Information in eXtreme Multi-label Classification via Predicted Instance Neighborhood Aggregation

6. InfluencerRank: Discovering Effective Influencers via Graph Convolutional Attentive Recurrent Neural Networks

7. Uncertainty in Extreme Multi-label Classification

8. MIND-S is a deep-learning prediction model for elucidating protein post-translational modifications in human diseases

9. #StayHome or #Marathon? Social Media Enhanced Pandemic Surveillance on Spatial-temporal Dynamic Graphs

10. Learning to Represent Human Motives for Goal-directed Web Browsing

11. Drug-Target Interaction Prediction with Graph Attention networks

12. COVID-19 Surveiller: toward a robust and effective pandemic surveillance system based on social media mining

13. TahcoRoll: fast genomic signature profiling via thinned automaton and rolling hash

14. Long Document Ranking with Query-Directed Sparse Transformer

15. 'The Boating Store Had Its Best Sail Ever': Pronunciation-attentive Contextualized Pun Recognition

16. JEDI: circular RNA prediction based on junction encoders and deep interaction among splice sites

17. Learning to Discriminate Perturbations for Blocking Adversarial Attacks in Text Classification

18. Mutation effect estimation on protein–protein interactions using deep contextualized representation learning

19. Prediction of microbial communities for urban metagenomics using neural network approach.

20. Multi-scale Human Behavior Modeling with Heterogeneous Data

21. COVID-19 Surveiller: toward a robust and effective pandemic surveillance system basedon social media mining.

22. Supplementary Materials: 'COVID-19 Surveiller: Toward a Robust and Effective Pandemic Surveillance System based on Social Media Mining'

23. Personalized Question Routing via Heterogeneous Network Embedding

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

23 results on '"Jiang, Jyun‐Yu"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources