9 results on '"Xuchao Zhang"'
Search Results
2. Interpreting Convolutional Sequence Model by Learning Local Prototypes with Adaptation Regularization
- Author
-
Xuchao Zhang, Dongjin Song, Bo Zong, Haifeng Chen, Jingchao Ni, Zhengzhang Chen, Wei Cheng, and Yanchi Liu
- Subjects
End user ,business.industry ,Computer science ,Deep learning ,Machine learning ,computer.software_genre ,Regularization (mathematics) ,Convolutional neural network ,Variety (cybernetics) ,Problem domain ,Artificial intelligence ,business ,Adaptation (computer science) ,computer ,Interpretability - Abstract
In many high-stakes applications of machine learning models, outputting only predictions or providing statistical confidence is usually insufficient to gain trust from end users, who often prefer a transparent reasoning paradigm. Despite the recent encouraging developments on deep networks for sequential data modeling, due to the highly recursive functions, the underlying rationales of their predictions are difficult to explain. Thus, in this paper, we aim to develop a sequence modeling approach that explains its own predictions by breaking input sequences down into evidencing segments (i.e., sub-sequences) in its reasoning. To this end, we build our model upon convolutional neural networks, which, in their vanilla forms, associates local receptive fields with outputs in an obscure manner. To unveil it, we resort to case-based reasoning, and design prototype modules whose units (i.e., prototypes) resemble exemplar segments in the problem domain. Each prediction is obtained by combining the comparisons between the prototypes and the segments of an input. To enhance interpretability, we propose a training objective that delicately adapts the distribution of prototypes to the data distribution in latent spaces, and design an algorithm to map prototypes to human-understandable segments. Through extensive experiments in a variety of domains, we demonstrate that our model can achieve high interpretability generally, together with a competitive accuracy to the state-of-the-art approaches.
- Published
- 2021
3. Temporal Context-Aware Representation Learning for Question Routing
- Author
-
Haifeng Chen, Chen Yuncong, Xuchao Zhang, Jian-Wu Xu, Ding Li, Bo Zong, and Wei Cheng
- Subjects
Computer science ,business.industry ,Dynamics (music) ,Temporal context ,Artificial intelligence ,Routing (electronic design automation) ,business ,Machine learning ,computer.software_genre ,Baseline (configuration management) ,Feature learning ,Temporal information ,computer - Abstract
Question routing (QR) aims at recommending newly posted questions to the potential answerers who are most likely to answer the questions. The existing approaches that learn users' expertise from their past question-answering activities usually suffer from challenges in two aspects: 1) multi-faceted expertise and 2) temporal dynamics in the answering behavior. This paper proposes a novel temporal context-aware model in multiple granularities of temporal dynamics that concurrently address the above challenges. Specifically, the temporal context-aware attention characterizes the answerer's multi-faceted expertise in terms of the questions' semantic and temporal information simultaneously. Moreover, the design of the multi-shift and multi-resolution module enables our model to handle temporal impact on different time granularities. Extensive experiments on six datasets from different domains demonstrate that the proposed model significantly outperforms competitive baseline models.
- Published
- 2020
4. Deep Prototypical Networks for Imbalanced Time Series Classification under Data Scarcity
- Author
-
Xuchao Zhang, Suwen Lin, Xian Wu, Nitesh V. Chawla, and Chao Huang
- Subjects
Time series classification ,business.industry ,Computer science ,media_common.quotation_subject ,02 engineering and technology ,Machine learning ,computer.software_genre ,Masking (Electronic Health Record) ,Temporal database ,Activity recognition ,Scarcity ,020204 information systems ,Metric (mathematics) ,0202 electrical engineering, electronic engineering, information engineering ,Embedding ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,media_common - Abstract
With the increase of temporal data availability, time series classification has drawn a lot of attention in the literature because of its wide spectrum of applications in diverse domains (e.g., healthcare, bioinformatics and finance), ranging from human activity recognition to financial pattern identification. While significant progress has been made to solve time series classification problem, the success of such methods relies on data sufficiency, and may not well capture the quality embeddings when training triple instances are scarce and highly imbalance across classes. To address these challenges, we propose a prototype embedding framework-Deep Prototypical Networks (DPN), which leverages a main embedding space to capture the discrepancies of difference time series classes for alleviating data scarcity. In addition, we further augment DPN framework with a relationship-dependent masking module to automatically fuse relevant information with a distance metric learning process, which addresses the data imbalance issue and performs robust time series classification. Experimental results show significant and consistent improvements compared to state-of-the-art techniques.
- Published
- 2019
5. Similarity-Aware Network Embedding with Self-Paced Learning
- Author
-
Xian Wu, Baoxu Shi, Nitesh V. Chawla, Xuchao Zhang, and Chao Huang
- Subjects
Theoretical computer science ,Degree (graph theory) ,Similarity (network science) ,Computer science ,Node (networking) ,Network embedding ,Embedding ,Cluster analysis ,Self paced - Abstract
Network embedding, which aims to learn low-dimensional vector representations for nodes in a network, has shown promising performance for many real-world applications, such as node classification and clustering. While various embedding methods have been developed for network data, they are limited in their assumption that nodes are correlated with their neighboring nodes with the same similarity degree. As such, these methods can be suboptimal for embedding network data. In this paper, we propose a new method named SANE, short for Similarity-Aware Network Embedding, to learn node representations by explicitly considering different similarity degrees between connected nodes in a network. In particular, we develop a new framework based on self-paced learning by accounting for both the explicit relations (i.e., observed links) and implicit relations (i.e., unobserved node similarities) in network representation learning. To justify our proposed model, we perform experiments on two real-world network data. Experiments results show that SNAE outperforms state-of-the-art embedding models on the tasks of node classification and node clustering.
- Published
- 2019
6. Feature driven learning framework for cybersecurity event detection
- Author
-
Chang-Tien Lu, Kaiqun Fu, Taoran Ji, Xuchao Zhang, Nathan Self, and Naren Ramakrishnan
- Subjects
Event (computing) ,Computer science ,Feature vector ,Supervised learning ,Context (language use) ,02 engineering and technology ,Computer security ,computer.software_genre ,Feature (computer vision) ,020204 information systems ,Encoding (memory) ,0202 electrical engineering, electronic engineering, information engineering ,Task analysis ,020201 artificial intelligence & image processing ,Social media ,computer - Abstract
Cybersecurity event detection is a crucial problem for mitigating effects on various aspects of society. Social media has become a notable source of indicators for detection of diverse events. Though previous social media based strategies for cyber-security event detection focus on mining certain event-related words, the dynamic and evolving nature of online discourse limits the performance of these approaches. Further, because these are typically unsupervised or weakly supervised learning strategies, they do not perform well in an environment of biased samples, noisy context, and informal language which is routine for online, user-generated content. This paper takes a supervised learning approach by proposing a novel multi-task learning based model. Our model can handle diverse structures in feature space by learning models for different types of potential high-profile targets simultaneously. For parameter optimization, we develop an efficient algorithm based on the alternating direction method of multipliers. Through extensive experiments on a real world Twitter dataset, we demonstrate that our approach consistently outperforms existing methods at encoding and identifying cyber-security incidents.
- Published
- 2019
7. Online Purchase Prediction via Multi-Scale Modeling of Behavior Dynamics
- Author
-
Chao Huang, Xuchao Zhang, Dawei Yin, Nitesh V. Chawla, Xian Wu, Chuxu Zhang, and Jiashu Zhao
- Subjects
Network architecture ,Artificial neural network ,business.industry ,Computer science ,02 engineering and technology ,Recommender system ,Machine learning ,computer.software_genre ,Recurrent neural network ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Graph (abstract data type) ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Categorical variable - Abstract
Online purchase forecasting is of great importance in e-commerce platforms, which is the basis of how to present personalized interesting product lists to individual customers. However, predicting online purchases is not trivial as it is influenced by many factors including: (i) the complex temporal pattern with hierarchical inter-correlations; (ii) arbitrary category dependencies. To address these factors, we develop a Graph Multi-Scale Pyramid Networks (GMP) framework to fully exploit users' latent behavioral patterns with both multi-scale temporal dynamics and arbitrary inter-dependencies among product categories. In GMP, we first design a multi-scale pyramid modulation network architecture which seamlessly preserves the underlying hierarchical temporal factors--governing users' purchase behaviors. Then, we employ convolution recurrent neural network to encode the categorical temporal pattern at each scale. After that, we develop a resolution-wise recalibration gating mechanism to automatically re-weight the importance of each scale-view representations. Finally, a context-graph neural network module is proposed to adaptively uncover complex dependencies among category-specific purchases. Extensive experiments on real-world e-commerce datasets demonstrate the superior performance of our method over state-of-the-art baselines across various settings.
- Published
- 2019
8. Spatiotemporal Event Forecasting from Incomplete Hyper-local Price Data
- Author
-
Chang-Tien Lu, Naren Ramakrishnan, Xuchao Zhang, Arnold P. Boedihardjo, and Liang Zhao
- Subjects
010104 statistics & probability ,Event forecasting ,Computer science ,0202 electrical engineering, electronic engineering, information engineering ,Domain knowledge ,020201 artificial intelligence & image processing ,02 engineering and technology ,Data mining ,0101 mathematics ,Missing data ,computer.software_genre ,01 natural sciences ,computer - Abstract
Hyper-local pricing data, e.g., about foods and commodities, exhibit subtle spatiotemporal variations that can be useful as crucial precursors of future events. Three major challenges in modeling such pricing data include: i) temporal dependencies underlying features; ii) spatiotemporal missing values; and iii) constraints underlying economic phenomena. These challenges hinder traditional event forecasting models from being applied effectively. This paper proposes a novel spatiotemporal event forecasting model that concurrently addresses the above challenges. Specifically, given continuous price data, a new soft time-lagged model is designed to select temporally dependent features. To handle missing values, we propose a data tensor completion method based on price domain knowledge. The parameters of the new model are optimized using a novel algorithm based on the Alternative Direction Methods of Multipliers (ADMM). Extensive experimental evaluations on multiple datasets demonstrate the effectiveness of our proposed approach.
- Published
- 2017
9. Automatical Storyline Generation with Help from Twitter
- Author
-
Xuchao Zhang, Wei Wang, Ting Hua, Chang-Tien Lu, and Naren Ramakrishnan
- Subjects
Topic model ,Work (electrical) ,Computer science ,Process (engineering) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,02 engineering and technology ,Data mining ,Baseline (configuration management) ,computer.software_genre ,Data science ,computer - Abstract
Storyline detection aims to connect seemly irrelevant single documents into meaningful chains, which provides opportunities for understanding how events evolve over time and what triggers such evolutions. Most previous work generated the storylines through unsupervised methods that can hardly reveal underlying factors driving the evolution process. This paper introduces a Bayesian model to generate storylines from massive documents and infer the corresponding hidden relations and topics. In addition, our model is the first attempt that utilizes Twitter data as human input to ``supervise'' the generation of storylines. Through extensive experiments, we demonstrate our proposed model can achieve significant improvement over baseline methods and can be used to discover interesting patterns for real world cases.
- Published
- 2016
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.