Author: "Xuchao Zhang" / Publisher: acm - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Xuchao Zhang"' showing total 9 results

Start Over Author "Xuchao Zhang" Publisher acm

9 results on '"Xuchao Zhang"'

1. CAT: Beyond Efficient Transformer for Content-Aware Anomaly Detection in Event Sequences

Author: Shengming Zhang, Yanchi Liu, Xuchao Zhang, Wei Cheng, Haifeng Chen, and Hui Xiong
Published: 2022

2. Interpreting Convolutional Sequence Model by Learning Local Prototypes with Adaptation Regularization

Author: Xuchao Zhang, Dongjin Song, Bo Zong, Haifeng Chen, Jingchao Ni, Zhengzhang Chen, Wei Cheng, and Yanchi Liu
Subjects: End user, business.industry, Computer science, Deep learning, Machine learning, computer.software_genre, Regularization (mathematics), Convolutional neural network, Variety (cybernetics), Problem domain, Artificial intelligence, business, Adaptation (computer science), computer, Interpretability
Abstract: In many high-stakes applications of machine learning models, outputting only predictions or providing statistical confidence is usually insufficient to gain trust from end users, who often prefer a transparent reasoning paradigm. Despite the recent encouraging developments on deep networks for sequential data modeling, due to the highly recursive functions, the underlying rationales of their predictions are difficult to explain. Thus, in this paper, we aim to develop a sequence modeling approach that explains its own predictions by breaking input sequences down into evidencing segments (i.e., sub-sequences) in its reasoning. To this end, we build our model upon convolutional neural networks, which, in their vanilla forms, associates local receptive fields with outputs in an obscure manner. To unveil it, we resort to case-based reasoning, and design prototype modules whose units (i.e., prototypes) resemble exemplar segments in the problem domain. Each prediction is obtained by combining the comparisons between the prototypes and the segments of an input. To enhance interpretability, we propose a training objective that delicately adapts the distribution of prototypes to the data distribution in latent spaces, and design an algorithm to map prototypes to human-understandable segments. Through extensive experiments in a variety of domains, we demonstrate that our model can achieve high interpretability generally, together with a competitive accuracy to the state-of-the-art approaches.
Published: 2021

3. Temporal Context-Aware Representation Learning for Question Routing

Author: Haifeng Chen, Chen Yuncong, Xuchao Zhang, Jian-Wu Xu, Ding Li, Bo Zong, and Wei Cheng
Subjects: Computer science, business.industry, Dynamics (music), Temporal context, Artificial intelligence, Routing (electronic design automation), business, Machine learning, computer.software_genre, Baseline (configuration management), Feature learning, Temporal information, computer
Abstract: Question routing (QR) aims at recommending newly posted questions to the potential answerers who are most likely to answer the questions. The existing approaches that learn users' expertise from their past question-answering activities usually suffer from challenges in two aspects: 1) multi-faceted expertise and 2) temporal dynamics in the answering behavior. This paper proposes a novel temporal context-aware model in multiple granularities of temporal dynamics that concurrently address the above challenges. Specifically, the temporal context-aware attention characterizes the answerer's multi-faceted expertise in terms of the questions' semantic and temporal information simultaneously. Moreover, the design of the multi-shift and multi-resolution module enables our model to handle temporal impact on different time granularities. Extensive experiments on six datasets from different domains demonstrate that the proposed model significantly outperforms competitive baseline models.
Published: 2020

4. Deep Prototypical Networks for Imbalanced Time Series Classification under Data Scarcity

Author: Xuchao Zhang, Suwen Lin, Xian Wu, Nitesh V. Chawla, and Chao Huang
Subjects: Time series classification, business.industry, Computer science, media_common.quotation_subject, 02 engineering and technology, Machine learning, computer.software_genre, Masking (Electronic Health Record), Temporal database, Activity recognition, Scarcity, 020204 information systems, Metric (mathematics), 0202 electrical engineering, electronic engineering, information engineering, Embedding, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, media_common
Abstract: With the increase of temporal data availability, time series classification has drawn a lot of attention in the literature because of its wide spectrum of applications in diverse domains (e.g., healthcare, bioinformatics and finance), ranging from human activity recognition to financial pattern identification. While significant progress has been made to solve time series classification problem, the success of such methods relies on data sufficiency, and may not well capture the quality embeddings when training triple instances are scarce and highly imbalance across classes. To address these challenges, we propose a prototype embedding framework-Deep Prototypical Networks (DPN), which leverages a main embedding space to capture the discrepancies of difference time series classes for alleviating data scarcity. In addition, we further augment DPN framework with a relationship-dependent masking module to automatically fuse relevant information with a distance metric learning process, which addresses the data imbalance issue and performs robust time series classification. Experimental results show significant and consistent improvements compared to state-of-the-art techniques.
Published: 2019

5. Similarity-Aware Network Embedding with Self-Paced Learning

Author: Xian Wu, Baoxu Shi, Nitesh V. Chawla, Xuchao Zhang, and Chao Huang
Subjects: Theoretical computer science, Degree (graph theory), Similarity (network science), Computer science, Node (networking), Network embedding, Embedding, Cluster analysis, Self paced
Abstract: Network embedding, which aims to learn low-dimensional vector representations for nodes in a network, has shown promising performance for many real-world applications, such as node classification and clustering. While various embedding methods have been developed for network data, they are limited in their assumption that nodes are correlated with their neighboring nodes with the same similarity degree. As such, these methods can be suboptimal for embedding network data. In this paper, we propose a new method named SANE, short for Similarity-Aware Network Embedding, to learn node representations by explicitly considering different similarity degrees between connected nodes in a network. In particular, we develop a new framework based on self-paced learning by accounting for both the explicit relations (i.e., observed links) and implicit relations (i.e., unobserved node similarities) in network representation learning. To justify our proposed model, we perform experiments on two real-world network data. Experiments results show that SNAE outperforms state-of-the-art embedding models on the tasks of node classification and node clustering.
Published: 2019

6. Feature driven learning framework for cybersecurity event detection

Author: Chang-Tien Lu, Kaiqun Fu, Taoran Ji, Xuchao Zhang, Nathan Self, and Naren Ramakrishnan
Subjects: Event (computing), Computer science, Feature vector, Supervised learning, Context (language use), 02 engineering and technology, Computer security, computer.software_genre, Feature (computer vision), 020204 information systems, Encoding (memory), 0202 electrical engineering, electronic engineering, information engineering, Task analysis, 020201 artificial intelligence & image processing, Social media, computer
Abstract: Cybersecurity event detection is a crucial problem for mitigating effects on various aspects of society. Social media has become a notable source of indicators for detection of diverse events. Though previous social media based strategies for cyber-security event detection focus on mining certain event-related words, the dynamic and evolving nature of online discourse limits the performance of these approaches. Further, because these are typically unsupervised or weakly supervised learning strategies, they do not perform well in an environment of biased samples, noisy context, and informal language which is routine for online, user-generated content. This paper takes a supervised learning approach by proposing a novel multi-task learning based model. Our model can handle diverse structures in feature space by learning models for different types of potential high-profile targets simultaneously. For parameter optimization, we develop an efficient algorithm based on the alternating direction method of multipliers. Through extensive experiments on a real world Twitter dataset, we demonstrate that our approach consistently outperforms existing methods at encoding and identifying cyber-security incidents.
Published: 2019

7. Online Purchase Prediction via Multi-Scale Modeling of Behavior Dynamics

Author: Chao Huang, Xuchao Zhang, Dawei Yin, Nitesh V. Chawla, Xian Wu, Chuxu Zhang, and Jiashu Zhao
Subjects: Network architecture, Artificial neural network, business.industry, Computer science, 02 engineering and technology, Recommender system, Machine learning, computer.software_genre, Recurrent neural network, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Graph (abstract data type), 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Categorical variable
Abstract: Online purchase forecasting is of great importance in e-commerce platforms, which is the basis of how to present personalized interesting product lists to individual customers. However, predicting online purchases is not trivial as it is influenced by many factors including: (i) the complex temporal pattern with hierarchical inter-correlations; (ii) arbitrary category dependencies. To address these factors, we develop a Graph Multi-Scale Pyramid Networks (GMP) framework to fully exploit users' latent behavioral patterns with both multi-scale temporal dynamics and arbitrary inter-dependencies among product categories. In GMP, we first design a multi-scale pyramid modulation network architecture which seamlessly preserves the underlying hierarchical temporal factors--governing users' purchase behaviors. Then, we employ convolution recurrent neural network to encode the categorical temporal pattern at each scale. After that, we develop a resolution-wise recalibration gating mechanism to automatically re-weight the importance of each scale-view representations. Finally, a context-graph neural network module is proposed to adaptively uncover complex dependencies among category-specific purchases. Extensive experiments on real-world e-commerce datasets demonstrate the superior performance of our method over state-of-the-art baselines across various settings.
Published: 2019

8. Spatiotemporal Event Forecasting from Incomplete Hyper-local Price Data

Author: Chang-Tien Lu, Naren Ramakrishnan, Xuchao Zhang, Arnold P. Boedihardjo, and Liang Zhao
Subjects: 010104 statistics & probability, Event forecasting, Computer science, 0202 electrical engineering, electronic engineering, information engineering, Domain knowledge, 020201 artificial intelligence & image processing, 02 engineering and technology, Data mining, 0101 mathematics, Missing data, computer.software_genre, 01 natural sciences, computer
Abstract: Hyper-local pricing data, e.g., about foods and commodities, exhibit subtle spatiotemporal variations that can be useful as crucial precursors of future events. Three major challenges in modeling such pricing data include: i) temporal dependencies underlying features; ii) spatiotemporal missing values; and iii) constraints underlying economic phenomena. These challenges hinder traditional event forecasting models from being applied effectively. This paper proposes a novel spatiotemporal event forecasting model that concurrently addresses the above challenges. Specifically, given continuous price data, a new soft time-lagged model is designed to select temporally dependent features. To handle missing values, we propose a data tensor completion method based on price domain knowledge. The parameters of the new model are optimized using a novel algorithm based on the Alternative Direction Methods of Multipliers (ADMM). Extensive experimental evaluations on multiple datasets demonstrate the effectiveness of our proposed approach.
Published: 2017

9. Automatical Storyline Generation with Help from Twitter

Author: Xuchao Zhang, Wei Wang, Ting Hua, Chang-Tien Lu, and Naren Ramakrishnan
Subjects: Topic model, Work (electrical), Computer science, Process (engineering), 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, 02 engineering and technology, Data mining, Baseline (configuration management), computer.software_genre, Data science, computer
Abstract: Storyline detection aims to connect seemly irrelevant single documents into meaningful chains, which provides opportunities for understanding how events evolve over time and what triggers such evolutions. Most previous work generated the storylines through unsupervised methods that can hardly reveal underlying factors driving the evolution process. This paper introduces a Bayesian model to generate storylines from massive documents and infer the corresponding hidden relations and topics. In addition, our model is the first attempt that utilizes Twitter data as human input to ``supervise'' the generation of storylines. Through extensive experiments, we demonstrate our proposed model can achieve significant improvement over baseline methods and can be used to discover interesting patterns for real world cases.
Published: 2016

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

9 results on '"Xuchao Zhang"'

1. CAT: Beyond Efficient Transformer for Content-Aware Anomaly Detection in Event Sequences

2. Interpreting Convolutional Sequence Model by Learning Local Prototypes with Adaptation Regularization

3. Temporal Context-Aware Representation Learning for Question Routing

4. Deep Prototypical Networks for Imbalanced Time Series Classification under Data Scarcity

5. Similarity-Aware Network Embedding with Self-Paced Learning

6. Feature driven learning framework for cybersecurity event detection

7. Online Purchase Prediction via Multi-Scale Modeling of Behavior Dynamics

8. Spatiotemporal Event Forecasting from Incomplete Hyper-local Price Data

9. Automatical Storyline Generation with Help from Twitter

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

9 results on '"Xuchao Zhang"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources