Author: "Xuchao Zhang" / Topic: computer.software_genre - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Xuchao Zhang"' showing total 20 results

Start Over Author "Xuchao Zhang" Topic computer.software_genre

20 results on '"Xuchao Zhang"'

1. Self-Paced Robust Learning for Leveraging Clean Labels in Noisy Data

Author: Xian Wu, Fanglan Chen, Chang-Tien Lu, Liang Zhao, and Xuchao Zhang
Subjects: Computer science, business.industry, Process (computing), General Medicine, Machine learning, computer.software_genre, Small set, Robust learning, Robustness (computer science), Convergence (routing), Leverage (statistics), Artificial intelligence, business, computer, Noisy data, Self paced
Abstract: The success of training accurate models strongly depends on the availability of a sufficient collection of precisely labeled data. However, real-world datasets contain erroneously labeled data samples that substantially hinder the performance of machine learning models. Meanwhile, well-labeled data is usually expensive to obtain and only a limited amount is available for training. In this paper, we consider the problem of training a robust model by using large-scale noisy data in conjunction with a small set of clean data. To leverage the information contained via the clean labels, we propose a novel self-paced robust learning algorithm (SPRL) that trains the model in a process from more reliable (clean) data instances to less reliable (noisy) ones under the supervision of well-labeled data. The self-paced learning process hedges the risk of selecting corrupted data into the training set. Moreover, theoretical analyses on the convergence of the proposed algorithm are provided under mild assumptions. Extensive experiments on synthetic and real-world datasets demonstrate that our proposed approach can achieve a considerable improvement in effectiveness and robustness to existing methods.
Published: 2020

2. TapNet: Multivariate Time Series Classification with Attentional Prototypical Network

Author: Yifeng Gao, Chang-Tien Lu, Jessica Lin, and Xuchao Zhang
Subjects: Multivariate statistics, Training set, Series (mathematics), business.industry, Computer science, Deep learning, Contrast (statistics), 02 engineering and technology, General Medicine, Machine learning, computer.software_genre, Class (biology), 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Feature (machine learning), 020201 artificial intelligence & image processing, Artificial intelligence, Time series, business, computer
Abstract: With the advance of sensor technologies, the Multivariate Time Series classification (MTSC) problem, perhaps one of the most essential problems in the time series data mining domain, has continuously received a significant amount of attention in recent decades. Traditional time series classification approaches based on Bag-of-Patterns or Time Series Shapelet have difficulty dealing with the huge amounts of feature candidates generated in high-dimensional multivariate data but have promising performance even when the training set is small. In contrast, deep learning based methods can learn low-dimensional features efficiently but suffer from a shortage of labelled data. In this paper, we propose a novel MTSC model with an attentional prototype network to take the strengths of both traditional and deep learning based approaches. Specifically, we design a random group permutation method combined with multi-layer convolutional networks to learn the low-dimensional features from multivariate time series data. To handle the issue of limited training labels, we propose a novel attentional prototype network to train the feature representation based on their distance to class prototypes with inadequate data labels. In addition, we extend our model into its semi-supervised setting by utilizing the unlabeled data. Extensive experiments on 18 datasets in a public UEA Multivariate time series archive with eight state-of-the-art baseline methods exhibit the effectiveness of the proposed model.
Published: 2020

3. Interpreting Convolutional Sequence Model by Learning Local Prototypes with Adaptation Regularization

Author: Xuchao Zhang, Dongjin Song, Bo Zong, Haifeng Chen, Jingchao Ni, Zhengzhang Chen, Wei Cheng, and Yanchi Liu
Subjects: End user, business.industry, Computer science, Deep learning, Machine learning, computer.software_genre, Regularization (mathematics), Convolutional neural network, Variety (cybernetics), Problem domain, Artificial intelligence, business, Adaptation (computer science), computer, Interpretability
Abstract: In many high-stakes applications of machine learning models, outputting only predictions or providing statistical confidence is usually insufficient to gain trust from end users, who often prefer a transparent reasoning paradigm. Despite the recent encouraging developments on deep networks for sequential data modeling, due to the highly recursive functions, the underlying rationales of their predictions are difficult to explain. Thus, in this paper, we aim to develop a sequence modeling approach that explains its own predictions by breaking input sequences down into evidencing segments (i.e., sub-sequences) in its reasoning. To this end, we build our model upon convolutional neural networks, which, in their vanilla forms, associates local receptive fields with outputs in an obscure manner. To unveil it, we resort to case-based reasoning, and design prototype modules whose units (i.e., prototypes) resemble exemplar segments in the problem domain. Each prediction is obtained by combining the comparisons between the prototypes and the segments of an input. To enhance interpretability, we propose a training objective that delicately adapts the distribution of prototypes to the data distribution in latent spaces, and design an algorithm to map prototypes to human-understandable segments. Through extensive experiments in a variety of domains, we demonstrate that our model can achieve high interpretability generally, together with a competitive accuracy to the state-of-the-art approaches.
Published: 2021

4. Few-Shot Semantic Segmentation via Prototype Augmentation with Image-Level Annotations

Author: Shuo Lei, Chang-Tien Lu, Fanglan Chen, Xuchao Zhang, and Jianfeng He
Subjects: Class (computer programming), Computer science, Process (engineering), business.industry, Pooling, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Machine learning, computer.software_genre, Task (project management), Metric space, Key (cryptography), Segmentation, Artificial intelligence, Representation (mathematics), business, computer
Abstract: Despite the great progress made by deep neural networks in the semantic segmentation task, traditional neural-network-based methods typically suffer from a shortage of large amounts of pixel-level annotations. Recent progress in few-shot semantic segmentation tackles the issue by only a few pixel-level annotated examples. However, these few-shot approaches cannot easily be applied to multi-way or weak an-notation settings. In this paper, we advance the few-shot segmentation paradigm towards a scenario where image-level an-notations are available to help the training process of a few pixel-level annotations. Our key idea is to learn a better prototype representation of the class by fusing the knowledge from the image-level labeled data. Specifically, we propose a new framework, called PAIA, to learn the class prototype representation in a metric space by integrating image-level annotations. Furthermore, by considering the uncertainty of pseudo-masks, a distilled soft masked average pooling strategy is designed to handle distractions in image-level annotations. Extensive empirical results on two datasets show superior performance of PAIA.
Published: 2021

5. Unsupervised Concept Representation Learning for Length-Varying Text Similarity

Author: Xuchao Zhang, Haifeng Chen, Bo Zong, Jingchao Ni, Yanchi Liu, and Wei Cheng
Subjects: Text corpus, Vocabulary, Matching (statistics), Phrase, Computer science, business.industry, media_common.quotation_subject, Context (language use), 02 engineering and technology, 010501 environmental sciences, computer.software_genre, 01 natural sciences, Similarity (psychology), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, F1 score, business, Feature learning, computer, Natural language processing, 0105 earth and related environmental sciences, media_common
Abstract: Measuring document similarity plays an important role in natural language processing tasks. Most existing document similarity approaches suffer from the information gap caused by context and vocabulary mismatches when comparing varying-length texts. In this paper, we propose an unsupervised concept representation learning approach to address the above issues. Specifically, we propose a novel Concept Generation Network (CGNet) to learn concept representations from the perspective of the entire text corpus. Moreover, a concept-based document matching method is proposed to leverage advances in the recognition of local phrase features and corpus-level concept features. Extensive experiments on real-world data sets demonstrate that new method can achieve a considerable improvement in comparing length-varying texts. In particular, our model achieved 6.5% better F1 Score compared to the best of the baseline models for a concept-project benchmark dataset.
Published: 2021

6. Robust Multi-target Regression for Correlated Data Corruption

Author: Yingwen Shao and Xuchao Zhang
Subjects: Corruption, Computer science, media_common.quotation_subject, 02 engineering and technology, 010501 environmental sciences, computer.software_genre, 01 natural sciences, Regression, Robust regression, Set (abstract data type), Linear regression, Scalability, 0202 electrical engineering, electronic engineering, information engineering, Data Corruption, 020201 artificial intelligence & image processing, Data mining, computer, 0105 earth and related environmental sciences, media_common
Abstract: Multi-target regression has recently drawn increasing attention in the machine learning community due to its capability of simultaneously predicting multiple continuous target variables based on a given set of input features. Jointly handling the inter-target correlations and input-output relationships is very challenging. That task becomes even more intricate in the presence of correlated data corruption. We observe that traditional robust methods can hardly deal with several emerging challenges, including 1) presence of correlated corruption among targets in the datasets, 2) difficulty in estimating the data corruption ratio, and 3) scalability to massive datasets. This paper proposes a novel approach that addresses all the above challenges by developing a distributed robust regression algorithm. Specifically, the algorithm optimizes regression coefficients of each target in parallel with a heuristically estimated corruption ratio and then consolidates the uncorrupted set in two strategies: global consensus and majority voting. Also, we prove that our algorithm benefits from strong guarantees in terms of convergence rates and coefficient recovery, which can be applied as a generic framework for robust regression problem with correlated corruption property. Extensive experiments on synthetic and real-world datasets demonstrate that our algorithm is superior to existing methods in both effectiveness and efficiency.
Published: 2020

7. Temporal Context-Aware Representation Learning for Question Routing

Author: Haifeng Chen, Chen Yuncong, Xuchao Zhang, Jian-Wu Xu, Ding Li, Bo Zong, and Wei Cheng
Subjects: Computer science, business.industry, Dynamics (music), Temporal context, Artificial intelligence, Routing (electronic design automation), business, Machine learning, computer.software_genre, Baseline (configuration management), Feature learning, Temporal information, computer
Abstract: Question routing (QR) aims at recommending newly posted questions to the potential answerers who are most likely to answer the questions. The existing approaches that learn users' expertise from their past question-answering activities usually suffer from challenges in two aspects: 1) multi-faceted expertise and 2) temporal dynamics in the answering behavior. This paper proposes a novel temporal context-aware model in multiple granularities of temporal dynamics that concurrently address the above challenges. Specifically, the temporal context-aware attention characterizes the answerer's multi-faceted expertise in terms of the questions' semantic and temporal information simultaneously. Moreover, the design of the multi-shift and multi-resolution module enables our model to handle temporal impact on different time granularities. Extensive experiments on six datasets from different domains demonstrate that the proposed model significantly outperforms competitive baseline models.
Published: 2020

8. Towards More Accurate Uncertainty Estimation In Text Classification

Author: Shuo Lei, Xuchao Zhang, Abdulaziz Alhamadani, Fanglan Chen, Zhiqian Chen, Jianfeng He, Chang-Tien Lu, and Bei Xiao
Subjects: business.industry, Computer science, 05 social sciences, 010501 environmental sciences, Machine learning, computer.software_genre, 01 natural sciences, Uncertainty estimation, 0502 economics and business, Measurement uncertainty, Artificial intelligence, 050207 economics, Human resources, business, Focus (optics), computer, 0105 earth and related environmental sciences, Overconfidence effect
Abstract: The uncertainty measurement of classified results is especially important in areas requiring limited human resources for higher accuracy. For instance, data-driven algorithms diagnosing diseases need accurate uncertainty score to decide whether additional but limited quantity of experts are needed for rectification. However, few uncertainty models focus on improving the performance of text classification where human resources are involved. To achieve this, we aim at generating accurate uncertainty score by improving the confidence of winning scores. Thus, a model called MSD, which includes three independent components as ``mix-up", ``self-ensembling", ``distinctiveness score", is proposed to improve the accuracy of uncertainty score by reducing the effect of overconfidence of winning score and considering the impact of different categories of uncertainty simultaneously. MSD can be applied with different Deep Neural Networks. Extensive experiments with ablation setting are conducted on four real-world datasets, on which, competitive results are obtained.
Published: 2020

9. Deep Prototypical Networks for Imbalanced Time Series Classification under Data Scarcity

Author: Xuchao Zhang, Suwen Lin, Xian Wu, Nitesh V. Chawla, and Chao Huang
Subjects: Time series classification, business.industry, Computer science, media_common.quotation_subject, 02 engineering and technology, Machine learning, computer.software_genre, Masking (Electronic Health Record), Temporal database, Activity recognition, Scarcity, 020204 information systems, Metric (mathematics), 0202 electrical engineering, electronic engineering, information engineering, Embedding, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, media_common
Abstract: With the increase of temporal data availability, time series classification has drawn a lot of attention in the literature because of its wide spectrum of applications in diverse domains (e.g., healthcare, bioinformatics and finance), ranging from human activity recognition to financial pattern identification. While significant progress has been made to solve time series classification problem, the success of such methods relies on data sufficiency, and may not well capture the quality embeddings when training triple instances are scarce and highly imbalance across classes. To address these challenges, we propose a prototype embedding framework-Deep Prototypical Networks (DPN), which leverages a main embedding space to capture the discrepancies of difference time series classes for alleviating data scarcity. In addition, we further augment DPN framework with a relationship-dependent masking module to automatically fuse relevant information with a distance metric learning process, which addresses the data imbalance issue and performs robust time series classification. Experimental results show significant and consistent improvements compared to state-of-the-art techniques.
Published: 2019

10. Feature driven learning framework for cybersecurity event detection

Author: Chang-Tien Lu, Kaiqun Fu, Taoran Ji, Xuchao Zhang, Nathan Self, and Naren Ramakrishnan
Subjects: Event (computing), Computer science, Feature vector, Supervised learning, Context (language use), 02 engineering and technology, Computer security, computer.software_genre, Feature (computer vision), 020204 information systems, Encoding (memory), 0202 electrical engineering, electronic engineering, information engineering, Task analysis, 020201 artificial intelligence & image processing, Social media, computer
Abstract: Cybersecurity event detection is a crucial problem for mitigating effects on various aspects of society. Social media has become a notable source of indicators for detection of diverse events. Though previous social media based strategies for cyber-security event detection focus on mining certain event-related words, the dynamic and evolving nature of online discourse limits the performance of these approaches. Further, because these are typically unsupervised or weakly supervised learning strategies, they do not perform well in an environment of biased samples, noisy context, and informal language which is routine for online, user-generated content. This paper takes a supervised learning approach by proposing a novel multi-task learning based model. Our model can handle diverse structures in feature space by learning models for different types of potential high-profile targets simultaneously. For parameter optimization, we develop an efficient algorithm based on the alternating direction method of multipliers. Through extensive experiments on a real world Twitter dataset, we demonstrate that our approach consistently outperforms existing methods at encoding and identifying cyber-security incidents.
Published: 2019

11. Online Purchase Prediction via Multi-Scale Modeling of Behavior Dynamics

Author: Chao Huang, Xuchao Zhang, Dawei Yin, Nitesh V. Chawla, Xian Wu, Chuxu Zhang, and Jiashu Zhao
Subjects: Network architecture, Artificial neural network, business.industry, Computer science, 02 engineering and technology, Recommender system, Machine learning, computer.software_genre, Recurrent neural network, 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, Graph (abstract data type), 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Categorical variable
Abstract: Online purchase forecasting is of great importance in e-commerce platforms, which is the basis of how to present personalized interesting product lists to individual customers. However, predicting online purchases is not trivial as it is influenced by many factors including: (i) the complex temporal pattern with hierarchical inter-correlations; (ii) arbitrary category dependencies. To address these factors, we develop a Graph Multi-Scale Pyramid Networks (GMP) framework to fully exploit users' latent behavioral patterns with both multi-scale temporal dynamics and arbitrary inter-dependencies among product categories. In GMP, we first design a multi-scale pyramid modulation network architecture which seamlessly preserves the underlying hierarchical temporal factors--governing users' purchase behaviors. Then, we employ convolution recurrent neural network to encode the categorical temporal pattern at each scale. After that, we develop a resolution-wise recalibration gating mechanism to automatically re-weight the importance of each scale-view representations. Finally, a context-graph neural network module is proposed to adaptively uncover complex dependencies among category-specific purchases. Extensive experiments on real-world e-commerce datasets demonstrate the superior performance of our method over state-of-the-art baselines across various settings.
Published: 2019

12. Robust Regression via Online Feature Selection under Adversarial Data Corruption

Author: Liang Zhao, Arnold P. Boedihardjo, Shuo Lei, Chang-Tien Lu, and Xuchao Zhang
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer science, Feature extraction, Feature selection, Machine Learning (stat.ML), 02 engineering and technology, 010501 environmental sciences, computer.software_genre, 01 natural sciences, Robust regression, Machine Learning (cs.LG), Data set, Set (abstract data type), Statistics - Machine Learning, Linear regression, 0202 electrical engineering, electronic engineering, information engineering, Feature (machine learning), Data Corruption, 020201 artificial intelligence & image processing, Data mining, computer, 0105 earth and related environmental sciences
Abstract: The presence of data corruption in user-generated streaming data, such as social media, motivates a new fundamental problem that learns reliable regression coefficient when features are not accessible entirely at one time. Until now, several important challenges still cannot be handled concurrently: 1) corrupted data estimation when only partial features are accessible; 2) online feature selection when data contains adversarial corruption; and 3) scaling to a massive dataset. This paper proposes a novel RObust regression algorithm via Online Feature Selection (\textit{RoOFS}) that concurrently addresses all the above challenges. Specifically, the algorithm iteratively updates the regression coefficients and the uncorrupted set via a robust online feature substitution method. We also prove that our algorithm has a restricted error bound compared to the optimal solution. Extensive empirical experiments in both synthetic and real-world datasets demonstrated that the effectiveness of our new method is superior to that of existing methods in the recovery of both feature selection and regression coefficients, with very competitive efficiency., Comment: 10 pages, 3 figures
Published: 2019
Full Text: View/download PDF

13. Situation-Based Interpretable Learning for Personality Prediction in Social Media

Author: Lei Zhang, Wenmo Kong, Chang-Tien Lu, Zitong Sheng, Liang Zhao, and Xuchao Zhang
Subjects: business.industry, Computer science, media_common.quotation_subject, 05 social sciences, 050109 social psychology, Context (language use), Overfitting, Lexicon, Machine learning, computer.software_genre, 01 natural sciences, Data set, 010104 statistics & probability, Feature (machine learning), Personality, 0501 psychology and cognitive sciences, Social media, Artificial intelligence, 0101 mathematics, Big Five personality traits, business, computer, media_common
Abstract: Predicting individuals personality traits with their social media profile has proved to be feasible, but researchers recently have run into bottlenecks on further improving the prediction accuracy. One major limitation is that existing studies failed to consider context information in predicting social media users’ behaviors. In this paper, we adopted the DIAMONDS situation theory in psychology to capture the context information in Facebook posts. To solve this issue, we proposed a novel situation-based feature interaction learning model. In this study, we extracted situation features according to the DIAMONDS lexicon and computed the interaction values between these situation features and the commonly used n-gram features at the post level. Features at the post level were aggregated up to the user level using the averaging strategy. A group lasso penalty was employed to enforce strong heredity in the model, which addressed the overfitting challenge introduced by the interaction features. Empirical tests on a large-scale data set have demonstrated the effectiveness of the proposed method.
Published: 2018

14. Spatiotemporal Event Forecasting from Incomplete Hyper-local Price Data

Author: Chang-Tien Lu, Naren Ramakrishnan, Xuchao Zhang, Arnold P. Boedihardjo, and Liang Zhao
Subjects: 010104 statistics & probability, Event forecasting, Computer science, 0202 electrical engineering, electronic engineering, information engineering, Domain knowledge, 020201 artificial intelligence & image processing, 02 engineering and technology, Data mining, 0101 mathematics, Missing data, computer.software_genre, 01 natural sciences, computer
Abstract: Hyper-local pricing data, e.g., about foods and commodities, exhibit subtle spatiotemporal variations that can be useful as crucial precursors of future events. Three major challenges in modeling such pricing data include: i) temporal dependencies underlying features; ii) spatiotemporal missing values; and iii) constraints underlying economic phenomena. These challenges hinder traditional event forecasting models from being applied effectively. This paper proposes a novel spatiotemporal event forecasting model that concurrently addresses the above challenges. Specifically, given continuous price data, a new soft time-lagged model is designed to select temporally dependent features. To handle missing values, we propose a data tensor completion method based on price domain knowledge. The parameters of the new model are optimized using a novel algorithm based on the Alternative Direction Methods of Multipliers (ADMM). Extensive experimental evaluations on multiple datasets demonstrate the effectiveness of our proposed approach.
Published: 2017

15. Online and Distributed Robust Regressions under Adversarial Data Corruption

Author: Xuchao Zhang, Chang-Tien Lu, Arnold P. Boedihardjo, and Liang Zhao
Subjects: FOS: Computer and information sciences, 021103 operations research, Distributed database, Computer science, Stochastic process, 0211 other engineering and technologies, 020206 networking & telecommunications, Machine Learning (stat.ML), 02 engineering and technology, computer.software_genre, Thresholding, Robust regression, Data modeling, Machine Learning (cs.LG), Computer Science - Learning, Robustness (computer science), Distributed algorithm, Statistics - Machine Learning, Linear regression, Computer Science - Data Structures and Algorithms, 0202 electrical engineering, electronic engineering, information engineering, Data Corruption, Data Structures and Algorithms (cs.DS), Data mining, computer
Abstract: In today's era of big data, robust least-squares regression becomes a more challenging problem when considering the adversarial corruption along with explosive growth of datasets. Traditional robust methods can handle the noise but suffer from several challenges when applied in huge dataset including 1) computational infeasibility of handling an entire dataset at once, 2) existence of heterogeneously distributed corruption, and 3) difficulty in corruption estimation when data cannot be entirely loaded. This paper proposes online and distributed robust regression approaches, both of which can concurrently address all the above challenges. Specifically, the distributed algorithm optimizes the regression coefficients of each data block via heuristic hard thresholding and combines all the estimates in a distributed robust consolidation. Furthermore, an online version of the distributed algorithm is proposed to incrementally update the existing estimates with new incoming data. We also prove that our algorithms benefit from strong robustness guarantees in terms of regression coefficient recovery with a constant upper bound on the error of state-of-the-art batch methods. Extensive experiments on synthetic and real datasets demonstrate that our approaches are superior to those of existing methods in effectiveness, with competitive efficiency., Accepted by ICDM 2017
Published: 2017

16. Robust Regression via Heuristic Hard Thresholding

Author: Liang Zhao, Xuchao Zhang, Chang-Tien Lu, and Arnold P. Boedihardjo
Subjects: business.industry, Computer science, Heuristic, 020206 networking & telecommunications, Pattern recognition, 02 engineering and technology, Machine learning, computer.software_genre, 01 natural sciences, Thresholding, Robust regression, 010104 statistics & probability, 0202 electrical engineering, electronic engineering, information engineering, Artificial intelligence, 0101 mathematics, business, computer
Abstract: The presence of data noise and corruptions recently invokes increasing attention on Robust Least Squares Regression (RLSR), which addresses the fundamental problem that learns reliable regression coefficients when response variables can be arbitrarily corrupted. Until now, several important challenges still cannot be handled concurrently: 1) exact recovery guarantee of regression coefficients 2) difficulty in estimating the corruption ratio parameter; and 3) scalability to massive dataset. This paper proposes a novel Robust Least squares regression algorithm via Heuristic Hard thresholding (RLHH), that concurrently addresses all the above challenges. Specifically, the algorithm alternately optimizes the regression coefficients and estimates the optimal uncorrupted set via heuristic hard thresholding without corruption ratio parameter until it converges. We also prove that our algorithm benefits from strong guarantees analogous to those of state-of-the-art methods in terms of convergence rates and recovery guarantees. We provide empirical evidence to demonstrate that the effectiveness of our new method is superior to that of existing methods in the recovery of both regression coefficients and uncorrupted sets, with very competitive efficiency.
Published: 2017

17. Automatical Storyline Generation with Help from Twitter

Author: Xuchao Zhang, Wei Wang, Ting Hua, Chang-Tien Lu, and Naren Ramakrishnan
Subjects: Topic model, Work (electrical), Computer science, Process (engineering), 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, 02 engineering and technology, Data mining, Baseline (configuration management), computer.software_genre, Data science, computer
Abstract: Storyline detection aims to connect seemly irrelevant single documents into meaningful chains, which provides opportunities for understanding how events evolve over time and what triggers such evolutions. Most previous work generated the storylines through unsupervised methods that can hardly reveal underlying factors driving the evolution process. This paper introduces a Bayesian model to generate storylines from massive documents and infer the corresponding hidden relations and topics. In addition, our model is the first attempt that utilizes Twitter data as human input to ``supervise'' the generation of storylines. Through extensive experiments, we demonstrate our proposed model can achieve significant improvement over baseline methods and can be used to discover interesting patterns for real world cases.
Published: 2016

18. SEAR: towards a mobile and context-sensitive speech-enabled augmented reality

Author: Nassir Navab, Xuchao Zhang, Stuart Goose, and S. Sudarsky
Subjects: Multimedia, Computer science, Interface (Java), Virtual machine, Human–computer interaction, Factory (object-oriented programming), Augmented reality, Speech synthesis, Context (language use), User interface, Dialog box, computer.software_genre, computer
Abstract: A maintenance engineer who talks to pumps and pipes may not seem like the ideal person to entrust with keeping a factory running smoothly, but we hope that our mobile speech-enabled augmented reality (SEAR) framework will enable such behavior in the future to be anything but suspicious. We describe how the SEAR framework uses our flexible and scalable vision-based localization techniques to offer the engineer a seamless multi-modal user interface. This interface juxtaposes a graphical augmented reality view with a context-sensitive speech dialog concerning the industrial equipment located in the immediate vicinity.
Published: 2003

19. E-commerce direct marketing using augmented reality

Author: Xuchao Zhang, Nassir Navab, and S.-P. Liou
Subjects: Multimedia, Computer science, business.industry, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Window (computing), E-commerce, computer.software_genre, Electronic mail, Product (business), Direct marketing, Augmented reality, business, Set (psychology), computer, Camera resectioning
Abstract: Turning Web customers from "window shoppers" into buyers demands an interactive sales model that informs them, gives them individualized attention, and helps to close the sale at the customer's request. Ideally, sales agents should have in-person meetings with all prospective customers. However, this may not be desirable or feasible, The next best thing is for sales agents to send promotional e-mails to their prospective customers. In this paper, we describe the development of a direct marketing system that uses augmented reality (AR) technology. A set of specially designed markers is used to calibrate the camera and track the motion of the markers for the augmentation of three dimensional product models. There is no special hardware required for this system except a PC camera (e.g., WebCam or ViCAM),.
Published: 2002

20. Mitigating uncertainty in document classification

Author: Xuchao Zhang, Chang-Tien Lu, Fanglan Chen, and Naren Ramakrishnan
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer science, Machine Learning (stat.ML), 02 engineering and technology, 010501 environmental sciences, computer.software_genre, Machine learning, 01 natural sciences, Machine Learning (cs.LG), Task (project management), Statistics - Machine Learning, 0202 electrical engineering, electronic engineering, information engineering, Feature (machine learning), Medical diagnosis, Dropout (neural networks), 0105 earth and related environmental sciences, business.industry, Document classification, Measurement uncertainty, 020201 artificial intelligence & image processing, Metric (unit), Artificial intelligence, Focus (optics), business, computer
Abstract: The uncertainty measurement of classifiers' predictions is especially important in applications such as medical diagnoses that need to ensure limited human resources can focus on the most uncertain predictions returned by machine learning models. However, few existing uncertainty models attempt to improve overall prediction accuracy where human resources are involved in the text classification task. In this paper, we propose a novel neural-network-based model that applies a new dropout-entropy method for uncertainty measurement. We also design a metric learning method on feature representations, which can boost the performance of dropout-based uncertainty methods with smaller prediction variance in accurate prediction trials. Extensive experiments on real-world data sets demonstrate that our method can achieve a considerable improvement in overall prediction accuracy compared to existing approaches. In particular, our model improved the accuracy from 0.78 to 0.92 when 30\% of the most uncertain predictions were handed over to human experts in "20NewsGroup" data., Comment: Accepted by NAACL19

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

20 results on '"Xuchao Zhang"'

1. Self-Paced Robust Learning for Leveraging Clean Labels in Noisy Data

2. TapNet: Multivariate Time Series Classification with Attentional Prototypical Network

3. Interpreting Convolutional Sequence Model by Learning Local Prototypes with Adaptation Regularization

4. Few-Shot Semantic Segmentation via Prototype Augmentation with Image-Level Annotations

5. Unsupervised Concept Representation Learning for Length-Varying Text Similarity

6. Robust Multi-target Regression for Correlated Data Corruption

7. Temporal Context-Aware Representation Learning for Question Routing

8. Towards More Accurate Uncertainty Estimation In Text Classification

9. Deep Prototypical Networks for Imbalanced Time Series Classification under Data Scarcity

10. Feature driven learning framework for cybersecurity event detection

11. Online Purchase Prediction via Multi-Scale Modeling of Behavior Dynamics

12. Robust Regression via Online Feature Selection under Adversarial Data Corruption

13. Situation-Based Interpretable Learning for Personality Prediction in Social Media

14. Spatiotemporal Event Forecasting from Incomplete Hyper-local Price Data

15. Online and Distributed Robust Regressions under Adversarial Data Corruption

16. Robust Regression via Heuristic Hard Thresholding

17. Automatical Storyline Generation with Help from Twitter

18. SEAR: towards a mobile and context-sensitive speech-enabled augmented reality

19. E-commerce direct marketing using augmented reality

20. Mitigating uncertainty in document classification

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

20 results on '"Xuchao Zhang"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources