Author: "Daumé III, Hal" / Topic: statistics - machine learning - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Daumé III, Hal"' showing total 26 results

Start Over Author "Daumé III, Hal" Topic statistics - machine learning

26 results on '"Daumé III, Hal"'

1. Active Imitation Learning from Multiple Non-Deterministic Teachers: Formulation, Challenges, and Algorithms

Author: Nguyen, Khanh and Daumé III, Hal
Subjects: Computer Science - Machine Learning, Computer Science - Human-Computer Interaction, Statistics - Machine Learning
Abstract: We formulate the problem of learning to imitate multiple, non-deterministic teachers with minimal interaction cost. Rather than learning a specific policy as in standard imitation learning, the goal in this problem is to learn a distribution over a policy space. We first present a general framework that efficiently models and estimates such a distribution by learning continuous representations of the teacher policies. Next, we develop Active Performance-Based Imitation Learning (APIL), an active learning algorithm for reducing the learner-teacher interaction cost in this framework. By making query decisions based on predictions of future progress, our algorithm avoids the pitfalls of traditional uncertainty-based approaches in the face of teacher behavioral uncertainty. Results on both toy and photo-realistic navigation tasks show that APIL significantly reduces the numbers of interactions with teachers without compromising on performance. Moreover, it is robust to various degrees of teacher behavioral uncertainty.
Published: 2020

2. Active Imitation Learning with Noisy Guidance

Author: Brantley, Kianté, Sharaf, Amr, and Daumé III, Hal
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Statistics - Machine Learning
Abstract: Imitation learning algorithms provide state-of-the-art results on many structured prediction tasks by learning near-optimal search policies. Such algorithms assume training-time access to an expert that can provide the optimal action at any queried state; unfortunately, the number of such queries is often prohibitive, frequently rendering these approaches impractical. To combat this query complexity, we consider an active learning setting in which the learning algorithm has additional access to a much cheaper noisy heuristic that provides noisy guidance. Our algorithm, LEAQI, learns a difference classifier that predicts when the expert is likely to disagree with the heuristic, and queries the expert only when necessary. We apply LEAQI to three sequence labeling tasks, demonstrating significantly fewer queries to the expert and comparable (or better) accuracies over a passive approach., Comment: ACL 2020
Published: 2020

3. Weight of Evidence as a Basis for Human-Oriented Explanations

Author: Alvarez-Melis, David, Daumé III, Hal, Vaughan, Jennifer Wortman, and Wallach, Hanna
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Interpretability is an elusive but highly sought-after characteristic of modern machine learning methods. Recent work has focused on interpretability via $\textit{explanations}$, which justify individual model predictions. In this work, we take a step towards reconciling machine explanations with those that humans produce and prefer by taking inspiration from the study of explanation in philosophy, cognitive science, and the social sciences. We identify key aspects in which these human explanations differ from current machine explanations, distill them into a list of desiderata, and formalize them into a framework via the notion of $\textit{weight of evidence}$ from information theory. Finally, we instantiate this framework in two simple applications and show it produces intuitive and comprehensible explanations., Comment: Human-Centric Machine Learning (HCML) Workshop @ NeurIPS 2019
Published: 2019

4. Reinforcement Learning with Convex Constraints

Author: Miryoosefi, Sobhan, Brantley, Kianté, Daumé III, Hal, Dudik, Miroslav, and Schapire, Robert
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Science and Game Theory, Statistics - Machine Learning
Abstract: In standard reinforcement learning (RL), a learning agent seeks to optimize the overall reward. However, many key aspects of a desired behavior are more naturally expressed as constraints. For instance, the designer may want to limit the use of unsafe actions, increase the diversity of trajectories to enable exploration, or approximate expert trajectories when rewards are sparse. In this paper, we propose an algorithmic scheme that can handle a wide class of constraints in RL tasks: specifically, any constraints that require expected values of some vector measurements (such as the use of an action) to lie in a convex set. This captures previously studied constraints (such as safety and proximity to an expert), but also enables new classes of constraints (such as diversity). Our approach comes with rigorous theoretical guarantees and only relies on the ability to approximately solve standard RL tasks. As a result, it can be easily adapted to work with any model-free or model-based RL. In our experiments, we show that it matches previous algorithms that enforce safety via constraints, but can also enforce new properties that these algorithms do not incorporate, such as diversity.
Published: 2019

5. Non-Monotonic Sequential Text Generation

Author: Welleck, Sean, Brantley, Kianté, Daumé III, Hal, and Cho, Kyunghyun
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Standard sequential generation methods assume a pre-specified generation order, such as text generation methods which generate words from left to right. In this work, we propose a framework for training models of text generation that operate in non-monotonic orders; the model directly learns good orders, without any additional annotation. Our framework operates by generating a word at an arbitrary position, and then recursively generating words to its left and then words to its right, yielding a binary tree. Learning is framed as imitation learning, including a coaching method which moves from imitating an oracle to reinforcing the policy's own preferences. Experimental results demonstrate that using the proposed method, it is possible to learn policies which generate text without pre-specifying a generation order, while achieving competitive performance with conventional left-to-right generation., Comment: ICML 2019
Published: 2019

6. Meta-Learning for Contextual Bandit Exploration

Author: Sharaf, Amr and Daumé III, Hal
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We describe MELEE, a meta-learning algorithm for learning a good exploration policy in the interactive contextual bandit setting. Here, an algorithm must take actions based on contexts, and learn based only on a reward signal from the action taken, thereby generating an exploration/exploitation trade-off. MELEE addresses this trade-off by learning a good exploration strategy for offline tasks based on synthetic data, on which it can simulate the contextual bandit setting. Based on these simulations, MELEE uses an imitation learning strategy to learn a good exploration policy that can then be applied to true contextual bandit tasks at test time. We compare MELEE to seven strong baseline contextual bandit algorithms on a set of three hundred real-world datasets, on which it outperforms alternatives in most settings, especially when differences in rewards are large. Finally, we demonstrate the importance of having a rich feature representation for learning how to explore.
Published: 2019

7. Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback

Author: Zhang, Chicheng, Agarwal, Alekh, Daumé III, Hal, Langford, John, and Negahban, Sahand N
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We investigate the feasibility of learning from a mix of both fully-labeled supervised data and contextual bandit data. We specifically consider settings in which the underlying learning signal may be different between these two data sources. Theoretically, we state and prove no-regret algorithms for learning that is robust to misaligned cost distributions between the two sources. Empirically, we evaluate some of these algorithms on a large selection of datasets, showing that our approach is both feasible and helpful in practice., Comment: 42 pages, 21 figures, ICML 2019
Published: 2019

8. Contextual Memory Trees

Author: Sun, Wen, Beygelzimer, Alina, Daumé III, Hal, Langford, John, and Mineiro, Paul
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We design and study a Contextual Memory Tree (CMT), a learning memory controller that inserts new memories into an experience store of unbounded size. It is designed to efficiently query for memories from that store, supporting logarithmic time insertion and retrieval operations. Hence CMT can be integrated into existing statistical learning algorithms as an augmented memory unit without substantially increasing training and inference computation. Furthermore CMT operates as a reduction to classification, allowing it to benefit from advances in representation or architecture. We demonstrate the efficacy of CMT by augmenting existing multi-class and multi-label classification algorithms with CMT and observe statistical improvement. We also test CMT learning on several image-captioning tasks to demonstrate that it performs computationally better than a simple nearest neighbors memory system while benefitting from reward learning., Comment: ICM 2019
Published: 2018

9. Hierarchical Imitation and Reinforcement Learning

Author: Le, Hoang M., Jiang, Nan, Agarwal, Alekh, Dudík, Miroslav, Yue, Yisong, and Daumé III, Hal
Subjects: Computer Science - Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: We study how to effectively leverage expert feedback to learn sequential decision-making policies. We focus on problems with sparse rewards and long time horizons, which typically pose significant challenges in reinforcement learning. We propose an algorithmic framework, called hierarchical guidance, that leverages the hierarchical structure of the underlying problem to integrate different modes of expert interaction. Our framework can incorporate different combinations of imitation learning (IL) and reinforcement learning (RL) at different levels, leading to dramatic reductions in both expert effort and cost of exploration. Using long-horizon benchmarks, including Montezuma's Revenge, we demonstrate that our approach can learn significantly faster than hierarchical RL, and be significantly more label-efficient than standard IL. We also theoretically analyze labeling cost for certain instantiations of our framework., Comment: Proceedings of the 35th International Conference on Machine Learning (ICML 2018)
Published: 2018

10. Active Learning for Cost-Sensitive Classification

Author: Krishnamurthy, Akshay, Agarwal, Alekh, Huang, Tzu-Kuo, Daume III, Hal, and Langford, John
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We design an active learning algorithm for cost-sensitive multiclass classification: problems where different errors have different costs. Our algorithm, COAL, makes predictions by regressing to each label's cost and predicting the smallest. On a new example, it uses a set of regressors that perform well on past data to estimate possible costs for each label. It queries only the labels that could be the best, ignoring the sure losers. We prove COAL can be efficiently implemented for any regression family that admits squared loss optimization; it also enjoys strong guarantees with respect to predictive performance and labeling effort. We empirically compare COAL to passive learning and several active learning baselines, showing significant improvements in labeling effort and test cost on real-world datasets., Comment: Fixed typos in Appendix A
Published: 2017

11. Logarithmic Time One-Against-Some

Author: Daume III, Hal, Karampatziakis, Nikos, Langford, John, and Mineiro, Paul
Subjects: Statistics - Machine Learning, Computer Science - Learning
Abstract: We create a new online reduction of multiclass classification to binary classification for which training and prediction time scale logarithmically with the number of classes. Compared to previous approaches, we obtain substantially better statistical performance for two reasons: First, we prove a tighter and more complete boosting theorem, and second we translate the results more directly into an algorithm. We show that several simple techniques give rise to an algorithm that can compete with one-against-all in both space and predictive power while offering exponential improvements in speed when the number of classes is large.
Published: 2016

12. Learning to Search Better Than Your Teacher

Author: Chang, Kai-Wei, Krishnamurthy, Akshay, Agarwal, Alekh, Daumé III, Hal, and Langford, John
Subjects: Computer Science - Learning, Statistics - Machine Learning
Abstract: Methods for learning to search for structured prediction typically imitate a reference policy, with existing theoretical guarantees demonstrating low regret compared to that reference. This is unsatisfactory in many applications where the reference policy is suboptimal and the goal of learning is to improve upon it. Can learning to search work even when the reference is poor? We provide a new learning to search algorithm, LOLS, which does well relative to the reference policy, but additionally guarantees low regret compared to deviations from the learned policy: a local-optimality guarantee. Consequently, LOLS can improve upon the reference policy, unlike previous algorithms. This enables us to develop structured contextual bandits, a partial information structured prediction setting with many potential applications., Comment: In ICML 2015
Published: 2015

13. Bayesian Multitask Learning with Latent Hierarchies

Author: Daume III, Hal
Subjects: Computer Science - Learning, Statistics - Machine Learning
Abstract: We learn multiple hypotheses for related tasks under a latent hierarchical relationship between tasks. We exploit the intuition that for domain adaptation, we wish to share classifier structure, but for multitask learning, we wish to share covariance structure. Our hierarchical model is seen to subsume several previously proposed multitask learning models and performs well on three distinct real-world data sets., Comment: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)
Published: 2014

14. Flexible Modeling of Latent Task Structures in Multitask Learning

Author: Passos, Alexandre, Rai, Piyush, Wainer, Jacques, and Daume III, Hal
Subjects: Computer Science - Learning, Statistics - Machine Learning
Abstract: Multitask learning algorithms are typically designed assuming some fixed, a priori known latent structure shared by all the tasks. However, it is usually unclear what type of latent task structure is the most appropriate for a given multitask learning problem. Ideally, the "right" latent task structure should be learned in a data-driven manner. We present a flexible, nonparametric Bayesian model that posits a mixture of factor analyzers structure on the tasks. The nonparametric aspect makes the model expressive enough to subsume many existing models of latent task structures (e.g, mean-regularized tasks, clustered tasks, low-rank or linear/non-linear subspace assumption on tasks, etc.). Moreover, it can also learn more general task structures, addressing the shortcomings of such models. We present a variational inference algorithm for our model. Experimental results on synthetic and real-world datasets, on both regression and classification problems, demonstrate the effectiveness of the proposed method., Comment: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)
Published: 2012

15. A Binary Classification Framework for Two-Stage Multiple Kernel Learning

Author: Kumar, Abhishek, Niculescu-Mizil, Alexandru, Kavukcuoglu, Koray, and Daume III, Hal
Subjects: Computer Science - Learning, Statistics - Machine Learning
Abstract: With the advent of kernel methods, automating the task of specifying a suitable kernel has become increasingly important. In this context, the Multiple Kernel Learning (MKL) problem of finding a combination of pre-specified base kernels that is suitable for the task at hand has received significant attention from researchers. In this paper we show that Multiple Kernel Learning can be framed as a standard binary classification problem with additional constraints that ensure the positive definiteness of the learned kernel. Framing MKL in this way has the distinct advantage that it makes it easy to leverage the extensive research in binary classification to develop better performing and more scalable MKL algorithms that are conceptually simpler, and, arguably, more accessible to practitioners. Experiments on nine data sets from different domains show that, despite its simplicity, the proposed technique compares favorably with current leading MKL approaches., Comment: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)
Published: 2012

16. Learning Task Grouping and Overlap in Multi-task Learning

Author: Kumar, Abhishek and Daume III, Hal
Subjects: Computer Science - Learning, Statistics - Machine Learning
Abstract: In the paradigm of multi-task learning, mul- tiple related prediction tasks are learned jointly, sharing information across the tasks. We propose a framework for multi-task learn- ing that enables one to selectively share the information across the tasks. We assume that each task parameter vector is a linear combi- nation of a finite number of underlying basis tasks. The coefficients of the linear combina- tion are sparse in nature and the overlap in the sparsity patterns of two tasks controls the amount of sharing across these. Our model is based on on the assumption that task pa- rameters within a group lie in a low dimen- sional subspace but allows the tasks in differ- ent groups to overlap with each other in one or more bases. Experimental results on four datasets show that our approach outperforms competing methods., Comment: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)
Published: 2012

17. Efficient Protocols for Distributed Classification and Optimization

Author: Daume III, Hal, Phillips, Jeff M., Saha, Avishek, and Venkatasubramanian, Suresh
Subjects: Computer Science - Learning, Statistics - Machine Learning
Abstract: In distributed learning, the goal is to perform a learning task over data distributed across multiple nodes with minimal (expensive) communication. Prior work (Daume III et al., 2012) proposes a general model that bounds the communication required for learning classifiers while allowing for $\eps$ training error on linearly separable data adversarially distributed across nodes. In this work, we develop key improvements and extensions to this basic model. Our first result is a two-party multiplicative-weight-update based protocol that uses $O(d^2 \log{1/\eps})$ words of communication to classify distributed data in arbitrary dimension $d$, $\eps$-optimally. This readily extends to classification over $k$ nodes with $O(kd^2 \log{1/\eps})$ words of communication. Our proposed protocol is simple to implement and is considerably more efficient than baselines compared, as demonstrated by our empirical results. In addition, we illustrate general algorithm design paradigms for doing efficient learning over distributed data. We show how to solve fixed-dimensional and high dimensional linear programming efficiently in a distributed setting where constraints may be distributed across nodes. Since many learning problems can be viewed as convex optimization problems where constraints are generated by individual points, this models many typical distributed learning scenarios. Our techniques make use of a novel connection from multipass streaming, as well as adapting the multiplicative-weight-update framework more generally to a distributed setting. As a consequence, our methods extend to the wide range of problems solvable using these techniques.
Published: 2012

18. Protocols for Learning Classifiers on Distributed Data

Author: Daume III, Hal, Phillips, Jeff M., Saha, Avishek, and Venkatasubramanian, Suresh
Subjects: Statistics - Machine Learning, Computer Science - Learning
Abstract: We consider the problem of learning classifiers for labeled data that has been distributed across several nodes. Our goal is to find a single classifier, with small approximation error, across all datasets while minimizing the communication between nodes. This setting models real-world communication bottlenecks in the processing of massive distributed datasets. We present several very general sampling-based solutions as well as some two-way protocols which have a provable exponential speed-up over any one-way protocol. We focus on core problems for noiseless data distributed across two or more nodes. The techniques we introduce are reminiscent of active learning, but rather than actively probing labels, nodes actively communicate with each other, each node simultaneously learning the important data from another node., Comment: 19 pages, 12 figures, accepted at AISTATS 2012
Published: 2012

19. The Infinite Hierarchical Factor Regression Model

Author: Rai, Piyush and Daumé III, Hal
Subjects: Computer Science - Learning, Statistics - Machine Learning
Abstract: We propose a nonparametric Bayesian factor regression model that accounts for uncertainty in the number of factors, and the relationship between factors. To accomplish this, we propose a sparse variant of the Indian Buffet Process and couple this with a hierarchical model over factors, based on Kingman's coalescent. We apply this model to two problems (factor analysis and factor regression) in gene-expression data analysis.
Published: 2009

20. Streamed Learning: One-Pass SVMs

Author: Rai, Piyush, Daumé III, Hal, and Venkatasubramanian, Suresh
Subjects: Computer Science - Learning, Statistics - Machine Learning
Abstract: We present a streaming model for large-scale classification (in the context of $\ell_2$-SVM) by leveraging connections between learning and computational geometry. The streaming model imposes the constraint that only a single pass over the data is allowed. The $\ell_2$-SVM is known to have an equivalent formulation in terms of the minimum enclosing ball (MEB) problem, and an efficient algorithm based on the idea of \emph{core sets} exists (Core Vector Machine, CVM). CVM learns a $(1+\varepsilon)$-approximate MEB for a set of points and yields an approximate solution to corresponding SVM instance. However CVM works in batch mode requiring multiple passes over the data. This paper presents a single-pass SVM which is based on the minimum enclosing ball of streaming data. We show that the MEB updates for the streaming case can be easily adapted to learn the SVM weight vector in a way similar to using online stochastic gradient updates. Our algorithm performs polylogarithmic computation at each example, and requires very small and constant storage. Experimental results show that, even in such restrictive settings, we can learn efficiently in just one pass and get accuracies comparable to other state-of-the-art SVM solvers (batch and online). We also give an analysis of the algorithm, and discuss some open issues and possible extensions.
Published: 2009

21. Bayesian Agglomerative Clustering with Coalescents

Author: Teh, Yee Whye, Daumé III, Hal, and Roy, Daniel
Subjects: Statistics - Machine Learning
Abstract: We introduce a new Bayesian model for hierarchical clustering based on a prior over trees called Kingman's coalescent. We develop novel greedy and sequential Monte Carlo inferences which operate in a bottom-up agglomerative fashion. We show experimentally the superiority of our algorithms over others, and demonstrate our approach in document clustering and phylolinguistics., Comment: NIPS 2008
Published: 2009

22. Adversarial Robustness for Code

Author: Bielik, Pavol, Vechev, Martin, Daumé III, Hal, and Singh, Aarti
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Software Engineering, Computer Science - Programming Languages, Statistics - Machine Learning, Machine Learning (stat.ML), Machine Learning (cs.LG), Programming Languages (cs.PL)
Abstract: Machine learning and deep learning in particular has been recently used to successfully address many tasks in the domain of code such as finding and fixing bugs, code completion, decompilation, type inference and many others. However, the issue of adversarial robustness of models for code has gone largely unnoticed. In this work, we explore this issue by: (i) instantiating adversarial attacks for code (a domain with discrete and highly structured inputs), (ii) showing that, similar to other domains, neural models for code are vulnerable to adversarial attacks, and (iii) combining existing and novel techniques to improve robustness while preserving high accuracy., Proceedings of Machine Learning Research, 119, ISSN:2640-3498, Proceedings of the 37th International Conference on Machine Learning
Published: 2020

23. Adversarial Attacks on Probabilistic Autoregressive Forecasting Models

Author: Dang-Nhu, Raphaël, Singh, Gagandeep, Bielik, Pavol, Vechev, Martin, Raphaël, Dang-Nhu, Daumé III, Hal, Singh, Aarti, and Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology [Zürich] (ETH Zürich)
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Machine Learning (stat.ML), [INFO]Computer Science [cs], [INFO] Computer Science [cs], 16. Peace & justice, Machine Learning (cs.LG)
Abstract: We develop an effective generation of adversarial attacks on neural models that output a sequence of probability distributions rather than a sequence of single values. This setting includes the recently proposed deep probabilistic autoregressive forecasting models that estimate the probability distribution of a time series given its past and achieve state-of-the-art results in a diverse set of application domains. The key technical challenge we address is how to effectively differentiate through the Monte-Carlo estimation of statistics of the output sequence joint distribution. Additionally, we extend prior work on probabilistic forecasting to the Bayesian setting which allows conditioning on future observations, instead of only on past observations. We demonstrate that our approach can successfully generate attacks with small input perturbations in two challenging tasks where robust decision making is crucial – stock market trading and prediction of electricity consumption., Proceedings of Machine Learning Research, 119, ISSN:2640-3498, Proceedings of the 37th International Conference on Machine Learning (ICML 2020)
Published: 2020
Full Text: View/download PDF

24. From Sets to Multisets: Provable Variational Inference for Probabilistic Integer Submodular Models

Author: Sahin, Aytunc, Bian, Yatao, Buhmann, Joachim, Krause, Andreas, Daumé III, Hal, and Singh, Aarti
Subjects: FOS: Computer and information sciences, Computer Science::Machine Learning, Computer Science::Computer Science and Game Theory, Computer Science - Machine Learning, Statistics - Machine Learning, TheoryofComputation_GENERAL, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: Proceedings of Machine Learning Research, 119, ISSN:2640-3498, Proceedings of the 37th International Conference on Machine Learning
Published: 2020
Full Text: View/download PDF

25. Set Functions for Time Series

Author: Horn, Max, Moor, Michael, Bock, Christian, Rieck, Bastian, Borgwardt, Karsten, Daumé III, Hal, and Singh, Arti
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Machine Learning (stat.ML), Machine Learning (cs.LG)
Abstract: Despite the eminent successes of deep neural networks, many architectures are often hard to transfer to irregularly-sampled and asynchronous time series that commonly occur in real-world datasets, especially in healthcare applications. This paper proposes a novel approach for classifying irregularly-sampled time series with unaligned measurements, focusing on high scalability and data efficiency. Our method SeFT (Set Functions for Time Series) is based on recent advances in differentiable set function learning, extremely parallelizable with a beneficial memory footprint, thus scaling well to large datasets of long time series and online monitoring scenarios. Furthermore, our approach permits quantifying per-observation contributions to the classification outcome. We extensively compare our method with existing algorithms on multiple healthcare time series datasets and demonstrate that it performs competitively whilst significantly reducing runtime., Proceedings of Machine Learning Research, 119, ISSN:2640-3498, Proceedings of the 37th International Conference on Machine Learning
Published: 2020

26. Topological Autoencoders

Author: Moor, Michael, Horn, Max, Rieck, Bastian Alexander, Borgwardt, Karsten, Daumé III, Hal, and Singh, Aarti
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, FOS: Mathematics, Algebraic Topology (math.AT), Machine Learning (stat.ML), Mathematics - Algebraic Topology, Machine Learning (cs.LG)
Abstract: We propose a novel approach for preserving topological structures of the input space in latent representations of autoencoders. Using persistent homology, a technique from topological data analysis, we calculate topological signatures of both the input and latent space to derive a topological loss term. Under weak theoretical assumptions, we construct this loss in a differentiable manner, such that the encoding learns to retain multi-scale connectivity information. We show that our approach is theoretically well-founded and that it exhibits favourable latent representations on a synthetic manifold as well as on real-world image data sets, while preserving low reconstruction errors., Proceedings of Machine Learning Research, 119, ISSN:2640-3498, Proceedings of the 37th International Conference on Machine Learning
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

26 results on '"Daumé III, Hal"'

1. Active Imitation Learning from Multiple Non-Deterministic Teachers: Formulation, Challenges, and Algorithms

2. Active Imitation Learning with Noisy Guidance

3. Weight of Evidence as a Basis for Human-Oriented Explanations

4. Reinforcement Learning with Convex Constraints

5. Non-Monotonic Sequential Text Generation

6. Meta-Learning for Contextual Bandit Exploration

7. Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback

8. Contextual Memory Trees

9. Hierarchical Imitation and Reinforcement Learning

10. Active Learning for Cost-Sensitive Classification

11. Logarithmic Time One-Against-Some

12. Learning to Search Better Than Your Teacher

13. Bayesian Multitask Learning with Latent Hierarchies

14. Flexible Modeling of Latent Task Structures in Multitask Learning

15. A Binary Classification Framework for Two-Stage Multiple Kernel Learning

16. Learning Task Grouping and Overlap in Multi-task Learning

17. Efficient Protocols for Distributed Classification and Optimization

18. Protocols for Learning Classifiers on Distributed Data

19. The Infinite Hierarchical Factor Regression Model

20. Streamed Learning: One-Pass SVMs

21. Bayesian Agglomerative Clustering with Coalescents

22. Adversarial Robustness for Code

23. Adversarial Attacks on Probabilistic Autoregressive Forecasting Models

24. From Sets to Multisets: Provable Variational Inference for Probabilistic Integer Submodular Models

25. Set Functions for Time Series

26. Topological Autoencoders

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

Publisher

26 results on '"Daumé III, Hal"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources