Author: "Schwarzschild, Avi" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Schwarzschild, Avi"' showing total 35 results

Start Over Author "Schwarzschild, Avi"

35 results on '"Schwarzschild, Avi"'

1. Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization

Author: Ding, Mucong, Deng, Chenghao, Choo, Jocelyn, Wu, Zichu, Agrawal, Aakriti, Schwarzschild, Avi, Zhou, Tianyi, Goldstein, Tom, Langford, John, Anandkumar, Anima, and Huang, Furong
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: While generalization over tasks from easy to hard is crucial to profile language models (LLMs), the datasets with fine-grained difficulty annotations for each problem across a broad range of complexity are still blank. Aiming to address this limitation, we present Easy2Hard-Bench, a consistently formatted collection of 6 benchmark datasets spanning various domains, such as mathematics and programming problems, chess puzzles, and reasoning questions. Each problem within these datasets is annotated with numerical difficulty scores. To systematically estimate problem difficulties, we collect abundant performance data on attempts to each problem by humans in the real world or LLMs on the prominent leaderboard. Leveraging the rich performance data, we apply well-established difficulty ranking systems, such as Item Response Theory (IRT) and Glicko-2 models, to uniformly assign numerical difficulty scores to problems. Moreover, datasets in Easy2Hard-Bench distinguish themselves from previous collections by a higher proportion of challenging problems. Through extensive experiments with six state-of-the-art LLMs, we provide a comprehensive analysis of their performance and generalization capabilities across varying levels of difficulty, with the aim of inspiring future research in LLM generalization. The datasets are available at https://huggingface.co/datasets/furonghuang-lab/Easy2Hard-Bench., Comment: NeurIPS 2024 Datasets and Benchmarks Track
Published: 2024

2. Prompt Recovery for Image Generation Models: A Comparative Study of Discrete Optimizers

Author: Williams, Joshua Nathaniel, Schwarzschild, Avi, and Kolter, J. Zico
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Recovering natural language prompts for image generation models, solely based on the generated images is a difficult discrete optimization problem. In this work, we present the first head-to-head comparison of recent discrete optimization techniques for the problem of prompt inversion. We evaluate Greedy Coordinate Gradients (GCG), PEZ , Random Search, AutoDAN and BLIP2's image captioner across various evaluation metrics related to the quality of inverted prompts and the quality of the images generated by the inverted prompts. We find that focusing on the CLIP similarity between the inverted prompts and the ground truth image acts as a poor proxy for the similarity between ground truth image and the image generated by the inverted prompts. While the discrete optimizers effectively minimize their objectives, simply using responses from a well-trained captioner often leads to generated images that more closely resemble those produced by the original prompts., Comment: 9 Pages, 4 Figures
Published: 2024

3. The CLRS-Text Algorithmic Reasoning Language Benchmark

Author: Markeeva, Larisa, McLeish, Sean, Ibarz, Borja, Bounsi, Wilfried, Kozlova, Olga, Vitvitskyi, Alex, Blundell, Charles, Goldstein, Tom, Schwarzschild, Avi, and Veličković, Petar
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Data Structures and Algorithms, Statistics - Machine Learning
Abstract: Eliciting reasoning capabilities from language models (LMs) is a critical direction on the path towards building intelligent systems. Most recent studies dedicated to reasoning focus on out-of-distribution performance on procedurally-generated synthetic benchmarks, bespoke-built to evaluate specific skills only. This trend makes results hard to transfer across publications, slowing down progress. Three years ago, a similar issue was identified and rectified in the field of neural algorithmic reasoning, with the advent of the CLRS benchmark. CLRS is a dataset generator comprising graph execution traces of classical algorithms from the Introduction to Algorithms textbook. Inspired by this, we propose CLRS-Text -- a textual version of these algorithmic traces. Out of the box, CLRS-Text is capable of procedurally generating trace data for thirty diverse, challenging algorithmic tasks across any desirable input distribution, while offering a standard pipeline in which any additional algorithmic tasks may be created in the benchmark. We fine-tune and evaluate various LMs as generalist executors on this benchmark, validating prior work and revealing a novel, interesting challenge for the LM reasoning community. Our code is available at https://github.com/google-deepmind/clrs/tree/master/clrs/_src/clrs_text., Comment: Preprint, under review. Comments welcome
Published: 2024

4. Transformers Can Do Arithmetic with the Right Embeddings

Author: McLeish, Sean, Bansal, Arpit, Stein, Alex, Jain, Neel, Kirchenbauer, John, Bartoldson, Brian R., Kailkhura, Bhavya, Bhatele, Abhinav, Geiping, Jonas, Schwarzschild, Avi, and Goldstein, Tom
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: The poor performance of transformers on arithmetic tasks seems to stem in large part from their inability to keep track of the exact position of each digit inside of a large span of digits. We mend this problem by adding an embedding to each digit that encodes its position relative to the start of the number. In addition to the boost these embeddings provide on their own, we show that this fix enables architectural modifications such as input injection and recurrent layers to improve performance even further. With positions resolved, we can study the logical extrapolation ability of transformers. Can they solve arithmetic problems that are larger and more complex than those in their training data? We find that training on only 20 digit numbers with a single GPU for one day, we can reach state-of-the-art performance, achieving up to 99% accuracy on 100 digit addition problems. Finally, we show that these gains in numeracy also unlock improvements on other multi-step reasoning tasks including sorting and multiplication.
Published: 2024

5. Rethinking LLM Memorization through the Lens of Adversarial Compression

Author: Schwarzschild, Avi, Feng, Zhili, Maini, Pratyush, Lipton, Zachary C., and Kolter, J. Zico
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language
Abstract: Large language models (LLMs) trained on web-scale datasets raise substantial concerns regarding permissible data usage. One major question is whether these models "memorize" all their training data or they integrate many data sources in some way more akin to how a human would learn and synthesize information. The answer hinges, to a large degree, on how we define memorization. In this work, we propose the Adversarial Compression Ratio (ACR) as a metric for assessing memorization in LLMs. A given string from the training data is considered memorized if it can be elicited by a prompt (much) shorter than the string itself -- in other words, if these strings can be "compressed" with the model by computing adversarial prompts of fewer tokens. The ACR overcomes the limitations of existing notions of memorization by (i) offering an adversarial view of measuring memorization, especially for monitoring unlearning and compliance; and (ii) allowing for the flexibility to measure memorization for arbitrary strings at a reasonably low compute. Our definition serves as a practical tool for determining when model owners may be violating terms around data usage, providing a potential legal tool and a critical lens through which to address such scenarios., Comment: https://locuslab.github.io/acr-memorization
Published: 2024

6. Forcing Diffuse Distributions out of Language Models

Author: Zhang, Yiming, Schwarzschild, Avi, Carlini, Nicholas, Kolter, Zico, and Ippolito, Daphne
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Despite being trained specifically to follow user instructions, today's instructiontuned language models perform poorly when instructed to produce random outputs. For example, when prompted to pick a number uniformly between one and ten Llama-2-13B-chat disproportionately favors the number five, and when tasked with picking a first name at random, Mistral-7B-Instruct chooses Avery 40 times more often than we would expect based on the U.S. population. When these language models are used for real-world tasks where diversity of outputs is crucial, such as language model assisted dataset construction, their inability to produce diffuse distributions over valid choices is a major hurdle. In this work, we propose a fine-tuning method that encourages language models to output distributions that are diffuse over valid outcomes. The methods we introduce generalize across a variety of tasks and distributions and make large language models practical for synthetic dataset generation with little human intervention.
Published: 2024

7. Benchmarking ChatGPT on Algorithmic Reasoning

Author: McLeish, Sean, Schwarzschild, Avi, and Goldstein, Tom
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: We evaluate ChatGPT's ability to solve algorithm problems from the CLRS benchmark suite that is designed for GNNs. The benchmark requires the use of a specified classical algorithm to solve a given problem. We find that ChatGPT outperforms specialist GNN models, using Python to successfully solve these problems. This raises new points in the discussion about learning algorithms with neural networks and how we think about what out of distribution testing looks like with web scale training data.
Published: 2024

8. Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

Author: Hans, Abhimanyu, Schwarzschild, Avi, Cherepanova, Valeriia, Kazemi, Hamid, Saha, Aniruddha, Goldblum, Micah, Geiping, Jonas, and Goldstein, Tom
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Detecting text generated by modern large language models is thought to be hard, as both LLMs and humans can exhibit a wide range of complex behaviors. However, we find that a score based on contrasting two closely related language models is highly accurate at separating human-generated and machine-generated text. Based on this mechanism, we propose a novel LLM detector that only requires simple calculations using a pair of pre-trained LLMs. The method, called Binoculars, achieves state-of-the-art accuracy without any training data. It is capable of spotting machine text from a range of modern LLMs without any model-specific modifications. We comprehensively evaluate Binoculars on a number of text sources and in varied situations. Over a wide range of document types, Binoculars detects over 90% of generated samples from ChatGPT (and other LLMs) at a false positive rate of 0.01%, despite not being trained on any ChatGPT data., Comment: 20 pages, code available at https://github.com/ahans30/Binoculars
Published: 2024

9. TOFU: A Task of Fictitious Unlearning for LLMs

Author: Maini, Pratyush, Feng, Zhili, Schwarzschild, Avi, Lipton, Zachary C., and Kolter, J. Zico
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language
Abstract: Large language models trained on massive corpora of data from the web can memorize and reproduce sensitive or private data raising both legal and ethical concerns. Unlearning, or tuning models to forget information present in their training data, provides us with a way to protect private data after training. Although several methods exist for such unlearning, it is unclear to what extent they result in models equivalent to those where the data to be forgotten was never learned in the first place. To address this challenge, we present TOFU, a Task of Fictitious Unlearning, as a benchmark aimed at helping deepen our understanding of unlearning. We offer a dataset of 200 diverse synthetic author profiles, each consisting of 20 question-answer pairs, and a subset of these profiles called the forget set that serves as the target for unlearning. We compile a suite of metrics that work together to provide a holistic picture of unlearning efficacy. Finally, we provide a set of baseline results from existing unlearning algorithms. Importantly, none of the baselines we consider show effective unlearning motivating continued efforts to develop approaches for unlearning that effectively tune models so that they truly behave as if they were never trained on the forget data at all., Comment: https://locuslab.github.io/tofu/
Published: 2024

10. Effective Backdoor Mitigation Depends on the Pre-training Objective

Author: Verma, Sahil, Bhatt, Gantavya, Schwarzschild, Avi, Singhal, Soumye, Das, Arnav Mohanty, Shah, Chirag, Dickerson, John P, and Bilmes, Jeff
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Despite the advanced capabilities of contemporary machine learning (ML) models, they remain vulnerable to adversarial and backdoor attacks. This vulnerability is particularly concerning in real-world deployments, where compromised models may exhibit unpredictable behavior in critical scenarios. Such risks are heightened by the prevalent practice of collecting massive, internet-sourced datasets for pre-training multimodal models, as these datasets may harbor backdoors. Various techniques have been proposed to mitigate the effects of backdooring in these models such as CleanCLIP which is the current state-of-the-art approach. In this work, we demonstrate that the efficacy of CleanCLIP in mitigating backdoors is highly dependent on the particular objective used during model pre-training. We observe that stronger pre-training objectives correlate with harder to remove backdoors behaviors. We show this by training multimodal models on two large datasets consisting of 3 million (CC3M) and 6 million (CC6M) datapoints, under various pre-training objectives, followed by poison removal using CleanCLIP. We find that CleanCLIP is ineffective when stronger pre-training objectives are used, even with extensive hyperparameter tuning. Our findings underscore critical considerations for ML practitioners who pre-train models using large-scale web-curated data and are concerned about potential backdoor threats. Notably, our results suggest that simpler pre-training objectives are more amenable to effective backdoor removal. This insight is pivotal for practitioners seeking to balance the trade-offs between using stronger pre-training objectives and security against backdoor attacks., Comment: Accepted for oral presentation at BUGS workshop @ NeurIPS 2023 (https://neurips2023-bugs.github.io/)
Published: 2023

11. NEFTune: Noisy Embeddings Improve Instruction Finetuning

Author: Jain, Neel, Chiang, Ping-yeh, Wen, Yuxin, Kirchenbauer, John, Chu, Hong-Min, Somepalli, Gowthami, Bartoldson, Brian R., Kailkhura, Bhavya, Schwarzschild, Avi, Saha, Aniruddha, Goldblum, Micah, Geiping, Jonas, and Goldstein, Tom
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: We show that language model finetuning can be improved, sometimes dramatically, with a simple augmentation. NEFTune adds noise to the embedding vectors during training. Standard finetuning of LLaMA-2-7B using Alpaca achieves 29.79% on AlpacaEval, which rises to 64.69% using noisy embeddings. NEFTune also improves over strong baselines on modern instruction datasets. Models trained with Evol-Instruct see a 10% improvement, with ShareGPT an 8% improvement, and with OpenPlatypus an 8% improvement. Even powerful models further refined with RLHF such as LLaMA-2-Chat benefit from additional training with NEFTune., Comment: 25 pages, Code is available on Github: https://github.com/neelsjain/NEFTune
Published: 2023

12. Baseline Defenses for Adversarial Attacks Against Aligned Language Models

Author: Jain, Neel, Schwarzschild, Avi, Wen, Yuxin, Somepalli, Gowthami, Kirchenbauer, John, Chiang, Ping-yeh, Goldblum, Micah, Saha, Aniruddha, Geiping, Jonas, and Goldstein, Tom
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language, Computer Science - Cryptography and Security
Abstract: As Large Language Models quickly become ubiquitous, it becomes critical to understand their security vulnerabilities. Recent work shows that text optimizers can produce jailbreaking prompts that bypass moderation and alignment. Drawing from the rich body of work on adversarial machine learning, we approach these attacks with three questions: What threat models are practically useful in this domain? How do baseline defense techniques perform in this new domain? How does LLM security differ from computer vision? We evaluate several baseline defense strategies against leading adversarial attacks on LLMs, discussing the various settings in which each is feasible and effective. Particularly, we look at three types of defenses: detection (perplexity based), input preprocessing (paraphrase and retokenization), and adversarial training. We discuss white-box and gray-box settings and discuss the robustness-performance trade-off for each of the defenses considered. We find that the weakness of existing discrete optimizers for text, combined with the relatively high costs of optimization, makes standard adaptive attacks more challenging for LLMs. Future research will be needed to uncover whether more powerful optimizers can be developed, or whether the strength of filtering and preprocessing defenses is greater in the LLMs domain than it has been in computer vision., Comment: 12 pages
Published: 2023

13. A Cookbook of Self-Supervised Learning

Author: Balestriero, Randall, Ibrahim, Mark, Sobal, Vlad, Morcos, Ari, Shekhar, Shashank, Goldstein, Tom, Bordes, Florian, Bardes, Adrien, Mialon, Gregoire, Tian, Yuandong, Schwarzschild, Avi, Wilson, Andrew Gordon, Geiping, Jonas, Garrido, Quentin, Fernandez, Pierre, Bar, Amir, Pirsiavash, Hamed, LeCun, Yann, and Goldblum, Micah
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Self-supervised learning, dubbed the dark matter of intelligence, is a promising path to advance machine learning. Yet, much like cooking, training SSL methods is a delicate art with a high barrier to entry. While many components are familiar, successfully training a SSL method involves a dizzying set of choices from the pretext tasks to training hyper-parameters. Our goal is to lower the barrier to entry into SSL research by laying the foundations and latest SSL recipes in the style of a cookbook. We hope to empower the curious researcher to navigate the terrain of methods, understand the role of the various knobs, and gain the know-how required to explore how delicious SSL can be.
Published: 2023

14. Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective

Author: Schwarzschild, Avi, Cembalest, Max, Rao, Karthik, Hines, Keegan, and Dickerson, John
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: As neural networks increasingly make critical decisions in high-stakes settings, monitoring and explaining their behavior in an understandable and trustworthy manner is a necessity. One commonly used type of explainer is post hoc feature attribution, a family of methods for giving each feature in an input a score corresponding to its influence on a model's output. A major limitation of this family of explainers in practice is that they can disagree on which features are more important than others. Our contribution in this paper is a method of training models with this disagreement problem in mind. We do this by introducing a Post hoc Explainer Agreement Regularization (PEAR) loss term alongside the standard term corresponding to accuracy, an additional term that measures the difference in feature attribution between a pair of explainers. We observe on three datasets that we can train a model with this loss term to improve explanation consensus on unseen data, and see improved consensus between explainers other than those used in the loss term. We examine the trade-off between improved consensus and model performance. And finally, we study the influence our method has on feature attribution explanations.
Published: 2023

15. Neural Auctions Compromise Bidder Information

Author: Stein, Alex, Schwarzschild, Avi, Curry, Michael, Goldstein, Tom, and Dickerson, John
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security, Computer Science - Computer Science and Game Theory
Abstract: Single-shot auctions are commonly used as a means to sell goods, for example when selling ad space or allocating radio frequencies, however devising mechanisms for auctions with multiple bidders and multiple items can be complicated. It has been shown that neural networks can be used to approximate optimal mechanisms while satisfying the constraints that an auction be strategyproof and individually rational. We show that despite such auctions maximizing revenue, they do so at the cost of revealing private bidder information. While randomness is often used to build in privacy, in this context it comes with complications if done without care. Specifically, it can violate rationality and feasibility constraints, fundamentally change the incentive structure of the mechanism, and/or harm top-level metrics such as revenue and social welfare. We propose a method that employs stochasticity to improve privacy while meeting the requirements for auction mechanisms with only a modest sacrifice in revenue. We analyze the cost to the auction house that comes with introducing varying degrees of privacy in common auction settings. Our results show that despite current neural auctions' ability to approximate optimal mechanisms, the resulting vulnerability that comes with relying on neural networks must be accounted for.
Published: 2023

16. Universal Guidance for Diffusion Models

Author: Bansal, Arpit, Chu, Hong-Min, Schwarzschild, Avi, Sengupta, Soumyadip, Goldblum, Micah, Geiping, Jonas, and Goldstein, Tom
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Typical diffusion models are trained to accept a particular form of conditioning, most commonly text, and cannot be conditioned on other modalities without retraining. In this work, we propose a universal guidance algorithm that enables diffusion models to be controlled by arbitrary guidance modalities without the need to retrain any use-specific components. We show that our algorithm successfully generates quality images with guidance functions including segmentation, face recognition, object detection, and classifier signals. Code is available at https://github.com/arpitbansal297/Universal-Guided-Diffusion.
Published: 2023

17. Transfer Learning with Deep Tabular Models

Author: Levin, Roman, Cherepanova, Valeriia, Schwarzschild, Avi, Bansal, Arpit, Bruss, C. Bayan, Goldstein, Tom, Wilson, Andrew Gordon, and Goldblum, Micah
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Recent work on deep learning for tabular data demonstrates the strong performance of deep tabular models, often bridging the gap between gradient boosted decision trees and neural networks. Accuracy aside, a major advantage of neural models is that they learn reusable features and are easily fine-tuned in new domains. This property is often exploited in computer vision and natural language applications, where transfer learning is indispensable when task-specific training data is scarce. In this work, we demonstrate that upstream data gives tabular neural networks a decisive advantage over widely used GBDT models. We propose a realistic medical diagnosis benchmark for tabular transfer learning, and we present a how-to guide for using upstream data to boost performance with a variety of tabular neural network architectures. Finally, we propose a pseudo-feature method for cases where the upstream and downstream feature sets differ, a tabular-specific problem widespread in real-world applications. Our code is available at https://github.com/LevinRoman/tabular-transfer-learning .
Published: 2022

18. End-to-end Algorithm Synthesis with Recurrent Networks: Logical Extrapolation Without Overthinking

Author: Bansal, Arpit, Schwarzschild, Avi, Borgnia, Eitan, Emam, Zeyad, Huang, Furong, Goldblum, Micah, and Goldstein, Tom
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Machine learning systems perform well on pattern matching tasks, but their ability to perform algorithmic or logical reasoning is not well understood. One important reasoning capability is algorithmic extrapolation, in which models trained only on small/simple reasoning problems can synthesize complex strategies for large/complex problems at test time. Algorithmic extrapolation can be achieved through recurrent systems, which can be iterated many times to solve difficult reasoning problems. We observe that this approach fails to scale to highly complex problems because behavior degenerates when many iterations are applied -- an issue we refer to as "overthinking." We propose a recall architecture that keeps an explicit copy of the problem instance in memory so that it cannot be forgotten. We also employ a progressive training routine that prevents the model from learning behaviors that are specific to iteration number and instead pushes it to learn behaviors that can be repeated indefinitely. These innovations prevent the overthinking problem, and enable recurrent systems to solve extremely hard extrapolation tasks.
Published: 2022

19. Datasets for Studying Generalization from Easy to Hard Examples

Author: Schwarzschild, Avi, Borgnia, Eitan, Gupta, Arjun, Bansal, Arpit, Emam, Zeyad, Huang, Furong, Goldblum, Micah, and Goldstein, Tom
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: We describe new datasets for studying generalization from easy to hard examples.
Published: 2021

20. MetaBalance: High-Performance Neural Networks for Class-Imbalanced Data

Author: Bansal, Arpit, Goldblum, Micah, Cherepanova, Valeriia, Schwarzschild, Avi, Bruss, C. Bayan, and Goldstein, Tom
Subjects: Computer Science - Artificial Intelligence
Abstract: Class-imbalanced data, in which some classes contain far more samples than others, is ubiquitous in real-world applications. Standard techniques for handling class-imbalance usually work by training on a re-weighted loss or on re-balanced data. Unfortunately, training overparameterized neural networks on such objectives causes rapid memorization of minority class data. To avoid this trap, we harness meta-learning, which uses both an ''outer-loop'' and an ''inner-loop'' loss, each of which may be balanced using different strategies. We evaluate our method, MetaBalance, on image classification, credit-card fraud detection, loan default prediction, and facial recognition tasks with severely imbalanced data, and we find that MetaBalance outperforms a wide array of popular re-sampling strategies.
Published: 2021

21. Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks

Author: Schwarzschild, Avi, Borgnia, Eitan, Gupta, Arjun, Huang, Furong, Vishkin, Uzi, Goldblum, Micah, and Goldstein, Tom
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Deep neural networks are powerful machines for visual pattern recognition, but reasoning tasks that are easy for humans may still be difficult for neural models. Humans possess the ability to extrapolate reasoning strategies learned on simple problems to solve harder examples, often by thinking for longer. For example, a person who has learned to solve small mazes can easily extend the very same search techniques to solve much larger mazes by spending more time. In computers, this behavior is often achieved through the use of algorithms, which scale to arbitrarily hard problem instances at the cost of more computation. In contrast, the sequential computing budget of feed-forward neural networks is limited by their depth, and networks trained on simple problems have no way of extending their reasoning to accommodate harder problems. In this work, we show that recurrent networks trained to solve simple problems with few recurrent steps can indeed solve much more complex problems simply by performing additional recurrences during inference. We demonstrate this algorithmic behavior of recurrent networks on prefix sum computation, mazes, and chess. In all three domains, networks trained on simple problem instances are able to extend their reasoning abilities at test time simply by "thinking for longer."
Published: 2021

22. SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

Author: Somepalli, Gowthami, Goldblum, Micah, Schwarzschild, Avi, Bruss, C. Bayan, and Goldstein, Tom
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Tabular data underpins numerous high-impact applications of machine learning from fraud detection to genomics and healthcare. Classical approaches to solving tabular problems, such as gradient boosting and random forests, are widely used by practitioners. However, recent deep learning methods have achieved a degree of performance competitive with popular techniques. We devise a hybrid deep learning approach to solving tabular data problems. Our method, SAINT, performs attention over both rows and columns, and it includes an enhanced embedding method. We also study a new contrastive self-supervised pre-training method for use when labels are scarce. SAINT consistently improves performance over previous deep learning methods, and it even outperforms gradient boosting methods, including XGBoost, CatBoost, and LightGBM, on average over a variety of benchmark tasks.
Published: 2021

23. The Uncanny Similarity of Recurrence and Depth

Author: Schwarzschild, Avi, Gupta, Arjun, Ghiasi, Amin, Goldblum, Micah, and Goldstein, Tom
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: It is widely believed that deep neural networks contain layer specialization, wherein neural networks extract hierarchical features representing edges and patterns in shallow layers and complete objects in deeper layers. Unlike common feed-forward models that have distinct filters at each layer, recurrent networks reuse the same parameters at various depths. In this work, we observe that recurrent models exhibit the same hierarchical behaviors and the same performance benefits with depth as feed-forward networks despite reusing the same filters at every recurrence. By training models of various feed-forward and recurrent architectures on several datasets for image classification as well as maze solving, we show that recurrent networks have the ability to closely emulate the behavior of non-recurrent deep models, often doing so with far fewer parameters.
Published: 2021

24. Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses

Author: Goldblum, Micah, Tsipras, Dimitris, Xie, Chulin, Chen, Xinyun, Schwarzschild, Avi, Song, Dawn, Madry, Aleksander, Li, Bo, and Goldstein, Tom
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Cryptography and Security, Computer Science - Computer Vision and Pattern Recognition
Abstract: As machine learning systems grow in scale, so do their training data requirements, forcing practitioners to automate and outsource the curation of training data in order to achieve state-of-the-art performance. The absence of trustworthy human supervision over the data collection process exposes organizations to security vulnerabilities; training data can be manipulated to control and degrade the downstream behaviors of learned models. The goal of this work is to systematically categorize and discuss a wide range of dataset vulnerabilities and exploits, approaches for defending against these threats, and an array of open problems in this space. In addition to describing various poisoning and backdoor threat models and the relationships among them, we develop their unified taxonomy.
Published: 2020

25. Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks

Author: Schwarzschild, Avi, Goldblum, Micah, Gupta, Arjun, Dickerson, John P, and Goldstein, Tom
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computers and Society, Statistics - Machine Learning
Abstract: Data poisoning and backdoor attacks manipulate training data in order to cause models to fail during inference. A recent survey of industry practitioners found that data poisoning is the number one concern among threats ranging from model stealing to adversarial attacks. However, it remains unclear exactly how dangerous poisoning methods are and which ones are more effective considering that these methods, even ones with identical objectives, have not been tested in consistent or realistic settings. We observe that data poisoning and backdoor attacks are highly sensitive to variations in the testing setup. Moreover, we find that existing methods may not generalize to realistic settings. While these existing works serve as valuable prototypes for data poisoning, we apply rigorous tests to determine the extent to which we should fear them. In order to promote fair comparison in future work, we develop standardized benchmarks for data poisoning and backdoor attacks., Comment: 19 pages, 4 figures
Published: 2020

26. Headless Horseman: Adversarial Attacks on Transfer Learning Models

Author: Abdelkader, Ahmed, Curry, Michael J., Fowl, Liam, Goldstein, Tom, Schwarzschild, Avi, Shu, Manli, Studer, Christoph, and Zhu, Chen
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Transfer learning facilitates the training of task-specific classifiers using pre-trained models as feature extractors. We present a family of transferable adversarial attacks against such classifiers, generated without access to the classification head; we call these \emph{headless attacks}. We first demonstrate successful transfer attacks against a victim network using \textit{only} its feature extractor. This motivates the introduction of a label-blind adversarial attack. This transfer attack method does not require any information about the class-label space of the victim. Our attack lowers the accuracy of a ResNet18 trained on CIFAR10 by over 40\%., Comment: 5 pages, 2 figures. Accepted in ICASSP 2020. Code available on https://github.com/zhuchen03/headless-attack.git
Published: 2020
Full Text: View/download PDF

27. Adversarial Attacks on Machine Learning Systems for High-Frequency Trading

Author: Goldblum, Micah, Schwarzschild, Avi, Patel, Ankit B., and Goldstein, Tom
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security, Quantitative Finance - Statistical Finance
Abstract: Algorithmic trading systems are often completely automated, and deep learning is increasingly receiving attention in this domain. Nonetheless, little is known about the robustness properties of these models. We study valuation models for algorithmic trading from the perspective of adversarial machine learning. We introduce new attacks specific to this domain with size constraints that minimize attack costs. We further discuss how these attacks can be used as an analysis tool to study and evaluate the robustness properties of financial models. Finally, we investigate the feasibility of realistic adversarial attacks in which an adversarial trader fools automated trading systems into making inaccurate predictions., Comment: ACM International Conference on AI in Finance (ICAIF) 2021
Published: 2020
Full Text: View/download PDF

28. Truth or Backpropaganda? An Empirical Investigation of Deep Learning Theory

Author: Goldblum, Micah, Geiping, Jonas, Schwarzschild, Avi, Moeller, Michael, and Goldstein, Tom
Subjects: Computer Science - Machine Learning, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: We empirically evaluate common assumptions about neural networks that are widely held by practitioners and theorists alike. In this work, we: (1) prove the widespread existence of suboptimal local minima in the loss landscape of neural networks, and we use our theory to find examples; (2) show that small-norm parameters are not optimal for generalization; (3) demonstrate that ResNets do not conform to wide-network theories, such as the neural tangent kernel, and that the interaction between skip connections and batch normalization plays a role; (4) find that rank does not correlate with generalization or robustness in a practical setting., Comment: 18 pages, 6 figures. First two authors contributed equally. Published as a conference paper at ICLR 2020
Published: 2019

29. An Implementation of Adaptive Mesh Refinement for Shallow Water Equations

Author: Schwarzschild, Avi and Mandli, Kyle T.
Subjects: Mathematics - Numerical Analysis
Abstract: An implementation of adaptive mesh refinement algorithms is presented for use with multilayer shallow water equations. Currently, adaptive mesh refinement is implemented with a single layer shallow water model in the GeoClaw framework. This implementation, also in the GeoClaw framework, is for multilayer models, which have been implemented in GeoClaw previously. Until now, however, these models were too computationally expensive to run on large domains while resolving detail in coastal regions.
Published: 2018

30. Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective

Author: Schwarzschild, Avi, primary, Cembalest, Max, additional, Rao, Karthik, additional, Hines, Keegan, additional, and Dickerson, John, additional
Published: 2023
Full Text: View/download PDF

31. Universal Guidance for Diffusion Models

Author: Bansal, Arpit, primary, Chu, Hong-Min, additional, Schwarzschild, Avi, additional, Sengupta, Soumyadip, additional, Goldblum, Micah, additional, Geiping, Jonas, additional, and Goldstein, Tom, additional
Published: 2023
Full Text: View/download PDF

32. Deep Thinking Systems: Logical Extrapolation with Recurrent Neural Networks

Author: Schwarzschild, Avi Koplon and Schwarzschild, Avi Koplon
Abstract: Deep neural networks are powerful machines for visual pattern recognition, but reasoning tasks that are easy for humans are still be difficult for neural models. Humans possess the ability to extrapolate reasoning strategies learned on simple problems to solve harder examples, often by thinking for longer. We study neural networks that have exactly this capability. By employing recurrence, we build neural networks that can expend more computation when needed. Using several datasets designed specifically for studying generalization from easy problems to harder test samples, we show that our recurrent networks can extrapolate from easy training data to much harder examples at test time, and they do so with many more iterations of a recurrent block of layers than are used during training.
Published: 2023

33. Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses

Author: Goldblum, Micah, primary, Tsipras, Dimitris, additional, Xie, Chulin, additional, Chen, Xinyun, additional, Schwarzschild, Avi, additional, Song, Dawn, additional, Madry, Aleksander, additional, Li, Bo, additional, and Goldstein, Tom, additional
Published: 2023
Full Text: View/download PDF

34. Adversarial attacks on machine learning systems for high-frequency trading

Author: Goldblum, Micah, primary, Schwarzschild, Avi, additional, Patel, Ankit, additional, and Goldstein, Tom, additional
Published: 2021
Full Text: View/download PDF

35. Headless Horseman: Adversarial Attacks on Transfer Learning Models

Author: Abdelkader, Ahmed, primary, Curry, Michael J., additional, Fowl, Liam, additional, Goldstein, Tom, additional, Schwarzschild, Avi, additional, Shu, Manli, additional, Studer, Christoph, additional, and Zhu, Chen, additional
Published: 2020
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

35 results on '"Schwarzschild, Avi"'

1. Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization

2. Prompt Recovery for Image Generation Models: A Comparative Study of Discrete Optimizers

3. The CLRS-Text Algorithmic Reasoning Language Benchmark

4. Transformers Can Do Arithmetic with the Right Embeddings

5. Rethinking LLM Memorization through the Lens of Adversarial Compression

6. Forcing Diffuse Distributions out of Language Models

7. Benchmarking ChatGPT on Algorithmic Reasoning

8. Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text

9. TOFU: A Task of Fictitious Unlearning for LLMs

10. Effective Backdoor Mitigation Depends on the Pre-training Objective

11. NEFTune: Noisy Embeddings Improve Instruction Finetuning

12. Baseline Defenses for Adversarial Attacks Against Aligned Language Models

13. A Cookbook of Self-Supervised Learning

14. Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective

15. Neural Auctions Compromise Bidder Information

16. Universal Guidance for Diffusion Models

17. Transfer Learning with Deep Tabular Models

18. End-to-end Algorithm Synthesis with Recurrent Networks: Logical Extrapolation Without Overthinking

19. Datasets for Studying Generalization from Easy to Hard Examples

20. MetaBalance: High-Performance Neural Networks for Class-Imbalanced Data

21. Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks

22. SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

23. The Uncanny Similarity of Recurrence and Depth

24. Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses

25. Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks

26. Headless Horseman: Adversarial Attacks on Transfer Learning Models

27. Adversarial Attacks on Machine Learning Systems for High-Frequency Trading

28. Truth or Backpropaganda? An Empirical Investigation of Deep Learning Theory

29. An Implementation of Adaptive Mesh Refinement for Shallow Water Equations

30. Reckoning with the Disagreement Problem: Explanation Consensus as a Training Objective

31. Universal Guidance for Diffusion Models

32. Deep Thinking Systems: Logical Extrapolation with Recurrent Neural Networks

33. Dataset Security for Machine Learning: Data Poisoning, Backdoor Attacks, and Defenses

34. Adversarial attacks on machine learning systems for high-frequency trading

35. Headless Horseman: Adversarial Attacks on Transfer Learning Models

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

35 results on '"Schwarzschild, Avi"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources