Author: "ARAM, H." / Database: arXiv - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"ARAM, H."' showing total 7 results

Start Over Author "ARAM, H." Database arXiv

7 results on '"ARAM, H."'

1. Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs

Author: Cummins, Chris, Seeker, Volker, Armengol-Estapé, Jordi, Markosyan, Aram H., Synnaeve, Gabriel, and Leather, Hugh
Subjects: Computer Science - Machine Learning
Abstract: Tools for rewriting, refactoring and optimizing code should be fast and correct. Large language models (LLMs), by their nature, possess neither of these qualities. Yet, there remains tremendous opportunity in using LLMs to improve code. We explore the use of LLMs not to transform code, but to code transforms. We propose a chain-of-thought approach to synthesizing code transformations from a small number of input/output code examples that incorporates execution and feedback. Unlike the direct rewrite approach, LLM-generated transformations are easy to inspect, debug, and validate. The logic of the rewrite is explicitly coded and easy to adapt. The compute required to run code transformations is minute compared to that of LLM rewriting. We test our approach on 16 Python code transformations and find that LLM- generated transforms are perfectly precise for 7 of them and less imprecise than direct LLM rewriting on the others. We hope to encourage further research to improving the precision of LLM code rewriting.
Published: 2024

2. Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Author: Samvelyan, Mikayel, Raparthy, Sharath Chandra, Lupu, Andrei, Hambro, Eric, Markosyan, Aram H., Bhatt, Manish, Mao, Yuning, Jiang, Minqi, Parker-Holder, Jack, Foerster, Jakob, Rocktäschel, Tim, and Raileanu, Roberta
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: As large language models (LLMs) become increasingly prevalent across many real-world applications, understanding and enhancing their robustness to adversarial attacks is of paramount importance. Existing methods for identifying adversarial prompts tend to focus on specific domains, lack diversity, or require extensive human annotations. To address these limitations, we present Rainbow Teaming, a novel black-box approach for producing a diverse collection of adversarial prompts. Rainbow Teaming casts adversarial prompt generation as a quality-diversity problem, and uses open-ended search to generate prompts that are both effective and diverse. Focusing on the safety domain, we use Rainbow Teaming to target various state-of-the-art LLMs, including the Llama 2 and Llama 3 models. Our approach reveals hundreds of effective adversarial prompts, with an attack success rate exceeding 90% across all tested models. Furthermore, we demonstrate that fine-tuning models with synthetic data generated by the Rainbow Teaming method significantly enhances their safety without sacrificing general performance or helpfulness. We additionally explore the versatility of Rainbow Teaming by applying it to question answering and cybersecurity, showcasing its potential to drive robust open-ended self-improvement in a wide range of applications.
Published: 2024

3. Complexity results on locally-balanced $2$-partitions of graphs

Author: Gharibyan, Aram H. and Petrosyan, Petros A.
Subjects: Mathematics - Combinatorics
Abstract: A \emph{$2$-partition of a graph $G$} is a function $f:V(G)\rightarrow \{0,1\}$. A $2$-partition $f$ of a graph $G$ is a \emph{locally-balanced with an open neighborhood} if for every $v\in V(G)$, $$\left\vert \vert \{u\in N_{G}(v)\colon\,f(u)=0\}\vert - \vert \{u\in N_{G}(v)\colon\,f(u)=1\}\vert \right\vert\leq 1.$$ A $2$-partition $f^{\prime}$ of a graph $G$ is a \emph{locally-balanced with a closed neighborhood} if for every $v\in V(G)$, $$\left\vert \vert \{u\in N_{G}[v]\colon\,f^{\prime}(u)=0\}\vert - \vert \{u\in N_{G}[v]\colon\,f^{\prime}(u)=1\}\vert \right\vert\leq 1.$$ In this paper we prove that the problem of the existence of locally-balanced $2$-partition with an open (closed) neighborhood is $NP$-complete for some restricted classes of graphs. In particular, we show that the problem of deciding if a given graph has a locally-balanced $2$-partition with an open neighborhood is $NP$-complete for biregular bipartite graphs and even bipartite graphs with maximum degree $4$, and the problem of deciding if a given graph has a locally-balanced $2$-partition with a closed neighborhood is $NP$-complete even for subcubic bipartite graphs and odd graphs with maximum degree $3$. Last results prove a conjecture of Balikyan and Kamalian., Comment: 19 pages, 4 figures
Published: 2024

4. Using Captum to Explain Generative Language Models

Author: Miglani, Vivek, Yang, Aobo, Markosyan, Aram H., Garcia-Olano, Diego, and Kokhlikyan, Narine
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, I.2.7
Abstract: Captum is a comprehensive library for model explainability in PyTorch, offering a range of methods from the interpretability literature to enhance users' understanding of PyTorch models. In this paper, we introduce new features in Captum that are specifically designed to analyze the behavior of generative language models. We provide an overview of the available functionalities and example applications of their potential for understanding learned associations within generative language models.
Published: 2023

5. Identifying and Disentangling Spurious Features in Pretrained Image Representations

Author: Darbinyan, Rafayel, Harutyunyan, Hrayr, Markosyan, Aram H., and Khachatrian, Hrant
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Neural networks employ spurious correlations in their predictions, resulting in decreased performance when these correlations do not hold. Recent works suggest fixing pretrained representations and training a classification head that does not use spurious features. We investigate how spurious features are represented in pretrained representations and explore strategies for removing information about spurious features. Considering the Waterbirds dataset and a few pretrained representations, we find that even with full knowledge of spurious features, their removal is not straightforward due to entangled representation. To address this, we propose a linear autoencoder training method to separate the representation into core, spurious, and other features. We propose two effective spurious feature removal approaches that are applied to the encoding and significantly improve classification performance measured by worst group accuracy.
Published: 2023

6. Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation

Author: Kottur, Satwik, Moon, Seungwhan, Markosyan, Aram H., Shah, Hardik, Damavandi, Babak, and Geramifard, Alborz
Subjects: Computer Science - Computation and Language
Abstract: People capture photos and videos to relive and share memories of personal significance. Recently, media montages (stories) have become a popular mode of sharing these memories due to their intuitive and powerful storytelling capabilities. However, creating such montages usually involves a lot of manual searches, clicks, and selections that are time-consuming and cumbersome, adversely affecting user experiences. To alleviate this, we propose task-oriented dialogs for montage creation as a novel interactive tool to seamlessly search, compile, and edit montages from a media collection. To the best of our knowledge, our work is the first to leverage multi-turn conversations for such a challenging application, extending the previous literature studying simple media retrieval tasks. We collect a new dataset C3 (Conversational Content Creation), comprising 10k dialogs conditioned on media montages simulated from a large media collection. We take a simulate-and-paraphrase approach to collect these dialogs to be both cost and time efficient, while drawing from natural language distribution. Our analysis and benchmarking of state-of-the-art language models showcase the multimodal challenges present in the dataset. Lastly, we present a real-world mobile demo application that shows the feasibility of the proposed work in real-world applications. Our code and data will be made publicly available., Comment: 8 pages, 6 figures, 2 tables
Published: 2022

7. Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models

Author: Tirumala, Kushal, Markosyan, Aram H., Zettlemoyer, Luke, and Aghajanyan, Armen
Subjects: Computer Science - Computation and Language
Abstract: Despite their wide adoption, the underlying training and memorization dynamics of very large language models is not well understood. We empirically study exact memorization in causal and masked language modeling, across model sizes and throughout the training process. We measure the effects of dataset size, learning rate, and model size on memorization, finding that larger language models memorize training data faster across all settings. Surprisingly, we show that larger models can memorize a larger portion of the data before over-fitting and tend to forget less throughout the training process. We also analyze the memorization dynamics of different parts of speech and find that models memorize nouns and numbers first; we hypothesize and provide empirical evidence that nouns and numbers act as a unique identifier for memorizing individual training examples. Together, these findings present another piece of the broader puzzle of trying to understand what actually improves as models get bigger.
Published: 2022

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

7 results on '"ARAM, H."'

1. Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs

2. Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

3. Complexity results on locally-balanced $2$-partitions of graphs

4. Using Captum to Explain Generative Language Models

5. Identifying and Disentangling Spurious Features in Pretrained Image Representations

6. Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation

7. Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Publication Type

Database

7 results on '"ARAM, H."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources