Author: "Mooney, Raymond J." / Publication Year Range: Last 10 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Mooney, Raymond J."' showing total 111 results

Start Over Author "Mooney, Raymond J." Publication Year Range Last 10 years

111 results on '"Mooney, Raymond J."'

1. Learning Deep Semantics for Test Completion

Author: Nie, Pengyu, Banerjee, Rahul, Li, Junyi Jessy, Mooney, Raymond J., and Gligoric, Milos
Subjects: Computer Science - Software Engineering, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Writing tests is a time-consuming yet essential task during software development. We propose to leverage recent advances in deep learning for text and code generation to assist developers in writing tests. We formalize the novel task of test completion to automatically complete the next statement in a test method based on the context of prior statements and the code under test. We develop TeCo -- a deep learning model using code semantics for test completion. The key insight underlying TeCo is that predicting the next statement in a test method requires reasoning about code execution, which is hard to do with only syntax-level data that existing code completion models use. TeCo extracts and uses six kinds of code semantics data, including the execution result of prior statements and the execution context of the test method. To provide a testbed for this new task, as well as to evaluate TeCo, we collect a corpus of 130,934 test methods from 1,270 open-source Java projects. Our results show that TeCo achieves an exact-match accuracy of 18, which is 29% higher than the best baseline using syntax-level data only. When measuring functional correctness of generated next statement, TeCo can generate runnable code in 29% of the cases compared to 18% obtained by the best baseline. Moreover, TeCo is significantly better than prior work on test oracle generation., Comment: Accepted as a conference paper in ICSE 2023
Published: 2023

2. Language-guided Task Adaptation for Imitation Learning

Author: Goyal, Prasoon, Mooney, Raymond J., and Niekum, Scott
Subjects: Computer Science - Artificial Intelligence
Abstract: We introduce a novel setting, wherein an agent needs to learn a task from a demonstration of a related task with the difference between the tasks communicated in natural language. The proposed setting allows reusing demonstrations from other tasks, by providing low effort language descriptions, and can also be used to provide feedback to correct agent errors, which are both important desiderata for building intelligent agents that assist humans in daily tasks. To enable progress in this proposed setting, we create two benchmarks -- Room Rearrangement and Room Navigation -- that cover a diverse set of task adaptations. Further, we propose a framework that uses a transformer-based model to reason about the entities in the tasks and their relationships, to learn a policy for the target task
Published: 2023

3. Using Developer Discussions to Guide Fixing Bugs in Software

Author: Panthaplackel, Sheena, Gligoric, Milos, Li, Junyi Jessy, and Mooney, Raymond J.
Subjects: Computer Science - Software Engineering, Computer Science - Computation and Language
Abstract: Automatically fixing software bugs is a challenging task. While recent work showed that natural language context is useful in guiding bug-fixing models, the approach required prompting developers to provide this context, which was simulated through commit messages written after the bug-fixing code changes were made. We instead propose using bug report discussions, which are available before the task is performed and are also naturally occurring, avoiding the need for any additional information from developers. For this, we augment standard bug-fixing datasets with bug report discussions. Using these newly compiled datasets, we demonstrate that various forms of natural language context derived from such discussions can aid bug-fixing, even leading to improved performance over using commit messages corresponding to the oracle bug-fixing commits., Comment: Accepted in the Findings of EMNLP 2022
Published: 2022

4. Zero-shot Video Moment Retrieval With Off-the-Shelf Models

Author: Diwan, Anuj, Peng, Puyuan, and Mooney, Raymond J.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: For the majority of the machine learning community, the expensive nature of collecting high-quality human-annotated data and the inability to efficiently finetune very large state-of-the-art pretrained models on limited compute are major bottlenecks for building models for new tasks. We propose a zero-shot simple approach for one such task, Video Moment Retrieval (VMR), that does not perform any additional finetuning and simply repurposes off-the-shelf models trained on other tasks. Our three-step approach consists of moment proposal, moment-query matching and postprocessing, all using only off-the-shelf models. On the QVHighlights benchmark for VMR, we vastly improve performance of previous zero-shot approaches by at least 2.5x on all metrics and reduce the gap between zero-shot and state-of-the-art supervised by over 74%. Further, we also show that our zero-shot approach beats non-pretrained supervised models on the Recall metrics and comes very close on mAP metrics; and that it also performs better than the best pretrained supervised model on shorter moments. Finally, we ablate and analyze our results and propose interesting future directions., Comment: Accepted to the NeurIPS 2022 Workshop on Transfer Learning for NLP (TL4NLP). 12 pages, 5 figures
Published: 2022

5. Entity-Focused Dense Passage Retrieval for Outside-Knowledge Visual Question Answering

Author: Wu, Jialin and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: Most Outside-Knowledge Visual Question Answering (OK-VQA) systems employ a two-stage framework that first retrieves external knowledge given the visual question and then predicts the answer based on the retrieved content. However, the retrieved knowledge is often inadequate. Retrievals are frequently too general and fail to cover specific knowledge needed to answer the question. Also, the naturally available supervision (whether the passage contains the correct answer) is weak and does not guarantee question relevancy. To address these issues, we propose an Entity-Focused Retrieval (EnFoRe) model that provides stronger supervision during training and recognizes question-relevant entities to help retrieve more specific knowledge. Experiments show that our EnFoRe model achieves superior retrieval performance on OK-VQA, the currently largest outside-knowledge VQA dataset. We also combine the retrieved knowledge with state-of-the-art VQA models, and achieve a new state-of-the-art performance on OK-VQA., Comment: EMNLP 2022
Published: 2022

6. Using Both Demonstrations and Language Instructions to Efficiently Learn Robotic Tasks

Author: Yu, Albert and Mooney, Raymond J.
Subjects: Computer Science - Robotics, Computer Science - Computation and Language, Computer Science - Machine Learning, I.2.9, I.2.7, I.2.6
Abstract: Demonstrations and natural language instructions are two common ways to specify and teach robots novel tasks. However, for many complex tasks, a demonstration or language instruction alone contains ambiguities, preventing tasks from being specified clearly. In such cases, a combination of both a demonstration and an instruction more concisely and effectively conveys the task to the robot than either modality alone. To instantiate this problem setting, we train a single multi-task policy on a few hundred challenging robotic pick-and-place tasks and propose DeL-TaCo (Joint Demo-Language Task Conditioning), a method for conditioning a robotic policy on task embeddings comprised of two components: a visual demonstration and a language instruction. By allowing these two modalities to mutually disambiguate and clarify each other during novel task specification, DeL-TaCo (1) substantially decreases the teacher effort needed to specify a new task and (2) achieves better generalization performance on novel objects and instructions over previous task-conditioning methods. To our knowledge, this is the first work to show that simultaneously conditioning a multi-task robotic manipulation policy on both demonstration and language embeddings improves sample efficiency and generalization over conditioning on either modality alone. See additional materials at https://deltaco-robot.github.io/, Comment: 24 pages, 10 figures. Project website at https://deltaco-robot.github.io/
Published: 2022

7. Towards Automated Error Analysis: Learning to Characterize Errors

Author: Gao, Tong, Singh, Shivang, and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Characterizing the patterns of errors that a system makes helps researchers focus future development on increasing its accuracy and robustness. We propose a novel form of "meta learning" that automatically learns interpretable rules that characterize the types of errors that a system makes, and demonstrate these rules' ability to help understand and improve two NLP systems. Our approach works by collecting error cases on validation data, extracting meta-features describing these samples, and finally learning rules that characterize errors using these features. We apply our approach to VilBERT, for Visual Question Answering, and RoBERTa, for Common Sense Question Answering. Our system learns interpretable rules that provide insights into systemic errors these systems make on the given tasks. Using these insights, we are also able to "close the loop" and modestly improve performance of these systems., Comment: 12 pages, 11 figures
Published: 2022

8. Learning to Describe Solutions for Bug Reports Based on Developer Discussions

Author: Panthaplackel, Sheena, Li, Junyi Jessy, Gligoric, Milos, and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language, Computer Science - Software Engineering
Abstract: When a software bug is reported, developers engage in a discussion to collaboratively resolve it. While the solution is likely formulated within the discussion, it is often buried in a large amount of text, making it difficult to comprehend and delaying its implementation. To expedite bug resolution, we propose generating a concise natural language description of the solution by synthesizing relevant content within the discussion, which encompasses both natural language and source code. We build a corpus for this task using a novel technique for obtaining noisy supervision from repository changes linked to bug reports, with which we establish benchmarks. We also design two systems for generating a description during an ongoing discussion by classifying when sufficient context for performing the task emerges in real-time. With automated and human evaluation, we find this task to form an ideal testbed for complex reasoning in long, bimodal dialogue context., Comment: Accepted in Findings of ACL 2022
Published: 2021

9. Impact of Evaluation Methodologies on Code Summarization

Author: Nie, Pengyu, Zhang, Jiyang, Li, Junyi Jessy, Mooney, Raymond J., and Gligoric, Milos
Subjects: Computer Science - Software Engineering, Computer Science - Machine Learning
Abstract: There has been a growing interest in developing machine learning (ML) models for code summarization tasks, e.g., comment generation and method naming. Despite substantial increase in the effectiveness of ML models, the evaluation methodologies, i.e., the way people split datasets into training, validation, and test sets, were not well studied. Specifically, no prior work on code summarization considered the timestamps of code and comments during evaluation. This may lead to evaluations that are inconsistent with the intended use cases. In this paper, we introduce the time-segmented evaluation methodology, which is novel to the code summarization research community, and compare it with the mixed-project and cross-project methodologies that have been commonly used. Each methodology can be mapped to some use cases, and the time-segmented methodology should be adopted in the evaluation of ML models for code summarization. To assess the impact of methodologies, we collect a dataset of (code, comment) pairs with timestamps to train and evaluate several recent ML models for code summarization. Our experiments show that different methodologies lead to conflicting evaluation results. We invite the community to expand the set of methodologies used in evaluations., Comment: Accepted as a conference paper in ACL 2022
Published: 2021

10. Zero-shot Task Adaptation using Natural Language

Author: Goyal, Prasoon, Mooney, Raymond J., and Niekum, Scott
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Imitation learning and instruction-following are two common approaches to communicate a user's intent to a learning agent. However, as the complexity of tasks grows, it could be beneficial to use both demonstrations and language to communicate with an agent. In this work, we propose a novel setting where an agent is given both a demonstration and a description, and must combine information from both the modalities. Specifically, given a demonstration for a task (the source task), and a natural language description of the differences between the demonstrated task and a related but different task (the target task), our goal is to train an agent to complete the target task in a zero-shot setting, that is, without any demonstrations for the target task. To this end, we introduce Language-Aided Reward and Value Adaptation (LARVA) which, given a source demonstration and a linguistic description of how the target task differs, learns to output a reward / value function that accurately describes the target task. Our experiments show that on a diverse set of adaptations, our approach is able to complete more than 95% of target tasks when using template-based descriptions, and more than 70% when using free-form natural language.
Published: 2021

11. Learning to Generate Code Comments from Class Hierarchies

Author: Zhang, Jiyang, Panthaplackel, Sheena, Nie, Pengyu, Mooney, Raymond J., Li, Junyi Jessy, and Gligoric, Milos
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Software Engineering
Abstract: Descriptive code comments are essential for supporting code comprehension and maintenance. We propose the task of automatically generating comments for overriding methods. We formulate a novel framework which accommodates the unique contextual and linguistic reasoning that is required for performing this task. Our approach features: (1) incorporating context from the class hierarchy; (2) conditioning on learned, latent representations of specificity to generate comments that capture the more specialized behavior of the overriding method; and (3) unlikelihood training to discourage predictions which do not conform to invariant characteristics of the comment corresponding to the overridden method. Our experiments show that the proposed approach is able to generate comments for overriding methods of higher quality compared to prevailing comment generation techniques.
Published: 2021

12. Deep Just-In-Time Inconsistency Detection Between Comments and Source Code

Author: Panthaplackel, Sheena, Li, Junyi Jessy, Gligoric, Milos, and Mooney, Raymond J.
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Natural language comments convey key aspects of source code such as implementation, usage, and pre- and post-conditions. Failure to update comments accordingly when the corresponding code is modified introduces inconsistencies, which is known to lead to confusion and software bugs. In this paper, we aim to detect whether a comment becomes inconsistent as a result of changes to the corresponding body of code, in order to catch potential inconsistencies just-in-time, i.e., before they are committed to a code base. To achieve this, we develop a deep-learning approach that learns to correlate a comment with code changes. By evaluating on a large corpus of comment/code pairs spanning various comment types, we show that our model outperforms multiple baselines by significant margins. For extrinsic evaluation, we show the usefulness of our approach by combining it with a comment update model to build a more comprehensive automatic comment maintenance system which can both detect and resolve inconsistent comments based on code changes., Comment: Accepted in AAAI 2021
Published: 2020

13. Systematic Generalization on gSCAN with Language Conditioned Embedding

Author: Gao, Tong, Huang, Qi, and Mooney, Raymond J.
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Systematic Generalization refers to a learning algorithm's ability to extrapolate learned behavior to unseen situations that are distinct but semantically similar to its training data. As shown in recent work, state-of-the-art deep learning models fail dramatically even on tasks for which they are designed when the test set is systematically different from the training data. We hypothesize that explicitly modeling the relations between objects in their contexts while learning their representations will help achieve systematic generalization. Therefore, we propose a novel method that learns objects' contextualized embeddings with dynamic message passing conditioned on the input natural language and end-to-end trainable with other downstream deep learning modules. To our knowledge, this model is the first one that significantly outperforms the provided baseline and reaches state-of-the-art performance on grounded-SCAN (gSCAN), a grounded natural language navigation dataset designed to require systematic generalization in its test splits., Comment: Accepted by AACL-IJCNLP 2020. Huang and Gao share co-first authorship, authors contribute equally and are listed in alphabetical order
Published: 2020

14. PixL2R: Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards

Author: Goyal, Prasoon, Niekum, Scott, and Mooney, Raymond J.
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Reinforcement learning (RL), particularly in sparse reward settings, often requires prohibitively large numbers of interactions with the environment, thereby limiting its applicability to complex problems. To address this, several prior approaches have used natural language to guide the agent's exploration. However, these approaches typically operate on structured representations of the environment, and/or assume some structure in the natural language commands. In this work, we propose a model that directly maps pixels to rewards, given a free-form natural language description of the task, which can then be used for policy learning. Our experiments on the Meta-World robot manipulation domain show that language-based rewards significantly improves the sample efficiency of policy learning, both in sparse and dense reward settings., Comment: Conference on Robot Learning (CoRL), 2020
Published: 2020

15. Improving VQA and its Explanations \\ by Comparing Competing Explanations

Author: Wu, Jialin, Chen, Liyan, and Mooney, Raymond J.
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Most recent state-of-the-art Visual Question Answering (VQA) systems are opaque black boxes that are only trained to fit the answer distribution given the question and visual content. As a result, these systems frequently take shortcuts, focusing on simple visual concepts or question priors. This phenomenon becomes more problematic as the questions become complex that requires more reasoning and commonsense knowledge. To address this issue, we present a novel framework that uses explanations for competing answers to help VQA systems select the correct answer. By training on human textual explanations, our framework builds better representations for the questions and visual content, and then reweights confidences in the answer candidates using either generated or retrieved explanations from the training set. We evaluate our framework on the VQA-X dataset, which has more difficult questions with human explanations, achieving new state-of-the-art results on both VQA and its explanations.
Published: 2020

16. Dialog as a Vehicle for Lifelong Learning

Author: Padmakumar, Aishwarya and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language
Abstract: Dialog systems research has primarily been focused around two main types of applications - task-oriented dialog systems that learn to use clarification to aid in understanding a goal, and open-ended dialog systems that are expected to carry out unconstrained "chit chat" conversations. However, dialog interactions can also be used to obtain various types of knowledge that can be used to improve an underlying language understanding system, or other machine learning systems that the dialog acts over. In this position paper, we present the problem of designing dialog systems that enable lifelong learning as an important challenge problem, in particular for applications involving physically situated robots. We include examples of prior work in this direction, and discuss challenges that remain to be addressed., Comment: Position Paper Track at the SIGDIAL Special Session on Physically Situated Dialogue (RoboDial 2.0) - Camera Ready Version
Published: 2020

17. Dialog Policy Learning for Joint Clarification and Active Learning Queries

Author: Padmakumar, Aishwarya and Mooney, Raymond J.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Intelligent systems need to be able to recover from mistakes, resolve uncertainty, and adapt to novel concepts not seen during training. Dialog interaction can enable this by the use of clarifications for correction and resolving uncertainty, and active learning queries to learn new concepts encountered during operation. Prior work on dialog systems has either focused on exclusively learning how to perform clarification/ information seeking, or to perform active learning. In this work, we train a hierarchical dialog policy to jointly perform both clarification and active learning in the context of an interactive language-based image retrieval task motivated by an online shopping application, and demonstrate that jointly learning dialog policies for clarification and active learning is more effective than the use of static dialog policies for one or both of these functions., Comment: AAAI 2020 Camera Ready
Published: 2020

18. Learning to Update Natural Language Comments Based on Code Changes

Author: Panthaplackel, Sheena, Nie, Pengyu, Gligoric, Milos, Li, Junyi Jessy, and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Software Engineering
Abstract: We formulate the novel task of automatically updating an existing natural language comment based on changes in the body of code it accompanies. We propose an approach that learns to correlate changes across two distinct language representations, to generate a sequence of edits that are applied to the existing comment to reflect the source code modifications. We train and evaluate our model using a dataset that we collected from commit histories of open-source software projects, with each example consisting of a concurrent update to a method and its corresponding comment. We compare our approach against multiple baselines using both automatic metrics and human evaluation. Results reflect the challenge of this task and that our model outperforms baselines with respect to making edits., Comment: Accepted in Association for Computational Linguistics (ACL) 2020
Published: 2020

19. Associating Natural Language Comment and Source Code Entities

Author: Panthaplackel, Sheena, Gligoric, Milos, Mooney, Raymond J., and Li, Junyi Jessy
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Software Engineering
Abstract: Comments are an integral part of software development; they are natural language descriptions associated with source code elements. Understanding explicit associations can be useful in improving code comprehensibility and maintaining the consistency between code and comments. As an initial step towards this larger goal, we address the task of associating entities in Javadoc comments with elements in Java source code. We propose an approach for automatically extracting supervised data using revision histories of open source projects and present a manually annotated evaluation dataset for this task. We develop a binary classifier and a sequence labeling model by crafting a rich feature set which encompasses various aspects of code, comments, and the relationships between them. Experiments show that our systems outperform several baselines learning from the proposed supervision., Comment: Accepted in AAAI 2020
Published: 2019

20. Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder

Author: Wu, Jialin and Mooney, Raymond J.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Most RNN-based image captioning models receive supervision on the output words to mimic human captions. Therefore, the hidden states can only receive noisy gradient signals via layers of back-propagation through time, leading to less accurate generated captions. Consequently, we propose a novel framework, Hidden State Guidance (HSG), that matches the hidden states in the caption decoder to those in a teacher decoder trained on an easier task of autoencoding the captions conditioned on the image. During training with the REINFORCE algorithm, the conventional rewards are sentence-based evaluation metrics equally distributed to each generated word, no matter their relevance. HSG provides a word-level reward that helps the model learn better hidden representations. Experimental results demonstrate that HSG clearly outperforms various state-of-the-art caption decoders using either raw images or detected objects as inputs.
Published: 2019

21. Generating Question Relevant Captions to Aid Visual Question Answering

Author: Wu, Jialin, Hu, Zeyuan, and Mooney, Raymond J.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Visual question answering (VQA) and image captioning require a shared body of general knowledge connecting language and vision. We present a novel approach to improve VQA performance that exploits this connection by jointly generating captions that are targeted to help answer a specific visual question. The model is trained using an existing caption dataset by automatically determining question-relevant captions using an online gradient-based method. Experimental results on the VQA v2 challenge demonstrates that our approach obtains state-of-the-art VQA performance (e.g. 68.4% on the Test-standard set using a single model) by simultaneously generating question-relevant captions., Comment: ACL 2019 camera-ready
Published: 2019

22. Do Human Rationales Improve Machine Explanations?

Author: Strout, Julia, Zhang, Ye, and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language
Abstract: Work on "learning with rationales" shows that humans providing explanations to a machine learning system can improve the system's predictive accuracy. However, this work has not been connected to work in "explainable AI" which concerns machines explaining their reasoning to humans. In this work, we show that learning with rationales can also improve the quality of the machine's explanations as evaluated by human judges. Specifically, we present experiments showing that, for CNN- based text classification, explanations generated using "supervised attention" are judged superior to explanations generated using normal unsupervised attention.
Published: 2019

23. Self-Critical Reasoning for Robust Visual Question Answering

Author: Wu, Jialin and Mooney, Raymond J.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Visual Question Answering (VQA) deep-learning systems tend to capture superficial statistical correlations in the training data because of strong language priors and fail to generalize to test data with a significantly different question-answer (QA) distribution. To address this issue, we introduce a self-critical training objective that ensures that visual explanations of correct answers match the most influential image regions more than other competitive answer candidates. The influential regions are either determined from human visual/textual explanations or automatically from just significant words in the question and answer. We evaluate our approach on the VQA generalization task using the VQA-CP dataset, achieving a new state-of-the-art i.e., 49.5% using textual explanations and 48.5% using automatically annotated regions., Comment: In NeurIPS 2019
Published: 2019

24. Using Natural Language for Reward Shaping in Reinforcement Learning

Author: Goyal, Prasoon, Niekum, Scott, and Mooney, Raymond J.
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Recent reinforcement learning (RL) approaches have shown strong performance in complex domains such as Atari games, but are often highly sample inefficient. A common approach to reduce interaction time with the environment is to use reward shaping, which involves carefully designing reward functions that provide the agent intermediate rewards for progress towards the goal. However, designing appropriate shaping rewards is known to be difficult as well as time-consuming. In this work, we address this problem by using natural language instructions to perform reward shaping. We propose the LanguagE-Action Reward Network (LEARN), a framework that maps free-form natural language instructions to intermediate rewards based on actions taken by the agent. These intermediate language-based rewards can seamlessly be integrated into any standard reinforcement learning algorithm. We experiment with Montezuma's Revenge from the Atari Learning Environment, a popular benchmark in RL. Our experiments on a diverse set of 15 tasks demonstrate that, for the same number of interactions with the environment, language-based rewards lead to successful completion of the task 60% more often on average, compared to learning without language., Comment: IJCAI 2019
Published: 2019

25. Improving Grounded Natural Language Understanding through Human-Robot Dialog

Author: Thomason, Jesse, Padmakumar, Aishwarya, Sinapov, Jivko, Walker, Nick, Jiang, Yuqian, Yedidsion, Harel, Hart, Justin, Stone, Peter, and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language, Computer Science - Human-Computer Interaction
Abstract: Natural language understanding for robotics can require substantial domain- and platform-specific engineering. For example, for mobile robots to pick-and-place objects in an environment to satisfy human commands, we can specify the language humans use to issue such commands, and connect concept words like red can to physical object properties. One way to alleviate this engineering for a new domain is to enable robots in human environments to adapt dynamically---continually learning new language constructions and perceptual concepts. In this work, we present an end-to-end pipeline for translating natural language commands to discrete robot actions, and use clarification dialogs to jointly improve language parsing and concept grounding. We train and evaluate this agent in a virtual setting on Amazon Mechanical Turk, and we transfer the learned agent to a physical robot platform to demonstrate it in the real world.
Published: 2019
Full Text: View/download PDF

26. Faithful Multimodal Explanation for Visual Question Answering

Author: Wu, Jialin and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: AI systems' ability to explain their reasoning is critical to their utility and trustworthiness. Deep neural networks have enabled significant progress on many challenging problems such as visual question answering (VQA). However, most of them are opaque black boxes with limited explanatory capability. This paper presents a novel approach to developing a high-performing VQA system that can elucidate its answers with integrated textual and visual explanations that faithfully reflect important aspects of its underlying reasoning while capturing the style of comprehensible human explanations. Extensive experimental evaluation demonstrates the advantages of this approach compared to competing methods with both automatic evaluation metrics and human evaluation metrics., Comment: In ACL 2019 BlackboxNLP workshop
Published: 2018

27. Learning a Policy for Opportunistic Active Learning

Author: Padmakumar, Aishwarya, Stone, Peter, and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Active learning identifies data points to label that are expected to be the most useful in improving a supervised model. Opportunistic active learning incorporates active learning into interactive tasks that constrain possible queries during interactions. Prior work has shown that opportunistic active learning can be used to improve grounding of natural language descriptions in an interactive object retrieval task. In this work, we use reinforcement learning for such an object retrieval task, to learn a policy that effectively trades off task completion with model improvement that would benefit future tasks., Comment: EMNLP 2018 Camera Ready
Published: 2018

28. Executable Trigger-Action Comments

Author: Nie, Pengyu, Rai, Rishabh, Li, Junyi Jessy, Khurshid, Sarfraz, Mooney, Raymond J., and Gligoric, Milos
Subjects: Computer Science - Software Engineering
Abstract: Natural language elements, e.g., todo comments, are frequently used to communicate among the developers and to describe tasks that need to be performed (actions) when specific conditions hold in the code repository (triggers). As projects evolve, development processes change, and development teams reorganize, these comments, because of their informal nature, frequently become irrelevant or forgotten. We present the first technique, dubbed TrigIt, to specify triggeraction todo comments as executable statements. Thus, actions are executed automatically when triggers evaluate to true. TrigIt specifications are written in the host language (e.g., Java) and are evaluated as part of the build process. The triggers are specified as query statements over abstract syntax trees and abstract representation of build configuration scripts, and the actions are specified as code transformation steps. We implemented TrigIt for the Java programming language and migrated 20 existing trigger-action comments from 8 popular open-source projects. We evaluate the cost of using TrigIt in terms of the number of tokens in the executable comments and the time overhead introduced in the build process.
Published: 2018

29. Joint Image Captioning and Question Answering

Author: Wu, Jialin, Hu, Zeyuan, and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: Answering visual questions need acquire daily common knowledge and model the semantic connection among different parts in images, which is too difficult for VQA systems to learn from images with the only supervision from answers. Meanwhile, image captioning systems with beam search strategy tend to generate similar captions and fail to diversely describe images. To address the aforementioned issues, we present a system to have these two tasks compensate with each other, which is capable of jointly producing image captions and answering visual questions. In particular, we utilize question and image features to generate question-related captions and use the generated captions as additional features to provide new knowledge to the VQA system. For image captioning, our system attains more informative results in term of the relative improvements on VQA tasks as well as competitive results using automated metrics. Applying our system to the VQA tasks, our results on VQA v2 dataset achieve 65.8% using generated captions and 69.1% using annotated captions in validation set and 68.4% in the test-standard set. Further, an ensemble of 10 models results in 69.7% in the test-standard split.
Published: 2018

30. Leveraging Discourse Information Effectively for Authorship Attribution

Author: Wang, Su, Ferracane, Elisa, and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language
Abstract: We explore techniques to maximize the effectiveness of discourse information in the task of authorship attribution. We present a novel method to embed discourse features in a Convolutional Neural Network text classifier, which achieves a state-of-the-art result by a substantial margin. We empirically investigate several featurization methods to understand the conditions under which discourse features contribute non-trivial performance gains, and analyze discourse embeddings., Comment: Accepted at IJCNLP 2017 as a conference paper
Published: 2017

31. Machine Learning

Author: Mooney, Raymond J. and Mitkov, Ruslan, book editor
Published: 2022
Full Text: View/download PDF

32. Stacking With Auxiliary Features

Author: Rajani, Nazneen Fatema and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Learning
Abstract: Ensembling methods are well known for improving prediction accuracy. However, they are limited in the sense that they cannot discriminate among component models effectively. In this paper, we propose stacking with auxiliary features that learns to fuse relevant information from multiple systems to improve performance. Auxiliary features enable the stacker to rely on systems that not just agree on an output but also the provenance of the output. We demonstrate our approach on three very different and difficult problems -- the Cold Start Slot Filling, the Tri-lingual Entity Discovery and Linking and the ImageNet object detection tasks. We obtain new state-of-the-art results on the first two tasks and substantial improvements on the detection task, thus verifying the power and generality of our approach., Comment: arXiv admin note: substantial text overlap with arXiv:1604.04802
Published: 2016

33. Supervised and Unsupervised Ensembling for Knowledge Base Population

Author: Rajani, Nazneen Fatema and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language, Computer Science - Learning
Abstract: We present results on combining supervised and unsupervised methods to ensemble multiple systems for two popular Knowledge Base Population (KBP) tasks, Cold Start Slot Filling (CSSF) and Tri-lingual Entity Discovery and Linking (TEDL). We demonstrate that our combined system along with auxiliary features outperforms the best performing system for both tasks in the 2015 competition, several ensembling baselines, as well as the state-of-the-art stacking approach to ensembling KBP systems. The success of our technique on two different and challenging problems demonstrates the power and generality of our combined approach to ensembling.
Published: 2016

34. Using Sentence-Level LSTM Language Models for Script Inference

Author: Pichotta, Karl and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language
Abstract: There is a small but growing body of research on statistical scripts, models of event sequences that allow probabilistic inference of implicit events from documents. These systems operate on structured verb-argument events produced by an NLP pipeline. We compare these systems with recent Recurrent Neural Net models that directly operate on raw tokens to predict sentences, finding the latter to be roughly comparable to the former in terms of predicting missing events in documents., Comment: To appear in Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL-16)
Published: 2016

35. Representing Meaning with a Combination of Logical and Distributional Models

Author: Beltagy, I., Roller, Stephen, Cheng, Pengxiang, Erk, Katrin, and Mooney, Raymond J.
Subjects: Computer Science - Computation and Language
Abstract: NLP tasks differ in the semantic information they require, and at this time no single se- mantic representation fulfills all requirements. Logic-based representations characterize sentence structure, but do not capture the graded aspect of meaning. Distributional models give graded similarity ratings for words and phrases, but do not capture sentence structure in the same detail as logic-based approaches. So it has been argued that the two are complementary. We adopt a hybrid approach that combines logic-based and distributional semantics through probabilistic logic inference in Markov Logic Networks (MLNs). In this paper, we focus on the three components of a practical system integrating logical and distributional models: 1) Parsing and task representation is the logic-based part where input problems are represented in probabilistic logic. This is quite different from representing them in standard first-order logic. 2) For knowledge base construction we form weighted inference rules. We integrate and compare distributional information with other sources, notably WordNet and an existing paraphrase collection. In particular, we use our system to evaluate distributional lexical entailment approaches. We use a variant of Robinson resolution to determine the necessary inference rules. More sources can easily be added by mapping them to logical rules; our system learns a resource-specific weight that corrects for scaling differences between resources. 3) In discussing probabilistic inference, we show how to solve the inference problems efficiently. To evaluate our approach, we use the task of textual entailment (RTE), which can utilize the strengths of both logic-based and distributional representations. In particular we focus on the SICK dataset, where we achieve state-of-the-art results., Comment: Special issue of Computational Linguistics on Formal Distributional Semantics, 2016
Published: 2015

36. Optimal Use of Verbal Instructions for Multi-robot Human Navigation Guidance

Author: Yedidsion, Harel, Deans, Jacqueline, Sheehan, Connor, Chillara, Mahathi, Hart, Justin, Stone, Peter, Mooney, Raymond J., Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Salichs, Miguel A., editor, Ge, Shuzhi Sam, editor, Barakova, Emilia Ivanova, editor, Cabibihan, John-John, editor, Wagner, Alan R., editor, Castro-González, Álvaro, editor, and He, Hongsheng, editor
Published: 2019
Full Text: View/download PDF

37. Ensembling Visual Explanations

Author: Rajani, Nazneen Fatema, Mooney, Raymond J., Escalante, Hugo Jair, Series Editor, Guyon, Isabelle, Series Editor, Escalera, Sergio, Series Editor, Baró, Xavier, editor, Güçlütürk, Yağmur, editor, Güçlü, Umut, editor, and van Gerven, Marcel, editor
Published: 2018
Full Text: View/download PDF

38. Optimal Use of Verbal Instructions for Multi-robot Human Navigation Guidance

Author: Yedidsion, Harel, primary, Deans, Jacqueline, additional, Sheehan, Connor, additional, Chillara, Mahathi, additional, Hart, Justin, additional, Stone, Peter, additional, and Mooney, Raymond J., additional
Published: 2019
Full Text: View/download PDF

39. Active Multitask Learning using Supervised and Shared Latent Topics

Author: Acharya, Ayan, primary, Mooney, Raymond J., additional, and Ghosh, Joydeep, additional
Published: 2016
Full Text: View/download PDF

40. Learning to Describe Solutions for Bug Reports Based on Developer Discussions

Author: Panthaplackel, Sheena, Li, Junyi Jessy, Gligoric, Milos, and Mooney, Raymond J.
Subjects: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, Computer Science - Computation and Language, Computation and Language (cs.CL)
Abstract: When a software bug is reported, developers engage in a discussion to collaboratively resolve it. While the solution is likely formulated within the discussion, it is often buried in a large amount of text, making it difficult to comprehend and delaying its implementation. To expedite bug resolution, we propose generating a concise natural language description of the solution by synthesizing relevant content within the discussion, which encompasses both natural language and source code. We build a corpus for this task using a novel technique for obtaining noisy supervision from repository changes linked to bug reports, with which we establish benchmarks. We also design two systems for generating a description during an ongoing discussion by classifying when sufficient context for performing the task emerges in real-time. With automated and human evaluation, we find this task to form an ideal testbed for complex reasoning in long, bimodal dialogue context., Accepted in Findings of ACL 2022
Published: 2022

41. Deep Just-In-Time Inconsistency Detection Between Comments and Source Code

Author: Panthaplackel, Sheena, primary, Li, Junyi Jessy, additional, Gligoric, Milos, additional, and Mooney, Raymond J., additional
Published: 2021
Full Text: View/download PDF

42. Dialog Policy Learning for Joint Clarification and Active Learning Queries

Author: Padmakumar, Aishwarya, primary and Mooney, Raymond J., additional
Published: 2021
Full Text: View/download PDF

43. Associating Natural Language Comment and Source Code Entities

Author: Panthaplackel, Sheena, primary, Gligoric, Milos, additional, Mooney, Raymond J., additional, and Li, Junyi Jessy, additional
Published: 2020
Full Text: View/download PDF

44. A framework for writing trigger-action todo comments in executable format

Author: Nie, Pengyu, primary, Rai, Rishabh, additional, Li, Junyi Jessy, additional, Khurshid, Sarfraz, additional, Mooney, Raymond J., additional, and Gligoric, Milos, additional
Published: 2019
Full Text: View/download PDF

45. Using Natural Language for Reward Shaping in Reinforcement Learning

Author: Goyal, Prasoon, primary, Niekum, Scott, additional, and Mooney, Raymond J., additional
Published: 2019
Full Text: View/download PDF

46. Improving Grounded Natural Language Understanding through Human-Robot Dialog

Author: Thomason, Jesse, primary, Padmakumar, Aishwarya, additional, Sinapov, Jivko, additional, Walker, Nick, additional, Jiang, Yuqian, additional, Yedidsion, Harel, additional, Hart, Justin, additional, Stone, Peter, additional, and Mooney, Raymond J., additional
Published: 2019
Full Text: View/download PDF

47. Jointly Improving Parsing and Perception for Natural Language Commands through Human-Robot Dialog.

Author: Thomason, Jesse, Padmakumar, Aishwarya, Sinapov, Jivko, Walker, Nick, Yuqian Jiang, Yedidsion, Harel, Hart, Justin, Stone, Peter, and Mooney, Raymond J.
Subjects: MOBILE robots, ROBOTICS, ARTIFICIAL intelligence, ROBOTS
Abstract: In this work, we present methods for using human-robot dialog to improve language understanding for a mobile robot agent. The agent parses natural language to underlying semantic meanings and uses robotic sensors to create multi-modal models of perceptual concepts like red and heavy. The agent can be used for showing navigation routes, delivering objects to people, and relocating objects from one location to another. We use dialog clarification questions both to understand commands and to generate additional parsing training data. The agent employs opportunistic active learning to select questions about how words relate to objects, improving its understanding of perceptual concepts. We evaluated this agent on Amazon Mechanical Turk. After training on data induced from conversations, the agent reduced the number of dialog questions it asked while receiving higher usability ratings. Additionally, we demonstrated the agent on a robotic platform, where it learned new perceptual concepts on the y while completing a real-world task. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

48. Machine Learning

Author: Mooney, Raymond J., additional
Published: 2018
Full Text: View/download PDF

49. Multi-Modal Word Synset Induction

Author: Thomason, Jesse, primary and Mooney, Raymond J., additional
Published: 2017
Full Text: View/download PDF

50. Stacking With Auxiliary Features

Author: Rajani, Nazneen Fatema, primary and Mooney, Raymond J., additional
Published: 2017
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

111 results on '"Mooney, Raymond J."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources