Author: "Rosé, Carolyn" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Rosé, Carolyn"' showing total 717 results

Start Over Author "Rosé, Carolyn"

717 results on '"Rosé, Carolyn"'

1. Improving Model Factuality with Fine-grained Critique-based Evaluator

Author: Xie, Yiqing, Zhou, Wenxuan, Prakash, Pradyot, Jin, Di, Mao, Yuning, Fettes, Quintin, Talebzadeh, Arya, Wang, Sinong, Fang, Han, Rose, Carolyn, Fried, Daniel, and Zhang, Hejia
Subjects: Computer Science - Computation and Language
Abstract: Factuality evaluation aims to detect factual errors produced by language models (LMs) and hence guide the development of more factual models. Towards this goal, we train a factuality evaluator, FenCE, that provides LM generators with claim-level factuality feedback. We conduct data augmentation on a combination of public judgment datasets to train FenCE to (1) generate textual critiques along with scores and (2) make claim-level judgment based on diverse source documents obtained by various tools. We then present a framework that leverages FenCE to improve the factuality of LM generators by constructing training data. Specifically, we generate a set of candidate responses, leverage FenCE to revise and score each response without introducing lesser-known facts, and train the generator by preferring highly scored revised responses. Experiments show that our data augmentation methods improve the evaluator's accuracy by 2.9% on LLM-AggreFact. With FenCE, we improve Llama3-8B-chat's factuality rate by 14.45% on FActScore, outperforming state-of-the-art factuality finetuning methods by 6.96%.
Published: 2024

2. CRScore: Grounding Automated Evaluation of Code Review Comments in Code Claims and Smells

Author: Naik, Atharva, Alenius, Marcus, Fried, Daniel, and Rose, Carolyn
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: The task of automated code review has recently gained a lot of attention from the machine learning community. However, current review comment evaluation metrics rely on comparisons with a human-written reference for a given code change (also called a diff), even though code review is a one-to-many problem like generation and summarization with many "valid reviews" for a diff. To tackle these issues we develop a CRScore - a reference-free metric to measure dimensions of review quality like conciseness, comprehensiveness, and relevance. We design CRScore to evaluate reviews in a way that is grounded in claims and potential issues detected in the code by LLMs and static analyzers. We demonstrate that CRScore can produce valid, fine-grained scores of review quality that have the greatest alignment with human judgment (0.54 Spearman correlation) and are more sensitive than reference-based metrics. We also release a corpus of 2.6k human-annotated review quality scores for machine-generated and GitHub review comments to support the development of automated metrics.
Published: 2024

3. Estimating Agreement by Chance for Sequence Annotation

Author: Li, Diya, Rosé, Carolyn, Yuan, Ao, and Zhou, Chunxiao
Subjects: Computer Science - Computation and Language
Abstract: In the field of natural language processing, correction of performance assessment for chance agreement plays a crucial role in evaluating the reliability of annotations. However, there is a notable dearth of research focusing on chance correction for assessing the reliability of sequence annotation tasks, despite their widespread prevalence in the field. To address this gap, this paper introduces a novel model for generating random annotations, which serves as the foundation for estimating chance agreement in sequence annotation tasks. Utilizing the proposed randomization model and a related comparison approach, we successfully derive the analytical form of the distribution, enabling the computation of the probable location of each annotated text segment and subsequent chance agreement estimation. Through a combination simulation and corpus-based evaluation, we successfully assess its applicability and validate its accuracy and efficacy., Comment: ACL 2024
Published: 2024

4. Leveraging Machine-Generated Rationales to Facilitate Social Meaning Detection in Conversations

Author: Dutt, Ritam, Wu, Zhen, Shi, Kelly, Sheth, Divyanshu, Gupta, Prakhar, and Rose, Carolyn Penstein
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: We present a generalizable classification approach that leverages Large Language Models (LLMs) to facilitate the detection of implicitly encoded social meaning in conversations. We design a multi-faceted prompt to extract a textual explanation of the reasoning that connects visible cues to underlying social meanings. These extracted explanations or rationales serve as augmentations to the conversational text to facilitate dialogue understanding and transfer. Our empirical results over 2,340 experimental settings demonstrate the significant positive impact of adding these rationales. Our findings hold true for in-domain classification, zero-shot, and few-shot domain transfer for two different social meaning detection tasks, each spanning two different corpora., Comment: To appear at The Proceedings of the Association for Computational Linguistics, 2024
Published: 2024

5. Generating Situated Reflection Triggers about Alternative Solution Paths: A Case Study of Generative AI for Computer-Supported Collaborative Learning

Author: Naik, Atharva, Yin, Jessica Ruhan, Kamath, Anusha, Ma, Qianou, Wu, Sherry Tongshuang, Murray, Charles, Bogart, Christopher, Sakr, Majd, and Rose, Carolyn P.
Subjects: Computer Science - Artificial Intelligence
Abstract: An advantage of Large Language Models (LLMs) is their contextualization capability - providing different responses based on student inputs like solution strategy or prior discussion, to potentially better engage students than standard feedback. We present a design and evaluation of a proof-of-concept LLM application to offer students dynamic and contextualized feedback. Specifically, we augment an Online Programming Exercise bot for a college-level Cloud Computing course with ChatGPT, which offers students contextualized reflection triggers during a collaborative query optimization task in database design. We demonstrate that LLMs can be used to generate highly situated reflection triggers that incorporate details of the collaborative discussion happening in context. We discuss in depth the exploration of the design space of the triggers and their correspondence with the learning objectives as well as the impact on student learning in a pilot study with 34 students.
Published: 2024

6. CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks

Author: Xie, Yiqing, Xie, Alex, Sheth, Divyanshu, Liu, Pengfei, Fried, Daniel, and Rose, Carolyn
Subjects: Computer Science - Software Engineering, Computer Science - Computation and Language
Abstract: To adequately test modern code generation systems, evaluation benchmarks must execute and test the code generated by the system. However, these execution and testing requirements have largely limited benchmarks to settings where code is easily executable or has human-written tests. To facilitate evaluation of code generation systems across diverse scenarios, we present CodeBenchGen, a framework to create scalable execution-based benchmarks from naturally occurring code sources. Specifically, we leverage a large language model (LLM) to sandbox arbitrary pieces of code into evaluation examples, including test cases for execution-based evaluation. We illustrate the usefulness of our framework by creating a dataset, Exec-CSN, which includes 1,931 examples involving 293 libraries converted from code in 367 GitHub repositories taken from the Code- SearchNet dataset. To demonstrate the solvability of examples in Exec-CSN, we present a human study demonstrating that 81.3% of the examples can be solved by humans and 61% are rated as "requires effort to solve". We conduct code generation experiments on open-source and proprietary models and analyze the performance of both humans and models. We provide code and data at: https://github.com/yiqingxyq/CodeBenchGen.
Published: 2024

7. DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation

Author: Xie, Yiqing, Zhang, Sheng, Cheng, Hao, Liu, Pengfei, Gero, Zelalem, Wong, Cliff, Naumann, Tristan, Poon, Hoifung, and Rose, Carolyn
Subjects: Computer Science - Computation and Language
Abstract: Medical text generation aims to assist with administrative work and highlight salient information to support decision-making. To reflect the specific requirements of medical text, in this paper, we propose a set of metrics to evaluate the completeness, conciseness, and attribution of the generated text at a fine-grained level. The metrics can be computed by various types of evaluators including instruction-following (both proprietary and open-source) and supervised entailment models. We demonstrate the effectiveness of the resulting framework, DocLens, with three evaluators on three tasks: clinical note generation, radiology report summarization, and patient question summarization. A comprehensive human study shows that DocLens exhibits substantially higher agreement with the judgments of medical experts than existing metrics. The results also highlight the need to improve open-source evaluators and suggest potential directions., Comment: ACL Camera Ready Version
Published: 2023

8. Data Augmentation for Code Translation with Comparable Corpora and Multiple References

Author: Xie, Yiqing, Naik, Atharva, Fried, Daniel, and Rose, Carolyn
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Software Engineering
Abstract: One major challenge of translating code between programming languages is that parallel training data is often limited. To overcome this challenge, we present two data augmentation techniques, one that builds comparable corpora (i.e., code pairs with similar functionality), and another that augments existing parallel data with multiple reference translations. Specifically, we build and analyze multiple types of comparable corpora, including programs generated from natural language documentation using a code generation model. Furthermore, to reduce overfitting to a single reference translation, we automatically generate additional translation references for available parallel data and filter the translations by unit tests, which increases variation in target translations. Experiments show that our data augmentation techniques significantly improve CodeT5 for translation between Java, Python, and C++ by an average of 7.5% Computational Accuracy (CA@1), which verifies the correctness of translations by execution. The code is available at https://github.com/Veronicium/CMTrans., Comment: EMNLP 2023 Findings (with minor updates on the flowcharts)
Published: 2023

9. Linguistic representations for fewer-shot relation extraction across domains

Author: Gururaja, Sireesh, Dutt, Ritam, Liao, Tinglong, and Rose, Carolyn
Subjects: Computer Science - Computation and Language
Abstract: Recent work has demonstrated the positive impact of incorporating linguistic representations as additional context and scaffolding on the in-domain performance of several NLP tasks. We extend this work by exploring the impact of linguistic representations on cross-domain performance in a few-shot transfer setting. An important question is whether linguistic representations enhance generalizability by providing features that function as cross-domain pivots. We focus on the task of relation extraction on three datasets of procedural text in two domains, cooking and materials science. Our approach augments a popular transformer-based architecture by alternately incorporating syntactic and semantic graphs constructed by freely available off-the-shelf tools. We examine their utility for enhancing generalization, and investigate whether earlier findings, e.g. that semantic representations can be more helpful than syntactic ones, extend to relation extraction in multiple domains. We find that while the inclusion of these graphs results in significantly higher performance in few-shot transfer, both types of graph exhibit roughly equivalent utility., Comment: ACL 2023
Published: 2023

10. Exploring Teachers’ Views and Confidence in the Integration of an Artificial Intelligence Curriculum into Their Classrooms: a Case Study of Curricular Co-Design Program

Author: Tatar, Cansu, Jiang, Shiyan, Rosé, Carolyn P., and Chao, Jie
Published: 2024
Full Text: View/download PDF

11. Making Sense of Machine Learning: Integrating Youth's Conceptual, Creative, and Critical Understandings of AI

Author: Morales-Navarro, Luis, Kafai, Yasmin B., Castro, Francisco, Payne, William, DesPortes, Kayla, DiPaola, Daniella, Williams, Randi, Ali, Safinah, Breazeal, Cynthia, Lee, Clifford, Soep, Elisabeth, Long, Duri, Magerko, Brian, Solyst, Jaemarie, Ogan, Amy, Tatar, Cansu, Jiang, Shiyan, Chao, Jie, Rosé, Carolyn P., and Vakil, Sepehr
Subjects: Computer Science - Computers and Society, K.3.2, H.5.3
Abstract: Understanding how youth make sense of machine learning and how learning about machine learning can be supported in and out of school is more relevant than ever before as young people interact with machine learning powered applications everyday; while connecting with friends, listening to music, playing games, or attending school. In this symposium, we present different perspectives on understanding how learners make sense of machine learning in their everyday lives, how sensemaking of machine learning can be supported in and out of school through the construction of applications, and how youth critically evaluate machine learning powered systems. We discuss how sensemaking of machine learning applications involves the development and integration of conceptual, creative, and critical understandings that are increasingly important to prepare youth to participate in the world.
Published: 2023

12. Distilling Multi-Scale Knowledge for Event Temporal Relation Extraction

Author: Yao, Hao-Ren, Breitfeller, Luke, Naik, Aakanksha, Zhou, Chunxiao, and Rose, Carolyn
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Event Temporal Relation Extraction (ETRE) is paramount but challenging. Within a discourse, event pairs are situated at different distances or the so-called proximity bands. The temporal ordering communicated about event pairs where at more remote (i.e., ``long'') or less remote (i.e., ``short'') proximity bands are encoded differently. SOTA models have tended to perform well on events situated at either short or long proximity bands, but not both. Nonetheless, real-world, natural texts contain all types of temporal event-pairs. In this paper, we present MulCo: Distilling Multi-Scale Knowledge via Contrastive Learning, a knowledge co-distillation approach that shares knowledge across multiple event pair proximity bands to improve performance on all types of temporal datasets. Our experimental results show that MulCo successfully integrates linguistic cues pertaining to temporal reasoning across both short and long proximity bands and achieves new state-of-the-art results on several ETRE benchmark datasets., Comment: Accepted to CIKM 2024 Full Research Track, camera ready version
Published: 2022
Full Text: View/download PDF

13. Reflecting on what counts as collaboration: Reaching forward without losing what is behind

Author: Järvelä, Sanna and Rosé, Carolyn P.
Published: 2023
Full Text: View/download PDF

14. Examining Computational Thinking Processes in Modeling Unstructured Data

Author: Jiang, Shiyan, Qian, Yingxiao, Tang, Hengtao, Yalcinkaya, Rabia, Rosé, Carolyn P., Chao, Jie, and Finzer, William
Abstract: As artificial intelligence (AI) technologies are increasingly pervasive in our daily lives, the need for students to understand the working mechanisms of AI technologies has become more urgent. Data modeling is an activity that has been proposed to engage students in reasoning about the working mechanism of AI technologies. While Computational thinking (CT) has been conceptualized as critical processes that students engage in during data modeling, much remains unexplored regarding how students created features from unstructured data to develop machine learning models. In this study, we examined high school students' patterns of iterative model development and themes of CT processes in iterative model development. Twenty-eight students from a journalism class engaged in refining machine learning models iteratively for classifying negative and positive reviews of ice cream stores. This study draws on a theoretical framework of CT processes to examine students' model development processes. The results showed that students (1) demonstrated three patterns of iterative model development, including incremental, filter-based, and radical feature creation; (2) engaged in complex reasoning about language use in diverse contexts in trial and error; and (3) leveraged multiple data representations when applying mathematical and computational techniques. The results provide implications for designing accessible AI learning experiences for students to understand the role and responsibility of modelers in creating AI technologies and studying AI learning experiences from the angle of CT processes.
Published: 2023
Full Text: View/download PDF

15. Enhancing student learning and achievement through orchestration of group processes and group composition

Author: Rosé, Carolyn P. and Järvelä, Sanna
Published: 2023
Full Text: View/download PDF

16. High School Students' Data Modeling Practices and Processes: From Modeling Unstructured Data to Evaluating Automated Decisions

Author: Jiang, Shiyan, Tang, Hengtao, Tatar, Cansu, Rosé, Carolyn P., and Chao, Jie
Abstract: It's critical to foster artificial intelligence (AI) literacy for high school students, the first generation to grow up surrounded by AI, to understand working mechanism of data-driven AI technologies and critically evaluate automated decisions from predictive models. While efforts have been made to engage youth in understanding AI through developing machine learning models, few provided in-depth insights into the nuanced learning processes. In this study, we examined high school students' data modeling practices and processes. Twenty-eight students developed machine learning models with text data for classifying negative and positive reviews of ice cream stores. We identified nine data modeling practices that describe students' processes of model exploration, development, and testing and two themes about evaluating automated decisions from data technologies. The results provide implications for designing accessible data modeling experiences for students to understand data justice as well as the role and responsibility of data modelers in creating AI technologies.
Published: 2023
Full Text: View/download PDF

17. Examining Socially Shared Regulation and Shared Physiological Arousal Events with Multimodal Learning Analytics

Author: Nguyen, Andy, Järvelä, Sanna, Rosé, Carolyn, Järvenoja, Hanna, and Malmberg, Jonna
Abstract: Socially shared regulation contributes to the success of collaborative learning. However, the assessment of socially shared regulation of learning (SSRL) faces several challenges in the effort to increase the understanding of collaborative learning and support outcomes due to the unobservability of the related cognitive and emotional processes. The recent development of trace-based assessment has enabled innovative opportunities to overcome the problem. Despite the potential of a trace-based approach to study SSRL, there remains a paucity of evidence on how trace-based evidence could be captured and utilised to assess and promote SSRL. This study aims to investigate the assessment of electrodermal activities (EDA) data to understand and support SSRL in collaborative learning, hence enhancing learning outcomes. The data collection involves secondary school students (N = 94) working collaboratively in groups through five science lessons. A multimodal data set of EDA and video data were examined to assess the relationship among shared arousals and interactions for SSRL. The results of this study inform the patterns among students' physiological activities and their SSRL interactions to provide trace-based evidence for an adaptive and maladaptive pattern of collaborative learning. Furthermore, our findings provide evidence about how trace-based data could be utilised to predict learning outcomes in collaborative learning.
Published: 2023
Full Text: View/download PDF

18. Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks

Author: Naik, Aakanksha, Lehman, Jill, and Rose, Carolyn
Subjects: Computer Science - Computation and Language
Abstract: Natural language understanding (NLU) has made massive progress driven by large benchmarks, but benchmarks often leave a long tail of infrequent phenomena underrepresented. We reflect on the question: have transfer learning methods sufficiently addressed the poor performance of benchmark-trained models on the long tail? We conceptualize the long tail using macro-level dimensions (e.g., underrepresented genres, topics, etc.), and perform a qualitative meta-analysis of 100 representative papers on transfer learning research for NLU. Our analysis asks three questions: (i) Which long tail dimensions do transfer learning studies target? (ii) Which properties of adaptation methods help improve performance on the long tail? (iii) Which methodological gaps have greatest negative impact on long tail performance? Our answers highlight major avenues for future research in transfer learning for the long tail. Lastly, using our meta-analysis framework, we perform a case study comparing the performance of various adaptation methods on clinical narratives, which provides interesting insights that may enable us to make progress along these future avenues., Comment: To appear in TACL 2022. This is a pre-MIT Press publication version
Published: 2021

19. Localize, Group, and Select: Boosting Text-VQA by Scene Text Modeling

Author: Lu, Xiaopeng, Fan, Zhen, Wang, Yansen, Oh, Jean, and Rose, Carolyn P.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: As an important task in multimodal context understanding, Text-VQA (Visual Question Answering) aims at question answering through reading text information in images. It differentiates from the original VQA task as Text-VQA requires large amounts of scene-text relationship understanding, in addition to the cross-modal grounding capability. In this paper, we propose Localize, Group, and Select (LOGOS), a novel model which attempts to tackle this problem from multiple aspects. LOGOS leverages two grounding tasks to better localize the key information of the image, utilizes scene text clustering to group individual OCR tokens, and learns to select the best answer from different sources of OCR (Optical Character Recognition) texts. Experiments show that LOGOS outperforms previous state-of-the-art methods on two Text-VQA benchmarks without using additional OCR annotation data. Ablation studies and analysis demonstrate the capability of LOGOS to bridge different modalities and better understand scene text., Comment: 9 pages
Published: 2021

20. Comparing Example-Based Collaborative Reflection to Problem Solving Practice for Learning during Team-Based Software Engineering Projects

Author: Sankaranarayanan, Sreecharan, Kandimalla, Siddharth Reddy, Bogart, Christopher, Murray, R. Charles, An, Haokang, Hilton, Michael, Sakr, Majd, and Rosé, Carolyn
Subjects: Computer Science - Software Engineering
Abstract: Contributing to the literature on aptitude-treatment interactions between worked examples and problem-solving, this paper addresses differential learning from the two approaches when students are positioned as domain experts learning new concepts. Our evaluation is situated in a team project that is part of an advanced software engineering course. In this course, students who possess foundational domain knowledge but are learning new concepts engage alternatively in programming followed by worked example-based reflection. They are either allowed to finish programming or are curtailed after a pre-specified time to participate in a longer worked example-based reflection. We find significant pre- to post-test learning gains in both conditions. Then, we not only find significantly more learning when students participated in longer worked example-based reflections but also a significant performance improvement on a problem-solving transfer task. These findings suggest that domain experts learning new concepts benefit more from worked example-based reflections than from problem-solving., Comment: 4 pages, 1 image, 1 table, 14th Computer Supported Collaborative Learning (CSCL) Proceedings at the Annual Meeting of the International Society of the Learning Sciences (ISLS)
Published: 2021

21. Robust Knowledge Graph Completion with Stacked Convolutions and a Student Re-Ranking Network

Author: Lovelace, Justin, Newman-Griffis, Denis, Vashishth, Shikhar, Lehman, Jill Fain, and Rosé, Carolyn Penstein
Subjects: Computer Science - Machine Learning
Abstract: Knowledge Graph (KG) completion research usually focuses on densely connected benchmark datasets that are not representative of real KGs. We curate two KG datasets that include biomedical and encyclopedic knowledge and use an existing commonsense KG dataset to explore KG completion in the more realistic setting where dense connectivity is not guaranteed. We develop a deep convolutional network that utilizes textual entity representations and demonstrate that our model outperforms recent KG completion methods in this challenging setting. We find that our model's performance improvements stem primarily from its robustness to sparsity. We then distill the knowledge from the convolutional network into a student network that re-ranks promising candidate entities. This re-ranking stage leads to further improvements in performance and demonstrates the effectiveness of entity re-ranking for KG completion., Comment: The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021)
Published: 2021

22. Data-Related Ethics Issues in Technologies for Informal Professional Learning

Author: Pammer-Schindler, Viktoria and Rosé, Carolyn
Abstract: Professional and lifelong learning are a necessity for workers. This is true both for re-skilling from disappearing jobs, as well as for staying current within a professional domain. AI-enabled scaffolding and just-in-time and situated learning in the workplace offer a new frontier for future impact of AIED. The hallmark of this community's work has been i) data-driven design of learning technology and ii) machine-learning enabled personalized interventions. In both cases, data are the foundation of AIED research and data-related ethics are thus central to AIED research. In this paper we formulate a vision how AIED research could address data-related ethics issues in informal and situated professional learning. The foundation of our vision is a secondary analysis of five research cases that offer insights related to data-driven adaptive technologies for informal professional learning. We describe the encountered data-related ethics issues. In our interpretation, we have developed three themes: Firstly, in informal and situated professional learning, relevant data about professional learning -- to be used as a basis for learning analytics and reflection or as a basis for adaptive systems - is not only about learners. Instead, due to the situatedness of learning, relevant data is also about others (colleagues, customers, clients) and other objects from the learner's context. Such data may be private, proprietary, or both. Secondly, manual tracking comes with high learner control over data. Thirdly, learning is not necessarily a shared goal in informal professional learning settings. From an ethics perspective, this is particularly problematic as much data that would be relevant for use within learning technologies hasn't been collected for the purposes of learning. These three themes translate into challenges for AIED research that need to be addressed in order to successfully investigate and develop AIED technology for informal and situated professional learning. As an outlook of this paper, we connect these challenges to ongoing research directions within AIED -- natural language processing, socio-technical design, and scenario-based data collection - that might be leveraged and aimed towards addressing data-related ethics challenges.
Published: 2022
Full Text: View/download PDF

23. An Empirical Analysis of High School Students' Practices of Modelling with Unstructured Data

Author: Jiang, Shiyan, Nocera, Amato, Tatar, Cansu, Yoder, Michael Miller, Chao, Jie, Wiedemann, Kenia, Finzer, William, and Rosé, Carolyn P.
Abstract: To date, many AI initiatives (eg, AI4K12, CS for All) developed standards and frameworks as guidance for educators to create accessible and engaging Artificial Intelligence (AI) learning experiences for K-12 students. These efforts revealed a significant need to prepare youth to gain a fundamental understanding of how intelligence is created, applied, and its potential to perpetuate bias and unfairness. This study contributes to the growing interest in K-12 AI education by examining student learning of modelling real-world text data. Four students from an Advanced Placement computer science classroom at a public high school participated in this study. Our qualitative analysis reveals that the students developed nuanced and in-depth understandings of how text classification models--a type of AI application--are trained. Specifically, we found that in modelling texts, students: (1) drew on their social experiences and cultural knowledge to create predictive features, (2) engineered predictive features to address model errors, (3) described model learning patterns from training data and (4) reasoned about noisy features when comparing models. This study contributes to an initial understanding of student learning of modelling unstructured data and offers implications for scaffolding in-depth reasoning about model decision making.
Published: 2022
Full Text: View/download PDF

24. STAGE: Tool for Automated Extraction of Semantic Time Cues to Enrich Neural Temporal Ordering Models

Author: Breitfeller, Luke, Naik, Aakanksha, and Rose, Carolyn
Subjects: Computer Science - Computation and Language
Abstract: Despite achieving state-of-the-art accuracy on temporal ordering of events, neural models showcase significant gaps in performance. Our work seeks to fill one of these gaps by leveraging an under-explored dimension of textual semantics: rich semantic information provided by explicit textual time cues. We develop STAGE, a system that consists of a novel temporal framework and a parser that can automatically extract time cues and convert them into representations suitable for integration with neural models. We demonstrate the utility of extracted cues by integrating them with an event ordering model using a joint BiLSTM and ILP constraint architecture. We outline the functionality of the 3-part STAGE processing approach, and show two methods of integrating its representations with the BiLSTM-ILP model: (i) incorporating semantic cues as additional features, and (ii) generating new constraints from semantic cues to be enforced in the ILP. We demonstrate promising results on two event ordering datasets, and highlight important issues in semantic cue representation and integration for future research.
Published: 2021

25. Evaluating the Impact of a Hierarchical Discourse Representation on Entity Coreference Resolution Performance

Author: Khosla, Sopan, Fiacco, James, and Rose, Carolyn
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Recent work on entity coreference resolution (CR) follows current trends in Deep Learning applied to embeddings and relatively simple task-related features. SOTA models do not make use of hierarchical representations of discourse structure. In this work, we leverage automatically constructed discourse parse trees within a neural approach and demonstrate a significant improvement on two benchmark entity coreference-resolution datasets. We explore how the impact varies depending upon the type of mention., Comment: Also contains the Appendix. Accepted to NAACL 2021 as a short paper
Published: 2021

26. Translational NLP: A New Paradigm and General Principles for Natural Language Processing Research

Author: Newman-Griffis, Denis, Lehman, Jill Fain, Rosé, Carolyn, and Hochheiser, Harry
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Natural language processing (NLP) research combines the study of universal principles, through basic science, with applied science targeting specific use cases and settings. However, the process of exchange between basic NLP and applications is often assumed to emerge naturally, resulting in many innovations going unapplied and many important questions left unstudied. We describe a new paradigm of Translational NLP, which aims to structure and facilitate the processes by which basic and applied NLP research inform one another. Translational NLP thus presents a third research paradigm, focused on understanding the challenges posed by application needs and how these challenges can drive innovation in basic science and technology design. We show that many significant advances in NLP research have emerged from the intersection of basic principles with application needs, and present a conceptual framework outlining the stakeholders and key questions in translational research. Our framework provides a roadmap for developing Translational NLP as a dedicated research area, and identifies general translational principles to facilitate exchange between basic and applied research., Comment: Accepted to NAACL-HLT 2021
Published: 2021

27. RESPER: Computationally Modelling Resisting Strategies in Persuasive Conversations

Author: Dutt, Ritam, Sinha, Sayan, Joshi, Rishabh, Chakraborty, Surya Shekhar, Riggs, Meredith, Yan, Xinru, Bao, Haogang, and Rosé, Carolyn Penstein
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Modelling persuasion strategies as predictors of task outcome has several real-world applications and has received considerable attention from the computational linguistics community. However, previous research has failed to account for the resisting strategies employed by an individual to foil such persuasion attempts. Grounded in prior literature in cognitive and social psychology, we propose a generalised framework for identifying resisting strategies in persuasive conversations. We instantiate our framework on two distinct datasets comprising persuasion and negotiation conversations. We also leverage a hierarchical sequence-labelling neural architecture to infer the aforementioned resisting strategies automatically. Our experiments reveal the asymmetry of power roles in non-collaborative goal-directed conversations and the benefits accrued from incorporating resisting strategies on the final conversation outcome. We also investigate the role of different resisting strategies on the conversation outcome and glean insights that corroborate with past findings. We also make the code and the dataset of this work publicly available at https://github.com/americast/resper., Comment: Accepted as a long paper at the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2021)
Published: 2021

28. Time-Series Insights into the Process of Passing or Failing Online University Courses Using Neural-Induced Interpretable Student States

Author: Jeon, Byungsoo, Shafran, Eyal, Breitfeller, Luke, Levin, Jason, and Rosé, Carolyn P.
Abstract: This paper addresses a key challenge in Educational Data Mining, namely to model student behavioral trajectories in order to provide a means for identifying students most at risk, with the goal of providing supportive interventions. While many forms of data including clickstream data or data from sensors have been used extensively in time series models for such purposes, in this paper we explore the use of textual data, which is sometimes available in the records of students at large, online universities. We propose a time series model that constructs an evolving student state representation using both clickstream data and a signal extracted from the textual notes recorded by human mentors assigned to each student. We explore how the addition of this textual data improves both the predictive power of student states for the purpose of identifying students at risk for course failure as well as for providing interpretable insights about student course engagement processes. [For the full proceedings, see ED599096.]
Published: 2019

29. Editorial: Nine elements for robust collaborative learning analytics: A constructive collaborative critique

Author: Wise, Alyssa Friend, Rosé, Carolyn, and Järvelä, Sanna
Published: 2023
Full Text: View/download PDF

30. Using Type Information to Improve Entity Coreference Resolution

Author: Khosla, Sopan and Rose, Carolyn
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Coreference resolution (CR) is an essential part of discourse analysis. Most recently, neural approaches have been proposed to improve over SOTA models from earlier paradigms. So far none of the published neural models leverage external semantic knowledge such as type information. This paper offers the first such model and evaluation, demonstrating modest gains in accuracy by introducing either gold standard or predicted types. In the proposed approach, type information serves both to (1) improve mention representation and (2) create a soft type consistency check between coreference candidate mentions. Our evaluation covers two different grain sizes of types over four different benchmark corpora., Comment: Accepted as Long Paper at CODI workshop EMNLP 2020
Published: 2020

31. MedFilter: Improving Extraction of Task-relevant Utterances from Doctor-Patient Conversations through Integration of Discourse Structure and Ontological Knowledge

Author: Khosla, Sopan, Vashishth, Shikhar, Lehman, Jill Fain, and Rose, Carolyn
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Information extraction from conversational data is particularly challenging because the task-centric nature of conversation allows for effective communication of implicit information by humans, but is challenging for machines. The challenges may differ between utterances depending on the role of the speaker within the conversation, especially when relevant expertise is distributed asymmetrically across roles. Further, the challenges may also increase over the conversation as more shared context is built up through information communicated implicitly earlier in the dialogue. In this paper, we propose the novel modeling approach MedFilter, which addresses these insights in order to increase performance at identifying and categorizing task-relevant utterances, and in so doing, positively impacts performance at a downstream information extraction task. We evaluate this approach on a corpus of nearly 7,000 doctor-patient conversations where MedFilter is used to identify medically relevant contributions to the discussion (achieving a 10% improvement over SOTA baselines in terms of area under the PR curve). Identifying task-relevant utterances benefits downstream medical processing, achieving improvements of 15%, 105%, and 23% respectively for the extraction of symptoms, medications, and complaints., Comment: Accepted as Long Paper to EMNLP 2020
Published: 2020

32. Keeping Up Appearances: Computational Modeling of Face Acts in Persuasion Oriented Discussions

Author: Dutt, Ritam, Joshi, Rishabh, and Rose, Carolyn Penstein
Subjects: Computer Science - Computation and Language
Abstract: The notion of face refers to the public self-image of an individual that emerges both from the individual's own actions as well as from the interaction with others. Modeling face and understanding its state changes throughout a conversation is critical to the study of maintenance of basic human needs in and through interaction. Grounded in the politeness theory of Brown and Levinson (1978), we propose a generalized framework for modeling face acts in persuasion conversations, resulting in a reliable coding manual, an annotated corpus, and computational models. The framework reveals insights about differences in face act utilization between asymmetric roles in persuasion conversations. Using computational models, we are able to successfully identify face acts as well as predict a key conversational outcome (e.g. donation success). Finally, we model a latent representation of the conversational state to analyze the impact of predicted face acts on the probability of a positive conversational outcome and observe several correlations that corroborate previous findings., Comment: To appear at Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP, 2020) as a full paper
Published: 2020

33. Adapting Event Extractors to Medical Data: Bridging the Covariate Shift

Author: Naik, Aakanksha, Lehman, Jill, and Rose, Carolyn
Subjects: Computer Science - Computation and Language
Abstract: We tackle the task of adapting event extractors to new domains without labeled data, by aligning the marginal distributions of source and target domains. As a testbed, we create two new event extraction datasets using English texts from two medical domains: (i) clinical notes, and (ii) doctor-patient conversations. We test the efficacy of three marginal alignment techniques: (i) adversarial domain adaptation (ADA), (ii) domain adaptive fine-tuning (DAFT), and (iii) a novel instance weighting technique based on language model likelihood scores (LIW). LIW and DAFT improve over a no-transfer BERT baseline on both domains, but ADA only improves on clinical notes. Deeper analysis of performance under different types of shifts (e.g., lexical shift, semantic shift) reveals interesting variations among models. Our best-performing models reach F1 scores of 70.0 and 72.9 on notes and conversations respectively, using no labeled data from target domains.
Published: 2020

34. Towards Open Domain Event Trigger Identification using Adversarial Domain Adaptation

Author: Naik, Aakanksha and Rosé, Carolyn
Subjects: Computer Science - Computation and Language
Abstract: We tackle the task of building supervised event trigger identification models which can generalize better across domains. Our work leverages the adversarial domain adaptation (ADA) framework to introduce domain-invariance. ADA uses adversarial training to construct representations that are predictive for trigger identification, but not predictive of the example's domain. It requires no labeled data from the target domain, making it completely unsupervised. Experiments with two domains (English literature and news) show that ADA leads to an average F1 score improvement of 3.9 on out-of-domain data. Our best performing model (BERT-A) reaches 44-49 F1 across both domains, using no labeled target data. Preliminary experiments reveal that finetuning on 1% labeled data, followed by self-training leads to substantial improvement, reaching 51.5 and 67.2 F1 on literature and news respectively., Comment: To appear at ACL 2020
Published: 2020

35. Improving Broad-Coverage Medical Entity Linking with Semantic Type Prediction and Large-Scale Datasets

Author: Vashishth, Shikhar, Newman-Griffis, Denis, Joshi, Rishabh, Dutt, Ritam, and Rose, Carolyn
Subjects: Computer Science - Computation and Language
Abstract: Medical entity linking is the task of identifying and standardizing medical concepts referred to in an unstructured text. Most of the existing methods adopt a three-step approach of (1) detecting mentions, (2) generating a list of candidate concepts, and finally (3) picking the best concept among them. In this paper, we probe into alleviating the problem of overgeneration of candidate concepts in the candidate generation module, the most under-studied component of medical entity linking. For this, we present MedType, a fully modular system that prunes out irrelevant candidate concepts based on the predicted semantic type of an entity mention. We incorporate MedType into five off-the-shelf toolkits for medical entity linking and demonstrate that it consistently improves entity linking performance across several benchmark datasets. To address the dearth of annotated training data for medical entity linking, we present WikiMed and PubMedDS, two large-scale medical entity linking datasets, and demonstrate that pre-training MedType on these datasets further improves entity linking performance. We make our source code and datasets publicly available for medical entity linking research., Comment: 44 pages
Published: 2020
Full Text: View/download PDF

36. A Machine Learning Framework for Authorship Identification From Texts

Author: Iyer, Rahul Radhakrishnan and Rose, Carolyn Penstein
Subjects: Computer Science - Computation and Language, Computer Science - Information Retrieval, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Authorship identification is a process in which the author of a text is identified. Most known literary texts can easily be attributed to a certain author because they are, for example, signed. Yet sometimes we find unfinished pieces of work or a whole bunch of manuscripts with a wide variety of possible authors. In order to assess the importance of such a manuscript, it is vital to know who wrote it. In this work, we aim to develop a machine learning framework to effectively determine authorship. We formulate the task as a single-label multi-class text categorization problem and propose a supervised machine learning framework incorporating stylometric features. This task is highly interdisciplinary in that it takes advantage of machine learning, information retrieval, and natural language processing. We present an approach and a model which learns the differences in writing style between $50$ different authors and is able to predict the author of a new text with high accuracy. The accuracy is seen to increase significantly after introducing certain linguistic stylometric features along with text features., Comment: 8 pages, 2 figures
Published: 2019

37. Social analytics to support engagement with learning communities

Author: Rosé, Carolyn, primary, Riggs, Meredith, additional, and Barbaro, Nicole, additional
Published: 2023
Full Text: View/download PDF

38. EQUATE: A Benchmark Evaluation Framework for Quantitative Reasoning in Natural Language Inference

Author: Ravichander, Abhilasha, Naik, Aakanksha, Rose, Carolyn, and Hovy, Eduard
Subjects: Computer Science - Computation and Language
Abstract: Quantitative reasoning is a higher-order reasoning skill that any intelligent natural language understanding system can reasonably be expected to handle. We present EQUATE (Evaluating Quantitative Understanding Aptitude in Textual Entailment), a new framework for quantitative reasoning in textual entailment. We benchmark the performance of 9 published NLI models on EQUATE, and find that on average, state-of-the-art methods do not achieve an absolute improvement over a majority-class baseline, suggesting that they do not implicitly learn to reason with quantities. We establish a new baseline Q-REAS that manipulates quantities symbolically. In comparison to the best performing NLI model, it achieves success on numerical reasoning tests (+24.2%), but has limited verbal reasoning capabilities (-8.1%). We hope our evaluation framework will support the development of models of quantitative reasoning in language understanding., Comment: To appear at CoNLL 2019
Published: 2019

39. Supporting perspective taking across chasms of thinking: Do real-time analytics hold the key?

Author: Rosé, Carolyn and Järvelä, Sanna
Published: 2022
Full Text: View/download PDF

40. Learning analytics

Author: Fiacco, James, primary, Jiang, Shiyan, additional, Adamson, David, additional, and Rosé, Carolyn P., additional
Published: 2023
Full Text: View/download PDF

41. Quantitative Approaches to Language in CSCL

Author: Borge, Marcela, Rosé, Carolyn, Hoadley, Christopher, Series Editor, van Aalst, Jan, Associate Editor, Jahnke, Isa, Associate Editor, Cress, Ulrike, editor, Rosé, Carolyn, editor, Wise, Alyssa Friend, editor, and Oshima, Jun, editor
Published: 2021
Full Text: View/download PDF

42. Tools and Resources for Setting Up Collaborative Spaces

Author: Rosé, Carolyn, Dimitriadis, Yannis, Hoadley, Christopher, Series Editor, van Aalst, Jan, Associate Editor, Jahnke, Isa, Associate Editor, Cress, Ulrike, editor, Rosé, Carolyn, editor, Wise, Alyssa Friend, editor, and Oshima, Jun, editor
Published: 2021
Full Text: View/download PDF

43. Collaborative Learning at Scale

Author: Chen, Bodong, Håklev, Stian, Rosé, Carolyn Penstein, Hoadley, Christopher, Series Editor, van Aalst, Jan, Associate Editor, Jahnke, Isa, Associate Editor, Cress, Ulrike, editor, Rosé, Carolyn, editor, Wise, Alyssa Friend, editor, and Oshima, Jun, editor
Published: 2021
Full Text: View/download PDF

44. Foundations, Processes, Technologies, and Methods: An Overview of CSCL Through Its Handbook

Author: Cress, Ulrike, Oshima, Jun, Rosé, Carolyn, Wise, Alyssa Friend, Hoadley, Christopher, Series Editor, van Aalst, Jan, Associate Editor, Jahnke, Isa, Associate Editor, Cress, Ulrike, editor, Rosé, Carolyn, editor, Wise, Alyssa Friend, editor, and Oshima, Jun, editor
Published: 2021
Full Text: View/download PDF

45. Navigating Cognitive Engagement in AI-Enhanced Education: Lexical Diversity and Open-Ended Inquiry in Journalism Learning

Author: McClure, Jeanne, primary, Bickel, Franziska, additional, Jiang, Shiyan, additional, Chao, Jie, additional, and Rosé, Carolyn P., additional
Published: 2024
Full Text: View/download PDF

46. Modeling with Primary Sources: An Approach to Teach Data Bias for Artificial Intelligence and Machine Learning Education

Author: McClure, Jeanne, primary, Zheng, Juan, additional, Bickel, Franziska, additional, Jiang, Shiyan, additional, Rosé, Carolyn P., additional, and Chao, Jie, additional
Published: 2024
Full Text: View/download PDF

47. What Does it Mean to be Literate in the Time of AI? Different Perspectives on Learning and Teaching AI Literacies in K-12 Education

Author: Kafai, Yasmin B., primary, Proctor, Chris, additional, Cai, Shuang, additional, Castro, Francisco, additional, Delaney, Victoria, additional, DesPortes, Kayla, additional, Hoadley, Christopher, additional, Lee, Victor R., additional, Long, Duri, additional, Magerko, Brian, additional, Roberts, Jessica, additional, Shapiro, Benjamin R., additional, Tseng, Tiffany, additional, Zhong, Vera, additional, and Rosé, Carolyn P., additional
Published: 2024
Full Text: View/download PDF

48. Leveraging Student Choice and Interest to Design an Engaging Lesson about Artificial Intelligence

Author: Ellis, Rebecca, primary, Chao, Jie, additional, Jiang, Shiyan, additional, Rosé, Carolyn P., additional, and Wiedemann, Kenia, additional
Published: 2024
Full Text: View/download PDF

49. Using Biterm Topic Modeling to Explore Gender Differences in Secondary Students’ Wonderings about AI Concepts

Author: Mushi, Doreen, primary, Chao, Jie, additional, Jiang, Shiyan, additional, Ellis, Rebecca, additional, Rosé, Carolyn P., additional, and Wiedemann, Kenia, additional
Published: 2024
Full Text: View/download PDF

50. Relations Matter – CSCL Research Informing and Developing CL Competencies

Author: Järvelä, Sanna and Rosé, Carolyn P.
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

717 results on '"Rosé, Carolyn"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources