Author: "Circi, Ruhan" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Circi, Ruhan"' showing total 19 results

Start Over Author "Circi, Ruhan"

19 results on '"Circi, Ruhan"'

1. The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches

Author: Abeysinghe, Bhashithe and Circi, Ruhan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Chatbots have been an interesting application of natural language generation since its inception. With novel transformer based Generative AI methods, building chatbots have become trivial. Chatbots which are targeted at specific domains for example medicine and psychology are implemented rapidly. This however, should not distract from the need to evaluate the chatbot responses. Especially because the natural language generation community does not entirely agree upon how to effectively evaluate such applications. With this work we discuss the issue further with the increasingly popular LLM based evaluations and how they correlate with human evaluations. Additionally, we introduce a comprehensive factored evaluation mechanism that can be utilized in conjunction with both human and LLM-based evaluations. We present the results of an experimental evaluation conducted using this scheme in one of our chatbot implementations which consumed educational reports, and subsequently compare automated, traditional human evaluation, factored human evaluation, and factored LLM evaluation. Results show that factor based evaluation produces better insights on which aspects need to be improved in LLM applications and further strengthens the argument to use human evaluation in critical spaces where main functionality is not direct retrieval., Comment: Accepted in The First Workshop on Large Language Models for Evaluation in Information Retrieval
Published: 2024

2. Speed-Accuracy Trade-Off? Not so Fast: Marginal Changes in Speed Have Inconsistent Relationships with Accuracy in Real-World Settings

Author: Domingue, Benjamin W., Kanopka, Klint, Stenhaug, Ben, Sulik, Michael J., Beverly, Tanesia, Brinkhuis, Matthieu, Circi, Ruhan, Faul, Jessica, Liao, Dandan, McCandliss, Bruce, Obradovic, Jelena, Piech, Chris, Porter, Tenelle, Soland, James, Weeks, Jon, Wise, Steven L., and Yeatman, Jason
Abstract: The speed-accuracy trade-off (SAT) suggests that time constraints reduce response accuracy. Its relevance in observational settings--where response time (RT) may not be constrained but respondent speed may still vary--is unclear. Using 29 data sets containing data from cognitive tasks, we use a flexible method for identification of the SAT (which we test in extensive simulation studies) to probe whether the SAT holds. We find inconsistent relationships between time and accuracy; marginal increases in time use for an individual do not necessarily predict increases in accuracy. Additionally, the speed-accuracy relationship may depend on the underlying difficulty of the interaction. We also consider the analysis of items and individuals; of particular interest is the observation that respondents who exhibit more within-person variation in response speed are typically of lower ability. We further find that RT is typically a weak predictor of response accuracy. Our findings document a range of empirical phenomena that should inform future modeling of RTs collected in observational settings. [This work was co-written by the Project iLEAD Consortium.]
Published: 2022
Full Text: View/download PDF

3. Examining the Relationship between STEM Coursetaking in High School and Grade 12 NAEP Mathematics Performance. AIR-NAEP Working Paper 2021-05

Author: American Institutes for Research (AIR), Education Statistics Services Institute Network (ESSIN), Yee, Darrick, Ogut, Burhan, Bohrnstedt, George, Broer, Markus, and Circi, Ruhan
Abstract: This study linked ninth-grade student background data and school-reported high school transcript data from the national High School Longitudinal Study of 2009 (HSLS:09) to student item responses on the 2013 National Assessment of Educational Progress (NAEP) mathematics assessment to examine the relationships between high school coursework and end-of-high school mathematics achievement. In a series of marginal maximum likelihood regression analyses, we find that STEM course GPAs, credits earned in AP/IB math and science courses, higher levels of math course content, and course-taking in chemistry and physics are all positively associated with NAEP math achievement. These relationships persist even when gender, race/ethnicity, early grade 9 mathematics achievement, and socioeconomic status are included as covariates. Cluster analysis of students with high estimated achievement suggest multiple paths to high mathematics achievement for students with both high- and low-socioeconomic status backgrounds.
Published: 2021

4. Why Does High School Coursework Matter? The Case for Increasing Exposure to Advanced Courses

Author: American Institutes for Research (AIR), Ogut, Burhan, Circi, Ruhan, and Yee, Darrick
Abstract: Increasing the rigor of courses taken in high school is a crucial part of education policy. However, existing knowledge about high school coursework is outdated. Using data from a recent nationally representative data set, this brief reports results that expand our knowledge base on the relationship between rigorous coursework and postsecondary outcomes. Findings show that (1) Timing of the course-taking matters; (2) Advanced coursework is important; (3) Students who take diverse courses are likely to have better postsecondary outcomes; and (4) Both the quality and quantity of the coursework matters. The results of this study have implications for students, parents, educators, policymakers, and researchers. Students and their parents who would like to increase students' chances of becoming successful after high school should ensure students do not fall behind in taking the specific courses outlined in this study and enroll in advanced courses after completing the prerequisites for those courses.
Published: 2021

5. Diving into Students' Transcripts: High School Course-Taking Sequences and Postsecondary Enrollment

Author: Ogut, Burhan and Circi, Ruhan
Abstract: The purpose of this study was to explore high school course-taking sequences and their relationship to college enrollment. Specifically, we implemented sequence analysis to discover common course-taking trajectories in math, science, and English language arts using high school transcript data from a recent nationally representative survey. Through sequence clustering, we reduced the complexity of the sequences and examined representative course-taking sequences. Classification tree, random forests, and multinomial logistic regression analyses were used to explore the relationship between the course sequences students complete and their postsecondary outcomes. Results showed that distinct representative course-taking sequences can be identified for all students as well as student subgroups. More advanced and complex course-taking sequences were associated with postsecondary enrollment.
Published: 2023
Full Text: View/download PDF

6. Does It Matter How the Rigor of High School Coursework Is Measured? Gaps in Coursework among Students and across Grades

Author: Ogut, Burhan, Yee, Darrick, Circi, Ruhan, and Dizdari, Nevin
Abstract: Research shows that the intensity of high school course-taking is related to postsecondary outcomes. However, there are various approaches to measuring the intensity of students' course-taking. This study presents new measures of coursework intensity that rely on differing levels of quantity and quality of coursework. We used these new indices to provide a current description of variations in high school course-taking across grades and student subgroups using a nationally representative dataset, the High School Longitudinal Study of 2009. Results showed that for measures emphasizing the quality of coursework the gaps in coursework among underserved students were larger and there was less upward movement in rigor across grades.
Published: 2023
Full Text: View/download PDF

7. Exploring Alignment among Learning Progressions, Teacher-Designed Formative Assessment Tasks, and Student Growth: Results of a Four-Year Study

Author: Furtak, Erin Marie, Circi, Ruhan, and Heredia, Sara C.
Abstract: This article describes a 4-year study of experienced high school biology teachers' participation in a five-step professional development experience in which they iteratively studied student ideas with the support of a set of learning progressions, designed formative assessment activities, practiced using those activities with their students, enacted the activities, and then reflected on next steps to guide their instruction. Drawing on classroom artifacts and student responses to a pre-post assessment, we examined the alignment of teacher-created formative assessment tasks with the learning progressions, as well as student learning relative to the progressions. A partial-credit Model revealed that the majority of students' learning reflected learning from lower- to upper-anchors on multiple learning progressions. This finding suggests that, by participating in the professional learning experience, teachers were able to successfully support student learning of the content as represented in the majority of the learning progressions. Results are interpreted in light of learning progressions being used as scaffolds for formative assessment design and practice.
Published: 2018
Full Text: View/download PDF

8. Challenges to the Use of Artificial Neural Networks for Diagnostic Classifications with Student Test Data

Author: Briggs, Derek C. and Circi, Ruhan
Abstract: Artificial Neural Networks (ANNs) have been proposed as a promising approach for the classification of students into different levels of a psychological attribute hierarchy. Unfortunately, because such classifications typically rely upon internally produced item response patterns that have not been externally validated, the instability of ANN estimates of attribute probabilities may not be widely appreciated. The present study illustrates the problem with both empirical and simulated data. In particular, it is shown that when an ANN is "trained" multiple times using the same data, attribute probability estimates can vary, sometimes dramatically. Researchers hoping to apply ANNs in the context of diagnostic classification models with student test data need to be very deliberate in checking the sensitivity of their findings.
Published: 2017
Full Text: View/download PDF

9. Teachers' Formative Assessment Abilities and Their Relationship to Student Learning: Findings from a Four-Year Intervention Study

Author: Furtak, Erin Marie, Kiemer, Katharina, and Circi, Ruhan Kizil
Abstract: The teaching practices of recognizing and responding to students' ideas during instruction are often called formative assessment, and can be conceptualized by four abilities: designing formative assessment tasks, asking questions to elicit student thinking, interpreting student ideas, and providing feedback that moves student thinking forward. While these practices have been linked to positive learning outcomes for students, designing and enacting formative assessment tasks in science classrooms presents instructional challenges for teachers. This paper reports on the results of a long-term study of high school biology teachers who participated in a 3 year professional development program, called the Formative Assessment Design Cycle (FADC), which guided them to iteratively design, enact, and reflect upon formative assessments for natural selection in school-based teacher learning communities. Nine teachers participated for three academic years; sources of data included teachers' interpreting of student ideas in line with a learning progression, the formative assessment tasks they designed each year of the study, videotaped classroom enactment of those tasks, and pre-post test student achievement from the Baseline and final year of the study. Results indicate that, on average, teachers increased on all abilities during the study and changes were statistically significant for interpreting students ideas, eliciting questions, and feedback. HLM models showed that while only the quality of feedback was a significant predictor at Baseline, it was teachers' task design and interpretation of ideas in Year 3. These results suggest the efficacy of the FADC in supporting teachers' formative assessment abilities. Findings are interpreted in light of professional development and formative assessment literatures.
Published: 2016
Full Text: View/download PDF

10. Automatic item generation: foundations and machine learning-based approaches for assessments

Author: Circi, Ruhan, primary, Hicks, Juanita, additional, and Sikali, Emmanuel, additional
Published: 2023
Full Text: View/download PDF

11. Student Learning

Author: Morrison, Deb, primary and Circi, Ruhan, additional
Published: 2017
Full Text: View/download PDF

12. Speed–Accuracy Trade-Off? Not So Fast: Marginal Changes in Speed Have Inconsistent Relationships With Accuracy in Real-World Settings

Author: Sub Softw.Techn. for Learning and Teach., Software Technology for Learning and Teaching, Domingue, Benjamin W., Kanopka, Klint, Stenhaug, Ben, Sulik, Michael J., Beverly, Tanesia, Brinkhuis, Matthieu, Circi, Ruhan, Faul, Jessica, Liao, Dandan, Mccandliss, Bruce, Obradović, Jelena, Piech, Chris, Porter, Tenelle, Consortium, Project Ilead, Soland, James, Weeks, Jon, Wise, Steven L., Yeatman, Jason, Sub Softw.Techn. for Learning and Teach., Software Technology for Learning and Teaching, Domingue, Benjamin W., Kanopka, Klint, Stenhaug, Ben, Sulik, Michael J., Beverly, Tanesia, Brinkhuis, Matthieu, Circi, Ruhan, Faul, Jessica, Liao, Dandan, Mccandliss, Bruce, Obradović, Jelena, Piech, Chris, Porter, Tenelle, Consortium, Project Ilead, Soland, James, Weeks, Jon, Wise, Steven L., and Yeatman, Jason
Published: 2022

13. Item revision to improve construct validity: A study on released science items in Turkish PISA 2006

Author: Baykal, Ali and Circi, Ruhan
Published: 2010
Full Text: View/download PDF

14. Speed accuracy tradeoff? Not so fast: Marginal changes in speed have inconsistent relationships with accuracy in real-world settings

Author: Domingue, Benjamin, Kanopka, Klint, Stenhaug, Ben, Sulik, Michael, Beverly, Tanesia, Brinkhuis, Matthieu J. S., Circi, Ruhan, Faul, Jessica, Liao, Dandan, McCandliss, Bruce, Obradovic, Jelena, Piech, Chris, Porter, Tenelle, Soland, Jim, Weeks, Jon, Wise, Steve, Yeatman, Jason D, Domingue, Benjamin, Kanopka, Klint, Stenhaug, Ben, Sulik, Michael, Beverly, Tanesia, Brinkhuis, Matthieu J. S., Circi, Ruhan, Faul, Jessica, Liao, Dandan, McCandliss, Bruce, Obradovic, Jelena, Piech, Chris, Porter, Tenelle, Soland, Jim, Weeks, Jon, Wise, Steve, and Yeatman, Jason D
Abstract: The speed-accuracy tradeoff suggests that responses generated under time constraints will be less accurate. While it has undergone extensive experimental verification, it is less clear whether it applies in settings where time pressures are not being experimentally manipulated (but where respondents still vary in their utilization of time). Using a large corpus of 29 response time datasets containing data from cognitive tasks without experimental manipulation of time pressure, we probe whether the speed-accuracy tradeoff holds across a variety of tasks using idiosyncratic within-person variation in speed. We find inconsistent relationships between marginal increases in time spent responding and accuracy; in many cases, marginal increases in time do not predict increases in accuracy. However, we do observe time pressures (in the form of time limits) to consistently reduce accuracy and for rapid responses to typically show the anticipated relationship (i.e., they are more accurate if they are slower). We also consider analysis of items and individuals. We find substantial variation in the item-level associations between speed and accuracy. On the person side, respondents who exhibit more within-person variation in response speed are typically of lower ability. Finally, we consider the predictive power of a person's response time in predicting out-of-sample responses; it is generally a weak predictor. Collectively, our findings suggest the speed-accuracy tradeoff may be limited as a conceptual model in its application in non-experimental settings and, more generally, offer empirical results and an analytic approach that will be useful as more response time data is collected.
Published: 2020

15. Speed accuracy tradeoff? Not so fast: Marginal changes in speed have inconsistent relationships with accuracy in real-world settings

Author: Sub Softw.Techn. for Learning and Teach., Software Technology for Learning and Teaching, Domingue, Benjamin, Kanopka, Klint, Stenhaug, Ben, Sulik, Michael, Beverly, Tanesia, Brinkhuis, Matthieu J. S., Circi, Ruhan, Faul, Jessica, Liao, Dandan, McCandliss, Bruce, Obradovic, Jelena, Piech, Chris, Porter, Tenelle, Soland, Jim, Weeks, Jon, Wise, Steve, Yeatman, Jason D, Sub Softw.Techn. for Learning and Teach., Software Technology for Learning and Teaching, Domingue, Benjamin, Kanopka, Klint, Stenhaug, Ben, Sulik, Michael, Beverly, Tanesia, Brinkhuis, Matthieu J. S., Circi, Ruhan, Faul, Jessica, Liao, Dandan, McCandliss, Bruce, Obradovic, Jelena, Piech, Chris, Porter, Tenelle, Soland, Jim, Weeks, Jon, Wise, Steve, and Yeatman, Jason D
Published: 2020

16. Speed accuracy tradeoff? Not so fast: Marginal changes in speed have inconsistent relationships with accuracy in real-world settings

Author: Domingue, Benjamin, primary, Kanopka, Klint, additional, Stenhaug, Ben, additional, Sulik, Michael, additional, Beverly, Tanesia, additional, Brinkhuis, Matthieu J. S., additional, Circi, Ruhan, additional, Faul, Jessica, additional, Liao, Dandan, additional, McCandliss, Bruce, additional, Obradovic, Jelena, additional, Piech, Chris, additional, Porter, Tenelle, additional, Soland, Jim, additional, Weeks, Jon, additional, Wise, Steve, additional, and Yeatman, Jason D, additional
Published: 2020
Full Text: View/download PDF

17. Learning Progressions, Formative Assessment, and Professional Development: Results of a Longitudinal Study

Author: Furtak, Erin Marie, Kiemer, Katharina, Swanson, Rebecca, Leon, Vanessa De, and Circi, Ruhan
Published: 2015
Full Text: View/download PDF

18. Challenges when Using an Artificial Neural Network to Make Diagnostic Classifications from Ordered Multiple Choice Items

Author: Briggs, Derek, Circi, Ruhan, and McClarty, Katie
Published: 2014
Full Text: View/download PDF

19. Exploring Mathematical Problem-Solving Through Process Mining: Insights from Large-Scale Assessment Log Data.

Author: Ogut, Burhan, Webb, Blue, Hicks, Juanita, Circi, Ruhan, and Yin, Michelle
Abstract: AbstractIn this study, we explore the application of process mining techniques on assessment log data to explore problem-solving strategies in Algebra. By analyzing sequences of student activities, we demonstrate the significant potential of process mining in identifying problem-solving strategies that lead to successful and unsuccessful outcomes. Our findings reveal that students who successfully solve the problem tend to follow one of three structured strategies, displaying a systematic process in filling the boxes of a Pascal’s triangle. Conversely, those who falter often start with a correct strategy but deviate by inserting incorrect values, especially in central boxes. Further analysis provides insights into the strategies and potential misconceptions among students from various disability groups. Notably, autistic students exhibit unique patterns, such as initiating the solution from the triangle’s right side, contrary to the common left-to-right strategy, and consistently applying this approach even when errors occur. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

19 results on '"Circi, Ruhan"'

1. The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches

2. Speed-Accuracy Trade-Off? Not so Fast: Marginal Changes in Speed Have Inconsistent Relationships with Accuracy in Real-World Settings

3. Examining the Relationship between STEM Coursetaking in High School and Grade 12 NAEP Mathematics Performance. AIR-NAEP Working Paper 2021-05

4. Why Does High School Coursework Matter? The Case for Increasing Exposure to Advanced Courses

5. Diving into Students' Transcripts: High School Course-Taking Sequences and Postsecondary Enrollment

6. Does It Matter How the Rigor of High School Coursework Is Measured? Gaps in Coursework among Students and across Grades

7. Exploring Alignment among Learning Progressions, Teacher-Designed Formative Assessment Tasks, and Student Growth: Results of a Four-Year Study

8. Challenges to the Use of Artificial Neural Networks for Diagnostic Classifications with Student Test Data

9. Teachers' Formative Assessment Abilities and Their Relationship to Student Learning: Findings from a Four-Year Intervention Study

10. Automatic item generation: foundations and machine learning-based approaches for assessments

11. Student Learning

12. Speed–Accuracy Trade-Off? Not So Fast: Marginal Changes in Speed Have Inconsistent Relationships With Accuracy in Real-World Settings

13. Item revision to improve construct validity: A study on released science items in Turkish PISA 2006

14. Speed accuracy tradeoff? Not so fast: Marginal changes in speed have inconsistent relationships with accuracy in real-world settings

15. Speed accuracy tradeoff? Not so fast: Marginal changes in speed have inconsistent relationships with accuracy in real-world settings

16. Speed accuracy tradeoff? Not so fast: Marginal changes in speed have inconsistent relationships with accuracy in real-world settings

17. Learning Progressions, Formative Assessment, and Professional Development: Results of a Longitudinal Study

18. Challenges when Using an Artificial Neural Network to Make Diagnostic Classifications from Ordered Multiple Choice Items

19. Exploring Mathematical Problem-Solving Through Process Mining: Insights from Large-Scale Assessment Log Data.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

19 results on '"Circi, Ruhan"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources