Author: "Suresh, Siddharth" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Suresh, Siddharth"' showing total 26 results

Start Over Author "Suresh, Siddharth"

26 results on '"Suresh, Siddharth"'

1. Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning

Author: Zhang, Jifan, Jain, Lalit, Guo, Yang, Chen, Jiayi, Zhou, Kuan Lok, Suresh, Siddharth, Wagenmaker, Andrew, Sievert, Scott, Rogers, Timothy, Jamieson, Kevin, Mankoff, Robert, and Nowak, Robert
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: We present a novel multimodal preference dataset for creative tasks, consisting of over 250 million human ratings on more than 2.2 million captions, collected through crowdsourcing rating data for The New Yorker's weekly cartoon caption contest over the past eight years. This unique dataset supports the development and evaluation of multimodal large language models and preference-based fine-tuning algorithms for humorous caption generation. We propose novel benchmarks for judging the quality of model-generated captions, utilizing both GPT4 and human judgments to establish ranking-based evaluation strategies. Our experimental results highlight the limitations of current fine-tuning methods, such as RLHF and DPO, when applied to creative tasks. Furthermore, we demonstrate that even state-of-the-art models like GPT4 and Claude currently underperform top human contestants in generating humorous captions. As we conclude this extensive data collection effort, we release the entire preference dataset to the research community, fostering further advancements in AI humor generation and evaluation.
Published: 2024

2. The Wisdom of Partisan Crowds: Comparing Collective Intelligence in Humans and LLM-based Agents

Author: Chuang, Yun-Shiuan, Suresh, Siddharth, Harlalka, Nikunj, Goyal, Agam, Hawkins, Robert, Yang, Sijia, Shah, Dhavan, Hu, Junjie, and Rogers, Timothy T.
Subjects: Computer Science - Computation and Language
Abstract: Human groups are able to converge on more accurate beliefs through deliberation, even in the presence of polarization and partisan bias -- a phenomenon known as the "wisdom of partisan crowds." Generated agents powered by Large Language Models (LLMs) are increasingly used to simulate human collective behavior, yet few benchmarks exist for evaluating their dynamics against the behavior of human groups. In this paper, we examine the extent to which the wisdom of partisan crowds emerges in groups of LLM-based agents that are prompted to role-play as partisan personas (e.g., Democrat or Republican). We find that they not only display human-like partisan biases, but also converge to more accurate beliefs through deliberation as humans do. We then identify several factors that interfere with convergence, including the use of chain-of-thought prompt and lack of details in personas. Conversely, fine-tuning on human data appears to enhance convergence. These findings show the potential and limitations of LLM-based agents as a model of human collective intelligence.
Published: 2023

3. Simulating Opinion Dynamics with Networks of LLM-based Agents

Author: Chuang, Yun-Shiuan, Goyal, Agam, Harlalka, Nikunj, Suresh, Siddharth, Hawkins, Robert, Yang, Sijia, Shah, Dhavan, Hu, Junjie, and Rogers, Timothy T.
Subjects: Physics - Physics and Society, Computer Science - Computation and Language
Abstract: Accurately simulating human opinion dynamics is crucial for understanding a variety of societal phenomena, including polarization and the spread of misinformation. However, the agent-based models (ABMs) commonly used for such simulations often over-simplify human behavior. We propose a new approach to simulating opinion dynamics based on populations of Large Language Models (LLMs). Our findings reveal a strong inherent bias in LLM agents towards producing accurate information, leading simulated agents to consensus in line with scientific reality. This bias limits their utility for understanding resistance to consensus views on issues like climate change. After inducing confirmation bias through prompt engineering, however, we observed opinion fragmentation in line with existing agent-based modeling and opinion dynamics research. These insights highlight the promise and limitations of LLM agents in this domain and suggest a path forward: refining LLMs with real-world discourse to better simulate the evolution of human beliefs.
Published: 2023

4. Learning interactions to boost human creativity with bandits and GPT-4

Author: Vartanian, Ara, Sun, Xiaoxi, Chuang, Yun-Shiuan, Suresh, Siddharth, Zhu, Xiaojin, and Rogers, Timothy T.
Subjects: Computer Science - Artificial Intelligence, Computer Science - Human-Computer Interaction, Computer Science - Machine Learning
Abstract: This paper considers how interactions with AI algorithms can boost human creative thought. We employ a psychological task that demonstrates limits on human creativity, namely semantic feature generation: given a concept name, respondents must list as many of its features as possible. Human participants typically produce only a fraction of the features they know before getting "stuck." In experiments with humans and with a language AI (GPT-4) we contrast behavior in the standard task versus a variant in which participants can ask for algorithmically-generated hints. Algorithm choice is administered by a multi-armed bandit whose reward indicates whether the hint helped generating more features. Humans and the AI show similar benefits from hints, and remarkably, bandits learning from AI responses prefer the same prompting strategy as those learning from human behavior. The results suggest that strategies for boosting human creativity via computer interactions can be learned by bandits run on groups of simulated participants.
Published: 2023

5. Can deep convolutional networks explain the semantic structure that humans see in photographs?

Author: Suresh, Siddharth, Huang, Wei-Chun, Mukherjee, Kushin, and Rogers, Timothy T
Subjects: Artificial Intelligence, Cognitive Neuroscience, Representation, Computational Modeling, Neural Networks
Abstract: In visual cognitive neuroscience, there are two main theories about the function of the ventral visual system. One suggests that it serves to classify objects (H1); the other suggests that it generates intermediate representations from which people can generate verbal descriptions, actions, and other kinds of information (H2). To adjudicate these, we trained two deep convolutional AlexNet models on 330,000 images belonging to 86 classes, representing the intersection of Ecoset images and the semantic norms collected by the Leuven group. One model was trained to produce category labels (H1) , the other to generate all of an item's semantic features (H2). The two models learned very different representational geometries throughout the network. The representations acquired by the feature-generating model aligned better with human-perceived similarities amongst images, and better predicted human judgments in a triadic comparison task. The results thus support H2.
Published: 2024

6. Learning interactions to boost human creativity with bandits and GPT-4

Author: Vartanian, Ara, Sun, Xiaoxi, Chuang, Yun-Shiuan, Suresh, Siddharth, Zhu, Jerry, and Rogers, Timothy
Subjects: Computer Science, Psychology, Creativity, Interactive behavior, Large Language Models
Abstract: This paper considers how interactions with AI algorithms can boost human creative thought. We employ a psychological task that demonstrates limits on human creativity, namely semantic feature generation: given a concept name, respondents must list as many of its features as possible. Human participants typically produce only a fraction of the features they know before getting ‚Äústuck.‚Äù In experiments with humans and with a large language model (GPT-4), we contrast behavior in the standard task versus a variant in which participants can ask for algorithmically-generated hints. Algorithm choice is administered by a multi-armed bandit whose reward indicates whether the hint helped generating more features. Humans and the AI show similar benefits from hints, and remarkably, bandits learning from AI responses prefer the same prompting strategy as those learning from human behavior. The results suggest that strategies for boosting human creativity via computer interactions can be learned by bandits run on groups of simulated participants.
Published: 2024

7. The Wisdom of Partisan Crowds: Comparing Collective Intelligence in Humans and LLM-based Agents

Author: Chuang, Yun-Shiuan, Harlalka, Nikunj, Suresh, Siddharth, Goyal, Agam, Hawkins, Robert, Yang, Sijia, Shah, Dhavan, Hu, Junjie, and Rogers, Timothy T
Subjects: Artificial Intelligence, Psychology, Group Behaviour, Social cognition, Large Language Models
Abstract: Human groups are able to converge to more accurate beliefs through deliberation, even in the presence of polarization and partisan bias --- a phenomenon known as the ``wisdom of partisan crowds.'' Large Language Models (LLMs) are increasingly being used to simulate human collective behavior, yet few benchmarks exist for evaluating their dynamics against the behavior of human groups. In this paper, we examine the extent to which the wisdom of partisan crowds emerges in groups of LLM-based agents that are prompted to role-play as partisan personas (e.g., Democrat or Republican). We find that they not only display human-like partisan biases, but also converge to more accurate beliefs through deliberation, as humans do. We then identify several factors that interfere with convergence, including the use of chain-of-thought prompting and lack of details in personas. Conversely, fine-tuning on human data appears to enhance convergence. These findings show the potential and limitations of LLM-based agents as a model of human collective intelligence.
Published: 2024

8. Simulating Opinion Dynamics with Networks of LLM-based Agents

Author: Chuang, Yun-Shiuan, Goyal, Agam, Harlalka, Nikunj, Suresh, Siddharth, Hawkins, Robert, Yang, Sijia, Shah, Dhavan, Hu, Junjie, and Rogers, Timothy T
Subjects: Computer Science, Psychology, Natural Language Processing, Agent-based Modeling, Large Language Models
Abstract: Accurately simulating human opinion dynamics is crucial for understanding a variety of societal phenomena, including polarization and the spread of misinformation. However, the agent-based models (ABMs) commonly used for such simulations often over-simplify human behavior. We propose a new approach to simulating opinion dynamics based on populations of Large Language Models (LLMs). Our findings reveal a strong inherent bias in LLM agents towards producing accurate information, leading simulated agents to consensus in line with scientific reality. This bias limits their utility for understanding resistance to consensus views on issues like climate change. After inducing confirmation bias through prompt engineering, however, we observed opinion fragmentation in line with existing agent-based modeling and opinion dynamics research. These insights highlight the promise and limitations of LLM agents in this domain and suggest a path forward: refining LLMs with real-world discourse to better simulate the evolution of human beliefs.
Published: 2024

9. Semantic Feature Verification in FLAN-T5

Author: Suresh, Siddharth, Mukherjee, Kushin, and Rogers, Timothy T.
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: This study evaluates the potential of a large language model for aiding in generation of semantic feature norms - a critical tool for evaluating conceptual structure in cognitive science. Building from an existing human-generated dataset, we show that machine-verified norms capture aspects of conceptual structure beyond what is expressed in human norms alone, and better explain human judgments of semantic similarity amongst items that are distally related. The results suggest that LLMs can greatly enhance traditional methods of semantic feature norm verification, with implications for our understanding of conceptual representation in humans and machines., Comment: To appear as a Tiny Paper at ICLR 2023
Published: 2023

10. Human-machine cooperation for semantic feature listing

Author: Mukherjee, Kushin, Suresh, Siddharth, and Rogers, Timothy T.
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Semantic feature norms, lists of features that concepts do and do not possess, have played a central role in characterizing human conceptual knowledge, but require extensive human labor. Large language models (LLMs) offer a novel avenue for the automatic generation of such feature lists, but are prone to significant error. Here, we present a new method for combining a learned model of human lexical-semantics from limited data with LLM-generated data to efficiently generate high-quality feature norms., Comment: To be published in the ICLR TinyPaper track
Published: 2023

11. Conceptual structure coheres in human cognition but not in large language models

Author: Suresh, Siddharth, Mukherjee, Kushin, Yu, Xizheng, Huang, Wei-Chun, Padua, Lisa, and Rogers, Timothy T
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Neural network models of language have long been used as a tool for developing hypotheses about conceptual representation in the mind and brain. For many years, such use involved extracting vector-space representations of words and using distances among these to predict or understand human behavior in various semantic tasks. Contemporary large language models (LLMs), however, make it possible to interrogate the latent structure of conceptual representations using experimental methods nearly identical to those commonly used with human participants. The current work utilizes three common techniques borrowed from cognitive psychology to estimate and compare the structure of concepts in humans and a suite of LLMs. In humans, we show that conceptual structure is robust to differences in culture, language, and method of estimation. Structures estimated from LLM behavior, while individually fairly consistent with those estimated from human behavior, vary much more depending upon the particular task used to generate responses--across tasks, estimates of conceptual structure from the very same model cohere less with one another than do human structure estimates. These results highlight an important difference between contemporary LLMs and human cognition, with implications for understanding some fundamental limitations of contemporary machine language.
Published: 2023

12. Behavioral estimates of conceptual structure are robust across tasks in humans but not large language models

Author: Suresh, Siddharth, Mukherjee, Kushin, Padua, Lisa, and Rogers, Timothy T
Subjects: Artificial Intelligence, Natural Language Processing, Semantic memory, Knowledge representation, Neural Networks
Abstract: Neural network models of language have long been used asa tool for developing hypotheses about conceptual representationin the mind and brain. For many years, such use involvedextracting vector-space representations of words andusing distances among these to predict or understand humanbehavior in various semantic tasks. In contemporary languageAIs, however, it is possible to interrogate the latent structureof conceptual representations using methods nearly identicalto those commonly used with human participants. The currentwork uses two common techniques borrowed from cognitivepsychology to estimate and compare lexical-semantic structurein both humans and a well-known AI, the DaVinci variant ofGPT-3. In humans, we show that conceptual structure is robustto differences in culture, language, and method of estimation.Structures estimated from AI behavior, while individuallyfairly consistent with those estimated from human behavior,depend much more upon the particular task used to generatebehavior responses–responses generated by the very samemodel in the two tasks yield estimates of conceptual structurethat cohere less with one another than do human structure estimates.The results suggest one important way that knowledgeinhering in contemporary AIs can differ from human cognition.
Published: 2023

13. Impact of Severe Winter Weather on Operations of a Radiation Oncology Department

Author: Fekrmandi, Fatemeh, primary, Gill, Jasmin, additional, Suresh, Siddharth, additional, Hewson, Sarah, additional, and Chowdhry, Varun K., additional
Published: 2024
Full Text: View/download PDF

14. Can deep convolutional networks explain the semantic structure that humans see in photographs?

Author: Suresh, Siddharth, primary, Mukherjee, Kushin, additional, and T. Rogers, Timothy, additional
Published: 2023
Full Text: View/download PDF

15. Author Correction: Clinical validation of smartphone-based activity tracking in peripheral artery disease patients

Author: Ata, Raheel, Gandhi, Neil, Rasmussen, Hannah, El-Gabalawy, Osama, Gutierrez, Santiago, Ahmad, Alizeh, Suresh, Siddharth, Ravi, Roshini, Rothenberg, Kara, and Aalami, Oliver
Published: 2020
Full Text: View/download PDF

16. Conceptual structure coheres in human cognition but not in large language models

Author: Suresh, Siddharth, primary, Mukherjee, Kushin, additional, Yu, Xizheng, additional, Huang, Wei-Chun, additional, Padua, Lisa, additional, and Rogers, Timothy, additional
Published: 2023
Full Text: View/download PDF

17. Visual memory for causal and coincidental events

Author: Suresh, Siddharth, primary and Ward, Emily J., additional
Published: 2022
Full Text: View/download PDF

18. Clinical validation of smartphone-based activity tracking in peripheral artery disease patients

Author: Ata, Raheel, Gandhi, Neil, Rasmussen, Hannah, El-Gabalawy, Osama, Gutierrez, Santiago, Ahmad, Alizeh, Suresh, Siddharth, Ravi, Roshini, Rothenberg, Kara, and Aalami, Oliver
Published: 2018
Full Text: View/download PDF

19. Visual ensemble representations in Deep Neural Networks trained for natural object recognition

Author: Suresh, Siddharth, primary and Ward, Emily J, additional
Published: 2021
Full Text: View/download PDF

20. MACHINE LEARNING FOR PREDICTION OF PACEMAKER AFTER TAVR IN PATIENTS WITH LOW STROKE VOLUME INDEX

Author: Pandey, Amitabh C., primary, Nichani, Arjun, additional, Pelter, Megan, additional, Ng, Daniel, additional, Jaravata, Ashley, additional, Duncan, Zabrina, additional, Suresh, Siddharth, additional, Mehta, Sandeep, additional, Stinis, Curtiss, additional, Bhavnani, Sanjeev, additional, and Teirstein, Paul, additional
Published: 2021
Full Text: View/download PDF

21. Evaluating and Improving Attrition Models for the Retail Banking Industry

Author: Suresh, Siddharth, primary, Visvalingam, Devan, additional, Lu, Adonis, additional, and Wright, Brian, additional
Published: 2020
Full Text: View/download PDF

22. GrCluster

Author: Ranganathan, Varun, primary, Suresh, Siddharth, additional, Mathur, Yash, additional, Subramanyam, Natarajan, additional, and Barbosa, Denilson, additional
Published: 2020
Full Text: View/download PDF

23. CHANGES IN EJECTION FRACTION AND GLOBAL LONGITUDINAL STRAIN ASSESSMENT IN PATIENTS WITH HEART FAILURE WITH REDUCED EJECTION FRACTION AFTER THERAPY WITH SACUBITRIL/VALSARTAN

Author: Pandey, Amitabh C., primary, Pelter, Megan, additional, Montgomery, Paul, additional, Kuo, Ruth, additional, Shen, Christine, additional, Sidhu, Rajbir, additional, Lerner, David, additional, Billick, Kristin, additional, Hay, Brooke, additional, Loveday, Alyssa, additional, Duckett, Ashley, additional, Suresh, Siddharth, additional, Srivastava, Ajay, additional, Rubenson, David, additional, Heywood, J. Thomas, additional, and Mohan, Rajeev, additional
Published: 2019
Full Text: View/download PDF

24. Brainstem auditory responses in type-2 diabetes mellitus

Author: Suresh, Siddharth, primary, Ramlan, Sharwak, additional, Somayaji, Gangadhara, additional, and Sequeira, Nimalka, additional
Published: 2018
Full Text: View/download PDF

25. Composite Mesh Electrodes with Immobilized Bacteria for Bio-Batteries

Author: Suresh, Siddharth, primary, Evitts, Richard W., additional, and Kennell, Glyn F., additional
Published: 2016
Full Text: View/download PDF

26. Erratum: Author Correction: Clinical validation of smartphone-based activity tracking in peripheral artery disease patients.

Author: Ata R, Gandhi N, Rasmussen H, El-Gabalawy O, Gutierrez S, Ahmad A, Suresh S, Ravi R, Rothenberg K, and Aalami O
Abstract: [This corrects the article DOI: 10.1038/s41746-018-0073-x.]., (© This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply 2020.)
Published: 2020
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

26 results on '"Suresh, Siddharth"'

1. Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning

2. The Wisdom of Partisan Crowds: Comparing Collective Intelligence in Humans and LLM-based Agents

3. Simulating Opinion Dynamics with Networks of LLM-based Agents

4. Learning interactions to boost human creativity with bandits and GPT-4

5. Can deep convolutional networks explain the semantic structure that humans see in photographs?

6. Learning interactions to boost human creativity with bandits and GPT-4

7. The Wisdom of Partisan Crowds: Comparing Collective Intelligence in Humans and LLM-based Agents

8. Simulating Opinion Dynamics with Networks of LLM-based Agents

9. Semantic Feature Verification in FLAN-T5

10. Human-machine cooperation for semantic feature listing

11. Conceptual structure coheres in human cognition but not in large language models

12. Behavioral estimates of conceptual structure are robust across tasks in humans but not large language models

13. Impact of Severe Winter Weather on Operations of a Radiation Oncology Department

14. Can deep convolutional networks explain the semantic structure that humans see in photographs?

15. Author Correction: Clinical validation of smartphone-based activity tracking in peripheral artery disease patients

16. Conceptual structure coheres in human cognition but not in large language models

17. Visual memory for causal and coincidental events

18. Clinical validation of smartphone-based activity tracking in peripheral artery disease patients

19. Visual ensemble representations in Deep Neural Networks trained for natural object recognition

20. MACHINE LEARNING FOR PREDICTION OF PACEMAKER AFTER TAVR IN PATIENTS WITH LOW STROKE VOLUME INDEX

21. Evaluating and Improving Attrition Models for the Retail Banking Industry

22. GrCluster

23. CHANGES IN EJECTION FRACTION AND GLOBAL LONGITUDINAL STRAIN ASSESSMENT IN PATIENTS WITH HEART FAILURE WITH REDUCED EJECTION FRACTION AFTER THERAPY WITH SACUBITRIL/VALSARTAN

24. Brainstem auditory responses in type-2 diabetes mellitus

25. Composite Mesh Electrodes with Immobilized Bacteria for Bio-Batteries

26. Erratum: Author Correction: Clinical validation of smartphone-based activity tracking in peripheral artery disease patients.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

26 results on '"Suresh, Siddharth"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources