76,871 results on '"Parikh AN"'
Search Results
2. TapeAgents: a Holistic Framework for Agent Development and Optimization
- Author
-
Bahdanau, Dzmitry, Gontier, Nicolas, Huang, Gabriel, Kamalloo, Ehsan, Pardinas, Rafael, Piché, Alex, Scholak, Torsten, Shliazhko, Oleh, Tremblay, Jordan Prince, Ghanem, Karam, Parikh, Soham, Tiwari, Mitul, and Vohra, Quaizar
- Subjects
Computer Science - Artificial Intelligence - Abstract
We present TapeAgents, an agent framework built around a granular, structured log tape of the agent session that also plays the role of the session's resumable state. In TapeAgents we leverage tapes to facilitate all stages of the LLM Agent development lifecycle. The agent reasons by processing the tape and the LLM output to produce new thought and action steps and append them to the tape. The environment then reacts to the agent's actions by likewise appending observation steps to the tape. By virtue of this tape-centred design, TapeAgents can provide AI practitioners with holistic end-to-end support. At the development stage, tapes facilitate session persistence, agent auditing, and step-by-step debugging. Post-deployment, one can reuse tapes for evaluation, fine-tuning, and prompt-tuning; crucially, one can adapt tapes from other agents or use revised historical tapes. In this report, we explain the TapeAgents design in detail. We demonstrate possible applications of TapeAgents with several concrete examples of building monolithic agents and multi-agent teams, of optimizing agent prompts and finetuning the agent's LLM. We present tooling prototypes and report a case study where we use TapeAgents to finetune a Llama-3.1-8B form-filling assistant to perform as well as GPT-4o while being orders of magnitude cheaper. Lastly, our comparative analysis shows that TapeAgents's advantages over prior frameworks stem from our novel design of the LLM agent as a resumable, modular state machine with a structured configuration, that generates granular, structured logs and that can transform these logs into training text -- a unique combination of features absent in previous work.
- Published
- 2024
3. Who's Gaming the System? A Causally-Motivated Approach for Detecting Strategic Adaptation
- Author
-
Chang, Trenton, Warrenburg, Lindsay, Park, Sae-Hwan, Parikh, Ravi B., Makar, Maggie, and Wiens, Jenna
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
In many settings, machine learning models may be used to inform decisions that impact individuals or entities who interact with the model. Such entities, or agents, may game model decisions by manipulating their inputs to the model to obtain better outcomes and maximize some utility. We consider a multi-agent setting where the goal is to identify the "worst offenders:" agents that are gaming most aggressively. However, identifying such agents is difficult without knowledge of their utility function. Thus, we introduce a framework in which each agent's tendency to game is parameterized via a scalar. We show that this gaming parameter is only partially identifiable. By recasting the problem as a causal effect estimation problem where different agents represent different "treatments," we prove that a ranking of all agents by their gaming parameters is identifiable. We present empirical results in a synthetic data study validating the usage of causal effect estimation for gaming detection and show in a case study of diagnosis coding behavior in the U.S. that our approach highlights features associated with gaming., Comment: 38 pages, 31 figures. NeurIPS 2024
- Published
- 2024
4. RE-Bench: Evaluating frontier AI R&D capabilities of language model agents against human experts
- Author
-
Wijk, Hjalmar, Lin, Tao, Becker, Joel, Jawhar, Sami, Parikh, Neev, Broadley, Thomas, Chan, Lawrence, Chen, Michael, Clymer, Josh, Dhyani, Jai, Ericheva, Elena, Garcia, Katharyn, Goodrich, Brian, Jurkovic, Nikola, Kinniment, Megan, Lajko, Aron, Nix, Seraphina, Sato, Lucas, Saunders, William, Taran, Maksym, West, Ben, and Barnes, Elizabeth
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Frontier AI safety policies highlight automation of AI research and development (R&D) by AI agents as an important capability to anticipate. However, there exist few evaluations for AI R&D capabilities, and none that are highly realistic and have a direct comparison to human performance. We introduce RE-Bench (Research Engineering Benchmark, v1), which consists of 7 challenging, open-ended ML research engineering environments and data from 71 8-hour attempts by 61 distinct human experts. We confirm that our experts make progress in the environments given 8 hours, with 82% of expert attempts achieving a non-zero score and 24% matching or exceeding our strong reference solutions. We compare humans to several public frontier models through best-of-k with varying time budgets and agent designs, and find that the best AI agents achieve a score 4x higher than human experts when both are given a total time budget of 2 hours per environment. However, humans currently display better returns to increasing time budgets, narrowly exceeding the top AI agent scores given an 8-hour budget, and achieving 2x the score of the top AI agent when both are given 32 total hours (across different attempts). Qualitatively, we find that modern AI agents possess significant expertise in many ML topics -- e.g. an agent wrote a faster custom Triton kernel than any of our human experts' -- and can generate and test solutions over ten times faster than humans, at much lower cost. We open-source the evaluation environments, human expert data, analysis code and agent trajectories to facilitate future research.
- Published
- 2024
5. CoNFiLD-inlet: Synthetic Turbulence Inflow Using Generative Latent Diffusion Models with Neural Fields
- Author
-
Liu, Xin-Yang, Parikh, Meet Hemant, Fan, Xiantao, Du, Pan, Wang, Qing, Chen, Yi-Fan, and Wang, Jian-Xun
- Subjects
Physics - Fluid Dynamics ,Computer Science - Machine Learning - Abstract
Eddy-resolving turbulence simulations require stochastic inflow conditions that accurately replicate the complex, multi-scale structures of turbulence. Traditional recycling-based methods rely on computationally expensive precursor simulations, while existing synthetic inflow generators often fail to reproduce realistic coherent structures of turbulence. Recent advances in deep learning (DL) have opened new possibilities for inflow turbulence generation, yet many DL-based methods rely on deterministic, autoregressive frameworks prone to error accumulation, resulting in poor robustness for long-term predictions. In this work, we present CoNFiLD-inlet, a novel DL-based inflow turbulence generator that integrates diffusion models with a conditional neural field (CNF)-encoded latent space to produce realistic, stochastic inflow turbulence. By parameterizing inflow conditions using Reynolds numbers, CoNFiLD-inlet generalizes effectively across a wide range of Reynolds numbers ($Re_\tau$ between $10^3$ and $10^4$) without requiring retraining or parameter tuning. Comprehensive validation through a priori and a posteriori tests in Direct Numerical Simulation (DNS) and Wall-Modeled Large Eddy Simulation (WMLES) demonstrates its high fidelity, robustness, and scalability, positioning it as an efficient and versatile solution for inflow turbulence synthesis., Comment: 27 pages, 10 figures
- Published
- 2024
6. Competing Bandits in Decentralized Large Contextual Matching Markets
- Author
-
Parikh, Satush, Basu, Soumya, Ghosh, Avishek, and Sankararaman, Abishek
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Sequential learning in a multi-agent resource constrained matching market has received significant interest in the past few years. We study decentralized learning in two-sided matching markets where the demand side (aka players or agents) competes for a `large' supply side (aka arms) with potentially time-varying preferences, to obtain a stable match. Despite a long line of work in the recent past, existing learning algorithms such as Explore-Then-Commit or Upper-Confidence-Bound remain inefficient for this problem. In particular, the per-agent regret achieved by these algorithms scales linearly with the number of arms, $K$. Motivated by the linear contextual bandit framework, we assume that for each agent an arm-mean can be represented by a linear function of a known feature vector and an unknown (agent-specific) parameter. Moreover, our setup captures the essence of a dynamic (non-stationary) matching market where the preferences over arms change over time. Our proposed algorithms achieve instance-dependent logarithmic regret, scaling independently of the number of arms, $K$.
- Published
- 2024
7. CRAFT@Large: Building Community Through Co-Making
- Author
-
Zhao, Yiran, Alinea-Bravo, Maria, and Parikh, Niti
- Subjects
Computer Science - Human-Computer Interaction ,Computer Science - Computers and Society ,K.4.3 - Abstract
CRAFT@Large (C@L) is an initiative launched by the MakerLAB at Cornell Tech to create an inclusive environment for the intercultural and intergenerational exchange of ideas through making. With our approach, we challenge the traditional definition of community outreach performed by academic makerspaces. Existing academic makerspaces often perform community engagement by only offering hourly, one-time workshops or by having community members provide a problem that is then used by students as a project assignment. These approaches position community members as occasional visitors and non-equal contributors, which not only conflict with the core values of co-creation but also limit the makerspaces' impact on connecting the universities and the communities. C@L explored an alternative approach in which we invited community members as long-term and equal co-makers into the academic makerspaces. In this article, we showcase two sets of collaborations that illustrate the continuity of people through co-making. We present how academic makerspaces can function as a hub that connects community members and partner organizations with the campus community in a long-term relationship.
- Published
- 2024
8. Informed Deep Abstaining Classifier: Investigating noise-robust training for diagnostic decision support systems
- Author
-
Schneider, Helen, Nowak, Sebastian, Parikh, Aditya, Layer, Yannik C., Theis, Maike, Block, Wolfgang, Sprinkart, Alois M., Attenberger, Ulrike, and Sifa, Rafet
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Image-based diagnostic decision support systems (DDSS) utilizing deep learning have the potential to optimize clinical workflows. However, developing DDSS requires extensive datasets with expert annotations and is therefore costly. Leveraging report contents from radiological data bases with Natural Language Processing to annotate the corresponding image data promises to replace labor-intensive manual annotation. As mining "real world" databases can introduce label noise, noise-robust training losses are of great interest. However, current noise-robust losses do not consider noise estimations that can for example be derived based on the performance of the automatic label generator used. In this study, we expand the noise-robust Deep Abstaining Classifier (DAC) loss to an Informed Deep Abstaining Classifier (IDAC) loss by incorporating noise level estimations during training. Our findings demonstrate that IDAC enhances the noise robustness compared to DAC and several state-of-the-art loss functions. The results are obtained on various simulated noise levels using a public chest X-ray data set. These findings are reproduced on an in-house noisy data set, where labels were extracted from the clinical systems of the University Hospital Bonn by a text-based transformer. The IDAC can therefore be a valuable tool for researchers, companies or clinics aiming to develop accurate and reliable DDSS from routine clinical data., Comment: This preprint has no post-submission improvements or corrections. The Version of Record of this contribution is published in the Neural Information Processing, ICONIP 2024 Proceedings
- Published
- 2024
9. Regulating Sommerfeld resonances for multi-state systems and higher partial waves
- Author
-
Parikh, Aditya, Sato, Ryosuke, and Slatyer, Tracy R.
- Subjects
High Energy Physics - Phenomenology ,Astrophysics - Cosmology and Nongalactic Astrophysics ,Astrophysics - Astrophysics of Galaxies ,High Energy Physics - Theory - Abstract
Long-range attractive interactions between dark matter particles can significantly enhance their annihilation, particularly at low velocities. This ``Sommerfeld enhancement'' is typically computed by evaluating the deformation of the two-particle wavefunction due to the long-range potential, while ignoring the physics associated with the annihilation, and then scaling the appropriate annihilation matrix elements by factors that depend on the wavefunction in the limit where the particles approach zero relative separation. It has long been recognized that this approach is a valid approximation only in the limit where the annihilation rate is small, and breaks down in the regime where the enhanced annihilation rate approaches the unitarity bound, in which case ignoring the impact of the annihilation physics on the two-particle wavefunction cannot be justified and leads to apparent violations of unitarity. In the case where the physics relevant to annihilation occurs at a parametrically shorter distance scale (higher energy scale) compared with the long-range potential, we provide a simple prescription for correcting the Sommerfeld enhancement for the effects of the short-range physics, valid for all partial waves and for systems where multiple states are coupled by the long-range potential., Comment: 70 pages, 17 figures
- Published
- 2024
10. Driving Privacy Forward: Mitigating Information Leakage within Smart Vehicles through Synthetic Data Generation
- Author
-
Parikh, Krish
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Machine Learning - Abstract
Smart vehicles produce large amounts of data, much of which is sensitive and at risk of privacy breaches. As attackers increasingly exploit anonymised metadata within these datasets to profile drivers, it's important to find solutions that mitigate this information leakage without hindering innovation and ongoing research. Synthetic data has emerged as a promising tool to address these privacy concerns, as it allows for the replication of real-world data relationships while minimising the risk of revealing sensitive information. In this paper, we examine the use of synthetic data to tackle these challenges. We start by proposing a comprehensive taxonomy of 14 in-vehicle sensors, identifying potential attacks and categorising their vulnerability. We then focus on the most vulnerable signals, using the Passive Vehicular Sensor (PVS) dataset to generate synthetic data with a Tabular Variational Autoencoder (TVAE) model, which included over 1 million data points. Finally, we evaluate this against 3 core metrics: fidelity, utility, and privacy. Our results show that we achieved 90.1% statistical similarity and 78% classification accuracy when tested on its original intent while also preventing the profiling of the driver. The code can be found at https://github.com/krish-parikh/Synthetic-Data-Generation
- Published
- 2024
11. A Rapid Trajectory Optimization and Control Framework for Resource-Constrained Applications
- Author
-
Parikh, Deep, Ahrens, Thomas L., and Majji, Manoranjan
- Subjects
Computer Science - Robotics ,Electrical Engineering and Systems Science - Systems and Control - Abstract
This paper presents a computationally efficient model predictive control formulation that uses an integral Chebyshev collocation method to enable rapid operations of autonomous agents. By posing the finite-horizon optimal control problem and recursive re-evaluation of the optimal trajectories, minimization of the L2 norms of the state and control errors are transcribed into a quadratic program. Control and state variable constraints are parameterized using Chebyshev polynomials and are accommodated in the optimal trajectory generation programs to incorporate the actuator limits and keepout constraints. Differentiable collision detection of polytopes is leveraged for optimal collision avoidance. Results obtained from the collocation methods are benchmarked against the existing approaches on an edge computer to outline the performance improvements. Finally, collaborative control scenarios involving multi-agent space systems are considered to demonstrate the technical merits of the proposed work., Comment: This work has been submitted to the IEEE ACC 2025 for possible publication
- Published
- 2024
12. Barr-Zee Diagrams at a High-Energy Muon Collider
- Author
-
Homiller, Samuel, Lodman, Jackie, Parikh, Aditya, and Reece, Matthew
- Subjects
High Energy Physics - Phenomenology - Abstract
The sensitivity of electron EDM experiments has been increasing at a rapid pace, and could yield indications of new physics in the coming decade. An intriguing possibility is that an EDM signal could be generated by new, electroweak-charged particles at the TeV scale that couple to the Higgs and contribute to the electron EDM at two-loop order via Barr-Zee diagrams. A high-energy muon collider could decisively search for new physics at this scale. In this work, we explore this complementarity between colliders and EDM experiments, and note that Barr-Zee diagrams from the aforementioned particles are closely related to vector-boson scattering processes at a muon collider. These loop corrections lead to kinematic features in the differential cross sections of these processes, dictated by the optical theorem. We demonstrate this connection in the context of the singlet-doublet and doublet-triplet extensions to the SM, explore the detectability of these features at a muon collider experiment, and discuss how these measurements can be used to ascertain the underlying model parameters., Comment: 22 pages + appendices, 9 figures
- Published
- 2024
13. Towards Democratization of Subspeciality Medical Expertise
- Author
-
O'Sullivan, Jack W., Palepu, Anil, Saab, Khaled, Weng, Wei-Hung, Cheng, Yong, Chu, Emily, Desai, Yaanik, Elezaby, Aly, Kim, Daniel Seung, Lan, Roy, Tang, Wilson, Tapaskar, Natalie, Parikh, Victoria, Jain, Sneha S., Kulkarni, Kavita, Mansfield, Philip, Webster, Dale, Gottweis, Juraj, Barral, Joelle, Schaekermann, Mike, Tanno, Ryutaro, Mahdavi, S. Sara, Natarajan, Vivek, Karthikesalingam, Alan, Ashley, Euan, and Tu, Tao
- Subjects
Computer Science - Human-Computer Interaction ,Computer Science - Artificial Intelligence - Abstract
The scarcity of subspecialist medical expertise, particularly in rare, complex and life-threatening diseases, poses a significant challenge for healthcare delivery. This issue is particularly acute in cardiology where timely, accurate management determines outcomes. We explored the potential of AMIE (Articulate Medical Intelligence Explorer), a large language model (LLM)-based experimental AI system optimized for diagnostic dialogue, to potentially augment and support clinical decision-making in this challenging context. We curated a real-world dataset of 204 complex cases from a subspecialist cardiology practice, including results for electrocardiograms, echocardiograms, cardiac MRI, genetic tests, and cardiopulmonary stress tests. We developed a ten-domain evaluation rubric used by subspecialists to evaluate the quality of diagnosis and clinical management plans produced by general cardiologists or AMIE, the latter enhanced with web-search and self-critique capabilities. AMIE was rated superior to general cardiologists for 5 of the 10 domains (with preference ranging from 9% to 20%), and equivalent for the rest. Access to AMIE's response improved cardiologists' overall response quality in 63.7% of cases while lowering quality in just 3.4%. Cardiologists' responses with access to AMIE were superior to cardiologist responses without access to AMIE for all 10 domains. Qualitative examinations suggest AMIE and general cardiologist could complement each other, with AMIE thorough and sensitive, while general cardiologist concise and specific. Overall, our results suggest that specialized medical LLMs have the potential to augment general cardiologists' capabilities by bridging gaps in subspecialty expertise, though further research and validation are essential for wide clinical utility.
- Published
- 2024
14. Cellular Uptake of Phase-Separating Peptide Coacervates.
- Author
-
Shebanova, Anastasia, Perrin, Quentin, Zhu, Kexin, Gudlur, Sushanth, Chen, Zilin, Sun, Yue, Huang, Congxi, Lim, Zhi, Mondarte, Evan, Sun, Ruoxuan, Lim, Sierin, Yu, Jing, Miao, Yansong, Parikh, Atul, Ludwig, Alexander, and Miserez, Ali
- Subjects
cell uptake ,drug delivery ,macropinocytosis ,peptide coacervates ,phagocytosis ,Peptides ,Pinocytosis ,Humans ,Animals ,Mice ,Phagocytosis - Abstract
Peptide coacervates self-assembling via liquid-liquid phase separation are appealing intracellular delivery vehicles of macromolecular therapeutics (proteins, DNA, mRNA) owing to their non-cytotoxicity, high encapsulation capacity, and efficient cellular uptake. However, the mechanisms by which these viscoelastic droplets cross the cellular membranes remain unknown. Here, using multimodal imaging, data analytics, and biochemical inhibition assays, we identify the key steps by which droplets enter the cell. We find that the uptake follows a non-canonical pathway and instead integrates essential features of macropinocytosis and phagocytosis, namely active remodeling of the actin cytoskeleton and appearance of filopodia-like protrusions. Experiments using giant unilamellar vesicles show that the coacervates attach to the bounding membrane in a charge- and cholesterol-dependent manner but do not breach the lipid bilayer barrier. Cell uptake in the presence of small molecule inhibitors - interfering with actin and tubulin polymerization - confirm the active role of cytoskeleton remodeling, most prominently evident in electron microscopy imaging. These findings suggest a peculiar internalization mechanism for viscoelastic, glassy coacervate droplets combining features of non-specific uptake of fluids by macropinocytosis and particulate uptake of phagocytosis. The broad implications of this study will enable to enhance the efficacy and utility of coacervate-based strategies for intracellular delivery of macromolecular therapeutics.
- Published
- 2024
15. Patent Foramen Ovale and Coronary Artery Spasm A New Patent Foramen Ovale-associated Condition that May Explain the Mechanism of Vasospastic Angina
- Author
-
Ravi, Deepak, Parikh, Rushi V, Aboulhosn, Jamil A, and Tobis, Jonathan M
- Subjects
Biomedical and Clinical Sciences ,Cardiovascular Medicine and Haematology ,Heart Disease - Coronary Heart Disease ,Neurosciences ,Headaches ,Cardiovascular ,Brain Disorders ,Migraines ,Atherosclerosis ,Heart Disease ,Pain Research ,2.1 Biological and endogenous factors ,Humans ,Coronary Vasospasm ,Foramen Ovale ,Patent ,Takotsubo Cardiomyopathy ,Angina Pectoris ,Migraine Disorders ,Patent foramen ovale ,Vasospastic angina ,Angina with nonobstructive coronary arteries ,Microvascular dysfunction ,Migraine with aura ,Takotsubo cardiomyopathy ,Migraine ,Vasoactive substances ,Cardiovascular System & Hematology ,Cardiovascular medicine and haematology - Abstract
Patent foramen ovale (PFO) may be an underlying factor in the pathogenesis of migraine, vasospastic angina, and Takotsubo cardiomyopathy. This article reviews the role that PFO may play in each of these clinical entities and discusses potential interventions. It also proposes a novel clinical syndrome wherein PFO may be the unifying link among migraine, coronary vasospasm, and Takotsubo cardiomyopathy in predisposed individuals.
- Published
- 2024
16. The Relationship of Duffy Gene Polymorphism with High-Sensitivity C-Reactive Protein, Mortality, and Cardiovascular Outcomes in Black Individuals
- Author
-
Ha, Edward T, Haessler, Jeffery, Taylor, Kent D, Tuftin, Bjoernar, Briggs, Matt, Parikh, Manish A, Peterson, Stephen J, Gerszten, Robert E, Wilson, James G, Kelsey, Karl, Tahir, Usman A, Seeman, Teresa, Rich, Stephen S, Carson, April P, Post, Wendy S, Kooperberg, Charles, Rotter, Jerome I, Raffield, Laura M, Auer, Paul, and Reiner, Alex P
- Subjects
Biological Sciences ,Genetics ,Prevention ,Atherosclerosis ,Heart Disease ,Cardiovascular ,Aging ,2.1 Biological and endogenous factors ,2.4 Surveillance and distribution ,Good Health and Well Being ,Humans ,Duffy Blood-Group System ,Female ,C-Reactive Protein ,Male ,Polymorphism ,Single Nucleotide ,Cardiovascular Diseases ,Aged ,Middle Aged ,Receptors ,Cell Surface ,Black or African American ,White ,Duffy gene polymorphism ,Duffy receptor for chemokines ,atypical chemokine receptor 1 ,healthcare disparity ,hs-CRP - Abstract
Background: Black adults have higher incidence of all-cause mortality and worse cardiovascular disease (CVD) outcomes when compared to other U.S. populations. The Duffy chemokine receptor is not expressed on erythrocytes in a large majority of Black adults, but the clinical implications of this are unclear. Methods: Here, we investigated the relationship of Duffy receptor status, high-sensitivity C-reactive protein (hs-CRP), and mortality and incident CVD events (coronary heart disease, stroke, and heart failure) in self-identified Black members of three contemporary, longitudinal cohort studies (the Women's Health Initiative, Jackson Heart Study, and Multi-Ethnic Study of Atherosclerosis). Data on 14,358 Black participants (9023 Duffy-null and 5335 Duffy-receptor-positive, as defined using single-nucleotide polymorphism (SNP) rs2814778) were included in this analysis. Results: Duffy null was strongly associated with higher hs-CRP (meta-analysis p = 2.62 × 10-9), but the association was largely attenuated, though still marginally significant (p = 0.005), after conditioning on known CRP locus alleles in linkage disequilibrium with the Duffy gene. In our discovery cohorts, Duffy-null status appeared to be associated with a higher risk of all-cause mortality and incident stroke, though these associations were attenuated and non-significant following adjustment for traditional risk factors including hs-CRP. Moreover, the association of Duffy-null status with mortality could not be replicated in an independent sample of Black adults from the UK Biobank. Conclusions: These findings suggest that the higher levels of hs-CRP found in Duffy-null individuals may be in part independent of CRP alleles known to influence circulating levels of hs-CRP. During the follow-up of this community-based sample of Black participants, Duffy-null status was not associated with mortality or incident CVD events independently of traditional risk factors including hs-CRP.
- Published
- 2024
17. Robust Proximity Operations using Probabilistic Markov Models
- Author
-
Parikh, Deep, Khowaja, Ali Hasnain, and Majji, Manoranjan
- Subjects
Computer Science - Robotics ,Electrical Engineering and Systems Science - Systems and Control - Abstract
A Markov decision process-based state switching is devised, implemented, and analyzed for proximity operations of various autonomous vehicles. The framework contains a pose estimator along with a multi-state guidance algorithm. The unified pose estimator leverages the extended Kalman filter for the fusion of measurements from rate gyroscopes, monocular vision, and ultra-wideband radar sensors. It is also equipped with Mahalonobis distance-based outlier rejection and under-weighting of measurements for robust performance. The use of probabilistic Markov models to transition between various guidance modes is proposed to enable robust and efficient proximity operations. Finally, the framework is validated through an experimental analysis of the docking of two small satellites and the precision landing of an aerial vehicle., Comment: This work has been submitted to the IEEE ICRA 2025 for possible publication. Accompanying video : https://youtu.be/8-fetyf_SrM. arXiv admin note: text overlap with arXiv:2409.09665
- Published
- 2024
18. 'Since Lawyers are Males..': Examining Implicit Gender Bias in Hindi Language Generation by LLMs
- Author
-
Joshi, Ishika, Gupta, Ishita, Dey, Adrita, and Parikh, Tapan
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Human-Computer Interaction - Abstract
Large Language Models (LLMs) are increasingly being used to generate text across various languages, for tasks such as translation, customer support, and education. Despite these advancements, LLMs show notable gender biases in English, which become even more pronounced when generating content in relatively underrepresented languages like Hindi. This study explores implicit gender biases in Hindi text generation and compares them to those in English. We developed Hindi datasets inspired by WinoBias to examine stereotypical patterns in responses from models like GPT-4o and Claude-3 sonnet. Our results reveal a significant gender bias of 87.8% in Hindi, compared to 33.4% in English GPT-4o generation, with Hindi responses frequently relying on gender stereotypes related to occupations, power hierarchies, and social class. This research underscores the variation in gender biases across languages and provides considerations for navigating these biases in generative AI systems.
- Published
- 2024
19. Pose estimation of CubeSats via sensor fusion and Error-State Extended Kalman Filter
- Author
-
Parikh, Deep and Majji, Manoranjan
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
A pose estimation technique based on error-state extended Kalman that fuses angular rates, accelerations, and relative range measurements is presented in this paper. An unconstrained dynamic model with kinematic coupling for a thrust-capable satellite is considered for the state propagation, and a pragmatic measurement model of the rate gyroscope, accelerometer, and an ultra-wideband radio are leveraged for the measurement update. The error-state extended Kalman filter framework is formulated for pose estimation, and its performance has been analyzed via several simulation scenarios. An application of the pose estimator for proximity operations and scaffolding formation of CubeSat deputies relative to their mother-ship is outlined. Finally, the performance of the error-state extended Kalman filter is demonstrated using experimental analysis consisting of a 3-DOF thrust cable satellite mock-up, rate gyroscope, accelerometer, and ultra-wideband radar modules., Comment: Presented at 46th Rocky Mountain AAS Guidance, Navigation and Control (GN&C) Conference, Breckenridge, CO
- Published
- 2024
20. The VIRUS-dE Survey I: Stars in dwarf elliptical galaxies - 3D dynamics and radially resolved stellar initial mass functions
- Author
-
Lipka, Mathias, Thomas, Jens, Saglia, Roberto, Bender, Ralf, Fabricius, Maximilian, Hill, Gary J., Kluge, Matthias, Landriau, Martin, Mazzalay, Ximena, Noyola, Eva, Parikh, Taniya, and Snigula, Jan
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We analyse the stellar structure of a sample of dwarf ellipticals (dE) inhabiting various environments within the Virgo cluster. Integral-field observations with a high spectral resolution allow us to robustly determine their low velocity dispersions ($\sim25$ km s$^{-1}$) and higher-order kinematic moments out to the half-light radius. We find the dEs exhibit a diversity in ages with the younger dEs being less enhanced than the older, suggesting a complex star formation history for those dEs that recently entered Virgo while others have been quenched shortly after reionization. Orbit-superposition modeling allowed us to recover viewing angles, stellar mass-to-light ratios (with gradients), as well as the intrinsic orbit structure. We find that the angular momentum of the dEs is strongly suppressed compared to ordinary early-type galaxies and correlates with the environment. Flattened dEs are so because of a suppressed kinetic energy perpendicular to their equatorial plane. Combining population and dynamical modeling results, we find an age-dependent stellar initial mass function (IMF) or, alternatively, evidence for a more extended star formation history for those galaxies that have had higher initial mass and/or inhabited lower density environments. dEs appear to have a spatially homogeneous stellar structure but the state they were `frozen' in as they stopped forming stars varies dramatically according to their initial conditions., Comment: Accepted for publication in ApJ, 56 pages, 27 figures, 3 tables
- Published
- 2024
21. Proximity operations of CubeSats via sensor fusion of ultra-wideband range measurements with rate gyroscopes, accelerometers and monocular vision
- Author
-
Parikh, Deep, Khowaja, Hasnain, Thakur, Ravi Kumar, and Majji, Manoranjan
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
A robust pose estimation algorithm based on an extended Kalman filter using measurements from accelerometers, rate gyroscopes, monocular vision and ultra-wideband radar is presented. The sensor fusion and pose estimation algorithm incorporates Mahalonobis distance-based outlier rejection and under-weighting of measurements for robust filter performance in the case of sudden range measurements led by the absence of measurements due to range limitations of radar transceivers. The estimator is further validated through an experimental analysis using low-cost radar, IMU and camera sensors. The pose estimate is utilized to perform proximity operations and docking of Transforming Proximity Operations and Docking Service (TPODS) satellite modules with a fixed target.
- Published
- 2024
22. Estimation of inertial properties of a rigid structure maneuvered by satellite modules
- Author
-
Parikh, Deep and Majji, Manoranjan
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
The LASR Laboratory is investigating the use of free-flying spacecraft modules in several on-orbit, servicing and manufacturing (OSAM) activities. Previous work consists of the system development and testing of the aforementioned thrust-capable modules. This study makes advancements to devise, implement and validate an algorithm for the estimation of inertial parameters of a rigid structure, to be maneuvered with the help of Transforming Proximity Operations and Docking Service (TPODS) satellite modules. The primary contribution of this activity is observability analysis to infer a conducive input sequence for estimating the inertial parameters. For the experimental validation of proposed estimation algorithm, real-time pose measurements are logged through the VICON motion capture system and the recorded data is utilized to assess the performance of the estimation algorithm to predict mass and moment of inertia of an isolated TPODS module., Comment: Presented at 2023 AAS/AIAA Astrodynamics Specialist Conference, Big Sky, MT
- Published
- 2024
23. A Scalable Tabletop Satellite Automation Testbed:Design And Experiments
- Author
-
Parikh, Deep, Khowaja, Ali Hasnain, Long, Nathan, Down, Ian, McElreath, James, Bire, Aniket, and Majji, Manoranjan
- Subjects
Computer Science - Robotics ,Electrical Engineering and Systems Science - Systems and Control - Abstract
This paper presents a detailed system design and component selection for the Transforming Proximity Operations and Docking Service (TPODS) module, designed to gain custody of uncontrolled resident space objects (RSOs) via rendezvous and proximity operation (RPO). In addition to serving as a free-flying robotic manipulator to work with cooperative and uncooperative RSOs, the TPODS modules are engineered to have the ability to cooperate with one another to build scaffolding for more complex satellite servicing activities. The structural design of the prototype module is inspired by Tensegrity principles, minimizing the structural mass of the modules frame. The prototype TPODS module is fabricated using lightweight polycarbonate with an aluminum or carbon fiber frame. The inner shell that houses various electronic and pneumatic components is 3-D printed using ABS material. Four OpenMV H7 R1 cameras are used for the pose estimation of resident space objects (RSOs), including other TPODS modules. Compressed air supplied by an external source is used for the initial testing and can be replaced by module-mounted nitrogen pressure vessels for full on-board propulsion later. A Teensy 4.1 single-board computer is used as a central command unit that receives data from the four OpenMV cameras, and commands its thrusters based on the control logic., Comment: Presented at 45th Rocky Mountain AAS Guidance, Navigation and Control (GN&C) Conference, Breckenridge, CO
- Published
- 2024
24. Benchmarking the Performance of Large Language Models on the Cerebras Wafer Scale Engine
- Author
-
Zhang, Zuoning, Parikh, Dhruv, Zhang, Youning, and Prasanna, Viktor
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
Transformer based Large Language Models (LLMs) have recently reached state of the art performance in Natural Language Processing (NLP) and Computer Vision (CV) domains. LLMs use the Multi-Headed Self-Attention (MHSA) mechanism to capture long-range global attention relationships among input words or image patches, drastically improving its performance over prior deep learning approaches. In this paper, we evaluate the performance of LLMs on the Cerebras Wafer Scale Engine (WSE). Cerebras WSE is a high performance computing system with 2.6 trillion transistors, 850,000 cores and 40 GB on-chip memory. Cerebras WSE's Sparse Linear Algebra Compute (SLAC) cores eliminates multiply-by-zeros operations and its 40 GB of on-chip memory is uniformly distributed among SLAC cores, enabling fast local access to model parameters. Moreover, Cerebras software configures routing between cores at runtime, optimizing communication overhead among cores. As LLMs are becoming more commonly used, new hardware architectures are needed to accelerate LLMs training and inference. We benchmark the effectiveness of this hardware architecture at accelerating LLMs training and inference. Additionally, we analyze if Cerebras WSE can scale the memory-wall associated with traditionally memory-bound compute tasks using its 20 PB/s high bandwidth memory. Furthermore, we examine the performance scalability of Cerebras WSE through a roofline model. By plotting performance metrics against computational intensity, we aim to assess their effectiveness at handling high compute-intensive LLMs training and inference tasks., Comment: IEEE HPEC 2024
- Published
- 2024
25. Localization and the Floer homology of strongly invertible knots
- Author
-
Parikh, Aakash
- Subjects
Mathematics - Geometric Topology ,57K18, 53D40 - Abstract
We establish two spectral sequences in knot Floer homology associated to a directed strongly invertible knot K: one from the knot Floer homology of K to a two dimensional vector space, and one from the singular knot Floer homology of a singular knot associated to K to the knot Floer homology quotient knot of K. The first of these spectral sequences is used to define a numerical invariant of strongly invertible knots., Comment: 50 pages, 26 figures. Comments welcome!
- Published
- 2024
26. The Llama 3 Herd of Models
- Author
-
Grattafiori, Aaron, Dubey, Abhimanyu, Jauhri, Abhinav, Pandey, Abhinav, Kadian, Abhishek, Al-Dahle, Ahmad, Letman, Aiesha, Mathur, Akhil, Schelten, Alan, Vaughan, Alex, Yang, Amy, Fan, Angela, Goyal, Anirudh, Hartshorn, Anthony, Yang, Aobo, Mitra, Archi, Sravankumar, Archie, Korenev, Artem, Hinsvark, Arthur, Rao, Arun, Zhang, Aston, Rodriguez, Aurelien, Gregerson, Austen, Spataru, Ava, Roziere, Baptiste, Biron, Bethany, Tang, Binh, Chern, Bobbie, Caucheteux, Charlotte, Nayak, Chaya, Bi, Chloe, Marra, Chris, McConnell, Chris, Keller, Christian, Touret, Christophe, Wu, Chunyang, Wong, Corinne, Ferrer, Cristian Canton, Nikolaidis, Cyrus, Allonsius, Damien, Song, Daniel, Pintz, Danielle, Livshits, Danny, Wyatt, Danny, Esiobu, David, Choudhary, Dhruv, Mahajan, Dhruv, Garcia-Olano, Diego, Perino, Diego, Hupkes, Dieuwke, Lakomkin, Egor, AlBadawy, Ehab, Lobanova, Elina, Dinan, Emily, Smith, Eric Michael, Radenovic, Filip, Guzmán, Francisco, Zhang, Frank, Synnaeve, Gabriel, Lee, Gabrielle, Anderson, Georgia Lewis, Thattai, Govind, Nail, Graeme, Mialon, Gregoire, Pang, Guan, Cucurell, Guillem, Nguyen, Hailey, Korevaar, Hannah, Xu, Hu, Touvron, Hugo, Zarov, Iliyan, Ibarra, Imanol Arrieta, Kloumann, Isabel, Misra, Ishan, Evtimov, Ivan, Zhang, Jack, Copet, Jade, Lee, Jaewon, Geffert, Jan, Vranes, Jana, Park, Jason, Mahadeokar, Jay, Shah, Jeet, van der Linde, Jelmer, Billock, Jennifer, Hong, Jenny, Lee, Jenya, Fu, Jeremy, Chi, Jianfeng, Huang, Jianyu, Liu, Jiawen, Wang, Jie, Yu, Jiecao, Bitton, Joanna, Spisak, Joe, Park, Jongsoo, Rocca, Joseph, Johnstun, Joshua, Saxe, Joshua, Jia, Junteng, Alwala, Kalyan Vasuden, Prasad, Karthik, Upasani, Kartikeya, Plawiak, Kate, Li, Ke, Heafield, Kenneth, Stone, Kevin, El-Arini, Khalid, Iyer, Krithika, Malik, Kshitiz, Chiu, Kuenley, Bhalla, Kunal, Lakhotia, Kushal, Rantala-Yeary, Lauren, van der Maaten, Laurens, Chen, Lawrence, Tan, Liang, Jenkins, Liz, Martin, Louis, Madaan, Lovish, Malo, Lubo, Blecher, Lukas, Landzaat, Lukas, de Oliveira, Luke, Muzzi, Madeline, Pasupuleti, Mahesh, Singh, Mannat, Paluri, Manohar, Kardas, Marcin, Tsimpoukelli, Maria, Oldham, Mathew, Rita, Mathieu, Pavlova, Maya, Kambadur, Melanie, Lewis, Mike, Si, Min, Singh, Mitesh Kumar, Hassan, Mona, Goyal, Naman, Torabi, Narjes, Bashlykov, Nikolay, Bogoychev, Nikolay, Chatterji, Niladri, Zhang, Ning, Duchenne, Olivier, Çelebi, Onur, Alrassy, Patrick, Zhang, Pengchuan, Li, Pengwei, Vasic, Petar, Weng, Peter, Bhargava, Prajjwal, Dubal, Pratik, Krishnan, Praveen, Koura, Punit Singh, Xu, Puxin, He, Qing, Dong, Qingxiao, Srinivasan, Ragavan, Ganapathy, Raj, Calderer, Ramon, Cabral, Ricardo Silveira, Stojnic, Robert, Raileanu, Roberta, Maheswari, Rohan, Girdhar, Rohit, Patel, Rohit, Sauvestre, Romain, Polidoro, Ronnie, Sumbaly, Roshan, Taylor, Ross, Silva, Ruan, Hou, Rui, Wang, Rui, Hosseini, Saghar, Chennabasappa, Sahana, Singh, Sanjay, Bell, Sean, Kim, Seohyun Sonia, Edunov, Sergey, Nie, Shaoliang, Narang, Sharan, Raparthy, Sharath, Shen, Sheng, Wan, Shengye, Bhosale, Shruti, Zhang, Shun, Vandenhende, Simon, Batra, Soumya, Whitman, Spencer, Sootla, Sten, Collot, Stephane, Gururangan, Suchin, Borodinsky, Sydney, Herman, Tamar, Fowler, Tara, Sheasha, Tarek, Georgiou, Thomas, Scialom, Thomas, Speckbacher, Tobias, Mihaylov, Todor, Xiao, Tong, Karn, Ujjwal, Goswami, Vedanuj, Gupta, Vibhor, Ramanathan, Vignesh, Kerkez, Viktor, Gonguet, Vincent, Do, Virginie, Vogeti, Vish, Albiero, Vítor, Petrovic, Vladan, Chu, Weiwei, Xiong, Wenhan, Fu, Wenyin, Meers, Whitney, Martinet, Xavier, Wang, Xiaodong, Wang, Xiaofang, Tan, Xiaoqing Ellen, Xia, Xide, Xie, Xinfeng, Jia, Xuchao, Wang, Xuewei, Goldschlag, Yaelle, Gaur, Yashesh, Babaei, Yasmine, Wen, Yi, Song, Yiwen, Zhang, Yuchen, Li, Yue, Mao, Yuning, Coudert, Zacharie Delpierre, Yan, Zheng, Chen, Zhengxing, Papakipos, Zoe, Singh, Aaditya, Srivastava, Aayushi, Jain, Abha, Kelsey, Adam, Shajnfeld, Adam, Gangidi, Adithya, Victoria, Adolfo, Goldstand, Ahuva, Menon, Ajay, Sharma, Ajay, Boesenberg, Alex, Baevski, Alexei, Feinstein, Allie, Kallet, Amanda, Sangani, Amit, Teo, Amos, Yunus, Anam, Lupu, Andrei, Alvarado, Andres, Caples, Andrew, Gu, Andrew, Ho, Andrew, Poulton, Andrew, Ryan, Andrew, Ramchandani, Ankit, Dong, Annie, Franco, Annie, Goyal, Anuj, Saraf, Aparajita, Chowdhury, Arkabandhu, Gabriel, Ashley, Bharambe, Ashwin, Eisenman, Assaf, Yazdan, Azadeh, James, Beau, Maurer, Ben, Leonhardi, Benjamin, Huang, Bernie, Loyd, Beth, De Paola, Beto, Paranjape, Bhargavi, Liu, Bing, Wu, Bo, Ni, Boyu, Hancock, Braden, Wasti, Bram, Spence, Brandon, Stojkovic, Brani, Gamido, Brian, Montalvo, Britt, Parker, Carl, Burton, Carly, Mejia, Catalina, Liu, Ce, Wang, Changhan, Kim, Changkyu, Zhou, Chao, Hu, Chester, Chu, Ching-Hsiang, Cai, Chris, Tindal, Chris, Feichtenhofer, Christoph, Gao, Cynthia, Civin, Damon, Beaty, Dana, Kreymer, Daniel, Li, Daniel, Adkins, David, Xu, David, Testuggine, Davide, David, Delia, Parikh, Devi, Liskovich, Diana, Foss, Didem, Wang, Dingkang, Le, Duc, Holland, Dustin, Dowling, Edward, Jamil, Eissa, Montgomery, Elaine, Presani, Eleonora, Hahn, Emily, Wood, Emily, Le, Eric-Tuan, Brinkman, Erik, Arcaute, Esteban, Dunbar, Evan, Smothers, Evan, Sun, Fei, Kreuk, Felix, Tian, Feng, Kokkinos, Filippos, Ozgenel, Firat, Caggioni, Francesco, Kanayet, Frank, Seide, Frank, Florez, Gabriela Medina, Schwarz, Gabriella, Badeer, Gada, Swee, Georgia, Halpern, Gil, Herman, Grant, Sizov, Grigory, Guangyi, Zhang, Lakshminarayanan, Guna, Inan, Hakan, Shojanazeri, Hamid, Zou, Han, Wang, Hannah, Zha, Hanwen, Habeeb, Haroun, Rudolph, Harrison, Suk, Helen, Aspegren, Henry, Goldman, Hunter, Zhan, Hongyuan, Damlaj, Ibrahim, Molybog, Igor, Tufanov, Igor, Leontiadis, Ilias, Veliche, Irina-Elena, Gat, Itai, Weissman, Jake, Geboski, James, Kohli, James, Lam, Janice, Asher, Japhet, Gaya, Jean-Baptiste, Marcus, Jeff, Tang, Jeff, Chan, Jennifer, Zhen, Jenny, Reizenstein, Jeremy, Teboul, Jeremy, Zhong, Jessica, Jin, Jian, Yang, Jingyi, Cummings, Joe, Carvill, Jon, Shepard, Jon, McPhie, Jonathan, Torres, Jonathan, Ginsburg, Josh, Wang, Junjie, Wu, Kai, U, Kam Hou, Saxena, Karan, Khandelwal, Kartikay, Zand, Katayoun, Matosich, Kathy, Veeraraghavan, Kaushik, Michelena, Kelly, Li, Keqian, Jagadeesh, Kiran, Huang, Kun, Chawla, Kunal, Huang, Kyle, Chen, Lailin, Garg, Lakshya, A, Lavender, Silva, Leandro, Bell, Lee, Zhang, Lei, Guo, Liangpeng, Yu, Licheng, Moshkovich, Liron, Wehrstedt, Luca, Khabsa, Madian, Avalani, Manav, Bhatt, Manish, Mankus, Martynas, Hasson, Matan, Lennie, Matthew, Reso, Matthias, Groshev, Maxim, Naumov, Maxim, Lathi, Maya, Keneally, Meghan, Liu, Miao, Seltzer, Michael L., Valko, Michal, Restrepo, Michelle, Patel, Mihir, Vyatskov, Mik, Samvelyan, Mikayel, Clark, Mike, Macey, Mike, Wang, Mike, Hermoso, Miquel Jubert, Metanat, Mo, Rastegari, Mohammad, Bansal, Munish, Santhanam, Nandhini, Parks, Natascha, White, Natasha, Bawa, Navyata, Singhal, Nayan, Egebo, Nick, Usunier, Nicolas, Mehta, Nikhil, Laptev, Nikolay Pavlovich, Dong, Ning, Cheng, Norman, Chernoguz, Oleg, Hart, Olivia, Salpekar, Omkar, Kalinli, Ozlem, Kent, Parkin, Parekh, Parth, Saab, Paul, Balaji, Pavan, Rittner, Pedro, Bontrager, Philip, Roux, Pierre, Dollar, Piotr, Zvyagina, Polina, Ratanchandani, Prashant, Yuvraj, Pritish, Liang, Qian, Alao, Rachad, Rodriguez, Rachel, Ayub, Rafi, Murthy, Raghotham, Nayani, Raghu, Mitra, Rahul, Parthasarathy, Rangaprabhu, Li, Raymond, Hogan, Rebekkah, Battey, Robin, Wang, Rocky, Howes, Russ, Rinott, Ruty, Mehta, Sachin, Siby, Sachin, Bondu, Sai Jayesh, Datta, Samyak, Chugh, Sara, Hunt, Sara, Dhillon, Sargun, Sidorov, Sasha, Pan, Satadru, Mahajan, Saurabh, Verma, Saurabh, Yamamoto, Seiji, Ramaswamy, Sharadh, Lindsay, Shaun, Feng, Sheng, Lin, Shenghao, Zha, Shengxin Cindy, Patil, Shishir, Shankar, Shiva, Zhang, Shuqiang, Wang, Sinong, Agarwal, Sneha, Sajuyigbe, Soji, Chintala, Soumith, Max, Stephanie, Chen, Stephen, Kehoe, Steve, Satterfield, Steve, Govindaprasad, Sudarshan, Gupta, Sumit, Deng, Summer, Cho, Sungmin, Virk, Sunny, Subramanian, Suraj, Choudhury, Sy, Goldman, Sydney, Remez, Tal, Glaser, Tamar, Best, Tamara, Koehler, Thilo, Robinson, Thomas, Li, Tianhe, Zhang, Tianjun, Matthews, Tim, Chou, Timothy, Shaked, Tzook, Vontimitta, Varun, Ajayi, Victoria, Montanez, Victoria, Mohan, Vijai, Kumar, Vinay Satish, Mangla, Vishal, Ionescu, Vlad, Poenaru, Vlad, Mihailescu, Vlad Tiberiu, Ivanov, Vladimir, Li, Wei, Wang, Wenchen, Jiang, Wenwen, Bouaziz, Wes, Constable, Will, Tang, Xiaocheng, Wu, Xiaojian, Wang, Xiaolan, Wu, Xilun, Gao, Xinbo, Kleinman, Yaniv, Chen, Yanjun, Hu, Ye, Jia, Ye, Qi, Ye, Li, Yenda, Zhang, Yilin, Zhang, Ying, Adi, Yossi, Nam, Youngjin, Yu, Wang, Zhao, Yu, Hao, Yuchen, Qian, Yundi, Li, Yunlu, He, Yuzi, Rait, Zach, DeVito, Zachary, Rosnbrick, Zef, Wen, Zhaoduo, Yang, Zhenyu, Zhao, Zhiwei, and Ma, Zhiyu
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical evaluation of Llama 3. We find that Llama 3 delivers comparable quality to leading language models such as GPT-4 on a plethora of tasks. We publicly release Llama 3, including pre-trained and post-trained versions of the 405B parameter language model and our Llama Guard 3 model for input and output safety. The paper also presents the results of experiments in which we integrate image, video, and speech capabilities into Llama 3 via a compositional approach. We observe this approach performs competitively with the state-of-the-art on image, video, and speech recognition tasks. The resulting models are not yet being broadly released as they are still under development.
- Published
- 2024
27. Exploring Facial Biomarkers for Depression through Temporal Analysis of Action Units
- Author
-
Parikh, Aditya, Sadeghi, Misha, and Eskofier, Bjorn
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Depression is characterized by persistent sadness and loss of interest, significantly impairing daily functioning and now a widespread mental disorder. Traditional diagnostic methods rely on subjective assessments, necessitating objective approaches for accurate diagnosis. Our study investigates the use of facial action units (AUs) and emotions as biomarkers for depression. We analyzed facial expressions from video data of participants classified with or without depression. Our methodology involved detailed feature extraction, mean intensity comparisons of key AUs, and the application of time series classification models. Furthermore, we employed Principal Component Analysis (PCA) and various clustering algorithms to explore the variability in emotional expression patterns. Results indicate significant differences in the intensities of AUs associated with sadness and happiness between the groups, highlighting the potential of facial analysis in depression assessment.
- Published
- 2024
28. ClaimCompare: A Data Pipeline for Evaluation of Novelty Destroying Patent Pairs
- Author
-
Parikh, Arav and Dori-Hacohen, Shiri
- Subjects
Computer Science - Information Retrieval - Abstract
A fundamental step in the patent application process is the determination of whether there exist prior patents that are novelty destroying. This step is routinely performed by both applicants and examiners, in order to assess the novelty of proposed inventions among the millions of applications filed annually. However, conducting this search is time and labor-intensive, as searchers must navigate complex legal and technical jargon while covering a large amount of legal claims. Automated approaches using information retrieval and machine learning approaches to detect novelty destroying patents present a promising avenue to streamline this process, yet research focusing on this space remains limited. In this paper, we introduce a novel data pipeline, ClaimCompare, designed to generate labeled patent claim datasets suitable for training IR and ML models to address this challenge of novelty destruction assessment. To the best of our knowledge, ClaimCompare is the first pipeline that can generate multiple novelty destroying patent datasets. To illustrate the practical relevance of this pipeline, we utilize it to construct a sample dataset comprising of over 27K patents in the electrochemical domain: 1,045 base patents from USPTO, each associated with 25 related patents labeled according to their novelty destruction towards the base patent. Subsequently, we conduct preliminary experiments showcasing the efficacy of this dataset in fine-tuning transformer models to identify novelty destroying patents, demonstrating 29.2% and 32.7% absolute improvement in MRR and P@1, respectively.
- Published
- 2024
29. National Costs for Cardiovascular-Related Hospitalizations and Inpatient Procedures in the United States, 2016 to 2021
- Author
-
Haidar, Amier, Gajjar, Aryan, Parikh, Rushi V, Benharash, Peyman, Fonarow, Gregg C, Watson, Karol, Needleman, Jack, and Ziaeian, Boback
- Subjects
Biomedical and Clinical Sciences ,Cardiovascular Medicine and Haematology ,Clinical Research ,Bioengineering ,Cardiovascular ,Heart Disease ,Health Services ,Heart Disease - Coronary Heart Disease ,Good Health and Well Being ,Cardiovascular hospitalizations ,health care costs ,National Inpatient Sample ,Cardiorespiratory Medicine and Haematology ,Cardiovascular System & Hematology ,Cardiovascular medicine and haematology - Abstract
The current economic burden of cardiovascular (CV)-related hospitalizations grouped by diagnoses and procedures in the United States has not been well characterized. The objective was to identify current trends in CV-related hospitalizations, procedural utilization, and health care costs using the most recent 6 years of hospitalization data. A retrospective analysis of discharge data from the National Inpatient Sample database was conducted to determine trends in CV-related hospitalizations, costs, and procedures for each year from 2016 to the most recent available dataset, 2021. Total CV-related costs were adjusted to and reported in 2023 dollars. In 2021, there were 4,687,370 CV-related hospitalizations at a cost of $108 billion. Heart failure hospitalizations accounted for the highest costs at $18.5 billion, followed by non-ST-elevation myocardial infarction at $11.2 billion and stroke at $10.9 billion. Significant upward trends in costs from 2016 to 2021 were observed for heart failure, stroke, atrial fibrillation, ST-elevation myocardial infarction, chest pain, hypertensive emergency, ventricular tachycardia, aortic dissection, sudden cardiac death, pericarditis, supraventricular tachycardia, and pulmonary heart disease. Over the 6 observational years, total costs increased by over $10 billion, representing a 10% increase. However, the increases were not linear, as there was a significant increase of 6.5% from 2018 to 2019, then a decrease of over 7% from 2019 to 2020, followed by an increase of approximately 6% from 2020 to 2021. By 2030, total CV-related costs are projected to reach $131.3 billion. For all years, coronary procedures were the most performed, followed by extracorporeal membrane oxygenation, non-bypass peripheral vascular surgery, pacemaker placement, and coronary artery bypass graft surgery. Both transcatheter aortic valve replacement and MitraClip procedures demonstrated significant upward trends from 2016 to 2021. Overall, from the years 2016 to 2021, CV-related hospitalizations, costs, and procedures demonstrated upward trends. In conclusion, CV disease remains a high burden in the hospital setting with tremendous health care costs.
- Published
- 2025
30. Looking beyond College: STEM College Seniors on Entering the Workforce and the Impact of Race and Gender
- Author
-
Julie J. Park, Jia Zheng, Kristyn Lue, Cinthya Salazar, Arman M. Liwanag, Roshan M. Parikh, and Julia L. Anderson
- Abstract
While a strong literature base exists around undergraduate experiences in Science, Technology, Engineering, and Math (STEM), few studies examine how students approach the question of "what's next" after graduation. This study examines the impact of social ties on STEM college seniors' plans to enter the STEM workforce, and how race/ethnicity and gender impact postgraduation planning in STEM. We interviewed a racially diverse sample of 39 STEM college seniors at a predominantly White research institution. Analysis showed that students relied on weak and strong social ties in obtaining job leads and valued diversity in the workplace. Some students of color and women experienced negative social ties (via racism and sexism) during internship experiences, which shaped their thinking around postgraduate opportunities. We discuss implications for equity, as well as recommendations for research and practice.
- Published
- 2024
- Full Text
- View/download PDF
31. ACR-ARS Practice Parameter for the Performance of Proton Beam Therapy
- Author
-
Frank, Steven J, Das, Indra J, Simone, Charles B, Davis, Brian J, Deville, Curtiland, Liao, Zhongxing, Lo, Simon S, McGovern, Susan L, Parikh, Rahul R, Reilly, Michael, Small, William, and Schechter, Naomi R
- Subjects
Medical and Biological Physics ,Physical Sciences ,Biomedical and Clinical Sciences ,Oncology and Carcinogenesis ,Radiation Oncology ,Cancer ,7.3 Management and decision making ,Quality Education ,Proton therapy ,ACR ,ARS ,Practice parameter ,Radiation oncology - Abstract
PURPOSE: This practice parameter for the performance of proton beam radiation therapy was revised collaboratively by the American College of Radiology (ACR) and the American Radium Society (ARS). This practice parameter was developed to serve as a tool in the appropriate application of proton therapy in the care of cancer patients or other patients with conditions in which radiation therapy is indicated. It addresses clinical implementation of proton radiation therapy, including personnel qualifications, quality assurance (QA) standards, indications, and suggested documentation. MATERIALS AND METHODS: This practice parameter for the performance of proton beam radiation therapy was developed according to the process described under the heading The Process for Developing ACR Practice Parameters and Technical Standards on the ACR website (https://www.acr.org/Clinical-Resources/Practice-Parameters-and-Technical-Standards) by the Committee on Practice Parameters - Radiation Oncology of the ACR Commission on Radiation Oncology in collaboration with the ARS. RESULTS: The qualifications and responsibilities of personnel, such as the proton center Chief Medical Officer or Medical Director, Radiation Oncologist, Radiation Physicist, Dosimetrist and Therapist, are outlined, including the necessity for continuing medical education. Proton therapy standard clinical indications and methodologies of treatment management are outlined by disease site and treatment group (e.g. pediatrics) including documentation and the process of proton therapy workflow and equipment specifications. Additionally, this proton therapy practice parameter updates policies and procedures related to a quality assurance and performance improvement program (QAPI), patient education, infection control, and safety. CONCLUSION: As proton therapy becomes more accessible to cancer patients, policies and procedures as outlined in this practice parameter will help ensure quality and safety programs are effectively implemented to optimize clinical care.
- Published
- 2024
32. Guideline-directed medical therapy prescribing patterns and in-hospital outcomes among heart failure patients during COVID-19.
- Author
-
Srivastava, Pratyaksh, Klomhaus, Alexandra, Rafique, Asim, Desai, Pooja, Daniels, Lori, Yancy, Clyde, Yang, Eric, Fonarow, Gregg, and Parikh, Rushi
- Subjects
COVID-19 ,Guideline-directed medical therapy ,Heart failure with reduced ejection fraction - Abstract
STUDY OBJECTIVE: The association of prior to admission guideline-directed medical therapy (GDMT) use in patients hospitalized with Heart Failure with Reduced Ejection Fraction (HFrEF, ejection fraction ≤40 %) and Coronavirus Disease 2019 (COVID-19) with in-hospital outcomes has not been well studied. DESIGN/SETTING/PARTICIPANTS/INTERVENTIONS/OUTCOME MEASURES: Using the American Heart Associations Get With The Guidelines Heart Failure Registry, we identified HFrEF patients presenting with acute decompensated heart failure (ADHF) and compared rates of GDMT prescription between those presenting prior to and during the pandemic. In a subgroup of patients with a concomitant COVID-19 diagnosis, we evaluated the association of prior to admission GDMT use with in-hospital mortality and severe COVID-19. RESULTS: 23,899 patients were admitted with HFrEF during the pandemic (2/16/20-3/24/21) and 26,459 patients were admitted in the year prior (2/16/19-2/15/20). In this overall cohort, prior to admission ACEI/ARB/ARNI (45.6 % vs 48.1 %, p
- Published
- 2024
33. First Case of HIV Seroconversion With Integrase Resistance Mutations on Long-Acting Cabotegravir for Prevention in Routine Care
- Author
-
Koss, Catherine A, Gandhi, Monica, Halvas, Elias K, Okochi, Hideaki, Chu, Carolyn, Glidden, David V, Gomez, Lisa Georgetti, Heaps, Amy L, Conroy, Amy A, Tran, Michael, Shetler, Cory, Hoeth, Dianna, Kuncze, Karen, Louie, Alexander, Garza, Hana Rivera, Mugoma, Erick Wafula, Penrose, Kerri J, Chohan, Bhavna H, Ayieko, James O, Mills, Anthony, Patel, Rupa R, Mellors, John W, and Parikh, Urvi M
- Subjects
Medical Microbiology ,Biomedical and Clinical Sciences ,Clinical Sciences ,Infectious Diseases ,Women's Health ,Sexually Transmitted Infections ,Genetics ,HIV/AIDS ,Prevention ,Infection ,Good Health and Well Being ,breakthrough infection ,HIV prevention ,pharmacokinetics ,pre-exposure prophylaxis ,resistance ,Clinical sciences ,Medical microbiology - Abstract
BackgroundLong-acting cabotegravir (CAB-LA) is highly effective for HIV prevention, but delayed HIV diagnoses and integrase strand transfer inhibitor (INSTI) resistance were observed in trials. We report the first case in routine clinical care of HIV infection on CAB-LA with INSTI resistance.MethodsThe SeroPrEP study enrolls individuals in the United States who acquire HIV on pre-exposure prophylaxis modalities to assess diagnostics, antiretroviral (ARV) drug levels, resistance, and treatment outcomes. Resistance mutations in full-length HIV-1 integrase were identified by single-genome sequencing (SGS). Cabotegravir concentrations in plasma and hair segments were measured by liquid chromatography-tandem mass spectrometry.ResultsA 23-year-old gender-nonbinary person, male at birth, restarted CAB-LA 6 months after discontinuation due to losing insurance. Prior to restart, HIV-1 RNA was not detected, but 20 days elapsed before CAB-LA injection. After the second CAB-LA injection, HIV antigen/antibody returned reactive (HIV-1 RNA 451 copies/mL). SGS of plasma HIV-1 RNA identified INSTI mutation Q148R in 2/24 sequences 2 days postdiagnosis; commercial genotype failed amplification. Cabotegravir hair concentration was 0.190 ng/mg 2 weeks prediagnosis; plasma cabotegravir was high (3.37 μg/mL; ∼20× PA-IC90) 14 days postdiagnosis. Viral suppression was maintained for 6 months on darunavir/cobicistat/emtricitabine/tenofovir alafenamide, then switched to doravirine + emtricitabine/tenofovir alafenamide due to nausea.ConclusionsIn this first case of HIV infection on CAB-LA with INSTI resistance in routine care, cabotegravir resistance was detected only with a sensitive research assay. Accelerated pathways to minimize time between HIV testing and CAB-LA initiation are needed to optimize acute HIV detection and mitigate resistance risk. Sustained product access regardless of insurance is imperative to reduce HIV infections on CAB-LA.
- Published
- 2024
34. Application of Differential Subsampling with Cartesian Ordering in Evaluating Left Ovarian Venous Reflux for Pretreatment Planning for Pelvic Venous Disorders.
- Author
-
Ebrahimi, Sheida, Siddiqui, Nawal, Besser, Alexandra, Rodriguez-Soto, Ana, Yu, Hon, Boone, Christine, Hsiao, Albert, Roberts, Anne, Parikh, Rupal, and Rakow-Penner, Rebecca
- Subjects
diagnostic imaging ,dynamic contrast-enhanced MRI ,pelvic pain ,pelvic venous disorder - Abstract
The diagnosis of a common cause of chronic pelvic pain can be made by visualizing reflux in the ovarian veins. Fluoroscopic venography is the gold standard for diagnosing ovarian vein reflux, but it is an invasive technique that exposes patients to ionizing radiation. MRI, with its lack of ionizing radiation and capability of high-temporal and spatial-resolution vascular imaging, has the potential to provide similar diagnostic information. This retrospective report describes and assesses the utility of a dynamic contrast-enhanced MRI technique based on Differential Subsampling with Cartesian Ordering (DISCO)-MRI in 30 patients with chronic pelvic pain. Among the 14 patients who underwent both DISCO-MRI and fluoroscopic venograms, 11 (78.6%) exhibited concordant results, while 3 patients (21.4%) had discordant findings. These results suggest the potential of multiphasic contrast-enhanced DISCO-MRI as a non-invasive diagnostic tool for evaluating chronic pelvic pain.
- Published
- 2024
35. Validation of Biomechanical Computed Tomography for Fracture Risk Classification in Metastatic Hormone-sensitive Prostate Cancer.
- Author
-
Lin, John, Hearn, Caleb, Getzen, Emily, Long, Qi, Lee, David, Keaveny, Tony, Jayadevappa, Ravishankar, Robinson, Kyle, Wong, Yu-Ning, Maxwell, Kara, Narayan, Vivek, Haas, Naomi, Takvorian, Samuel, Bikle, Daniel, Chiang, Janet, Khan, Amna, Rajapakse, Chamith, Morgans, Alicia, and Parikh, Ravi
- Subjects
Androgen deprivation therapy ,Antiresorptive therapy ,Biomechanical computed tomography ,Dual x-ray absorptiometry ,Fracture ,Prostate cancer ,Humans ,Male ,Prostatic Neoplasms ,Aged ,Retrospective Studies ,Bone Density ,Tomography ,X-Ray Computed ,Risk Assessment ,Fractures ,Bone ,Middle Aged ,Androgen Antagonists ,Absorptiometry ,Photon ,Cohort Studies ,Biomechanical Phenomena - Abstract
BACKGROUND: Guidelines recommend dual-energy x-ray absorptiometry (DXA) screening to assess fracture risk and benefit from antiresorptive therapy in men with metastatic hormone-sensitive prostate cancer (mHSPC) on androgen deprivation therapy (ADT). However,
- Published
- 2024
36. Minimal Residual Disease using a Plasma-Only Circulating Tumor DNA Assay to Predict Recurrence of Metastatic Colorectal Cancer Following Curative Intent Treatment.
- Author
-
Parikh, Aparna, Chee, Bryant, Tsai, Jill, Rich, Thereasa, Price, Kristin, Patel, Sonia, Zhang, Li, Ibrahim, Faaiz, Esquivel, Mikaela, Van Seventer, Emily, Jarnagin, Joy, Raymond, Victoria, Corvera, Carlos, Hirose, Kenzo, Nakakura, Eric, Corcoran, Ryan, Van Loon, Katherine, and Atreya, Chloe
- Subjects
Humans ,Colorectal Neoplasms ,Circulating Tumor DNA ,Neoplasm ,Residual ,Female ,Male ,Neoplasm Recurrence ,Local ,Middle Aged ,Aged ,Biomarkers ,Tumor ,Prognosis ,Adult ,Neoplasm Metastasis ,Aged ,80 and over - Abstract
PURPOSE: Minimal residual disease (MRD) detection can identify the recurrence in patients with colorectal cancer (CRC) following definitive treatment. We evaluated a plasma-only MRD assay to predict recurrence and survival in patients with metastatic CRC who underwent curative intent procedures (surgery and/or radiotherapy), with or without (neo)adjuvant chemotherapy. The primary objective of this study was to assess the correlation of postprocedure tumor cell-free DNA detection status with radiographic disease recurrence. EXPERIMENTAL DESIGN: Preprocedure and postprocedure longitudinal samples were collected from 53 patients and analyzed with a multiomic MRD assay detecting circulating tumor DNA (ctDNA) from genomic and epigenomic signals. Preprocedure and postprocedure ctDNA detection correlated with recurrence-free and overall survival (OS). RESULTS: From 52 patients, 230/233 samples were successfully analyzed. At the time of data cutoff, 36 (69.2%) patients recurred with median follow-up of 31 months. Detectable ctDNA was observed in 19/42 patients (45.2%) with ctDNA analyzed 3 weeks postprocedure. ctDNA detection 3 weeks postprocedure was associated with shorter median recurrence-free survival (RFS; HR, 5.27; 95% CI, 2.31-12.0; P < 0.0001) and OS (HR, 12.83; 95% CI, 3.6-45.9; P < 0.0001). Preprocedure ctDNA detection status was not associated with RFS but was associated with improved OS (HR, 4.65; 95% CI, 1.4-15.2; P = 0.0111). Undetectable ctDNA preprocedure had notable long-term OS, >90% 3 years postprocedure. CONCLUSIONS: In this cohort of oligometastatic CRC, detection of ctDNA preprocedure or postprocedure was associated with inferior outcomes even after accounting for known prognostic clinicopathologic variables. This suggests ctDNA may enhance current risk stratification methods helping the evaluation of novel treatments and surveillance strategies toward improving patient outcomes.
- Published
- 2024
37. A Phase II Trial of the WEE1 Inhibitor Adavosertib in SETD2-Altered Advanced Solid Tumor Malignancies (NCI 10170).
- Author
-
Maldonado, Edward, Rathmell, W, Shapiro, Geoffrey, Takebe, Naoko, Rodon, Jordi, Mahalingam, Devalingam, Trikalinos, Nikolaos, Kalebasty, Arash, Parikh, Mamta, Boerner, Scott, Balido, Celene, Krings, Gregor, Burns, Timothy, Bergsland, Emily, Munster, Pamela, Ashworth, Alan, LoRusso, Patricia, and Aggarwal, Rahul
- Subjects
Humans ,Middle Aged ,Histone-Lysine N-Methyltransferase ,Male ,Aged ,Female ,Protein-Tyrosine Kinases ,Cell Cycle Proteins ,Pyrazoles ,Neoplasms ,Carcinoma ,Renal Cell ,Kidney Neoplasms ,Pyrimidinones ,Mutation - Abstract
UNLABELLED: We sought to evaluate the efficacy of WEE1 inhibitor adavosertib in patients with solid tumor malignancies (cohort A) and clear cell renal cell carcinoma (ccRCC; cohort B). NCT03284385 was a parallel cohort, Simon two-stage, phase II study of adavosertib (300 mg QDAY by mouth on days 1-5 and 8-12 of each 21-day cycle) in patients with solid tumor malignancies harboring a pathogenic SETD2 mutation. The primary endpoint was the objective response rate. Correlative assays evaluated the loss of H3K36me3 by IHC, a downstream consequence of SETD2 loss, in archival tumor tissue. Eighteen patients were enrolled (9/cohort). The median age was 60 years (range 45-74). The median duration of treatment was 1.28 months (range 0-24+). No objective responses were observed in either cohort; accrual was halted following stage 1. Minor tumor regressions were observed in 4/18 (22%) evaluable patients. Stable disease (SD) was the best overall response in 10/18 (56%) patients, including three patients with SD > 4 months. One patient with ccRCC remains on treatment for >24 months. The most common adverse events of any grade were nausea (59%), anemia (41%), diarrhea (41%), and neutropenia (41%). Nine patients (50%) experienced a Grade ≥3 adverse event. Of eight evaluable archival tissue samples, six (75%) had a loss of H3K36me3 by IHC. Adavosertib failed to exhibit objective responses in SETD2-altered ccRCC and other solid tumor malignancies although prolonged SD was observed in a subset of patients. Combination approaches may yield greater depth of tumor response. SIGNIFICANCE: WEE1 inhibition with adavosertib monotherapy demonstrated limited clinical activity in patients with SETD2-altered solid tumors despite compelling preclinical data indicating a synthetic lethal effect, which did not translate into robust tumor regression. Loss of the H3K36me3 trimethylation mark caused by SETD2-deficiency was confirmed in the majority of evaluable tumors. A subset of patients derived clinical benefit as manifested by minor tumor regressions and prolonged SD.
- Published
- 2024
38. Controlling transport across artificial cell membranes
- Author
-
Parikh, Atul N
- Subjects
Biochemistry and Cell Biology ,Biological Sciences - Published
- 2024
39. Genome-wide association study identifies high-impact susceptibility loci for HCC in North America.
- Author
-
Hassan, Manal, Li, Donghui, Han, Younghun, Byun, Jinyoung, Hatia, Rikita, Long, Erping, Choi, Jiyeon, Kelley, Robin, Cleary, Sean, Lok, Anna, Bracci, Paige, Permuth, Jennifer, Bucur, Roxana, Yuan, Jian-Min, Singal, Amit, Jalal, Prasun, Ghobrial, R, Santella, Regina, Kono, Yuko, Shah, Dimpy, Nguyen, Mindie, Liu, Geoffrey, Parikh, Neehar, Kim, Richard, Wu, Hui-Chen, El-Serag, Hashem, Chang, Ping, Li, Yanan, Chun, Yun, Lee, Sunyoung, Gu, Jian, Hawk, Ernest, Sun, Ryan, Huff, Chad, Rashid, Asif, Amin, Hesham, Beretta, Laura, Wolff, Robert, Antwi, Samuel, Patt, Yehuda, Hwang, Lu-Yu, Klein, Alison, Zhang, Karen, Schmidt, Mikayla, White, Donna, Goss, John, Khaderi, Saira, Marrero, Jorge, Cigarroa, Francisco, Shah, Pankil, Kaseb, Ahmed, Roberts, Lewis, and Amos, Christopher
- Subjects
Humans ,Genome-Wide Association Study ,Liver Neoplasms ,Genetic Predisposition to Disease ,Carcinoma ,Hepatocellular ,Male ,Female ,Middle Aged ,North America ,Case-Control Studies ,Polymorphism ,Single Nucleotide ,Aged ,Genetic Loci ,White People - Abstract
BACKGROUND AND AIMS: Despite the substantial impact of environmental factors, individuals with a family history of liver cancer have an increased risk for HCC. However, genetic factors have not been studied systematically by genome-wide approaches in large numbers of individuals from European descent populations (EDP). APPROACH AND RESULTS: We conducted a 2-stage genome-wide association study (GWAS) on HCC not affected by HBV infections. A total of 1872 HCC cases and 2907 controls were included in the discovery stage, and 1200 HCC cases and 1832 controls in the validation. We analyzed the discovery and validation samples separately and then conducted a meta-analysis. All analyses were conducted in the presence and absence of HCV. The liability-scale heritability was 24.4% for overall HCC. Five regions with significant ORs (95% CI) were identified for nonviral HCC: 3p22.1, MOBP , rs9842969, (0.51, [0.40-0.65]); 5p15.33, TERT , rs2242652, (0.70, (0.62-0.79]); 19q13.11, TM6SF2 , rs58542926, (1.49, [1.29-1.72]); 19p13.11 MAU2 , rs58489806, (1.53, (1.33-1.75]); and 22q13.31, PNPLA3 , rs738409, (1.66, [1.51-1.83]). One region was identified for HCV-induced HCC: 6p21.31, human leukocyte antigen DQ beta 1, rs9275224, (0.79, [0.74-0.84]). A combination of homozygous variants of PNPLA3 and TERT showing a 6.5-fold higher risk for nonviral-related HCC compared to individuals lacking these genotypes. This observation suggests that gene-gene interactions may identify individuals at elevated risk for developing HCC. CONCLUSIONS: Our GWAS highlights novel genetic susceptibility of nonviral HCC among European descent populations from North America with substantial heritability. Selected genetic influences were observed for HCV-positive HCC. Our findings indicate the importance of genetic susceptibility to HCC development.
- Published
- 2024
40. ACR: A Benchmark for Automatic Cohort Retrieval
- Author
-
Thai, Dung Ngoc, Ardulov, Victor, Mena, Jose Ulises, Tiwari, Simran, Erofeev, Gleb, Eskander, Ramy, Tarabishy, Karim, Parikh, Ravi B, and Salloum, Wael
- Subjects
Computer Science - Artificial Intelligence - Abstract
Identifying patient cohorts is fundamental to numerous healthcare tasks, including clinical trial recruitment and retrospective studies. Current cohort retrieval methods in healthcare organizations rely on automated queries of structured data combined with manual curation, which are time-consuming, labor-intensive, and often yield low-quality results. Recent advancements in large language models (LLMs) and information retrieval (IR) offer promising avenues to revolutionize these systems. Major challenges include managing extensive eligibility criteria and handling the longitudinal nature of unstructured Electronic Medical Records (EMRs) while ensuring that the solution remains cost-effective for real-world application. This paper introduces a new task, Automatic Cohort Retrieval (ACR), and evaluates the performance of LLMs and commercial, domain-specific neuro-symbolic approaches. We provide a benchmark task, a query dataset, an EMR dataset, and an evaluation framework. Our findings underscore the necessity for efficient, high-quality ACR systems capable of longitudinal reasoning across extensive patient databases.
- Published
- 2024
41. EchoGuide: Active Acoustic Guidance for LLM-Based Eating Event Analysis from Egocentric Videos
- Author
-
Parikh, Vineet, Mahmud, Saif, Agarwal, Devansh, Li, Ke, Guimbretière, François, and Zhang, Cheng
- Subjects
Computer Science - Human-Computer Interaction - Abstract
Self-recording eating behaviors is a step towards a healthy lifestyle recommended by many health professionals. However, the current practice of manually recording eating activities using paper records or smartphone apps is often unsustainable and inaccurate. Smart glasses have emerged as a promising wearable form factor for tracking eating behaviors, but existing systems primarily identify when eating occurs without capturing details of the eating activities (E.g., what is being eaten). In this paper, we present EchoGuide, an application and system pipeline that leverages low-power active acoustic sensing to guide head-mounted cameras to capture egocentric videos, enabling efficient and detailed analysis of eating activities. By combining active acoustic sensing for eating detection with video captioning models and large-scale language models for retrieval augmentation, EchoGuide intelligently clips and analyzes videos to create concise, relevant activity records on eating. We evaluated EchoGuide with 9 participants in naturalistic settings involving eating activities, demonstrating high-quality summarization and significant reductions in video data needed, paving the way for practical, scalable eating activity tracking., Comment: Accepted at ISWC '24
- Published
- 2024
42. Development and Validation of a Deep-Learning Model for Differential Treatment Benefit Prediction for Adults with Major Depressive Disorder Deployed in the Artificial Intelligence in Depression Medication Enhancement (AIDME) Study
- Author
-
Benrimoh, David, Armstrong, Caitrin, Mehltretter, Joseph, Fratila, Robert, Perlman, Kelly, Israel, Sonia, Kapelner, Adam, Parikh, Sagar V., Karp, Jordan F., Heller, Katherine, and Turecki, Gustavo
- Subjects
Quantitative Biology - Neurons and Cognition ,Computer Science - Machine Learning - Abstract
INTRODUCTION: The pharmacological treatment of Major Depressive Disorder (MDD) relies on a trial-and-error approach. We introduce an artificial intelligence (AI) model aiming to personalize treatment and improve outcomes, which was deployed in the Artificial Intelligence in Depression Medication Enhancement (AIDME) Study. OBJECTIVES: 1) Develop a model capable of predicting probabilities of remission across multiple pharmacological treatments for adults with at least moderate major depression. 2) Validate model predictions and examine them for amplification of harmful biases. METHODS: Data from previous clinical trials of antidepressant medications were standardized into a common framework and included 9,042 adults with moderate to severe major depression. Feature selection retained 25 clinical and demographic variables. Using Bayesian optimization, a deep learning model was trained on the training set, refined using the validation set, and tested once on the held-out test set. RESULTS: In the evaluation on the held-out test set, the model demonstrated achieved an AUC of 0.65. The model outperformed a null model on the test set (p = 0.01). The model demonstrated clinical utility, achieving an absolute improvement in population remission rate in hypothetical and actual improvement testing. While the model did identify one drug (escitalopram) as generally outperforming the other drugs (consistent with the input data), there was otherwise significant variation in drug rankings. On bias testing, the model did not amplify potentially harmful biases. CONCLUSIONS: We demonstrate the first model capable of predicting outcomes for 10 different treatment options for patients with MDD, intended to be used at or near the start of treatment to personalize treatment. The model was put into clinical practice during the AIDME randomized controlled trial whose results are reported separately.
- Published
- 2024
43. RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
- Author
-
Nasiriany, Soroush, Maddukuri, Abhiram, Zhang, Lance, Parikh, Adeet, Lo, Aaron, Joshi, Abhishek, Mandlekar, Ajay, and Zhu, Yuke
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Recent advancements in Artificial Intelligence (AI) have largely been propelled by scaling. In Robotics, scaling is hindered by the lack of access to massive robot datasets. We advocate using realistic physical simulation as a means to scale environments, tasks, and datasets for robot learning methods. We present RoboCasa, a large-scale simulation framework for training generalist robots in everyday environments. RoboCasa features realistic and diverse scenes focusing on kitchen environments. We provide thousands of 3D assets across over 150 object categories and dozens of interactable furniture and appliances. We enrich the realism and diversity of our simulation with generative AI tools, such as object assets from text-to-3D models and environment textures from text-to-image models. We design a set of 100 tasks for systematic evaluation, including composite tasks generated by the guidance of large language models. To facilitate learning, we provide high-quality human demonstrations and integrate automated trajectory generation methods to substantially enlarge our datasets with minimal human burden. Our experiments show a clear scaling trend in using synthetically generated robot data for large-scale imitation learning and show great promise in harnessing simulation data in real-world tasks. Videos and open-source code are available at https://robocasa.ai/, Comment: RSS 2024
- Published
- 2024
44. DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation
- Author
-
Wu, Yinjun, Keoliya, Mayank, Chen, Kan, Velingker, Neelay, Li, Ziyang, Getzen, Emily J, Long, Qi, Naik, Mayur, Parikh, Ravi B, and Wong, Eric
- Subjects
Computer Science - Machine Learning ,Statistics - Methodology - Abstract
Designing faithful yet accurate AI models is challenging, particularly in the field of individual treatment effect estimation (ITE). ITE prediction models deployed in critical settings such as healthcare should ideally be (i) accurate, and (ii) provide faithful explanations. However, current solutions are inadequate: state-of-the-art black-box models do not supply explanations, post-hoc explainers for black-box models lack faithfulness guarantees, and self-interpretable models greatly compromise accuracy. To address these issues, we propose DISCRET, a self-interpretable ITE framework that synthesizes faithful, rule-based explanations for each sample. A key insight behind DISCRET is that explanations can serve dually as database queries to identify similar subgroups of samples. We provide a novel RL algorithm to efficiently synthesize these explanations from a large search space. We evaluate DISCRET on diverse tasks involving tabular, image, and text data. DISCRET outperforms the best self-interpretable models and has accuracy comparable to the best black-box models while providing faithful explanations. DISCRET is available at https://github.com/wuyinjun-1993/DISCRET-ICML2024., Comment: Accepted at ICML 2024. 22 pages, 5 figures
- Published
- 2024
45. Analysing the Public Discourse around OpenAI's Text-To-Video Model 'Sora' using Topic Modeling
- Author
-
Parikh, Vatsal Vinay
- Subjects
Computer Science - Computers and Society ,Computer Science - Computation and Language ,Computer Science - Information Retrieval ,Computer Science - Machine Learning ,Computer Science - Social and Information Networks - Abstract
The recent introduction of OpenAI's text-to-video model Sora has sparked widespread public discourse across online communities. This study aims to uncover the dominant themes and narratives surrounding Sora by conducting topic modeling analysis on a corpus of 1,827 Reddit comments from five relevant subreddits (r/OpenAI, r/technology, r/singularity, r/vfx, and r/ChatGPT). The comments were collected over a two-month period following Sora's announcement in February 2024. After preprocessing the data, Latent Dirichlet Allocation (LDA) was employed to extract four key topics: 1) AI Impact and Trends in Sora Discussions, 2) Public Opinion and Concerns about Sora, 3) Artistic Expression and Video Creation with Sora, and 4) Sora's Applications in Media and Entertainment. Visualizations including word clouds, bar charts, and t-SNE clustering provided insights into the importance of topic keywords and the distribution of comments across topics. The results highlight prominent narratives around Sora's potential impact on industries and employment, public sentiment and ethical concerns, creative applications, and use cases in the media and entertainment sectors. While limited to Reddit data within a specific timeframe, this study offers a framework for understanding public perceptions of emerging generative AI technologies through online discourse analysis.
- Published
- 2024
46. Thinking Forward: Memory-Efficient Federated Finetuning of Language Models
- Author
-
Panchal, Kunjal, Parikh, Nisarg, Choudhary, Sunav, Zhang, Lijun, Brun, Yuriy, and Guan, Hui
- Subjects
Computer Science - Machine Learning - Abstract
Finetuning large language models (LLMs) in federated learning (FL) settings has become increasingly important as it allows resource-constrained devices to finetune a model using private data. However, finetuning LLMs using backpropagation requires excessive memory (especially from intermediate activations) for resource-constrained devices. While Forward-mode Auto-Differentiation (AD) can significantly reduce memory footprint from activations, we observe that directly applying it to LLM finetuning results in slow convergence and poor accuracy. In this paper, we introduce Spry, an FL algorithm that splits trainable weights of an LLM among participating clients, such that each client computes gradients using forward-mode AD that are closer estimations of the true gradients. Spry achieves a low memory footprint, high accuracy, and fast convergence. We formally prove that the global gradients in Spry are unbiased estimators of true global gradients for homogeneous data distributions across clients, while heterogeneity increases bias of the estimates. We also derive Spry's convergence rate, showing that the gradients decrease inversely proportional to the number of FL rounds, indicating the convergence up to the limits of heterogeneity. Empirically, Spry reduces the memory footprint during training by 1.4-7.1x in contrast to backpropagation, while reaching comparable accuracy, across a wide range of language tasks, models, and FL settings. Spry reduces the convergence time by 1.2-20.3x and achieves 5.2-13.5% higher accuracy against zero-order methods. When finetuning Llama2-7B with LoRA, compared to the peak memory consumption of 33.9GB of backpropagation, Spry only consumes 6.2GB of peak memory. For OPT13B, the reduction is from 76.5GB to 10.8GB. Spry makes feasible previously impossible FL deployments on commodity edge devices. Our source code is available at https://github.com/Astuary/Spry., Comment: Accepted to NeurIPS 2024
- Published
- 2024
47. Fully automated construction of three-dimensional finite element simulations from Optical Coherence Tomography
- Author
-
Straughan, Ross, Kadry, Karim, Parikh, Sahil A., Edelman, Elazer R., and Nezami, Farhad R.
- Subjects
Computer Science - Computational Engineering, Finance, and Science ,Quantitative Biology - Tissues and Organs - Abstract
Despite recent advances in diagnosis and treatment, atherosclerotic coronary artery diseases remain a leading cause of death worldwide. Various imaging modalities and metrics can detect lesions and predict patients at risk; however, identifying unstable lesions is still difficult. Current techniques cannot fully capture the complex morphology-modulated mechanical responses that affect plaque stability, leading to catastrophic failure and mute the benefit of device and drug interventions. Finite Element (FE) simulations utilizing intravascular imaging OCT (Optical Coherence Tomography) are effective in defining physiological stress distributions. However, creating 3D FE simulations of coronary arteries from OCT images is challenging to fully automate given OCT frame sparsity, limited material contrast, and restricted penetration depth. To address such limitations, we developed an algorithmic approach to automatically produce 3D FE-ready digital twins from labeled OCT images. The 3D models are anatomically faithful and recapitulate mechanically relevant tissue lesion components, automatically producing morphologies structurally similar to manually constructed models whilst including more minute details. A mesh convergence study highlighted the ability to reach stress and strain convergence with average errors of just 5.9% and 1.6% respectively in comparison to FE models with approximately twice the number of elements in areas of refinement. Such an automated procedure will enable analysis of large clinical cohorts at a previously unattainable scale and opens the possibility for in-silico methods for patient specific diagnoses and treatment planning for coronary artery disease.
- Published
- 2024
- Full Text
- View/download PDF
48. Targeted Multilingual Adaptation for Low-resource Language Families
- Author
-
Downey, C. M., Blevins, Terra, Serai, Dhwani, Parikh, Dwija, and Steinert-Threlkeld, Shane
- Subjects
Computer Science - Computation and Language - Abstract
The "massively-multilingual" training of multilingual models is known to limit their utility in any one language, and they perform particularly poorly on low-resource languages. However, there is evidence that low-resource languages can benefit from targeted multilinguality, where the model is trained on closely related languages. To test this approach more rigorously, we systematically study best practices for adapting a pre-trained model to a language family. Focusing on the Uralic family as a test case, we adapt XLM-R under various configurations to model 15 languages; we then evaluate the performance of each experimental setting on two downstream tasks and 11 evaluation languages. Our adapted models significantly outperform mono- and multilingual baselines. Furthermore, a regression analysis of hyperparameter effects reveals that adapted vocabulary size is relatively unimportant for low-resource languages, and that low-resource languages can be aggressively up-sampled during training at little detriment to performance in high-resource languages. These results introduce new best practices for performing language adaptation in a targeted setting.
- Published
- 2024
49. Vision Transformers for End-to-End Vision-Based Quadrotor Obstacle Avoidance
- Author
-
Bhattacharya, Anish, Rao, Nishanth, Parikh, Dhruv, Kunapuli, Pratik, Wu, Yuwei, Tao, Yuezhan, Matni, Nikolai, and Kumar, Vijay
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Electrical Engineering and Systems Science - Image and Video Processing - Abstract
We demonstrate the capabilities of an attention-based end-to-end approach for high-speed vision-based quadrotor obstacle avoidance in dense, cluttered environments, with comparison to various state-of-the-art learning architectures. Quadrotor unmanned aerial vehicles (UAVs) have tremendous maneuverability when flown fast; however, as flight speed increases, traditional model-based approaches to navigation via independent perception, mapping, planning, and control modules breaks down due to increased sensor noise, compounding errors, and increased processing latency. Thus, learning-based, end-to-end vision-to-control networks have shown to have great potential for online control of these fast robots through cluttered environments. We train and compare convolutional, U-Net, and recurrent architectures against vision transformer (ViT) models for depth image-to-control in high-fidelity simulation, observing that ViT models are more effective than others as quadrotor speeds increase and in generalization to unseen environments, while the addition of recurrence further improves performance while reducing quadrotor energy cost across all tested flight speeds. We assess performance at speeds of up to 7m/s in simulation and hardware. To the best of our knowledge, this is the first work to utilize vision transformers for end-to-end vision-based quadrotor control., Comment: 11 pages, 18 figures, 3 tables (with supplementary)
- Published
- 2024
50. Transfer-LMR: Heavy-Tail Driving Behavior Recognition in Diverse Traffic Scenarios
- Author
-
Parikh, Chirag, Mishra, Ravi Shankar, Chandra, Rohan, and Sarvadevabhatla, Ravi Kiran
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Recognizing driving behaviors is important for downstream tasks such as reasoning, planning, and navigation. Existing video recognition approaches work well for common behaviors (e.g. "drive straight", "brake", "turn left/right"). However, the performance is sub-par for underrepresented/rare behaviors typically found in tail of the behavior class distribution. To address this shortcoming, we propose Transfer-LMR, a modular training routine for improving the recognition performance across all driving behavior classes. We extensively evaluate our approach on METEOR and HDD datasets that contain rich yet heavy-tailed distribution of driving behaviors and span diverse traffic scenarios. The experimental results demonstrate the efficacy of our approach, especially for recognizing underrepresented/rare driving behaviors.
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.