105,896 results on '"Ramachandran, A"'
Search Results
2. Shakespeare and the Ethics of the Global: An Interview with Preti Taneja
- Author
-
Campana, Joseph, Ramachandran, Ayesha, and Taneja, Preti
- Published
- 2024
- Full Text
- View/download PDF
3. Spatial and Scalar Multitudes: Thinking with World, Globe, and Planet
- Author
-
Campana, Joseph and Ramachandran, Ayesha
- Published
- 2024
- Full Text
- View/download PDF
4. Athlete and Auxologist: James Tanner's Interest in Physique and his Role in Establishing the Short Stature of Children as a Medical Classification
- Author
-
Ramachandran, Aishwarya and Vertinsky, Patricia
- Published
- 2023
- Full Text
- View/download PDF
5. Lines of Beauty: In Honor of David Quint
- Author
-
Ramachandran, Ayesha, Rubini, Rocco, and Van der Laan, Sarah
- Published
- 2021
- Full Text
- View/download PDF
6. Allegories of Influence: Spenser, Chaucer and Italian Romance
- Author
-
Ramachandran, Ayesha
- Published
- 2021
- Full Text
- View/download PDF
7. BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning
- Author
-
Fu, Yujuan Velvin, Ramachandran, Giridhar Kaushik, Park, Namu, Lybarger, Kevin, Xia, Fei, Uzuner, Ozlem, and Yetisgen, Meliha
- Subjects
Computer Science - Computation and Language - Abstract
Large language models (LLMs) such as ChatGPT are fine-tuned on large and diverse instruction-following corpora, and can generalize to new tasks. However, those instruction-tuned LLMs often perform poorly in specialized medical natural language understanding (NLU) tasks that require domain knowledge, granular text comprehension, and structured data extraction. To bridge the gap, we: (1) propose a unified prompting format for 7 important NLU tasks, % through span extraction and multi-choice question-answering (QA), (2) curate an instruction-tuning dataset, MNLU-Instruct, utilizing diverse existing open-source medical NLU corpora, and (3) develop BioMistral-NLU, a generalizable medical NLU model, through fine-tuning BioMistral on MNLU-Instruct. We evaluate BioMistral-NLU in a zero-shot setting, across 6 important NLU tasks, from two widely adopted medical NLU benchmarks: Biomedical Language Understanding Evaluation (BLUE) and Biomedical Language Understanding and Reasoning Benchmark (BLURB). Our experiments show that our BioMistral-NLU outperforms the original BioMistral, as well as the proprietary LLMs - ChatGPT and GPT-4. Our dataset-agnostic prompting strategy and instruction tuning step over diverse NLU tasks enhance LLMs' generalizability across diverse medical NLU tasks. Our ablation experiments show that instruction-tuning on a wider variety of tasks, even when the total number of training instances remains constant, enhances downstream zero-shot generalization., Comment: 3 figures an 5 tables
- Published
- 2024
8. POD-Attention: Unlocking Full Prefill-Decode Overlap for Faster LLM Inference
- Author
-
Kamath, Aditya K, Prabhu, Ramya, Mohan, Jayashree, Peter, Simon, Ramjee, Ramachandran, and Panwar, Ashish
- Subjects
Computer Science - Machine Learning ,Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
Each request in LLM inference goes through two phases: compute-bound prefill and memory-bandwidth-bound decode. To improve GPU utilization, recent systems use hybrid batching that combines the prefill and decode phases of different requests into the same batch. Hybrid batching works well for linear operations as it amortizes the cost of loading model weights from HBM. However, attention computation in hybrid batches remains inefficient because existing attention kernels are optimized for either prefill or decode. In this paper, we present POD-Attention -- the first GPU kernel that efficiently computes attention for hybrid batches. POD-Attention aims to maximize the utilization of both compute and memory bandwidth by carefully allocating the GPU's resources such that prefill and decode operations happen concurrently on the same multiprocessor. We integrate POD-Attention in a state-of-the-art LLM inference scheduler Sarathi-Serve. POD-Attention speeds up attention computation by up to 75% (mean 28%) and increases LLM serving throughput by up to 22% in offline inference. In online inference, POD-Attention enables lower time-to-first-token (TTFT), time-between-tokens (TBT), and request execution latency versus Sarathi-Serve.
- Published
- 2024
9. Structure, dynamics and phase transitions in electric field assembled colloidal crystals and glasses
- Author
-
Barros, Indira, Ramachandran, Sayanth, and Chakraborty, Indrani
- Subjects
Condensed Matter - Soft Condensed Matter ,Condensed Matter - Materials Science - Abstract
Field-induced assembly of colloidal particles into structures of desired configurations is extremely relevant from the viewpoint of producing field-assembled micro-swimmers and reconfigurable smart materials. However, the behaviour of colloidal particles under the influence of alternating current (AC) electric fields remains a topic of ongoing investigation due to the complex effects of various control parameters. Here we examine the role of several factors including particle size, zeta potential, voltage and frequency of the applied field in the formation of different structural configurations ranging from crystals to glasses, and observe interesting and unexpected behaviours in the structure formation. Additionally, we investigate the dynamics of structure formation; the nature of diffusion and active motion in these out-of-equilibrium systems, and show how that leads to the formation of close-packed or open structures. Lastly, we investigate the frequency-driven disorder-order-disorder phase transition in colloidal crystals, which is a starting point for building reconfigurable systems. Our findings contribute to a deeper understanding of the significant roles of various factors in electric field-induced assembly of colloidal particles, as well as pave the way for potential applications in micro-robotics and next-generation materials., Comment: 10 pages, 7 figures
- Published
- 2024
10. ZK-DPPS: A Zero-Knowledge Decentralised Data Sharing and Processing Middleware
- Author
-
Jabbari, Amir, Ramachandran, Gowri, Malik, Sidra, and Jurdak, Raja
- Subjects
Computer Science - Cryptography and Security - Abstract
In the current digital landscape, supply chains have transformed into complex networks driven by the Internet of Things (IoT), necessitating enhanced data sharing and processing capabilities to ensure traceability and transparency. Leveraging Blockchain technology in IoT applications advances reliability and transparency in near-real-time insight extraction processes. However, it raises significant concerns regarding data privacy. Existing privacy-preserving approaches often rely on Smart Contracts for automation and Zero Knowledge Proofs (ZKP) for privacy. However, apart from being inflexible in adopting system changes while effectively protecting data confidentiality, these approaches introduce significant computational expenses and overheads that make them impractical for dynamic supply chain environments. To address these challenges, we propose ZK-DPPS, a framework that ensures zero-knowledge communications without the need for traditional ZKPs. In ZK-DPPS, privacy is preserved through a combination of Fully Homomorphic Encryption (FHE) for computations and Secure Multi-Party Computations (SMPC) for key reconstruction. To ensure that the raw data remains private throughout the entire process, we use FHE to execute computations directly on encrypted data. The "zero-knowledge" aspect of ZK-DPPS refers to the system's ability to process and share data insights without exposing sensitive information, thus offering a practical and efficient alternative to ZKP-based methods. We demonstrate the efficacy of ZK-DPPS through a simulated supply chain scenario, showcasing its ability to tackle the dual challenges of privacy preservation and computational trust in decentralised environments.
- Published
- 2024
11. On the order of Brauer classes capturing Brauer-Manin Obstruction
- Author
-
Biswas, Mridul, Ramachandran, Divyasree C, and Samanta, Biswanath
- Subjects
Mathematics - Number Theory ,Mathematics - Algebraic Geometry ,Primary 14G05, Secondary 14G12, 11G35, 14J20, 11G25, 14F22 - Abstract
Let $X$ be a smooth projective variety defined over a number field $k$ that has Brauer-Manin obstruction to the existence of rational points. We study the orders of the Brauer classes which capture the obstruction. We prove that one may require elements of arbitrarily large order to capture the obstruction. In other words, the exponent of the group which captures the obstruction has no upper bound., Comment: 8 pages. Comments are welcome
- Published
- 2024
12. First constraints on general neutrino interactions based on KATRIN data
- Author
-
Aker, M., Batzler, D., Beglarian, A., Beisenkötter, J., Biassoni, M., Bieringer, B., Biondi, Y., Block, F., Bornschein, B., Bornschein, L., Böttcher, M., Carminati, M., Chatrabhuti, A., Chilingaryan, S., Daniel, B. A., Descher, M., Barrero, D. Díaz, Doe, P. J., Dragoun, O., Drexlin, G., Edzards, F., Eitel, K., Ellinger, E., Engel, R., Enomoto, S., Felden, A., Fengler, C., Fiorini, C., Formaggio, J. A., Forstner, C., Fränkle, F. M., Gagliardi, G., Gauda, K., Gavin, A. S., Gil, W., Glück, F., Grössle, R., Gutknecht, N., Hannen, V., Hasselmann, L., Helbing, K., Henke, H., Heyns, S., Hiller, R., Hillesheimer, D., Hinz, D., Höhn, T., Huber, A., Jansen, A., Khosonthongkee, K., Köhler, C., Köllenberger, L., Kopmann, A., Kovač, N., La Cascio, L., Lasserre, T., Lauer, J., Le, T. L., Lebeda, O., Lehnert, B., Li, G., Lokhov, A., Machatschek, M., Mark, M., Marsteller, A., McMichael, K., Melzer, C., Mertens, S., Mohanty, S., Mostafa, J., Müller, K., Nava, A., Neumann, H., Niemes, S., Onillon, A., Parno, D. S., Pavan, M., Pinsook, U., Poon, A. W. P., Poyato, J. M. L., Priester, F., Ráliš, J., Ramachandran, S., Robertson, R. G. H., Rodenbeck, C., Röllig, M., Sack, R., Saenz, A., Salomon, R., Schäfer, P., Schlösser, K., Schlösser, M., Schlüter, L., Schneidewind, S., Schrank, M., Schürmann, J., Schütz, A. K., Schwemmer, A., Schwenck, A., Šefčík, M., Siegmann, D., Simon, F., Songwadhana, J., Spanier, F., Spreng, D., Sreethawong, W., Steidl, M., Štorek, J., Stribl, X., Sturm, M., Suwonjandee, N., Jerome, N. Tan, Telle, H. H., Thorne, L. A., Thümmler, T., Titov, N., Tkachev, I., Urban, K., Valerius, K., Vénos, D., Weinheimer, C., Welte, S., Wendel, J., Wetter, M., Wiesinger, C., Wilkerson, J. F., Wolf, J., Wüstling, S., Wydra, J., Xu, W., Zadorozhny, S., and Zeller, G.
- Subjects
Nuclear Experiment ,High Energy Physics - Experiment ,High Energy Physics - Phenomenology - Abstract
The precision measurement of the tritium $\beta$-decay spectrum performed by the KATRIN experiment provides a unique way to search for general neutrino interactions (GNI). All theoretical allowed GNI terms involving neutrinos are incorporated into a low-energy effective field theory, and can be identified by specific signatures in the measured tritium $\beta$-spectrum. In this paper an effective description of the impact of GNI on the $\beta$-spectrum is formulated and the first constraints on the effective GNI parameters are derived based on the 4 Mio. electrons collected in the second measurement campaign of KATRIN in 2019. In addition, constraints on selected types of interactions are investigated, thereby exploring the potential of KATRIN to search for more specific new physics cases, including a right-handed W boson, a charged Higgs or leptoquarks.
- Published
- 2024
13. Magnetically Tuned Metal-Insulator Transition in LaAlO$_3$/SrTiO$_3$ Nanowire Arrays
- Author
-
Ramachandran, Ranjani, Anand, Shashank, Eom, Kitae, Lee, Kyoungjun, Yang, Dengyu, Yu, Muqing, Biswas, Sayanwita, Nethwewala, Aditi, Eom, Chang-Beom, Carlson, Erica, Irvin, Patrick, and Levy, Jeremy
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Strongly Correlated Electrons ,Condensed Matter - Superconductivity - Abstract
A wide family of two dimensional (2D) systems, including stripe-phase superconductors, sliding Luttinger liquids, and anisotropic 2D materials, can be modeled by an array of coupled one-dimensional (1D) electron channels or nanowire arrays. Here we report experiments in arrays of conducting nanowires with gate and field tunable interwire coupling, that are programmed at the LaAlO$_3$/SrTiO$_3$ interface. We find a magnetically-tuned metal-to-insulator transition in which the transverse resistance of the nanowire array increases by up to four orders of magnitude. To explain this behavior, we develop a minimal model of a coupled two-wire system which agrees well with observed phenomena. These nanowire arrays can serve as a model systems to understand the origin of exotic behavior in correlated materials via analog quantum simulation.
- Published
- 2024
14. Electric and Magnetic Field-dependent Tunneling between Coupled Nanowires
- Author
-
Anand, Shashank, Ramachandran, Ranjani, Eom, Kitae, Lee, Kyoungjun, Yang, Dengyu, Yu, Muqing, Biswas, Sayanwita, Nethwewala, Aditi, Eom, Chang-Beom, Carlson, Erica, Irvin, Patrick, and Levy, Jeremy
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Strongly Correlated Electrons ,Condensed Matter - Superconductivity - Abstract
Coupled quasi-one-dimensional (quasi-1D) electron systems host rich emergent physics that cannot be accounted for by understanding isolated 1D electron systems alone. Open questions remain about how transport in these arrays can be manipulated by the application of external electric and magnetic fields. In this theoretical study, we consider a pair of coupled nanowires with non-interacting electrons. We find that a metal-insulator transition is induced by an out-of-plane magnetic field and a transverse potential bias on an array of such coupled wires. We demonstrate the existence of distinct conductance features and highlight the crucial role played by the field dependence of the interwire potential barrier on transport properties. These predictions agree well with transport experiments performed on coupled nanowires sketched on an LaAlO3/SrTiO3 interface. Since our model makes minimal assumptions, we expect our predictions to hold for a wide class of coupled 1D systems.
- Published
- 2024
15. Precision Thermodynamics of the Fermi polaron at strong coupling
- Author
-
Ramachandran, S., Jensen, S., and Alhassid, Y.
- Subjects
Condensed Matter - Quantum Gases ,Nuclear Theory - Abstract
The Fermi polaron problem, which describes a mobile impurity that interacts with a spin-polarized Fermi sea, is a paradigmatic system in quantum many-body physics and has been challenging to address quantitatively in its strong coupling regime. We present the first controlled thermodynamic calculations for the Fermi polaron at strong coupling using finite-temperature auxiliary-field quantum Monte Carlo (AFMC) methods in the framework of the canonical ensemble. Modeled as a spin-imbalanced system, the Fermi polaron has a Monte Carlo sign problem, but we show that it is moderate over a wide range of temperatures and coupling strengths beyond the unitary limit of the BCS-BEC crossover. We calculate the contact, a quantity which measures the strength of the short-range correlations, as a function of temperature at unitarity and as a function of the coupling strength at fixed temperature and find good agreement with a variational approach based on one particle-hole excitation of the Fermi sea. We compare our results for the contact with recent experiments and find good agreement at unitarity (within error bars) but discrepancies away from unitarity on the BEC side of the crossover. We also calculate the thermal energy gap at unitarity as a function of temperature., Comment: 8 pages, 7 figures
- Published
- 2024
16. AI Foundation Model for Heliophysics: Applications, Design, and Implementation
- Author
-
Roy, Sujit, Singh, Talwinder, Freitag, Marcus, Schmude, Johannes, Lal, Rohit, Hegde, Dinesha, Ranjan, Soumya, Lin, Amy, Gaur, Vishal, Vos, Etienne Eben, Ghosal, Rinki, Patro, Badri Narayana, Aydin, Berkay, Pogorelov, Nikolai, Moreno, Juan Bernabe, Maskey, Manil, and Ramachandran, Rahul
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - Instrumentation and Methods for Astrophysics ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Deep learning-based methods have been widely researched in the areas of language and vision, demonstrating their capacity to understand long sequences of data and their usefulness in numerous helio-physics applications. Foundation models (FMs), which are pre-trained on a large-scale datasets, form the basis for a variety of downstream tasks. These models, especially those based on transformers in vision and language, show exceptional potential for adapting to a wide range of downstream applications. In this paper, we provide our perspective on the criteria for designing an FM for heliophysics and associated challenges and applications using the Solar Dynamics Observatory (SDO) dataset. We believe that this is the first study to design an FM in the domain of heliophysics., Comment: 31 Pages, 12 figures
- Published
- 2024
17. ASTRA: Accurate and Scalable ANNS-based Training of Extreme Classifiers
- Author
-
Mehta, Sonu, Mohan, Jayashree, Natarajan, Nagarajan, Ramjee, Ramachandran, and Varma, Manik
- Subjects
Computer Science - Machine Learning ,Computer Science - Information Retrieval - Abstract
`Extreme Classification'' (or XC) is the task of annotating data points (queries) with relevant labels (documents), from an extremely large set of $L$ possible labels, arising in search and recommendations. The most successful deep learning paradigm that has emerged over the last decade or so for XC is to embed the queries (and labels) using a deep encoder (e.g. DistilBERT), and use linear classifiers on top of the query embeddings. This architecture is of appeal because it enables millisecond-time inference using approximate nearest neighbor search (ANNS). The key question is how do we design training algorithms that are accurate as well as scale to $O(100M)$ labels on a limited number of GPUs. State-of-the-art XC techniques that demonstrate high accuracies (e.g., DEXML, Ren\'ee, DEXA) on standard datasets have per-epoch training time that scales as $O(L)$ or employ expensive negative sampling strategies, which are prohibitive in XC scenarios. In this work, we develop an accurate and scalable XC algorithm ASTRA with two key observations: (a) building ANNS index on the classifier vectors and retrieving hard negatives using the classifiers aligns the negative sampling strategy to the loss function optimized; (b) keeping the ANNS indices current as the classifiers change through the epochs is prohibitively expensive while using stale negatives (refreshed periodically) results in poor accuracy; to remedy this, we propose a negative sampling strategy that uses a mixture of importance sampling and uniform sampling. By extensive evaluation on standard XC as well as proprietary datasets with 120M labels, we demonstrate that ASTRA achieves SOTA precision, while reducing training time by 4x-15x relative to the second best.
- Published
- 2024
18. Mnemosyne: Parallelization Strategies for Efficiently Serving Multi-Million Context Length LLM Inference Requests Without Approximations
- Author
-
Agrawal, Amey, Chen, Junda, Goiri, Íñigo, Ramjee, Ramachandran, Zhang, Chaojie, Tumanov, Alexey, and Choukse, Esha
- Subjects
Computer Science - Machine Learning ,Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
As large language models (LLMs) evolve to handle increasingly longer contexts, serving inference requests for context lengths in the range of millions of tokens presents unique challenges. While existing techniques are effective for training, they fail to address the unique challenges of inference, such as varying prefill and decode phases and their associated latency constraints - like Time to First Token (TTFT) and Time Between Tokens (TBT). Furthermore, there are no long context inference solutions that allow batching requests to increase the hardware utilization today. In this paper, we propose three key innovations for efficient interactive long context LLM inference, without resorting to any approximation: adaptive chunking to reduce prefill overheads in mixed batching, Sequence Pipeline Parallelism (SPP) to lower TTFT, and KV Cache Parallelism (KVP) to minimize TBT. These contributions are combined into a 3D parallelism strategy, enabling Mnemosyne to scale interactive inference to context lengths at least up to 10 million tokens with high throughput enabled with batching. To our knowledge, Mnemosyne is the first to be able to achieve support for 10 million long context inference efficiently, while satisfying production-grade SLOs on TBT (30ms) on contexts up to and including 10 million.
- Published
- 2024
19. Visualizing Dynamics of Charges and Strings in (2+1)D Lattice Gauge Theories
- Author
-
Cochran, Tyler A., Jobst, Bernhard, Rosenberg, Eliott, Lensky, Yuri D., Gyawali, Gaurav, Eassa, Norhan, Will, Melissa, Abanin, Dmitry, Acharya, Rajeev, Beni, Laleh Aghababaie, Andersen, Trond I., Ansmann, Markus, Arute, Frank, Arya, Kunal, Asfaw, Abraham, Atalaya, Juan, Babbush, Ryan, Ballard, Brian, Bardin, Joseph C., Bengtsson, Andreas, Bilmes, Alexander, Bourassa, Alexandre, Bovaird, Jenna, Broughton, Michael, Browne, David A., Buchea, Brett, Buckley, Bob B., Burger, Tim, Burkett, Brian, Bushnell, Nicholas, Cabrera, Anthony, Campero, Juan, Chang, Hung-Shen, Chen, Zijun, Chiaro, Ben, Claes, Jahan, Cleland, Agnetta Y., Cogan, Josh, Collins, Roberto, Conner, Paul, Courtney, William, Crook, Alexander L., Curtin, Ben, Das, Sayan, Demura, Sean, De Lorenzo, Laura, Di Paolo, Agustin, Donohoe, Paul, Drozdov, Ilya, Dunsworth, Andrew, Eickbusch, Alec, Elbag, Aviv Moshe, Elzouka, Mahmoud, Erickson, Catherine, Ferreira, Vinicius S., Burgos, Leslie Flores, Forati, Ebrahim, Fowler, Austin G., Foxen, Brooks, Ganjam, Suhas, Gasca, Robert, Genois, Élie, Giang, William, Gilboa, Dar, Gosula, Raja, Dau, Alejandro Grajales, Graumann, Dietrich, Greene, Alex, Gross, Jonathan A., Habegger, Steve, Hansen, Monica, Harrigan, Matthew P., Harrington, Sean D., Heu, Paula, Higgott, Oscar, Hilton, Jeremy, Huang, Hsin-Yuan, Huff, Ashley, Huggins, William J., Jeffrey, Evan, Jiang, Zhang, Jones, Cody, Joshi, Chaitali, Juhas, Pavol, Kafri, Dvir, Kang, Hui, Karamlou, Amir H., Kechedzhi, Kostyantyn, Khaire, Trupti, Khattar, Tanuj, Khezri, Mostafa, Kim, Seon, Klimov, Paul V., Kobrin, Bryce, Korotkov, Alexander N., Kostritsa, Fedor, Kreikebaum, John Mark, Kurilovich, Vladislav D., Landhuis, David, Lange-Dei, Tiano, Langley, Brandon W., Lau, Kim-Ming, Ledford, Justin, Lee, Kenny, Lester, Brian J., Guevel, Loïck Le, Li, Wing Yan, Lill, Alexander T., Livingston, William P., Locharla, Aditya, Lundahl, Daniel, Lunt, Aaron, Madhuk, Sid, Maloney, Ashley, Mandrà, Salvatore, Martin, Leigh S., Martin, Orion, Maxfield, Cameron, McClean, Jarrod R., McEwen, Matt, Meeks, Seneca, Megrant, Anthony, Miao, Kevin C., Molavi, Reza, Molina, Sebastian, Montazeri, Shirin, Movassagh, Ramis, Neill, Charles, Newman, Michael, Nguyen, Anthony, Nguyen, Murray, Ni, Chia-Hung, Niu, Murphy Yuezhen, Oliver, William D., Ottosson, Kristoffer, Pizzuto, Alex, Potter, Rebecca, Pritchard, Orion, Quintana, Chris, Ramachandran, Ganesh, Reagor, Matthew J., Rhodes, David M., Roberts, Gabrielle, Sankaragomathi, Kannan, Satzinger, Kevin J., Schurkus, Henry F., Shearn, Michael J., Shorter, Aaron, Shutty, Noah, Shvarts, Vladimir, Sivak, Volodymyr, Small, Spencer, Smith, W. Clarke, Springer, Sofia, Sterling, George, Suchard, Jordan, Szasz, Aaron, Sztein, Alex, Thor, Douglas, Torunbalci, M. Mert, Vaishnav, Abeer, Vargas, Justin, Vdovichev, Sergey, Vidal, Guifre, Heidweiller, Catherine Vollgraff, Waltman, Steven, Wang, Shannon X., Ware, Brayden, White, Theodore, Wong, Kristi, Woo, Bryan W. K., Xing, Cheng, Yao, Z. Jamie, Yeh, Ping, Ying, Bicheng, Yoo, Juhwan, Yosri, Noureldin, Young, Grayson, Zalcman, Adam, Zhang, Yaxing, Zhu, Ningfeng, Zobris, Nicholas, Boixo, Sergio, Kelly, Julian, Lucero, Erik, Chen, Yu, Smelyanskiy, Vadim, Neven, Hartmut, Gammon-Smith, Adam, Pollmann, Frank, Knap, Michael, and Roushan, Pedram
- Subjects
Quantum Physics ,Condensed Matter - Strongly Correlated Electrons ,High Energy Physics - Lattice - Abstract
Lattice gauge theories (LGTs) can be employed to understand a wide range of phenomena, from elementary particle scattering in high-energy physics to effective descriptions of many-body interactions in materials. Studying dynamical properties of emergent phases can be challenging as it requires solving many-body problems that are generally beyond perturbative limits. We investigate the dynamics of local excitations in a $\mathbb{Z}_2$ LGT using a two-dimensional lattice of superconducting qubits. We first construct a simple variational circuit which prepares low-energy states that have a large overlap with the ground state; then we create particles with local gates and simulate their quantum dynamics via a discretized time evolution. As the effective magnetic field is increased, our measurements show signatures of transitioning from deconfined to confined dynamics. For confined excitations, the magnetic field induces a tension in the string connecting them. Our method allows us to experimentally image string dynamics in a (2+1)D LGT from which we uncover two distinct regimes inside the confining phase: for weak confinement the string fluctuates strongly in the transverse direction, while for strong confinement transverse fluctuations are effectively frozen. In addition, we demonstrate a resonance condition at which dynamical string breaking is facilitated. Our LGT implementation on a quantum processor presents a novel set of techniques for investigating emergent particle and string dynamics.
- Published
- 2024
20. Future-Proofing Medical Imaging with Privacy-Preserving Federated Learning and Uncertainty Quantification: A Review
- Author
-
Koutsoubis, Nikolas, Waqas, Asim, Yilmaz, Yasin, Ramachandran, Ravi P., Schabath, Matthew, and Rasool, Ghulam
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Artificial Intelligence (AI) has demonstrated significant potential in automating various medical imaging tasks, which could soon become routine in clinical practice for disease diagnosis, prognosis, treatment planning, and post-treatment surveillance. However, the privacy concerns surrounding patient data present a major barrier to the widespread adoption of AI in medical imaging, as large, diverse training datasets are essential for developing accurate, generalizable, and robust Artificial intelligence models. Federated Learning (FL) offers a solution that enables organizations to train AI models collaboratively without sharing sensitive data. federated learning exchanges model training information, such as gradients, between the participating sites. Despite its promise, federated learning is still in its developmental stages and faces several challenges. Notably, sensitive information can still be inferred from the gradients shared during model training. Quantifying AI models' uncertainty is vital due to potential data distribution shifts post-deployment, which can affect model performance. Uncertainty quantification (UQ) in FL is particularly challenging due to data heterogeneity across participating sites. This review provides a comprehensive examination of FL, privacy-preserving FL (PPFL), and UQ in FL. We identify key gaps in current FL methodologies and propose future research directions to enhance data privacy and trustworthiness in medical imaging applications., Comment: 21 pages, 5 figures, 4 tables, Review paper, preprint to Radiology AI. arXiv admin note: text overlap with arXiv:2406.12815
- Published
- 2024
21. Prithvi WxC: Foundation Model for Weather and Climate
- Author
-
Schmude, Johannes, Roy, Sujit, Trojak, Will, Jakubik, Johannes, Civitarese, Daniel Salles, Singh, Shraddha, Kuehnert, Julian, Ankur, Kumar, Gupta, Aman, Phillips, Christopher E, Kienzler, Romeo, Szwarcman, Daniela, Gaur, Vishal, Shinde, Rajat, Lal, Rohit, Da Silva, Arlindo, Diaz, Jorge Luis Guevara, Jones, Anne, Pfreundschuh, Simon, Lin, Amy, Sheshadri, Aditi, Nair, Udaysankar, Anantharaj, Valentine, Hamann, Hendrik, Watson, Campbell, Maskey, Manil, Lee, Tsengdar J, Moreno, Juan Bernabe, and Ramachandran, Rahul
- Subjects
Computer Science - Machine Learning ,Physics - Atmospheric and Oceanic Physics - Abstract
Triggered by the realization that AI emulators can rival the performance of traditional numerical weather prediction models running on HPC systems, there is now an increasing number of large AI models that address use cases such as forecasting, downscaling, or nowcasting. While the parallel developments in the AI literature focus on foundation models -- models that can be effectively tuned to address multiple, different use cases -- the developments on the weather and climate side largely focus on single-use cases with particular emphasis on mid-range forecasting. We close this gap by introducing Prithvi WxC, a 2.3 billion parameter foundation model developed using 160 variables from the Modern-Era Retrospective Analysis for Research and Applications, Version 2 (MERRA-2). Prithvi WxC employs an encoder-decoder-based architecture, incorporating concepts from various recent transformer models to effectively capture both regional and global dependencies in the input data. The model has been designed to accommodate large token counts to model weather phenomena in different topologies at fine resolutions. Furthermore, it is trained with a mixed objective that combines the paradigms of masked reconstruction with forecasting. We test the model on a set of challenging downstream tasks namely: Autoregressive rollout forecasting, Downscaling, Gravity wave flux parameterization, and Extreme events estimation. The pretrained model with 2.3 billion parameters, along with the associated fine-tuning workflows, has been publicly released as an open-source contribution via Hugging Face.
- Published
- 2024
22. Multi-wavelength spectroscopic analysis of the ULX Holmberg II X-1 and its nebula suggests the presence of a heavy black hole accreting from a B-type donor
- Author
-
Serantes, S. Reyero, Oskinova, L., Hamann, W. -R., Gómez-González, V. M., Todt, H., Pauli, D., Soria, R., Gies, D. R., Torrejón, J. M., Bulik, T., Ramachandran, V., Sander, A. A. C., Bozzo, E., and Poutanen, J.
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Solar and Stellar Astrophysics - Abstract
Ultra-luminous X-ray sources (ULXs) are high-mass X-ray binaries with an X-ray luminosity above $10^{39}$ erg s$^{-1}$. These ULXs can be powered by black holes that are more massive than $20M_\odot$, accreting in a standard regime, or lighter compact objects accreting supercritically. There are only a few ULXs with known optical or UV counterparts, and their nature is debated. Determining whether optical/UV radiation is produced by the donor star or by the accretion disc is crucial for understanding ULX physics and testing massive binary evolution. We conduct, for the first time, a fully consistent multi-wavelength spectral analysis of a ULX and its circumstellar nebula. We aim to establish the donor star type and test the presence of strong disc winds in the prototypical ULX Holmberg II X-1 (Ho II X-1). We intent to obtain a realistic spectral energy distribution of the ionising source, which is needed for robust nebula analysis. We acquired new UV spectra of Ho II X-1 with the HST and complemented them with archival optical and X-ray data. We explored the spectral energy distribution of the source and analysed the spectra using the stellar atmosphere code PoWR and the photoionisation code Cloudy. Our analysis of the X-ray, UV, and optical spectra of Ho II X-1 and its nebula consistently explains the observations. We do not find traces of disc wind signatures in the UV and the optical, rejecting previous claims of the ULX being a supercritical accretor. The optical/UV counterpart of HoII X-1 is explained by a B-type supergiant donor star. Thus, the observations are fully compatible with Ho II X-1 being a close binary consisting of an $\gtrsim 66\,M_\odot$ black hole accreting matter from an $\simeq 22 M_\odot$ B-supergiant companion. Also, we propose a possible evolution scenario for the system, suggesting that Ho II X-1 is a potential gravitational wave source progenitor., Comment: Accepted in A&A. 14 pages (12 main body + 2 appendix), 6 figures, 6 tables
- Published
- 2024
- Full Text
- View/download PDF
23. On Evaluation of Vision Datasets and Models using Human Competency Frameworks
- Author
-
Ramachandran, Rahul, Kulkarni, Tejal, Sharma, Charchit, Vijaykeerthy, Deepak, and Balasubramanian, Vineeth N
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Evaluating models and datasets in computer vision remains a challenging task, with most leaderboards relying solely on accuracy. While accuracy is a popular metric for model evaluation, it provides only a coarse assessment by considering a single model's score on all dataset items. This paper explores Item Response Theory (IRT), a framework that infers interpretable latent parameters for an ensemble of models and each dataset item, enabling richer evaluation and analysis beyond the single accuracy number. Leveraging IRT, we assess model calibration, select informative data subsets, and demonstrate the usefulness of its latent parameters for analyzing and comparing models and datasets in computer vision.
- Published
- 2024
24. CACER: Clinical Concept Annotations for Cancer Events and Relations
- Author
-
Fu, Yujuan, Ramachandran, Giridhar Kaushik, Halwani, Ahmad, McInnes, Bridget T., Xia, Fei, Lybarger, Kevin, Yetisgen, Meliha, and Uzuner, Özlem
- Subjects
Computer Science - Computation and Language - Abstract
Clinical notes contain unstructured representations of patient histories, including the relationships between medical problems and prescription drugs. To investigate the relationship between cancer drugs and their associated symptom burden, we extract structured, semantic representations of medical problem and drug information from the clinical narratives of oncology notes. We present Clinical Concept Annotations for Cancer Events and Relations (CACER), a novel corpus with fine-grained annotations for over 48,000 medical problems and drug events and 10,000 drug-problem and problem-problem relations. Leveraging CACER, we develop and evaluate transformer-based information extraction (IE) models such as BERT, Flan-T5, Llama3, and GPT-4 using fine-tuning and in-context learning (ICL). In event extraction, the fine-tuned BERT and Llama3 models achieved the highest performance at 88.2-88.0 F1, which is comparable to the inter-annotator agreement (IAA) of 88.4 F1. In relation extraction, the fine-tuned BERT, Flan-T5, and Llama3 achieved the highest performance at 61.8-65.3 F1. GPT-4 with ICL achieved the worst performance across both tasks. The fine-tuned models significantly outperformed GPT-4 in ICL, highlighting the importance of annotated training data and model optimization. Furthermore, the BERT models performed similarly to Llama3. For our task, LLMs offer no performance advantage over the smaller BERT models. The results emphasize the need for annotated training data to optimize models. Multiple fine-tuned transformer models achieved performance comparable to IAA for several extraction tasks., Comment: This is a pre-copy-editing, author-produced PDF of an article accepted for publication in JAMIA following peer review. The definitive publisher-authenticated version is available online at https://academic.oup.com/jamia/advance-article/doi/10.1093/jamia/ocae231/7748302
- Published
- 2024
- Full Text
- View/download PDF
25. Experimental and computational study of ethanolamine ices at astrochemical conditions
- Author
-
Ramachandran, R, Sil, Milan, Gorai, Prasanta, Meka, J K, Pavithraa, S, Lo, J -I, Chou, S -L, Wu, Y -J, Janardhan, P, Cheng, B -M, Bhardwaj, Anil, Rivilla, Vıctor M., Mason, N J, Sivaraman, B, and Das, Ankan
- Subjects
Astrophysics - Astrophysics of Galaxies ,Astrophysics - Solar and Stellar Astrophysics - Abstract
Ethanolamine (NH2CH2CH2OH) has recently been identified in the molecular cloud G+0.693-0.027, situated in the SgrB2 complex in the Galactic center. However, its presence in other regions, and in particular in star-forming sites, is still elusive. Given its likely role as a precursor to simple amino acids, understanding its presence in the star-forming region is required. Here, we present the experimentally obtained temperature-dependent spectral features and morphological behavior of pure ethanolamine ices under astrochemical conditions in the 2 - 12 micro meter (MIR) and 120 - 230 nm (VUV) regions for the first time. These features would help in understanding its photochemical behavior. In addition, we present the first chemical models specifically dedicated to ethanolamine. These models include all the discussed chemical routes from the literature, along with the estimated binding energies and activation energies from quantum chemical calculations reported in this work. We have found that surface reactions: CH2OH + NH2CH2 --> NH2CH2CH2OH and NH2 + C2H4OH --> NH2CH2CH2OH in warmer regions (60-90 K) could play a significant role in the formation of ethanolamine. Our modeled abundance of ethanolamine complements the upper limit of ethanolamine column density estimated in earlier observations in hot core/corino regions. Furthermore, we provide a theoretical estimation of the rotational and distortional constants for various species (such as HNCCO, NH2CHCO, and NH2CH2CO) related to ethanolamine that have not been studied in existing literature. This study could be valuable for identifying these species in the future., Comment: 18 pages, 6 figures. Accepted for the publication in The Astrophysical Journal
- Published
- 2024
26. Quantum error correction below the surface code threshold
- Author
-
Acharya, Rajeev, Aghababaie-Beni, Laleh, Aleiner, Igor, Andersen, Trond I., Ansmann, Markus, Arute, Frank, Arya, Kunal, Asfaw, Abraham, Astrakhantsev, Nikita, Atalaya, Juan, Babbush, Ryan, Bacon, Dave, Ballard, Brian, Bardin, Joseph C., Bausch, Johannes, Bengtsson, Andreas, Bilmes, Alexander, Blackwell, Sam, Boixo, Sergio, Bortoli, Gina, Bourassa, Alexandre, Bovaird, Jenna, Brill, Leon, Broughton, Michael, Browne, David A., Buchea, Brett, Buckley, Bob B., Buell, David A., Burger, Tim, Burkett, Brian, Bushnell, Nicholas, Cabrera, Anthony, Campero, Juan, Chang, Hung-Shen, Chen, Yu, Chen, Zijun, Chiaro, Ben, Chik, Desmond, Chou, Charina, Claes, Jahan, Cleland, Agnetta Y., Cogan, Josh, Collins, Roberto, Conner, Paul, Courtney, William, Crook, Alexander L., Curtin, Ben, Das, Sayan, Davies, Alex, De Lorenzo, Laura, Debroy, Dripto M., Demura, Sean, Devoret, Michel, Di Paolo, Agustin, Donohoe, Paul, Drozdov, Ilya, Dunsworth, Andrew, Earle, Clint, Edlich, Thomas, Eickbusch, Alec, Elbag, Aviv Moshe, Elzouka, Mahmoud, Erickson, Catherine, Faoro, Lara, Farhi, Edward, Ferreira, Vinicius S., Burgos, Leslie Flores, Forati, Ebrahim, Fowler, Austin G., Foxen, Brooks, Ganjam, Suhas, Garcia, Gonzalo, Gasca, Robert, Genois, Élie, Giang, William, Gidney, Craig, Gilboa, Dar, Gosula, Raja, Dau, Alejandro Grajales, Graumann, Dietrich, Greene, Alex, Gross, Jonathan A., Habegger, Steve, Hall, John, Hamilton, Michael C., Hansen, Monica, Harrigan, Matthew P., Harrington, Sean D., Heras, Francisco J. H., Heslin, Stephen, Heu, Paula, Higgott, Oscar, Hill, Gordon, Hilton, Jeremy, Holland, George, Hong, Sabrina, Huang, Hsin-Yuan, Huff, Ashley, Huggins, William J., Ioffe, Lev B., Isakov, Sergei V., Iveland, Justin, Jeffrey, Evan, Jiang, Zhang, Jones, Cody, Jordan, Stephen, Joshi, Chaitali, Juhas, Pavol, Kafri, Dvir, Kang, Hui, Karamlou, Amir H., Kechedzhi, Kostyantyn, Kelly, Julian, Khaire, Trupti, Khattar, Tanuj, Khezri, Mostafa, Kim, Seon, Klimov, Paul V., Klots, Andrey R., Kobrin, Bryce, Kohli, Pushmeet, Korotkov, Alexander N., Kostritsa, Fedor, Kothari, Robin, Kozlovskii, Borislav, Kreikebaum, John Mark, Kurilovich, Vladislav D., Lacroix, Nathan, Landhuis, David, Lange-Dei, Tiano, Langley, Brandon W., Laptev, Pavel, Lau, Kim-Ming, Guevel, Loïck Le, Ledford, Justin, Lee, Kenny, Lensky, Yuri D., Leon, Shannon, Lester, Brian J., Li, Wing Yan, Li, Yin, Lill, Alexander T., Liu, Wayne, Livingston, William P., Locharla, Aditya, Lucero, Erik, Lundahl, Daniel, Lunt, Aaron, Madhuk, Sid, Malone, Fionn D., Maloney, Ashley, Mandrá, Salvatore, Martin, Leigh S., Martin, Steven, Martin, Orion, Maxfield, Cameron, McClean, Jarrod R., McEwen, Matt, Meeks, Seneca, Megrant, Anthony, Mi, Xiao, Miao, Kevin C., Mieszala, Amanda, Molavi, Reza, Molina, Sebastian, Montazeri, Shirin, Morvan, Alexis, Movassagh, Ramis, Mruczkiewicz, Wojciech, Naaman, Ofer, Neeley, Matthew, Neill, Charles, Nersisyan, Ani, Neven, Hartmut, Newman, Michael, Ng, Jiun How, Nguyen, Anthony, Nguyen, Murray, Ni, Chia-Hung, O'Brien, Thomas E., Oliver, William D., Opremcak, Alex, Ottosson, Kristoffer, Petukhov, Andre, Pizzuto, Alex, Platt, John, Potter, Rebecca, Pritchard, Orion, Pryadko, Leonid P., Quintana, Chris, Ramachandran, Ganesh, Reagor, Matthew J., Rhodes, David M., Roberts, Gabrielle, Rosenberg, Eliott, Rosenfeld, Emma, Roushan, Pedram, Rubin, Nicholas C., Saei, Negar, Sank, Daniel, Sankaragomathi, Kannan, Satzinger, Kevin J., Schurkus, Henry F., Schuster, Christopher, Senior, Andrew W., Shearn, Michael J., Shorter, Aaron, Shutty, Noah, Shvarts, Vladimir, Singh, Shraddha, Sivak, Volodymyr, Skruzny, Jindra, Small, Spencer, Smelyanskiy, Vadim, Smith, W. Clarke, Somma, Rolando D., Springer, Sofia, Sterling, George, Strain, Doug, Suchard, Jordan, Szasz, Aaron, Sztein, Alex, Thor, Douglas, Torres, Alfredo, Torunbalci, M. Mert, Vaishnav, Abeer, Vargas, Justin, Vdovichev, Sergey, Vidal, Guifre, Villalonga, Benjamin, Heidweiller, Catherine Vollgraff, Waltman, Steven, Wang, Shannon X., Ware, Brayden, Weber, Kate, White, Theodore, Wong, Kristi, Woo, Bryan W. K., Xing, Cheng, Yao, Z. Jamie, Yeh, Ping, Ying, Bicheng, Yoo, Juhwan, Yosri, Noureldin, Young, Grayson, Zalcman, Adam, Zhang, Yaxing, Zhu, Ningfeng, and Zobrist, Nicholas
- Subjects
Quantum Physics - Abstract
Quantum error correction provides a path to reach practical quantum computing by combining multiple physical qubits into a logical qubit, where the logical error rate is suppressed exponentially as more qubits are added. However, this exponential suppression only occurs if the physical error rate is below a critical threshold. In this work, we present two surface code memories operating below this threshold: a distance-7 code and a distance-5 code integrated with a real-time decoder. The logical error rate of our larger quantum memory is suppressed by a factor of $\Lambda$ = 2.14 $\pm$ 0.02 when increasing the code distance by two, culminating in a 101-qubit distance-7 code with 0.143% $\pm$ 0.003% error per cycle of error correction. This logical memory is also beyond break-even, exceeding its best physical qubit's lifetime by a factor of 2.4 $\pm$ 0.3. We maintain below-threshold performance when decoding in real time, achieving an average decoder latency of 63 $\mu$s at distance-5 up to a million cycles, with a cycle time of 1.1 $\mu$s. To probe the limits of our error-correction performance, we run repetition codes up to distance-29 and find that logical performance is limited by rare correlated error events occurring approximately once every hour, or 3 $\times$ 10$^9$ cycles. Our results present device performance that, if scaled, could realize the operational requirements of large scale fault-tolerant quantum algorithms., Comment: 10 pages, 4 figures, Supplementary Information
- Published
- 2024
27. CortexCompile: Harnessing Cortical-Inspired Architectures for Enhanced Multi-Agent NLP Code Synthesis
- Author
-
Ramachandran, Gautham and Yang, Rick
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,I.2.2 ,I.2.7 - Abstract
Current approaches to automated code generation often rely on monolithic models that lack real-time adaptability and scalability. This limitation is particularly evident in complex programming tasks that require dynamic adjustment and efficiency. The integration of neuroscience principles into Natural Language Processing (NLP) has the potential to revolutionize automated code generation. This paper presents CortexCompile, a novel modular system inspired by the specialized functions of the human brain's cortical regions. By emulating the distinct roles of the Prefrontal Cortex, Parietal Cortex, Temporal Lobe, and Motor Cortex, CortexCompile achieves significant advancements in scalability, efficiency, and adaptability compared to traditional monolithic models like GPT-4o. The system's architecture features a Task Orchestration Agent that manages dynamic task delegation and parallel processing, facilitating the generation of highly accurate and optimized code across increasingly complex programming tasks. Experimental evaluations demonstrate that CortexCompile consistently outperforms GPT-4o in development time, accuracy, and user satisfaction, particularly in tasks involving real-time strategy games and first-person shooters. These findings underscore the viability of neuroscience-inspired architectures in addressing the limitations of current NLP models, paving the way for more efficient and human-like AI systems., Comment: 17 pages, 6 figures
- Published
- 2024
28. Imagen 3
- Author
-
Imagen-Team-Google, Baldridge, Jason, Bauer, Jakob, Bhutani, Mukul, Brichtova, Nicole, Bunner, Andrew, Chan, Kelvin, Chen, Yichang, Dieleman, Sander, Du, Yuqing, Eaton-Rosen, Zach, Fei, Hongliang, de Freitas, Nando, Gao, Yilin, Gladchenko, Evgeny, Colmenarejo, Sergio Gómez, Guo, Mandy, Haig, Alex, Hawkins, Will, Hu, Hexiang, Huang, Huilian, Igwe, Tobenna Peter, Kaplanis, Christos, Khodadadeh, Siavash, Kim, Yelin, Konyushkova, Ksenia, Langner, Karol, Lau, Eric, Luo, Shixin, Mokrá, Soňa, Nandwani, Henna, Onoe, Yasumasa, Oord, Aäron van den, Parekh, Zarana, Pont-Tuset, Jordi, Qi, Hang, Qian, Rui, Ramachandran, Deepak, Rane, Poorva, Rashwan, Abdullah, Razavi, Ali, Riachi, Robert, Srinivasan, Hansa, Srinivasan, Srivatsan, Strudel, Robin, Uria, Benigno, Wang, Oliver, Wang, Su, Waters, Austin, Wolff, Chris, Wright, Auriel, Xiao, Zhisheng, Xiong, Hao, Xu, Keyang, van Zee, Marc, Zhang, Junlin, Zhang, Katie, Zhou, Wenlei, Zolna, Konrad, Aboubakar, Ola, Akbulut, Canfer, Akerlund, Oscar, Albuquerque, Isabela, Anderson, Nina, Andreetto, Marco, Aroyo, Lora, Bariach, Ben, Barker, David, Ben, Sherry, Berman, Dana, Biles, Courtney, Blok, Irina, Botadra, Pankil, Brennan, Jenny, Brown, Karla, Buckley, John, Bunel, Rudy, Bursztein, Elie, Butterfield, Christina, Caine, Ben, Carpenter, Viral, Casagrande, Norman, Chang, Ming-Wei, Chang, Solomon, Chaudhuri, Shamik, Chen, Tony, Choi, John, Churbanau, Dmitry, Clement, Nathan, Cohen, Matan, Cole, Forrester, Dektiarev, Mikhail, Du, Vincent, Dutta, Praneet, Eccles, Tom, Elue, Ndidi, Feden, Ashley, Fruchter, Shlomi, Garcia, Frankie, Garg, Roopal, Ge, Weina, Ghazy, Ahmed, Gipson, Bryant, Goodman, Andrew, Górny, Dawid, Gowal, Sven, Gupta, Khyatti, Halpern, Yoni, Han, Yena, Hao, Susan, Hayes, Jamie, Hertz, Amir, Hirst, Ed, Hou, Tingbo, Howard, Heidi, Ibrahim, Mohamed, Ike-Njoku, Dirichi, Iljazi, Joana, Ionescu, Vlad, Isaac, William, Jana, Reena, Jennings, Gemma, Jenson, Donovon, Jia, Xuhui, Jones, Kerry, Ju, Xiaoen, Kajic, Ivana, Ayan, Burcu Karagol, Kelly, Jacob, Kothawade, Suraj, Kouridi, Christina, Ktena, Ira, Kumakaw, Jolanda, Kurniawan, Dana, Lagun, Dmitry, Lavitas, Lily, Lee, Jason, Li, Tao, Liang, Marco, Li-Calis, Maggie, Liu, Yuchi, Alberca, Javier Lopez, Lu, Peggy, Lum, Kristian, Ma, Yukun, Malik, Chase, Mellor, John, Mosseri, Inbar, Murray, Tom, Nematzadeh, Aida, Nicholas, Paul, Oliveira, João Gabriel, Ortiz-Jimenez, Guillermo, Paganini, Michela, Paine, Tom Le, Paiss, Roni, Parrish, Alicia, Peckham, Anne, Peswani, Vikas, Petrovski, Igor, Pfaff, Tobias, Pirozhenko, Alex, Poplin, Ryan, Prabhu, Utsav, Qi, Yuan, Rahtz, Matthew, Rashtchian, Cyrus, Rastogi, Charvi, Raul, Amit, Rebuffi, Sylvestre-Alvise, Ricco, Susanna, Riedel, Felix, Robinson, Dirk, Rohatgi, Pankaj, Rosgen, Bill, Rumbley, Sarah, Ryu, Moonkyung, Salgado, Anthony, Singla, Sahil, Schroff, Florian, Schumann, Candice, Shah, Tanmay, Shillingford, Brendan, Shivakumar, Kaushik, Shtatnov, Dennis, Singer, Zach, Sluzhaev, Evgeny, Sokolov, Valerii, Sottiaux, Thibault, Stimberg, Florian, Stone, Brad, Stutz, David, Su, Yu-Chuan, Tabellion, Eric, Tang, Shuai, Tao, David, Thomas, Kurt, Thornton, Gregory, Toor, Andeep, Udrescu, Cristian, Upadhyay, Aayush, Vasconcelos, Cristina, Vasiloff, Alex, Voynov, Andrey, Walker, Amanda, Wang, Luyu, Wang, Miaosen, Wang, Simon, Wang, Stanley, Wang, Qifei, Wang, Yuxiao, Weisz, Ágoston, Wiles, Olivia, Wu, Chenxia, Xu, Xingyu Federico, Xue, Andrew, Yang, Jianbo, Yu, Luo, Yurtoglu, Mete, Zand, Ali, Zhang, Han, Zhang, Jiageng, Zhao, Catherine, Zhaxybay, Adilet, Zhou, Miao, Zhu, Shengqi, Zhu, Zhenkai, Bloxwich, Dawn, Bordbar, Mahyar, Cobo, Luis C., Collins, Eli, Dai, Shengyang, Doshi, Tulsee, Dragan, Anca, Eck, Douglas, Hassabis, Demis, Hsiao, Sissie, Hume, Tom, Kavukcuoglu, Koray, King, Helen, Krawczyk, Jack, Li, Yeqing, Meier-Hellstern, Kathy, Orban, Andras, Pinsky, Yury, Subramanya, Amar, Vinyals, Oriol, Yu, Ting, and Zwols, Yori
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts. We describe our quality and responsibility evaluations. Imagen 3 is preferred over other state-of-the-art (SOTA) models at the time of evaluation. In addition, we discuss issues around safety and representation, as well as methods we used to minimize the potential harm of our models.
- Published
- 2024
29. Stable-BC: Controlling Covariate Shift with Stable Behavior Cloning
- Author
-
Mehta, Shaunak A., Ciftci, Yusuf Umut, Ramachandran, Balamurugan, Bansal, Somil, and Losey, Dylan P.
- Subjects
Computer Science - Robotics - Abstract
Behavior cloning is a common imitation learning paradigm. Under behavior cloning the robot collects expert demonstrations, and then trains a policy to match the actions taken by the expert. This works well when the robot learner visits states where the expert has already demonstrated the correct action; but inevitably the robot will also encounter new states outside of its training dataset. If the robot learner takes the wrong action at these new states it could move farther from the training data, which in turn leads to increasingly incorrect actions and compounding errors. Existing works try to address this fundamental challenge by augmenting or enhancing the training data. By contrast, in our paper we develop the control theoretic properties of behavior cloned policies. Specifically, we consider the error dynamics between the system's current state and the states in the expert dataset. From the error dynamics we derive model-based and model-free conditions for stability: under these conditions the robot shapes its policy so that its current behavior converges towards example behaviors in the expert dataset. In practice, this results in Stable-BC, an easy to implement extension of standard behavior cloning that is provably robust to covariate shift. We demonstrate the effectiveness of our algorithm in simulations with interactive, nonlinear, and visual environments. We also conduct experiments where a robot arm uses Stable-BC to play air hockey. See our website here: https://collab.me.vt.edu/Stable-BC/
- Published
- 2024
30. Measurement of the electric potential and the magnetic field in the shifted analysing plane of the KATRIN experiment
- Author
-
Aker, M., Batzler, D., Beglarian, A., Behrens, J., Beisenkötter, J., Biassoni, M., Bieringer, B., Biondi, Y., Block, F., Bobien, S., Böttcher, M., Bornschein, B., Bornschein, L., Caldwell, T. S., Carminati, M., Chatrabhuti, A., Chilingaryan, S., Daniel, B. A., Debowski, K., Descher, M., Barrero, D. Díaz, Doe, P. J., Dragoun, O., Drexlin, G., Edzards, F., Eitel, K., Ellinger, E., Engel, R., Enomoto, S., Felden, A., Fengler, C., Fiorini, C., Formaggio, J. A., Forstner, C., Fränkle, F. M., Gauda, K., Gavin, A. S., Gil, W., Glück, F., Grössle, R., Gumbsheimer, R., Hannen, V., Hasselmann, L., Haußmann, N., Helbing, K., Heyns, S., Hickford, S., Hiller, R., Hillesheimer, D., Hinz, D., Höhn, T., Huber, A., Jansen, A., Karl, C., Kellerer, J., Khosonthongkee, K., Köhler, C., Köllenberger, L., Kopmann, A., Kovač, N., Krause, H., La Cascio, L., Lasserre, T., Lauer, J., Le, T. L., Lebeda, O., Lehnert, B., Li, G., Lokhov, A., Machatschek, M., Mark, M., Marsteller, A., Martin, E. L., McMichael, K., Melzer, C., Mertens, S., Mohanty, S., Mostafa, J., Müller, K., Nava, A., Neumann, H., Niemes, S., Parno, D. S., Pavan, M., Pinsook, U., Poon, A. W. P., Poyato, J. M. L., Pozzi, S., Priester, F., Ráliš, J., Ramachandran, S., Robertson, R. G. H., Rodenbeck, C., Röllig, M., Sack, R., Saenz, A., Salomon, R., Schäfer, P., Schlösser, M., Schlösser, K., Schlüter, L., Schneidewind, S., Schrank, M., Schürmann, J., Schütz, A. K., Schwemmer, A., Schwenck, A., Šefčík, M., Siegmann, D., Simon, F., Spanier, F., Spreng, D., Sreethawong, W., Steidl, M., Štorek, J., Stribl, X., Sturm, M., Suwonjandee, N., Jerome, N. Tan, Telle, H. H., Thorne, L. A., Thümmler, T., Titov, N., Tkachev, I., Urban, K., Valerius, K., Vénos, D., Weinheimer, C., Welte, S., Wendel, J., Wiesinger, C., Wilkerson, J. F., Wolf, J., Wüstling, S., Wydra, J., Xu, W., Zadorozhny, S., and Zeller, G.
- Subjects
Physics - Instrumentation and Detectors - Abstract
The projected sensitivity of the effective electron neutrino-mass measurement with the KATRIN experiment is below 0.3 eV (90 % CL) after five years of data acquisition. The sensitivity is affected by the increased rate of the background electrons from KATRIN's main spectrometer. A special shifted-analysing-plane (SAP) configuration was developed to reduce this background by a factor of two. The complex layout of electromagnetic fields in the SAP configuration requires a robust method of estimating these fields. We present in this paper a dedicated calibration measurement of the fields using conversion electrons of gaseous $^\mathrm{83m}$Kr, which enables the neutrino-mass measurements in the SAP configuration., Comment: 19 pages, 11 figures
- Published
- 2024
31. Observations of Extremely Metal-Poor O Stars: Weak Winds and Constraints for Evolution Models
- Author
-
Telford, O. Grace, Chisholm, John, Sander, Andreas A. C., Ramachandran, Varsha, McQuinn, Kristen B. W., and Berg, Danielle A.
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - Astrophysics of Galaxies - Abstract
Metal-poor massive stars drive the evolution of low-mass galaxies, both locally and at high redshift. However, quantifying the feedback they impart to their local surroundings remains uncertain because models of stellar evolution, mass loss, and ionizing spectra are unconstrained by observations below 20% solar metallicity ($Z_\odot$). We present new Keck Cosmic Web Imager optical spectroscopy of three O stars in the nearby dwarf galaxies Leo P, Sextans A, and WLM, which have gas-phase oxygen abundances of 3-14% $Z_\odot$. To characterize their fundamental stellar properties and radiation-driven winds, we fit PoWR atmosphere models to the optical spectra simultaneously with Hubble Space Telescope far-ultraviolet (FUV) spectra and multi-wavelength photometry. We find that all three stars have effective temperatures consistent with their spectral types and surface gravities typical of main-sequence dwarf stars. Yet, the combination of those inferred parameters and luminosity for the two lower-$Z$ stars is not reproduced by stellar evolution models, even those that include rotation or binary interactions. The scenario of multiple-star systems is difficult to reconcile with all available data, suggesting that these observations pose a challenge to current evolution models. We highlight the importance of validating the relationship between stellar mass, temperature, and luminosity at very low $Z$ for accurate estimates of ionizing photon production and spectral hardness. Finally, all three stars' FUV wind profiles reveal low mass-loss rates and terminal wind velocities in tension with expectations from widely adopted radiation-driven wind models. These results provide empirical benchmarks for future development of mass-loss and evolution models for metal-poor stellar populations., Comment: Accepted at ApJ. 10 figures, 25 pages
- Published
- 2024
32. A Second-Order, Variable-Resolution, Weakly-Compressible Smoothed Particle Hydrodynamics Scheme
- Author
-
Muta, Abhinav, Negi, Pawan, and Ramachandran, Prabhu
- Subjects
Physics - Fluid Dynamics - Abstract
The smoothed particle hydrodynamics (SPH) method has been widely used to simulate incompressible and slightly compressible fluid flows. Adaptive refinement strategies to dynamically increase the resolution of the particles to capture sharp gradients in the flow have also been developed. However, most of the SPH schemes in the literature are not second-order convergent (SOC). Both second-order convergence and adaptive resolution are considered grand challenge problems in the SPH community. In this paper, we propose, for the first time, a second-order convergent (SOC) adaptive refinement strategy along with a SOC weakly-compressible SPH scheme. We employ the method of manufactured solutions to systematically develop the scheme and validate the solver. We demonstrate the order of convergence of the entire scheme using the Taylor-Green vortex problem and then go on to demonstrate the applicability of the method to simulate flow past a circular cylinder.
- Published
- 2024
33. Binarity at LOw Metallicity (BLOeM): a spectroscopic VLT monitoring survey of massive stars in the SMC
- Author
-
Shenar, T., Bodensteiner, J., Sana, H., Crowther, P. A., Lennon, D. J., Abdul-Masih, M., Almeida, L. A., Backs, F., Berlanas, S. R., Bernini-Peron, M., Bestenlehner, J. M., Bowman, D. M., Bronner, V. A., Britavskiy, N., de Koter, A., de Mink, S. E., Deshmukh, K., Evans, C. J., Fabry, M., Gieles, M., Gilkis, A., González-Torà, G., Gräfener, G., Götberg, Y., Hawcroft, C., Hénault-Brunet, V., Herrero, A., Holgado, G., Janssens, S., Johnston, C., Josiek, J., Justham, S., Kalari, V. M., Katabi, Z. Z., Keszthelyi, Z., Klencki, J., Kubát, J., Kubátová, B., Langer, N., Lefever, R. R., Ludwig, B., Mackey, J., Mahy, L., Apellániz, J. Maíz, Mandel, I., Maravelias, G., Marchant, P., Menon, A., Najarro, F., Oskinova, L. M., Ovadia, A. J. G. O'Grady R., Patrick, L. R., Pauli, D., Pawlak, M., Ramachandran, V., Renzo, M., Rocha, D. F., Sander, A. A. C., Sayada, T., Schneider, F. R. N., Schootemeijer, A., Schösser, E. C., Schürmann, C., Sen, K., Shahaf, S., Simón-Díaz, S., Stoop, M., van Loon, J. Th., Toonen, S., Tramper, F., Valli, R., van Son, L. A. C., Vigna-Gómez, A., Villaseñor, J. I., Vink, J. S., Wang, C., and Willcox, R.
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - Astrophysics of Galaxies - Abstract
Surveys in the Milky Way and Large Magellanic Cloud revealed that the majority of massive stars will interact with companions during their lives. However, knowledge of the binary properties of massive stars at low metallicity, which approaches the conditions of the Early Universe, remains sparse. We present the Binarity at LOw Metallicity (BLOeM) campaign - an ESO large programme designed to obtain 25 epochs of spectroscopy for 929 massive stars in the SMC - the lowest metallicity conditions in which multiplicity is probed to date (Z = 0.2 Zsun). BLOeM will provide (i) the binary fraction, (ii) the orbital configurations of systems with periods P < 3 yr, (iii) dormant OB+BH binaries, and (iv) a legacy database of physical parameters of massive stars at low metallicity. The stars are observed with the LR02 setup of the giraffe instrument of the Very Large Telescope (3960-4570A, resolving power R=6200; typical signal-to-noise ratio S/N=70-100). This paper utilises the first 9 epochs obtained over a three-month time. We describe the survey and data reduction, perform a spectral classification of the stacked spectra, and construct a Hertzsprung-Russell diagram of the sample via spectral-type and photometric calibrations. The sample covers spectral types from O4 to F5, spanning the effective temperature and luminosity ranges 6.5
- Published
- 2024
- Full Text
- View/download PDF
34. X-Shooting ULLYSES: Massive stars at low metallicity VII. Stellar and wind properties of B supergiants in the Small Magellanic Cloud
- Author
-
Bernini-Peron, M., Sander, A. A. C., Ramachandran, V., Oskinova, L. M., Vink, J. S., Verhamme, O., Najarro, F., Josiek, J., Brands, S. A., Crowther, P. A., Gómez-González, V. M. A., Gormaz-Matamala, A. C., Hawcroft, C., Kuiper, R., Mahy, L., Marcolino, W. L. F., Martins, L. P., Mehner, A., Parsons, T. N., Pauli, D., Shenar, T., Schootemeijer, A., Todt, H., van Loon, J. Th., and collaboration, the XShootU
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - Astrophysics of Galaxies - Abstract
Context. B supergiants (BSGs) represent an important connection between the main sequence and more extreme evolutionary stages of massive stars. Additionally, lying toward the cool end of the hot star regime, determining their wind properties is crucial to constrain the evolution and feedback of massive stars as, for instance, they might manifest the bi-stability jump phenomenon. Aims. We undertake a detailed analysis of a representative sample of 18 Small Magellanic Cloud (SMC) BSGs within the ULLYSES and XShootU datasets. Our UV and optical analysis spans BSGs from B0 to B8 - covering the bi-stability jump region. We aim to evaluate their evolutionary status and verify what their wind properties say about the bi-stability jump in a low-metallicity environment. Methods. We used the CMFGEN to model the spectra and photometry (from UV to infrared) of our sample. We compare our results with different evolutionary models, with previous determinations in the literature of OB stars, and with diverging mass-loss recipes at the bi-stability jump. Additionally, we provide the first BSG models in the SMC including X-rays. Results. (i) Within a single-stellar evolution framework, the evolutionary status of early BSGs seem less clear than that of late BSGs, which agree with H-shell burning models. (ii) UV analysis shows evidence that BSGs contain X-rays in their atmospheres, for which we provide constraints. In general, we find higher X-ray luminosity (close to the standard log(L_X/L) ~ -7) for early BSGs. For cooler BSGs, lower values are preferred, log(L_X/L) ~ -8.5. (iii) The obtained mass-loss rates suggest neither a jump nor a monotonic decrease with temperature. Instead, a rather constant trend is observed, which is at odds with the increase found for Galactic BSGs. (iv) The wind velocity behavior with temperature shows a sharp drop at ~19 kK, similar to what is observed for Galactic BSGs., Comment: 33 pages (23+10)+(22 at Zenodo), 34 figures (21+13)+(21 at Zenodo), 7 tables (3+4)+(1 at Zenodo), accepted for publication
- Published
- 2024
35. Nanoscale ferroelectric programming of van der Waals heterostructures
- Author
-
Yang, Dengyu, Cao, Qingrui, Akyuz, Erin, Hayden, John, Nordlander, Josh, Yu, Muqing, Ramachandran, Ranjani, Irvin, Patrick, Maria, Jon-Paul, Hunt, Benjamin M., and Levy, Jeremy
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Mesoscale and Nanoscale Physics ,Physics - Applied Physics - Abstract
The ability to create superlattices in van der Waals (vdW) heterostructures via moir\'e interference heralded a new era in the science and technology of two-dimensional materials. Through precise control of the twist angle, flat bands and strongly correlated phases have been engineered. The precise twisting of vdW layers is in some sense a bottom-up approach--a single parameter can dial in a wide range of periodic structures. Here, we describe a top-down approach to engineering nanoscale potentials in vdW layers using a buried programmable ferroelectric layer. Ultra-low-voltage electron beam lithography (ULV-EBL) is used to program ferroelectric domains in a ferroelectric Al_{1-x}B_{x}N thin film through a graphene/hexagonal boron nitride (hBN) heterostructure that is transferred on top. We demonstrate ferroelectric field effects by creating a lateral p-n junction, and demonstrate spatial resolution down to 35 nm, limited by the resolution of our scanned probe characterization methods. This innovative, resist-free patterning method is predicted to achieve 10 nm resolution and enable arbitrary programming of vdW layers, opening a pathway to create new phases that are inaccessible by moir\'e techniques. The ability to "paint" different phases of matter on a single vdW "canvas" provides a wealth of new electronic and photonic functionalities., Comment: 9 pages, 4 figures and supplemental material
- Published
- 2024
36. A Self-Supervised Learning Pipeline for Demographically Fair Facial Attribute Classification
- Author
-
Ramachandran, Sreeraj and Rattani, Ajita
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Computers and Society ,Computer Science - Machine Learning - Abstract
Published research highlights the presence of demographic bias in automated facial attribute classification. The proposed bias mitigation techniques are mostly based on supervised learning, which requires a large amount of labeled training data for generalizability and scalability. However, labeled data is limited, requires laborious annotation, poses privacy risks, and can perpetuate human bias. In contrast, self-supervised learning (SSL) capitalizes on freely available unlabeled data, rendering trained models more scalable and generalizable. However, these label-free SSL models may also introduce biases by sampling false negative pairs, especially at low-data regimes 200K images) under low compute settings. Further, SSL-based models may suffer from performance degradation due to a lack of quality assurance of the unlabeled data sourced from the web. This paper proposes a fully self-supervised pipeline for demographically fair facial attribute classifiers. Leveraging completely unlabeled data pseudolabeled via pre-trained encoders, diverse data curation techniques, and meta-learning-based weighted contrastive learning, our method significantly outperforms existing SSL approaches proposed for downstream image classification tasks. Extensive evaluations on the FairFace and CelebA datasets demonstrate the efficacy of our pipeline in obtaining fair performance over existing baselines. Thus, setting a new benchmark for SSL in the fairness of facial attribute classification., Comment: 13 pages, IJCB 2024
- Published
- 2024
37. Accuracy is Not All You Need
- Author
-
Dutta, Abhinav, Krishnan, Sanjeev, Kwatra, Nipun, and Ramjee, Ramachandran
- Subjects
Computer Science - Machine Learning - Abstract
When Large Language Models (LLMs) are compressed using techniques such as quantization, the predominant way to demonstrate the validity of such techniques is by measuring the model's accuracy on various benchmarks.If the accuracies of the baseline model and the compressed model are close, it is assumed that there was negligible degradation in quality.However, even when the accuracy of baseline and compressed model are similar, we observe the phenomenon of flips, wherein answers change from correct to incorrect and vice versa in proportion.We conduct a detailed study of metrics across multiple compression techniques, models and datasets, demonstrating that the behavior of compressed models as visible to end-users is often significantly different from the baseline model, even when accuracy is similar.We further evaluate compressed models qualitatively and quantitatively using MT-Bench and show that compressed models are significantly worse than baseline models in this free-form generative task.Thus, we argue that compression techniques should also be evaluated using distance metrics.We propose two such metrics, KL-Divergence and flips, and show that they are well correlated.
- Published
- 2024
38. Buckling by disordered growth
- Author
-
Ramachandran, Rahul G., Alert, Ricard, and Haas, Pierre A.
- Subjects
Condensed Matter - Soft Condensed Matter ,Physics - Biological Physics - Abstract
Buckling instabilities driven by tissue growth underpin key developmental events such as the folding of the brain. Tissue growth is disordered due to cell-to-cell variability, but the effects of this variability on buckling are unknown. Here, we analyse what is perhaps the simplest setup of this problem: the buckling of an elastic rod with fixed ends driven by spatially varying growth. Combining analytical calculations for simple growth fields and numerical sampling of random growth fields, we show that variability can increase as well as decrease the growth threshold for buckling, even when growth variability does not cause any residual stresses. For random growth, we find that the shift of the buckling threshold correlates with spatial moments of the growth field. Our results imply that biological systems can either trigger or avoid buckling by exploiting the spatial arrangement of growth variability., Comment: 6 pages, 3 figures; Supplemental Material: 7 pages, 1 figure
- Published
- 2024
39. Etalon: Holistic Performance Evaluation Framework for LLM Inference Systems
- Author
-
Agrawal, Amey, Agarwal, Anmol, Kedia, Nitin, Mohan, Jayashree, Kundu, Souvik, Kwatra, Nipun, Ramjee, Ramachandran, and Tumanov, Alexey
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
Serving large language models (LLMs) in production can incur substantial costs, which has prompted recent advances in inference system optimizations. Today, these systems are evaluated against conventional latency and throughput metrics (eg. TTFT, TBT, Normalised Latency and TPOT). However, these metrics fail to fully capture the nuances of LLM inference, leading to an incomplete assessment of user-facing performance crucial for real-time applications such as chat and translation. In this paper, we first identify the pitfalls of current performance metrics in evaluating LLM inference systems. We then propose Etalon, a comprehensive performance evaluation framework that includes fluidity-index -- a novel metric designed to reflect the intricacies of the LLM inference process and its impact on real-time user experience. Finally, we evaluate various existing open-source platforms and model-as-a-service offerings using Etalon, discussing their strengths and weaknesses. Etalon is available at https://github.com/project-etalon/etalon.
- Published
- 2024
40. Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
- Author
-
Yenamandra, Sriram, Ramachandran, Arun, Khanna, Mukul, Yadav, Karmesh, Vakil, Jay, Melnik, Andrew, Büttner, Michael, Harz, Leon, Brown, Lyon, Nandi, Gora Chand, PS, Arjun, Yadav, Gaurav Kumar, Kala, Rahul, Haschke, Robert, Luo, Yang, Zhu, Jinxin, Han, Yansen, Lu, Bingyi, Gu, Xuan, Liu, Qinyuan, Zhao, Yaping, Ye, Qiting, Dou, Chenxiao, Chua, Yansong, Kuzma, Volodymyr, Humennyy, Vladyslav, Partsey, Ruslan, Francis, Jonathan, Chaplot, Devendra Singh, Chhablani, Gunjan, Clegg, Alexander, Gervet, Theophile, Jain, Vidhi, Ramrakhya, Ram, Szot, Andrew, Wang, Austin, Yang, Tsung-Yen, Edsinger, Aaron, Kemp, Charlie, Shah, Binit, Kira, Zsolt, Batra, Dhruv, Mottaghi, Roozbeh, Bisk, Yonatan, and Paxton, Chris
- Subjects
Computer Science - Robotics ,Computer Science - Computer Vision and Pattern Recognition - Abstract
In order to develop robots that can effectively serve as versatile and capable home assistants, it is crucial for them to reliably perceive and interact with a wide variety of objects across diverse environments. To this end, we proposed Open Vocabulary Mobile Manipulation as a key benchmark task for robotics: finding any object in a novel environment and placing it on any receptacle surface within that environment. We organized a NeurIPS 2023 competition featuring both simulation and real-world components to evaluate solutions to this task. Our baselines on the most challenging version of this task, using real perception in simulation, achieved only an 0.8% success rate; by the end of the competition, the best participants achieved an 10.8\% success rate, a 13x improvement. We observed that the most successful teams employed a variety of methods, yet two common threads emerged among the best solutions: enhancing error detection and recovery, and improving the integration of perception with decision-making processes. In this paper, we detail the results and methodologies used, both in simulation and real-world settings. We discuss the lessons learned and their implications for future research. Additionally, we compare performance in real and simulated environments, emphasizing the necessity for robust generalization to novel settings.
- Published
- 2024
41. CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs
- Author
-
Ramachandran, Akshat, Kundu, Souvik, and Krishna, Tushar
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Electrical Engineering and Systems Science - Image and Video Processing - Abstract
We present CLAMP-ViT, a data-free post-training quantization method for vision transformers (ViTs). We identify the limitations of recent techniques, notably their inability to leverage meaningful inter-patch relationships, leading to the generation of simplistic and semantically vague data, impacting quantization accuracy. CLAMP-ViT employs a two-stage approach, cyclically adapting between data generation and model quantization. Specifically, we incorporate a patch-level contrastive learning scheme to generate richer, semantically meaningful data. Furthermore, we leverage contrastive learning in layer-wise evolutionary search for fixed- and mixed-precision quantization to identify optimal quantization parameters while mitigating the effects of a non-smooth loss landscape. Extensive evaluations across various vision tasks demonstrate the superiority of CLAMP-ViT, with performance improvements of up to 3% in top-1 accuracy for classification, 0.6 mAP for object detection, and 1.5 mIoU for segmentation at similar or better compression ratio over existing alternatives. Code is available at https://github.com/georgia-tech-synergy-lab/CLAMP-ViT.git, Comment: ECCV 2024
- Published
- 2024
42. Observation of near-scission 'polar' and 'equatorial' proton emission in heavy-ion induced fission
- Author
-
Singh, Pawan, Gupta, Y. K., Prajapati, G. K., Joshi, B. N., Prajapati, V. G., Sirswal, N., Ramachandran, K., Pradeep, A. S., Dagre, V. S., Kumar, M., Jhingan, A., Deshmukh, N., John, B. V., Nayak, B. K., Biswas, D. C., and Choudhury, R. K.
- Subjects
Nuclear Experiment ,Nuclear Theory - Abstract
Proton and $\alpha$-particle energy spectra were measured in coincidence with fission fragments at different relative angles in $^{16}$O (96 MeV) + $^{232}$Th reaction. The multiplicity spectra were analyzed within the framework of a Moving Source Disentangling Analysis (MSDA) to determine contributions from different emission stages. The MSDA conclusively shows ``Near Scission Emission (NSE)" as an essential component in the multiplicity spectra. In contrast to NSE $\alpha$ particles which emit mainly perpendicular (``equatorial emission"), the NSE protons are observed to be emitted perpendicular as well as parallel (``polar emission") to the fission axis with similar intensities ($\sim$20\% for each). Thus, around 40\% of total pre-scission protons are emitted near the scission stage, whereas the same fraction for $\alpha$ particles is only around 10\%. The inevitable presence of ``polar" and ``equatorial" NSE protons in a heavy-ion induced fission has been observed for the first time. Present results open up a new avenue to study the heavy-ion induced fission dynamics.
- Published
- 2024
43. X-Shooting ULLYSES: Massive stars at low metallicity. IV. Spectral analysis methods and exemplary results for O stars
- Author
-
Sander, A. A. C., Bouret, J. -C., Bernini-Peron, M., Puls, J., Backs, F., Berlanas, S. R., Bestenlehner, J. M., Brands, S. A., Herrero, A., Martins, F., Maryeva, O., Pauli, D., Ramachandran, V., Crowther, P. A., Gómez-González, V. M. A., Gormaz-Matamala, A. C., Hamann, W. -R., Hillier, D. J., Kuiper, R., Larkin, C. J. K., Lefever, R. R., Mehner, A., Najarro, F., Oskinova, L. M., Schösser, E. C., Shenar, T., Todt, H., ud-Doula, A., and Vink, J. S.
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - Astrophysics of Galaxies ,Astrophysics - Instrumentation and Methods for Astrophysics - Abstract
CONTEXT: The spectral analysis of hot, massive stars is a fundamental astrophysical method to obtain their intrinsic properties and their feedback. Quantitative spectroscopy for hot, massive stars requires detailed numerical modeling of the atmosphere and an iterative treatment to obtain the best solution within a given framework. AIMS: We present an overview of different techniques for the quantitative spectroscopy of hot stars employed within the X-Shooting ULLYSES collaboration, from grid-based approaches to tailored fits. By performing a blind test, we gain an overview about the similarities and differences of the resulting parameters. Our study aims to provide an overview of the parameter spread caused by different approaches. METHODS: For three different stars from the sample (SMC O5 star AzV 377, LMC O7 star Sk -69 50, and LMC O9 star Sk -66 171), we employ different atmosphere codes (CMFGEN, Fastwind, PoWR) and strategies to determine their best-fitting model. For our analyses, UV and optical spectra are used to derive the properties with some methods relying purely on optical data for comparison. To determine the overall spectral energy distribution, we further employ additional photometry from the literature. RESULTS: Effective temperatures for each of three sample stars agree within 3 kK while the differences in log g can be up to 0.2 dex. Luminosity differences of up to 0.1 dex result from different reddening assumptions, which seem to be larger for the methods employing a genetic algorithm. All sample stars are nitrogen-enriched. CONCLUSIONS: We find a reasonable agreement between the different methods. Tailored fitting tends to be able to minimize discrepancies obtained with more course or automatized treatments. UV spectral data is essential for the determination of realistic wind parameters. For one target (Sk -69 50), we find clear indications of an evolved status., Comment: 19+15 pages, 21+4 figures, accepted version (A&A 689, A30) including language editing, condensed abstract
- Published
- 2024
- Full Text
- View/download PDF
44. Subtractive Training for Music Stem Insertion using Latent Diffusion Models
- Author
-
Villa-Renteria, Ivan, Wang, Mason L., Shah, Zachary, Li, Zhe, Kim, Soohyun, Ramachandran, Neelesh, and Pilanci, Mert
- Subjects
Computer Science - Sound ,Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
We present Subtractive Training, a simple and novel method for synthesizing individual musical instrument stems given other instruments as context. This method pairs a dataset of complete music mixes with 1) a variant of the dataset lacking a specific stem, and 2) LLM-generated instructions describing how the missing stem should be reintroduced. We then fine-tune a pretrained text-to-audio diffusion model to generate the missing instrument stem, guided by both the existing stems and the text instruction. Our results demonstrate Subtractive Training's efficacy in creating authentic drum stems that seamlessly blend with the existing tracks. We also show that we can use the text instruction to control the generation of the inserted stem in terms of rhythm, dynamics, and genre, allowing us to modify the style of a single instrument in a full song while keeping the remaining instruments the same. Lastly, we extend this technique to MIDI formats, successfully generating compatible bass, drum, and guitar parts for incomplete arrangements.
- Published
- 2024
45. X-Shooting ULLYSES: Massive Stars at low metallicity VIII. Stellar and wind parameters of newly revealed stripped stars in Be binaries
- Author
-
Ramachandran, V., Sander, A. A. C., Pauli, D., Klencki, J., Backs, F., Tramper, F., Bernini-Peron, M., Crowther, P., Hamann, W. -R., Ignace, R., Kuiper, R., Oey, S., Oskinova, L. M., Shenar, T., Todt, H., Vink, J. S., Wang, L., Wofford, A., and collaboration, the XShootU
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - Astrophysics of Galaxies ,Astrophysics - High Energy Astrophysical Phenomena - Abstract
On the route towards merging neutron stars and stripped-envelope supernovae, binary population synthesis predicts a large number of post-interaction systems with massive stars that have stripped off their outer layers. Yet, observations of such stars in the intermediate-mass regime below the Wolf-Rayet masses are rare. Using X-Shooting ULLYSES (XShootU) data, we discovered three partially stripped star + Be/Oe binaries in the Magellanic Clouds. We analyzed the UV and optical spectra using the PoWR model atmosphere code by superimposing model spectra corresponding to each component. The estimated current masses of the partially stripped stars fall within the intermediate mass range of 4-8 $M_{\odot}$. These objects are overluminous for their stellar masses, matching core He-burning luminosities. Their Be/Oe secondaries have much higher masses than their stripped primaries (mass ratio > 2). All three partially stripped stars show significant nitrogen enrichment and carbon and oxygen depletion on their surfaces. Additionally, one of our sample stars exhibits significant helium enrichment. Our study provides the first comprehensive determination of the wind parameters of partially stripped stars in the intermediate mass range. The wind mass-loss rates of these stars are found to be on the order of $10^{-7} M_\odot$ yr$^{-1}$, which is over ten times higher than that of OB stars of the same luminosity. Current evolutionary models characterizing this phase typically employ OB or WR mass-loss rates, which underestimate or overestimate stripped stars' mass-loss rates by an order of magnitude. Binary evolution models indicate that the observed primaries had initial masses of 12-17 $M_{\odot}$, making them potential candidates for stripped-envelope supernovae that form neutron stars. If they survive the explosion, these systems may become Be X-ray binaries and later double neutron stars., Comment: Accepted for publication in Astronomy & Astrophysics
- Published
- 2024
46. Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation
- Author
-
Collins, Katherine M., Kim, Najoung, Bitton, Yonatan, Rieser, Verena, Omidshafiei, Shayegan, Hu, Yushi, Chen, Sherol, Dutta, Senjuti, Chang, Minsuk, Lee, Kimin, Liang, Youwei, Evans, Georgina, Singla, Sahil, Li, Gang, Weller, Adrian, He, Junfeng, Ramachandran, Deepak, and Dvijotham, Krishnamurthy Dj
- Subjects
Computer Science - Machine Learning ,Computer Science - Computation and Language ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Human feedback plays a critical role in learning and refining reward models for text-to-image generation, but the optimal form the feedback should take for learning an accurate reward function has not been conclusively established. This paper investigates the effectiveness of fine-grained feedback which captures nuanced distinctions in image quality and prompt-alignment, compared to traditional coarse-grained feedback (for example, thumbs up/down or ranking between a set of options). While fine-grained feedback holds promise, particularly for systems catering to diverse societal preferences, we show that demonstrating its superiority to coarse-grained feedback is not automatic. Through experiments on real and synthetic preference data, we surface the complexities of building effective models due to the interplay of model choice, feedback type, and the alignment between human judgment and computational interpretation. We identify key challenges in eliciting and utilizing fine-grained feedback, prompting a reassessment of its assumed benefits and practicality. Our findings -- e.g., that fine-grained feedback can lead to worse models for a fixed budget, in some settings; however, in controlled settings with known attributes, fine grained rewards can indeed be more helpful -- call for careful consideration of feedback attributes and potentially beckon novel modeling approaches to appropriately unlock the potential value of fine-grained feedback in-the-wild.
- Published
- 2024
47. Machine Learning Global Simulation of Nonlocal Gravity Wave Propagation
- Author
-
Gupta, Aman, Sheshadri, Aditi, Roy, Sujit, Gaur, Vishal, Maskey, Manil, and Ramachandran, Rahul
- Subjects
Physics - Atmospheric and Oceanic Physics ,Computer Science - Machine Learning ,Physics - Fluid Dynamics ,Physics - Geophysics - Abstract
Global climate models typically operate at a grid resolution of hundreds of kilometers and fail to resolve atmospheric mesoscale processes, e.g., clouds, precipitation, and gravity waves (GWs). Model representation of these processes and their sources is essential to the global circulation and planetary energy budget, but subgrid scale contributions from these processes are often only approximately represented in models using parameterizations. These parameterizations are subject to approximations and idealizations, which limit their capability and accuracy. The most drastic of these approximations is the "single-column approximation" which completely neglects the horizontal evolution of these processes, resulting in key biases in current climate models. With a focus on atmospheric GWs, we present the first-ever global simulation of atmospheric GW fluxes using machine learning (ML) models trained on the WINDSET dataset to emulate global GW emulation in the atmosphere, as an alternative to traditional single-column parameterizations. Using an Attention U-Net-based architecture trained on globally resolved GW momentum fluxes, we illustrate the importance and effectiveness of global nonlocality, when simulating GWs using data-driven schemes., Comment: 9 pages, 7 figures, no tables
- Published
- 2024
48. Euler factors of equivariant $L$--functions of Drinfeld modules and beyond
- Author
-
Popescu, Cristian D. and Ramachandran, Nandagopal
- Subjects
Mathematics - Number Theory ,11G09, 11M38, 11F80 - Abstract
In \cite{FGHP}, the first author and his collaborators proved an equivariant Tamagawa number formula for the special value at $s=0$ of a Goss--type $L$--function, equivariant with respect to a Galois group $G$, and associated to a Drinfeld module defined on $\Bbb F_q[t]$ and over a finite, integral extension of $\Bbb F_q[t]$. The formula in question was proved provided that the values at $0$ of the Euler factors of the equivariant $L$--function in question satisfy certain identities involving Fitting ideals of certain $G$--cohomologically trivial, finite $\Bbb F_q[t][G]$--modules associated to the Drinfeld module. In \cite{FGHP}, we prove these identities in the particular case of the Carlitz module. In this paper, we develop general techniques and prove the identities in question for arbitrary Drinfeld modules. Further, we indicate how these techniques can be extended to the more general case of higher dimensional abelian $t$--modules, which is relevant in the context of the proof of the equivariant Tamagawa number formula for abelian $t$--modules given by N. Green and the first author in \cite{Green-Popescu}. This paper is based on a lecture given by the first author at ICMAT Madrid in May 2023 and builds upon results obtained by the second author in his PhD thesis \cite{Ramachandran-thesis}.
- Published
- 2024
49. Hydrodynamic simulation of Cygnus OB2: the absence of a cluster wind termination shock
- Author
-
Vieu, Thibault, Larkin, Cormac J. K., Härer, Lucia, Reville, Brian, Sander, Andreas A. C., and Ramachandran, Varsha
- Subjects
Astrophysics - Astrophysics of Galaxies ,Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Solar and Stellar Astrophysics - Abstract
We perform a large-scale hydrodynamic simulation of a massive star cluster whose stellar population mimics that of the Cygnus OB2 association. The main-sequence stars are first simulated during 1.6 Myr, until a quasi-stationary state is reached. At this time the three Wolf-Rayet stars observed in Cygnus OB2 are added to the simulation, which continues to 2 Myr. Using a high-resolution grid in the centre of the domain, we can resolve the most massive stars individually, which allows us to probe the kinetic structures at small (parsec) scales. We find that, although the cluster excavates a spherical "superbubble" cavity, the stellar population is too loosely distributed to blow a large-scale cluster wind termination shock, and that collective effects from wind-wind interactions are much less efficient than usually assumed. This challenges our understanding of the ultra-high energy emission observed from the region., Comment: 15 pages, 12 figures
- Published
- 2024
50. Direct neutrino-mass measurement based on 259 days of KATRIN data
- Author
-
Aker, M., Batzler, D., Beglarian, A., Behrens, J., Beisenkötter, J., Biassoni, M., Bieringer, B., Biondi, Y., Block, F., Bobien, S., Böttcher, M., Bornschein, B., Bornschein, L., Caldwell, T. S., Carminati, M., Chatrabhuti, A., Chilingaryan, S., Daniel, B. A., Debowski, K., Descher, M., Barrero, D. Díaz, Doe, P. J., Dragoun, O., Drexlin, G., Edzards, F., Eitel, K., Ellinger, E., Engel, R., Enomoto, S., Felden, A., Fengler, C., Fiorini, C., Formaggio, J. A., Forstner, C., Fränkle, F. M., Gauda, K., Gavin, A. S., Gil, W., Glück, F., Grohmann, S., Grössle, R., Gumbsheimer, R., Gutknecht, N., Hannen, V., Hasselmann, L., Haußmann, N., Helbing, K., Henke, H., Heyns, S., Hickford, S., Hiller, R., Hillesheimer, D., Hinz, D., Höhn, T., Huber, A., Jansen, A., Karl, C., Kellerer, J., Khosonthongkee, K., Kleifges, M., Klein, M., Kohpeiß, J., Köhler, C., Köllenberger, L., Kopmann, A., Kovač, N., Kovalík, A., Krause, H., La Cascio, L., Lasserre, T., Lauer, J., Le, T., Lebeda, O., Lehnert, B., Li, G., Lokhov, A., Machatschek, M., Mark, M., Marsteller, A., Martin, E. L., Melzer, C., Mertens, S., Mohanty, S., Mostafa, J., Müller, K., Nava, A., Neumann, H., Niemes, S., Onillon, A., Parno, D. S., Pavan, M., Pinsook, U., Poon, A. W. P., Poyato, J. M. Lopez, Pozzi, S., Priester, F., Ráliš, J., Ramachandran, S., Robertson, R. G. H., Rodenbeck, C., Röllig, M., Röttele, C., Ryšavý, M., Sack, R., Saenz, A., Salomon, R., Schäfer, P., Schlösser, M., Schlösser, K., Schlüter, L., Schneidewind, S., Schnurr, U., Schrank, M., Schürmann, J., Schütz, A., Schwemmer, A., Schwenck, A., Šefčík, M., Siegmann, D., Simon, F., Spanier, F., Spreng, D., Sreethawong, W., Steidl, M., Štorek, J., Stribl, X., Sturm, M., Suwonjandee, N., Jerome, N. Tan, Telle, H. H., Thorne, L. A., Thümmler, T., Tirolf, S., Titov, N., Tkachev, I., Urban, K., Valerius, K., Vénos, D., Weinheimer, C., Welte, S., Wendel, J., Wiesinger, C., Wilkerson, J. F., Wolf, J., Wüstling, S., Wydra, J., Xu, W., Zadorozhny, S., and Zeller, G.
- Subjects
Nuclear Experiment ,High Energy Physics - Experiment - Abstract
The fact that neutrinos carry a non-vanishing rest mass is evidence of physics beyond the Standard Model of elementary particles. Their absolute mass bears important relevance from particle physics to cosmology. In this work, we report on the search for the effective electron antineutrino mass with the KATRIN experiment. KATRIN performs precision spectroscopy of the tritium $\beta$-decay close to the kinematic endpoint. Based on the first five neutrino-mass measurement campaigns, we derive a best-fit value of $m_\nu^{2} = {-0.14^{+0.13}_{-0.15}}~\mathrm{eV^2}$, resulting in an upper limit of $m_\nu < {0.45}~\mathrm{eV}$ at 90 % confidence level. With six times the statistics of previous data sets, amounting to 36 million electrons collected in 259 measurement days, a substantial reduction of the background level and improved systematic uncertainties, this result tightens KATRIN's previous bound by a factor of almost two., Comment: 61 pages, 20 figures, 2 tables
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.