Author: "Ravi, P." - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Ravi, P."' showing total 57,613 results

Start Over Author "Ravi, P."

57,613 results on '"Ravi, P."'

1. Learning Fair Robustness via Domain Mixup

Author: Zhong, Meiyu and Tandon, Ravi
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security, Computer Science - Computers and Society
Abstract: Adversarial training is one of the predominant techniques for training classifiers that are robust to adversarial attacks. Recent work, however has found that adversarial training, which makes the overall classifier robust, it does not necessarily provide equal amount of robustness for all classes. In this paper, we propose the use of mixup for the problem of learning fair robust classifiers, which can provide similar robustness across all classes. Specifically, the idea is to mix inputs from the same classes and perform adversarial training on mixed up inputs. We present a theoretical analysis of this idea for the case of linear classifiers and show that mixup combined with adversarial training can provably reduce the class-wise robustness disparity. This method not only contributes to reducing the disparity in class-wise adversarial risk, but also the class-wise natural risk. Complementing our theoretical analysis, we also provide experimental results on both synthetic data and the real world dataset (CIFAR-10), which shows improvement in class wise disparities for both natural and adversarial risks.
Published: 2024

2. On the strong geodeticity in the corona type product of graphs

Author: Sonar, Bishal, Guragain, Satyam, and Srivastava, Ravi
Subjects: Mathematics - Combinatorics, 05C12, 05C38
Abstract: The paper focuses on studying strong geodetic sets and numbers in the context of corona-type products of graphs. Our primary focus is on three variations of the corona products: the generalized corona, generalized edge corona, and generalized neighborhood corona products. A strong geodetic set is a minimal subset of vertices that covers all vertices in the graph through unique geodesics connecting pairs from this subset. We obtain the strong geodetic set and number of the corona-type product graph using the strong 2-geodetic set and strong 2-geodetic number of the initial arbitrary graphs. We analyze how the structural properties of these corona products affect the strong geodetic number, providing new insights into geodetic coverage and the relationships between graph compositions. This work contributes to expanding research on the geodetic parameters of product graphs.
Published: 2024

3. Exploring the Performance of Genetic Algorithm and Variable Neighborhood Search for Solving the Single Depot Multiple Set Orienteering Problem: A Comparative Study

Author: Kant, Ravi, Agarwal, Sarthak, Gupta, Aakash, and Mishra, Abhishek
Subjects: Mathematics - Optimization and Control
Abstract: This article discusses the single Depot multiple Set Orienteering Problem (sDmSOP), a recently suggested generalization of the Set Orienteering Problem (SOP). This problem aims to discover a path for each traveler over a subset of vertices, where each vertex is associated with only one cluster, and the total profit made from the clusters visited is maximized while still fitting within the available budget constraints. The profit can be collected only by visiting at least one cluster vertex. According to the SOP, each vertex cluster must have at least one of its visits counted towards the profit for that cluster. Like to the SOP, the sDmSOP restricts the number of clusters visited based on the budget for tour expenses. To address this problem, we employ the Genetic Algorithm (GA) and Variable Neighborhood Search (VNS) meta-heuristic. The optimal solution for small-sized problems is also suggested by solving the Integer Linear Programming (ILP) formulation using the General Algebraic Modeling System (GAMS) 37.1.0 with CPLEX for the sDmSOP. Promising computational results are presented that demonstrate the practicability of the proposed GA, VNS meta-heuristic, and ILP formulation by demonstrating substantial improvements to the solutions generated by VNS than GA while simultaneously needing much less time to compute than CPLEX., Comment: 14 pages, 3 figures, 2 tables
Published: 2024

4. IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos

Author: Liu, Yunong, Eyzaguirre, Cristobal, Li, Manling, Khanna, Shubh, Niebles, Juan Carlos, Ravi, Vineeth, Mishra, Saumitra, Liu, Weiyu, and Wu, Jiajun
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Robotics
Abstract: Shape assembly is a ubiquitous task in daily life, integral for constructing complex 3D structures like IKEA furniture. While significant progress has been made in developing autonomous agents for shape assembly, existing datasets have not yet tackled the 4D grounding of assembly instructions in videos, essential for a holistic understanding of assembly in 3D space over time. We introduce IKEA Video Manuals, a dataset that features 3D models of furniture parts, instructional manuals, assembly videos from the Internet, and most importantly, annotations of dense spatio-temporal alignments between these data modalities. To demonstrate the utility of IKEA Video Manuals, we present five applications essential for shape assembly: assembly plan generation, part-conditioned segmentation, part-conditioned pose estimation, video object segmentation, and furniture assembly based on instructional video manuals. For each application, we provide evaluation metrics and baseline methods. Through experiments on our annotated data, we highlight many challenges in grounding assembly instructions in videos to improve shape assembly, including handling occlusions, varying viewpoints, and extended assembly sequences., Comment: NeurIPS 2024 Datasets and Benchmarks Track
Published: 2024

5. Generalizing Bell nonlocality without global causal assumptions

Author: Kunjwal, Ravi and Oreshkov, Ognyan
Subjects: Quantum Physics
Abstract: Bell scenarios are multipartite scenarios that exclude any communication between parties. This constraint leads to a strict hierarchy of correlation sets in such scenarios, namely, classical, quantum, and nonsignaling. However, without any constraints on communication between the parties, they can realize arbitrary correlations by exchanging only classical systems. Here we consider a multipartite scenario where the parties can engage in at most a single round of communication, i.e., each party is allowed to receive a system once, implement any local intervention on it, and send out the resulting system once. While no global assumption about causal relations between parties is assumed in this scenario, we do make a causal assumption local to each party, i.e., the input received by it causally precedes the output it sends out. We then introduce antinomicity, a notion of nonclassicality for correlations in such scenarios, and prove the existence of a strict hierarchy of correlation sets classified by their antinomicity. Antinomicity serves as a generalization of Bell nonlocality: when all the parties discard their output systems (i.e., in a nonsignaling scenario), it is mathematically equivalent to Bell nonlocality. Like Bell nonlocality, it can be understood as an instance of fine-tuning, one that is necessary in any classical model of cyclic causation that avoids time-travel antinomies but allows antinomic correlations. Furthermore, antinomicity resolves a long-standing puzzle, i.e., the failure of causal inequality violations as device-independent witnesses of nonclassicality. Antinomicity implies causal inequality violations, but not conversely., Comment: 7 pages, 2 figures, short version of arXiv:2307.02565 with an emphasis on device-independent aspects
Published: 2024

6. Fine-Grained Uncertainty Quantification via Collisions

Author: Friedbaum, Jesse, Adiga, Sudarshan, and Tandon, Ravi
Subjects: Computer Science - Machine Learning, Computer Science - Information Theory, Mathematics - Statistics Theory, Statistics - Machine Learning
Abstract: We propose a new approach for fine-grained uncertainty quantification (UQ) using a collision matrix. For a classification problem involving $K$ classes, the $K\times K$ collision matrix $S$ measures the inherent (aleatoric) difficulty in distinguishing between each pair of classes. In contrast to existing UQ methods, the collision matrix gives a much more detailed picture of the difficulty of classification. We discuss several possible downstream applications of the collision matrix, establish its fundamental mathematical properties, as well as show its relationship with existing UQ methods, including the Bayes error rate. We also address the new problem of estimating the collision matrix using one-hot labeled data. We propose a series of innovative techniques to estimate $S$. First, we learn a contrastive binary classifier which takes two inputs and determines if they belong to the same class. We then show that this contrastive classifier (which is PAC learnable) can be used to reliably estimate the Gramian matrix of $S$, defined as $G=S^TS$. Finally, we show that under very mild assumptions, $G$ can be used to uniquely recover $S$, a new result on stochastic matrices which could be of independent interest. Experimental results are also presented to validate our methods on several datasets.
Published: 2024

7. Depth-resolved measurement of solvation entropy, interfacial transport and charge-transfer kinetics of practical lithium-ion batteries

Author: Chalise, Divya, Lubner, Sean D., Kaur, Sumanjeet, Srinivasan, Venkat, and Prasher, Ravi S
Subjects: Physics - Chemical Physics
Abstract: Understanding the performance of electrochemical energy storage systems requires probing the electrochemical properties at each layer and interface during cell operation. While traditional onboard and operando methods can measure impedance, voltage, or capacity, they lack spatial resolution to pinpoint the properties to specific layers and interfaces. In this work, we describe an approach of using thermal waves to measure entropy change, transport resistance, and charge-transfer resistance with depth resolution of a few microns within an electrochemical cell. We achieve this by relating heat generation at multiple harmonics of an AC current to electrochemical processes and leveraging frequency dependence of thermal penetration depth for spatial resolution. We name this frequency domain spectroscopy of the thermal signatures of the electrochemical processes measured at multiple harmonics of the alternating current as Multi-harmonic ElectroThermal Spectroscopy (METS). This technique enables isolation and measurement of solvation entropy at individual electrode-electrolyte interfaces from the first harmonic (1{\omega}) thermal signature and resolution of the overall interfacial impedance into charge-transfer and interface transport resistance components from the second harmonic (2{\omega}) thermal signature. From this, we also demonstrate an operando measurement of the growth of the solid-electrolyte interphase (SEI) layer at the lithium-electrolyte interface and show that two chemically similar electrodes can have significantly different interfacial transport resistance based on the preparation of the electrodes. Additionally, the method is not specific to lithium-ion chemistry and can therefore be generalized for all electrochemical systems of interest.
Published: 2024

8. Star-Formation in Neutral Hydrogen Gas Reservoirs at Cosmic Noon

Author: Dharmender, Joshi, Ravi, Fumagalli, Michele, Noterdaeme, Pasquier, Chand, Hum, and Ho, Luis C.
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We aim to constrain the average star formation associated with neutral hydrogen gas reservoirs at cosmic noon. Using a unprecedented sample of 1716 high column density Damped Ly-$\alpha$ absorbers (DLAs) from the Sloan Digital Sky Survey with log($N$(HI) / cm$^{-2}$) $\ge$21, we generated the average Ly-$\alpha$ emission spectrum associated to DLAs, free from emission from the background quasar. We measured Ly$\alpha$ emission at $> 5.8\sigma$ level with luminosity $8.95\pm 1.54 \times \rm 10^{40}\ \text{erg}\ \text{s}^{-1}$ (corresponding to about 0.02 L$^{\star}$ at $z \sim$ 2-3) in systems with average log($N$(HI)/ $cm^{-2}$) $\approx$21.2 and at median redshift of $z \sim$ 2.64. The peak of the Ly$\alpha$ emission is apparently redshifted by $\sim$300 km s$^{-1}$ relative to the absorption redshift, which is seemingly due to suppression of blue Ly-$\alpha$ photons by radiative transfer through expanding gas. We infer that DLAs form stars with an average rate of (0.08 $\pm$ 0.01)/$\text{f}_\text{esc}\ \rm \text{M}_{\odot}\ \text{yr}^{-1}$, i.e, $\approx (0.54\pm 0.09)\rm \text{M}_{\odot}\ \text{yr}^{-1}$ for a typical escape fraction, $\text{f}_{\text{esc}} =0.15$, of Lyman-$\alpha$ emitting galaxies. DLA galaxies follows the star formation main sequence of star-forming galaxies at high redshift, suggesting that the DLA population is dominated by the lower mass end of Lyman-$\alpha$ emitting galaxies., Comment: 5 pages, 3 figures, accepted for publication in A&A (Letters)
Published: 2024

9. First spectropolarimetric observation of the neutron star low-mass X-ray binary GX 3+1

Author: Gnarini, Andrea, Farinelli, Ruben, Ursini, Francesco, Bianchi, Stefano, Capitanio, Fiamma, Matt, Giorgio, Ng, Mason, Tarana, Antonella, Bobrikova, Anna, Cocchi, Massimo, Fabiani, Sergio, Kaaret, Philip, Poutanen, Juri, and Ravi, Swati
Subjects: Astrophysics - High Energy Astrophysical Phenomena
Abstract: We report the first simultaneous X-ray spectropolarimetric observation of the bright atoll neutron star low-mass X-ray binary GX 3+1, performed by the Imaging X-ray Polarimetry Explorer (IXPE) joint with NICER and NuSTAR. The source does not exhibit significant polarization in the 2-8 keV energy band, with an upper limit of 1.3% at a 99% confidence level on the polarization degree. The observed spectra can be well described by a combination of thermal disk emission, the hard Comptonization component, and reflected photons off the accretion disk. In particular, from the broad Fe K$\alpha$ line profile, we were able to determine the inclination of the system ($i \approx 36^\circ$), which is crucial for comparing the observed polarization with theoretical models. Both the spectral and polarization properties of GX 3+1 are consistent with those of other atoll sources observed by IXPE. Therefore, we may expect a similar geometrical configuration for the accreting system and the hot Comptonizing region. The low polarization is also consistent with the low inclination of the system., Comment: 7 pages, 6 figures, 4 tables. Accepted for publication in Astronomy & Astrophysics. arXiv admin note: text overlap with arXiv:2408.02309
Published: 2024

10. Aspects of the chiral crossover transition in (2+1)-flavor QCD with M\'{o}bius domain-wall fermions

Author: Gavai, Rajiv V., Jaensch, Mischa E., Kaczmarek, Olaf, Karsch, Frithjof, Sarkar, Mugdha, Shanker, Ravi, Sharma, Sayantan, Sharma, Sipaz, and Ueding, Tristan
Subjects: High Energy Physics - Lattice, High Energy Physics - Phenomenology
Abstract: The non-singlet part of the chiral symmetry in QCD with two light flavors is known to be restored through a crossover transition at a pseudo-critical temperature. However, the temperature dependence of the singlet part of the chiral symmetry and whether it is effectively restored at the same temperature is not well understood. Using (2+1)-flavor QCD configurations generated using the M\"{o}bius domain-wall discretization on an $N_\tau=8$ lattice, we construct suitable observables where the singlet and non-singlet chiral symmetries are disentangled in order to study their temperature dependence across the crossover transition. From the peak of the disconnected part of the chiral susceptibility, we obtain a pseudo-critical temperature $T_{pc}=158.7{}_{{}-2.3}^{{}+2.6}$ MeV where the non-singlet part of the chiral symmetry is effectively restored. From a calculation of the topological susceptibility and its temperature dependence we find that the singlet $U_A(1)$ part of the chiral symmetry is not effectively restored at $T<186$ MeV., Comment: Number of pages: 10; Number of figures: 6
Published: 2024

11. Solid-state batteries enabled by ultra-high-frequency self-heating

Author: Zhang, Buyi, Chalise, Divya, Zeng, Yuqiang, Kaur, Sumanjeet, Dames, Chris, and Prasher, Ravi S.
Subjects: Physics - Applied Physics
Abstract: Solid-state batteries (SSBs) are promising next-generation batteries due to their high energy density and enhanced thermal stability and safety. However, their sluggish kinetics and transport at room temperature results in high internal impedance and critically reduces the attainable discharge energy density. Taking advantage of their strong temperature-dependent ionic conductivity, here we introduce ultra-high frequency ($>10^5$ Hz) self-heating (UHFSH) of SSBs, which can rapidly warm up the batteries from room temperature to operating temperature (~65 {\deg}C) in less than a minute. As proof of concept, UHFSH experiments were conducted on symmetric solid-state cells with lithium aluminum germanium phosphate (LAGP) electrolyte with different configurations. Using an experimentally validated model, pack-level simulations predict fast heating (50 K/min) and minimized heating energy consumption (less than 4%). Without any modification of the materials or structure of the batteries, our non-intrusive self-heating strategy enables the SSBs to discharge more than two-fold energy in 25 {\deg}C ambient.
Published: 2024

12. Classification and Morphological Analysis of DLBCL Subtypes in H\&E-Stained Slides

Author: Gupta, Ravi Kant, Jindal, Mohit, Jain, Garima, Sridhar, Epari, Yadav, Subhash, Jain, Hasmukh, Shet, Tanuja, Sakhdeo, Uma, Sengar, Manju, Nayak, Lingaraj, Bagal, Bhausaheb, Apkare, Umesh, and Sethi, Amit
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We address the challenge of automated classification of diffuse large B-cell lymphoma (DLBCL) into its two primary subtypes: activated B-cell-like (ABC) and germinal center B-cell-like (GCB). Accurate classification between these subtypes is essential for determining the appropriate therapeutic strategy, given their distinct molecular profiles and treatment responses. Our proposed deep learning model demonstrates robust performance, achieving an average area under the curve (AUC) of (87.4 pm 5.7)\% during cross-validation. It shows a high positive predictive value (PPV), highlighting its potential for clinical application, such as triaging for molecular testing. To gain biological insights, we performed an analysis of morphological features of ABC and GCB subtypes. We segmented cell nuclei using a pre-trained deep neural network and compared the statistics of geometric and color features for ABC and GCB. We found that the distributions of these features were not very different for the two subtypes, which suggests that the visual differences between them are more subtle. These results underscore the potential of our method to assist in more precise subtype classification and can contribute to improved treatment management and outcomes for patients of DLBCL.
Published: 2024

13. Efficient Whole Slide Image Classification through Fisher Vector Representation

Author: Gupta, Ravi Kant, Dharani, Dadi, Shanker, Shambhavi, and Sethi, Amit
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: The advancement of digital pathology, particularly through computational analysis of whole slide images (WSI), is poised to significantly enhance diagnostic precision and efficiency. However, the large size and complexity of WSIs make it difficult to analyze and classify them using computers. This study introduces a novel method for WSI classification by automating the identification and examination of the most informative patches, thus eliminating the need to process the entire slide. Our method involves two-stages: firstly, it extracts only a few patches from the WSIs based on their pathological significance; and secondly, it employs Fisher vectors (FVs) for representing features extracted from these patches, which is known for its robustness in capturing fine-grained details. This approach not only accentuates key pathological features within the WSI representation but also significantly reduces computational overhead, thus making the process more efficient and scalable. We have rigorously evaluated the proposed method across multiple datasets to benchmark its performance against comprehensive WSI analysis and contemporary weakly-supervised learning methodologies. The empirical results indicate that our focused analysis of select patches, combined with Fisher vector representation, not only aligns with, but at times surpasses, the classification accuracy of standard practices. Moreover, this strategy notably diminishes computational load and resource expenditure, thereby establishing an efficient and precise framework for WSI analysis in the realm of digital pathology.
Published: 2024

14. Complex Bott Periodicity in algebraic geometry

Author: Larson, Hannah and Vakil, Ravi
Subjects: Mathematics - Algebraic Geometry
Abstract: We state and prove a form of Bott periodicity (for $U(n)$) in an algebraic setting (so, $GL(n)$) which makes sense over $\mathbb{Z}$, which also specializes to Bott periodicity in the usual sense (hence giving yet another proof of classical Bott periodicity).
Published: 2024

15. Clustered Patch Embeddings for Permutation-Invariant Classification of Whole Slide Images

Author: Gupta, Ravi Kant, Das, Shounak, and Sethi, Amit
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Whole Slide Imaging (WSI) is a cornerstone of digital pathology, offering detailed insights critical for diagnosis and research. Yet, the gigapixel size of WSIs imposes significant computational challenges, limiting their practical utility. Our novel approach addresses these challenges by leveraging various encoders for intelligent data reduction and employing a different classification model to ensure robust, permutation-invariant representations of WSIs. A key innovation of our method is the ability to distill the complex information of an entire WSI into a single vector, effectively capturing the essential features needed for accurate analysis. This approach significantly enhances the computational efficiency of WSI analysis, enabling more accurate pathological assessments without the need for extensive computational resources. This breakthrough equips us with the capability to effectively address the challenges posed by large image resolutions in whole-slide imaging, paving the way for more scalable and effective utilization of WSIs in medical diagnostics and research, marking a significant advancement in the field., Comment: arXiv admin note: text overlap with arXiv:2411.08530
Published: 2024

16. Atomic-scale mapping of superconductivity in the incoherent CDW mosaic phase of a transition metal dichalcogenide

Author: Sajan, Sandra, Guo, Haojie, Agarwal, Tarushi, Sánchez-Ramírez, Irián, Patra, Chandan, Vergniory, Maia G., de Juan, Fernando, Singh, Ravi Prakash, and Ugeda, Miguel M.
Subjects: Condensed Matter - Superconductivity, Condensed Matter - Mesoscale and Nanoscale Physics, Condensed Matter - Materials Science, Condensed Matter - Strongly Correlated Electrons
Abstract: The emergence of superconductivity in the octahedrally coordinated (1T) phase of TaS2 is preceded by the intriguing loss of long-range order in the charge density wave (CDW). Such decoherence, attainable by different methods, results in the formation of nm-sized coherent CDW domains bound by a two-dimensional network of domain walls (DW) - mosaic phase -, which has been proposed as the spatial origin of the superconductivity. Here, we report the atomic-scale characterization of the superconducting state of 1T-TaSSe, a model 1T compound exhibiting the CDW mosaic phase. We use high-resolution scanning tunneling spectroscopy and Andreev spectroscopy to probe the microscopic nature of the superconducting state in unambiguous connection with the electronic structure of the mosaic phase. Spatially resolved conductance maps at the Fermi level at the onset of superconductivity reveal that the density of states is mostly localized on the CDW domains compared to the domain walls, which suggests their dominant role in the formation of superconductivity. This scenario is confirmed within the superconducting dome at 340 mK, where superconductivity is fully developed, and the subtle spatial inhomogeneity of the superconducting gap remains unlinked to the domain wall network. Our results provide key new insights into the fundamental interplay between superconductivity and CDW in these relevant strongly correlated systems.
Published: 2024

17. Exploring the Role of LLMs for Supporting Older Adults: Opportunities and Concerns

Author: Kaliappan, Sidharth, Anand, Abhay Sheel, Saha, Koustuv, and Karkar, Ravi
Subjects: Computer Science - Human-Computer Interaction
Abstract: We explore some of the existing research in HCI around technology for older adults and examine the role of LLMs in enhancing it. We also discuss the digital divide and emphasize the need for inclusive technology design. At the same time, we also surface concerns regarding privacy, security, and the accuracy of information provided by LLMs, alongside the importance of user-centered design to make technology accessible and effective for the elderly. We show the transformative possibilities of LLM-supported interactions at the intersection of aging, technology, and human-computer interaction, advocating for further research and development in this area., Comment: This short paper was accepted at CHI 2024 Workshop on HCI and Aging: New Directions, New Principles
Published: 2024

18. Effective Virtual Reality Teleoperation of an Upper-body Humanoid with Modified Task Jacobians and Relaxed Barrier Functions for Self-Collision Avoidance

Author: Jorgensen, Steven Jens and Bhadeshiya, Ravi
Subjects: Computer Science - Robotics, Computer Science - Machine Learning
Abstract: We present an approach for retartgeting off-the-shelf Virtual Reality (VR) trackers to effectively teleoperate an upper-body humanoid while ensuring self-collision-free motions. Key to the effectiveness was the proper assignment of trackers to joint sets via modified task Jacobians and relaxed barrier functions for self-collision avoidance. The approach was validated on Apptronik's Astro hardware by demonstrating manipulation capabilities on a table-top environment with pick-and-place box packing and a two-handed box pick up and handover task., Comment: First Prize Winner of Horizons of an extended robotics reality Workshop at International Conference on Intelligent Robots and Systems, 2022
Published: 2024

19. LAuReL: Learned Augmented Residual Layer

Author: Menghani, Gaurav, Kumar, Ravi, and Kumar, Sanjiv
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: One of the core pillars of efficient deep learning methods is architectural improvements such as the residual/skip connection, which has led to significantly better model convergence and quality. Since then the residual connection has become ubiquitous in not just convolutional neural networks but also transformer-based architectures, the backbone of LLMs. In this paper we introduce \emph{Learned Augmented Residual Layer} (LAuReL) -- a novel generalization of the canonical residual connection -- with the goal to be an in-situ replacement of the latter while outperforming on both model quality and footprint metrics. Our experiments show that using \laurel can help boost performance for both vision and language models. For example, on the ResNet-50, ImageNet 1K task, it achieves $60\%$ of the gains from adding an extra layer, while only adding $0.003\%$ more parameters, and matches it while adding $2.6\times$ fewer parameters., Comment: Accepted at the 2nd Efficient Systems for Foundation Models Workshop at the International Conference on Machine Learning (ICML) 2024
Published: 2024

20. Entangled orbital, spin, and ferroelectric orders in $p$-electron magnet CsO$_2$

Author: Ono, Ryota, Kaushik, Ravi, Artyukhin, Sergey, Jansen, Martin, Solovyev, Igor, and Ewings, Russell A.
Subjects: Condensed Matter - Strongly Correlated Electrons
Abstract: Alkali superoxides differ from conventional transition metal magnets, exhibit magnetism from partially occupied oxygen molecular $\pi^*$-orbitals. Among them, CsO$_2$ stands out for its potential to exhibit novel quantum collective phenomena, such as an orbital order induced Tomonaga-Luttinger liquid state. Using ab-initio Hubbard models, superexchange theory, and experimental spin wave measurements, we propose that CsO$_2$ exhibits unconventional magnetoelectric characteristics at low temperature. Our analysis confirms a canted antiferromagnetic ground state and a spin-flop transition, with ferroelectricity is induced by breaking inversion and time-reversal symmetry in the spin-flop phase. Consequently, our analysis reveals a strong interplay not only between exchange interactions but also among magnetically-induced polarization and orbital order. The magnetic structure, stabilized by orbital order, induces magnetically-induced polarization through an antisymmetric mechanism. Overall, our results reveal the coexistence of three highly entangled orders in CsO$_2$, namely, orbital, spin and ferroelectricity., Comment: 13 pages, 9 figures
Published: 2024

21. Gen-AI for User Safety: A Survey

Author: Desai, Akshar Prabhu, Ravi, Tejasvi, Luqman, Mohammad, Kota, Nithya, and Yadav, Pranjul
Subjects: Computer Science - Artificial Intelligence, Computer Science - Cryptography and Security
Abstract: Machine Learning and data mining techniques (i.e. supervised and unsupervised techniques) are used across domains to detect user safety violations. Examples include classifiers used to detect whether an email is spam or a web-page is requesting bank login information. However, existing ML/DM classifiers are limited in their ability to understand natural languages w.r.t the context and nuances. The aforementioned challenges are overcome with the arrival of Gen-AI techniques, along with their inherent ability w.r.t translation between languages, fine-tuning between various tasks and domains. In this manuscript, we provide a comprehensive overview of the various work done while using Gen-AI techniques w.r.t user safety. In particular, we first provide the various domains (e.g. phishing, malware, content moderation, counterfeit, physical safety) across which Gen-AI techniques have been applied. Next, we provide how Gen-AI techniques can be used in conjunction with various data modalities i.e. text, images, videos, audio, executable binaries to detect violations of user-safety. Further, also provide an overview of how Gen-AI techniques can be used in an adversarial setting. We believe that this work represents the first summarization of Gen-AI techniques for user-safety.
Published: 2024

22. Scalable DP-SGD: Shuffling vs. Poisson Subsampling

Author: Chua, Lynn, Ghazi, Badih, Kamath, Pritish, Kumar, Ravi, Manurangsi, Pasin, Sinha, Amer, and Zhang, Chiyuan
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security, Computer Science - Data Structures and Algorithms
Abstract: We provide new lower bounds on the privacy guarantee of the multi-epoch Adaptive Batch Linear Queries (ABLQ) mechanism with shuffled batch sampling, demonstrating substantial gaps when compared to Poisson subsampling; prior analysis was limited to a single epoch. Since the privacy analysis of Differentially Private Stochastic Gradient Descent (DP-SGD) is obtained by analyzing the ABLQ mechanism, this brings into serious question the common practice of implementing shuffling-based DP-SGD, but reporting privacy parameters as if Poisson subsampling was used. To understand the impact of this gap on the utility of trained machine learning models, we introduce a practical approach to implement Poisson subsampling at scale using massively parallel computation, and efficiently train models with the same. We compare the utility of models trained with Poisson-subsampling-based DP-SGD, and the optimistic estimates of utility when using shuffling, via our new lower bounds on the privacy guarantee of ABLQ with shuffling., Comment: To appear at NeurIPS 2024
Published: 2024

23. Industrial Machines Health Prognosis using a Transformer-based Framework

Author: Poland, David J, Puglisi, Lemuel, and Ravi, Daniele
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Machine Learning
Abstract: This article introduces Transformer Quantile Regression Neural Networks (TQRNNs), a novel data-driven solution for real-time machine failure prediction in manufacturing contexts. Our objective is to develop an advanced predictive maintenance model capable of accurately identifying machine system breakdowns. To do so, TQRNNs employ a two-step approach: (i) a modified quantile regression neural network to segment anomaly outliers while maintaining low time complexity, and (ii) a concatenated transformer network aimed at facilitating accurate classification even within a large timeframe of up to one hour. We have implemented our proposed pipeline in a real-world beverage manufacturing industry setting. Our findings demonstrate the model's effectiveness, achieving an accuracy rate of 70.84% with a 1-hour lead time for predicting machine breakdowns. Additionally, our analysis shows that using TQRNNs can increase high-quality production, improving product yield from 78.38% to 89.62%. We believe that predictive maintenance assumes a pivotal role in modern manufacturing, minimizing unplanned downtime, reducing repair costs, optimizing production efficiency, and ensuring operational stability. Its potential to generate substantial cost savings while enhancing sustainability and competitiveness underscores its importance in contemporary manufacturing practices., Comment: 10 pages, 5 figures. Accepted for presentation at the IEEE MetroAXRAINE conference
Published: 2024

24. Asymmetries and Circumstellar Interaction in the Type II SN 2024bch

Author: Andrews, Jennifer E., Shrestha, Manisha, Bostroem, K. Azalee, Dong, Yize, Pearson, Jeniveve, Fausnaugh, M. M., Sand, David J., Valenti, S., Ravi, Aravind P., Hoang, Emily, Hosseinzadeh, Griffin, Ilyin, Ilya, Janzen, Daryl, Lundquist, M. J., Meza, Nicolaz, Smith, Nathan, Jha, Saurabh W., Andrews, Moira, Farah, Joseph, Gonzalez, Estefania Padilla, Howell, D. Andrew, McCully, Curtis, Newsome, Megan, Pellegrino, Craig, Terreran, Giacomo, Wiggins, Patrick, Hsu, Brian, Christy, Collin T., Wang, Xiofeng, Liu, Jialian, and Chen, Liyang
Subjects: Astrophysics - High Energy Astrophysical Phenomena, Astrophysics - Solar and Stellar Astrophysics
Abstract: We present a comprehensive multi-epoch photometric and spectroscopic study of SN 2024bch, a nearby (19.9 Mpc) Type II supernova (SN) with prominent early high ionization emission lines. Optical spectra from 2.9 days after the estimated explosion reveal narrow lines of H I, He II, C IV, and N IV that disappear by day 6. High cadence photometry from the ground and TESS show that the SN brightened quickly and reached a peak M$_V \sim$ $-$17.8 mag within a week of explosion, and late-time photometry suggests a $^{56}$Ni mass of 0.050 M$_{\odot}$. High-resolution spectra from day 8 and 43 trace the unshocked circumstellar medium (CSM) and indicate a wind velocity of 30--40 km s$^{-1}$, a value consistent with a red supergiant (RSG) progenitor. Comparisons between models and the early spectra suggest a pre-SN mass-loss rate of $\dot{M} \sim 10^{-3}-10^{-2}\ M_\odot\ \mathrm{yr}^{-1}$, which is too high to be explained by quiescent mass loss from RSGs, but is consistent with some recent measurements of similar SNe. Persistent blueshifted H I and [O I] emission lines seen in the optical and NIR spectra could be produced by asymmetries in the SN ejecta, while the multi-component H$\alpha$ may indicate continued interaction with an asymmetric CSM well into the nebular phase. SN 2024bch provides another clue to the complex environments and mass-loss histories around massive stars., Comment: Submitted to ApJ
Published: 2024

25. Luminous Type II Short-Plateau SN 2023ufx: Asymmetric Explosion of a Partially-Stripped Massive Progenitor

Author: Ravi, Aravind P., Valenti, Stefano, Dong, Yize, Hiramatsu, Daichi, Barmentloo, Stan, Jerkstrand, Anders, Bostroem, K. Azalee, Pearson, Jeniveve, Shrestha, Manisha, Andrews, Jennifer E., Sand, David J., Hosseinzadeh, Griffin, Lundquist, Michael, Hoang, Emily, Mehta, Darshana, Retamal, Nicolas Meza, Martas, Aidan, Jha, Saurabh W., Janzen, Daryl, Subrayan, Bhagya, Howell, D. Andrew, McCully, Curtis, Farah, Joseph, Newsome, Megan, Gonzalez, Estefania Padilla, Terreran, Giacomo, Andrews, Moira, Filippenko, Alexei V., Brink, Thomas G., Zheng, Weikang, Yang, Yi, Vinko, Jozsef, Wheeler, J. Craig, Smith, Nathan, Rho, Jeonghee, Konyves-Toth, Reka, and Gutierrez, Claudia P.
Subjects: Astrophysics - High Energy Astrophysical Phenomena, Astrophysics - Solar and Stellar Astrophysics
Abstract: We present supernova (SN) 2023ufx, a unique Type IIP SN with the shortest known plateau duration ($t_\mathrm{PT}$ $\sim$47 days), a luminous V-band peak ($M_{V}$ = $-$18.42 $\pm$ 0.08 mag), and a rapid early decline rate ($s1$ = 3.47 $\pm$ 0.09 mag (50 days)$^{-1}$). By comparing observed photometry to a hydrodynamic MESA+STELLA model grid, we constrain the progenitor to be a massive red supergiant with M$_\mathrm{ZAMS}$ $\simeq$19 - 25 M$_{\odot}$. Independent comparisons with nebular spectral models also suggest an initial He-core mass of $\sim$6 M$_{\odot}$, and thus a massive progenitor. For a Type IIP, SN 2023ufx produced an unusually high amount of nickel ($^{56}$Ni) $\sim$0.14 $\pm$ 0.02 M$_{\odot}$, during the explosion. We find that the short plateau duration in SN 2023ufx can be explained with the presence of a small hydrogen envelope (M$_\mathrm{H_\mathrm{env}}$ $\simeq$1.2 M$_{\odot}$), suggesting partial stripping of the progenitor. About $\simeq$0.09 M$_{\odot}$ of CSM through mass loss from late-time stellar evolution of the progenitor is needed to fit the early time ($\lesssim$10 days) pseudo-bolometric light curve. Nebular line diagnostics of broad and multi-peak components of [O I] $\lambda\lambda$6300, 6364, H$\alpha$, and [Ca II] $\lambda \lambda$7291, 7323 suggest that the explosion of SN 2023ufx could be inherently asymmetric, preferentially ejecting material along our line-of-sight., Comment: Submitted to ApJ, 30 pages, 19 figures
Published: 2024

26. A High-Resolution, US-scale Digital Similar of Interacting Livestock, Wild Birds, and Human Ecosystems with Applications to Multi-host Epidemic Spread

Author: Adiga, Abhijin, Chopra, Ayush, Wilson, Mandy L., Ravi, S. S., Xie, Dawen, Swarup, Samarth, Lewis, Bryan, Raskar, Ramesh, and Marathe, Madhav V.
Subjects: Computer Science - Computational Engineering, Finance, and Science
Abstract: One Health issues, such as the spread of highly pathogenic avian influenza (HPAI), present significant challenges at the intersection of human, animal, and environmental health. Recent H5N1 outbreaks underscore the need for comprehensive modeling that capture the complex interactions between various entities in these interconnected ecosystems, encompassing livestock, wild birds, and human populations. To support such efforts, we present a synthetic spatiotemporal gridded dataset for the contiguous United States, referred to as a digital similar. The methodology for constructing this digital similar involves fusing diverse datasets using statistical and optimization techniques. The livestock component includes farm-level representations of multiple livestock types -- cattle, poultry, hogs, and sheep -- including further categorization into subtypes, such as milk and beef cows, chicken, turkeys, ducks, etc. It also includes location-level data for livestock-product processing centers. Weekly abundance data for key wild bird species involved in avian flu transmission are included along with temporal networks of movements. Gridded distributions of the human population, along with demographic and occupational features, capture the placement of agricultural workers and the general population. The digital similar is verified and validated in multiple ways.This dataset aims to provide a comprehensive basis for modeling complex phenomena at the wild-domestic-human interfaces.
Published: 2024

27. The Implicit Bias of Gradient Descent on Separable Multiclass Data

Author: Ravi, Hrithik, Scott, Clayton, Soudry, Daniel, and Wang, Yutong
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Implicit bias describes the phenomenon where optimization-based training algorithms, without explicit regularization, show a preference for simple estimators even when more complex estimators have equal objective values. Multiple works have developed the theory of implicit bias for binary classification under the assumption that the loss satisfies an exponential tail property. However, there is a noticeable gap in analysis for multiclass classification, with only a handful of results which themselves are restricted to the cross-entropy loss. In this work, we employ the framework of Permutation Equivariant and Relative Margin-based (PERM) losses [Wang and Scott, 2024] to introduce a multiclass extension of the exponential tail property. This class of losses includes not only cross-entropy but also other losses. Using this framework, we extend the implicit bias result of Soudry et al. [2018] to multiclass classification. Furthermore, our proof techniques closely mirror those of the binary case, thus illustrating the power of the PERM framework for bridging the binary-multiclass gap., Comment: Accepted to NeurIPS 2024
Published: 2024

28. The Interaction Layer: An Exploration for Co-Designing User-LLM Interactions in Parental Wellbeing Support Systems

Author: Viswanathan, Sruthi, Ibrahim, Seray, Shankar, Ravi, Binns, Reuben, Van Kleek, Max, and Slovak, Petr
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Artificial Intelligence
Abstract: Parenting brings emotional and physical challenges, from balancing work, childcare, and finances to coping with exhaustion and limited personal time. Yet, one in three parents never seek support. AI systems potentially offer stigma-free, accessible, and affordable solutions. Yet, user adoption often fails due to issues with explainability and reliability. To see if these issues could be solved using a co-design approach, we developed and tested NurtureBot, a wellbeing support assistant for new parents. 32 parents co-designed the system through Asynchronous Remote Communities method, identifying the key challenge as achieving a "successful chat". Aspart of co-design, parents role-played as NurturBot, rewriting its dialogues to improve user understanding, control, and outcomes. The refined prototype evaluated by 32 initial and 46 new parents, showed improved user experience and usability, with final CUQ score of 91.3/100, demonstrating successful interaction patterns. Our process revealed useful interaction design lessons for effective AI parenting support.
Published: 2024

29. X-ray and Radio Campaign of the Z-source GX 340+0 II: the X-ray polarization in the normal branch

Author: Bhargava, Yash, Russell, Thomas D., Ng, Mason, Balasubramanian, Arvind, Zhang, Liang, Ravi, Swati, Jadoliya, Vishal, Bhattacharyya, Sudip, Pahari, Mayukh, Homan, Jeroen, Marshall, Herman L., Chakrabarty, Deepto, Carotenuto, Francesco, and Kaushik, Aman
Subjects: Astrophysics - High Energy Astrophysical Phenomena
Abstract: We present the first X-ray polarization measurement of the neutron star low-mass X-ray binary and Z-source, GX 340$+$0, in the normal branch (NB) using a 200 ks observation with the Imaging X-ray Polarimetric Explorer (IXPE). This observation was performed in 2024 August. Along with IXPE, we also conducted simultaneous observations with NICER, AstroSat, Insight-HXMT, ATCA, and GMRT to investigate the broadband spectral and timing properties in the X-ray and radio wavelengths. During the campaign, the source traced a complete Z-track during the IXPE observation but spent most of the time in the NB. We measure X-ray polarization degree (PD) of $1.22\pm0.25\%$ in the 2-8 keV energy band with a polarization angle (PA) of $38\pm6^\circ$. The PD in the NB is observed to be weaker than in the horizontal branch (HB) but aligned in the same direction. The PD of the source exhibits a marginal increase with energy while the PA shows no energy dependence. The joint spectro-polarimetric modeling is consistent with the observed X-ray polarization originating from a single spectral component from the blackbody, the Comptonized emission, or reflection feature, while the disk emission does not contribute towards the X-ray polarization. GMRT observations at 1.26 GHz during HB had a tentative detection at 4.5$\pm$0.7 mJy while ATCA observations a day later during the NB detected the source at 0.70$\pm$0.05 mJy and 0.59$\pm$0.05 mJy in the 5.5 & 9 GHz bands, respectively, suggesting an evolving jet structure depending on the Z-track position., Comment: 17 pages, 5 figures, 4 tables; Submitted to ApJ
Published: 2024

30. MBExplainer: Multilevel bandit-based explanations for downstream models with augmented graph embeddings

Author: Golgoon, Ashkan, Franks, Ryan, Filom, Khashayar, and Kannan, Arjun Ravi
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computational Engineering, Finance, and Science, Mathematics - Numerical Analysis, Statistics - Machine Learning, 68T01, I.2
Abstract: In many industrial applications, it is common that the graph embeddings generated from training GNNs are used in an ensemble model where the embeddings are combined with other tabular features (e.g., original node or edge features) in a downstream ML task. The tabular features may even arise naturally if, e.g., one tries to build a graph such that some of the node or edge features are stored in a tabular format. Here we address the problem of explaining the output of such ensemble models for which the input features consist of learned neural graph embeddings combined with additional tabular features. We propose MBExplainer, a model-agnostic explanation approach for downstream models with augmented graph embeddings. MBExplainer returns a human-legible triple as an explanation for an instance prediction of the whole pipeline consisting of three components: a subgraph with the highest importance, the topmost important nodal features, and the topmost important augmented downstream features. A game-theoretic formulation is used to take the contributions of each component and their interactions into account by assigning three Shapley values corresponding to their own specific games. Finding the explanation requires an efficient search through the corresponding local search spaces corresponding to each component. MBExplainer applies a novel multilevel search algorithm that enables simultaneous pruning of local search spaces in a computationally tractable way. In particular, three interweaved Monte Carlo Tree Search are utilized to iteratively prune the local search spaces. MBExplainer also includes a global search algorithm that uses contextual bandits to efficiently allocate pruning budget among the local search spaces. We show the effectiveness of MBExplainer by presenting a set of comprehensive numerical examples on multiple public graph datasets for both node and graph classification tasks.
Published: 2024

31. EXACFS -- A CIL Method to mitigate Catastrophic Forgetting

Author: Balasubramanian, S, Subramaniam, M Sai, Talasu, Sai Sriram, Krishna, P Yedu, Sai, Manepalli Pranav Phanindra, Mukkamala, Ravi, and Gera, Darshan
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Deep neural networks (DNNS) excel at learning from static datasets but struggle with continual learning, where data arrives sequentially. Catastrophic forgetting, the phenomenon of forgetting previously learned knowledge, is a primary challenge. This paper introduces EXponentially Averaged Class-wise Feature Significance (EXACFS) to mitigate this issue in the class incremental learning (CIL) setting. By estimating the significance of model features for each learned class using loss gradients, gradually aging the significance through the incremental tasks and preserving the significant features through a distillation loss, EXACFS effectively balances remembering old knowledge (stability) and learning new knowledge (plasticity). Extensive experiments on CIFAR-100 and ImageNet-100 demonstrate EXACFS's superior performance in preserving stability while acquiring plasticity.
Published: 2024

32. On Memorization of Large Language Models in Logical Reasoning

Author: Xie, Chulin, Huang, Yangsibo, Zhang, Chiyuan, Yu, Da, Chen, Xinyun, Lin, Bill Yuchen, Li, Bo, Ghazi, Badih, and Kumar, Ravi
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) achieve good performance on challenging reasoning benchmarks, yet could also make basic reasoning mistakes. This contrasting behavior is puzzling when it comes to understanding the mechanisms behind LLMs' reasoning capabilities. One hypothesis is that the increasingly high and nearly saturated performance on common reasoning benchmarks could be due to the memorization of similar problems. In this paper, we systematically investigate this hypothesis with a quantitative measurement of memorization in reasoning tasks, using a dynamically generated logical reasoning benchmark based on Knights and Knaves (K&K) puzzles. We found that LLMs could interpolate the training puzzles (achieving near-perfect accuracy) after fine-tuning, yet fail when those puzzles are slightly perturbed, suggesting that the models heavily rely on memorization to solve those training puzzles. On the other hand, we show that while fine-tuning leads to heavy memorization, it also consistently improves generalization performance. In-depth analyses with perturbation tests, cross difficulty-level transferability, probing model internals, and fine-tuning with wrong answers suggest that the LLMs learn to reason on K&K puzzles despite training data memorization. This phenomenon indicates that LLMs exhibit a complex interplay between memorization and genuine reasoning abilities. Finally, our analysis with per-sample memorization score sheds light on how LLMs switch between reasoning and memorization in solving logical puzzles. Our code and data are available at https://memkklogic.github.io.
Published: 2024

33. S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving

Author: Wozniak, Maciej K., Govindarajan, Hariprasath, Klingner, Marvin, Maurice, Camille, Kiran, B Ravi, and Yogamani, Senthil
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Robotics
Abstract: Recent self-supervised clustering-based pre-training techniques like DINO and Cribo have shown impressive results for downstream detection and segmentation tasks. However, real-world applications such as autonomous driving face challenges with imbalanced object class and size distributions and complex scene geometries. In this paper, we propose S3PT a novel scene semantics and structure guided clustering to provide more scene-consistent objectives for self-supervised training. Specifically, our contributions are threefold: First, we incorporate semantic distribution consistent clustering to encourage better representation of rare classes such as motorcycles or animals. Second, we introduce object diversity consistent spatial clustering, to handle imbalanced and diverse object sizes, ranging from large background areas to small objects such as pedestrians and traffic signs. Third, we propose a depth-guided spatial clustering to regularize learning based on geometric information of the scene, thus further refining region separation on the feature level. Our learned representations significantly improve performance in downstream semantic segmentation and 3D object detection tasks on the nuScenes, nuImages, and Cityscapes datasets and show promising domain translation properties., Comment: Accepted for WACV 2025
Published: 2024

34. Tschirnhausen bundles of covers of the projective line

Author: Vakil, Ravi and Vemulapalli, Sameera
Subjects: Mathematics - Algebraic Geometry, Mathematics - Number Theory, Primary 14H60, Secondary 14H51, 14H30
Abstract: A degree $d$ genus $g$ cover of the complex projective line by a smooth curve $C$ yields a vector bundle on the projective line by pushforward of the structure sheaf. Which bundles are possible? Equivalently, which $\mathbb{P}^{d-2}$-bundles over $\mathbb{P}^1$ contain such covers? (In the language of many previous papers: what are the scrollar invariants of the cover?) We give a complete answer in degree $4$, which exhibits the expected pathologies. We describe a polytope (one per degree) which we propose gives the complete answer for primitive covers, i.e. covers that don't factor through a subcover. We show that all such bundles (for primitive covers) lie in this polytope, and that a ``positive proportion'' of the polytope arises from smooth covers. Moreover, we show the necessity of the primitivity assumption. Finally, we show that the map from the Hurwitz space of smooth covers to the space of bundles is not flat (for $d>3$ and $g \gg_d 0$)., Comment: Comments welcome!
Published: 2024

35. Model-agnostic basis functions for the 2-point correlation function of dark matter in linear theory

Author: Paranjape, Aseem and Sheth, Ravi K.
Subjects: Astrophysics - Cosmology and Nongalactic Astrophysics, Computer Science - Machine Learning
Abstract: We consider approximating the linearly evolved 2-point correlation function (2pcf) of dark matter $\xi_{\rm lin}(r;\boldsymbol{\theta})$ in a cosmological model with parameters $\boldsymbol{\theta}$ as the linear combination $\xi_{\rm lin}(r;\boldsymbol{\theta})\approx\sum_i\,b_i(r)\,w_i(\boldsymbol{\theta})$, where the functions $\mathcal{B}=\{b_i(r)\}$ form a $\textit{model-agnostic basis}$ for the linear 2pcf. This decomposition is important for model-agnostic analyses of the baryon acoustic oscillation (BAO) feature in the nonlinear 2pcf of galaxies that fix $\mathcal{B}$ and leave the coefficients $\{w_i\}$ free. To date, such analyses have made simple but sub-optimal choices for $\mathcal{B}$, such as monomials. We develop a machine learning framework for systematically discovering a $\textit{minimal}$ basis $\mathcal{B}$ that describes $\xi_{\rm lin}(r)$ near the BAO feature in a wide class of cosmological models. We use a custom architecture, denoted $\texttt{BiSequential}$, for a neural network (NN) that explicitly realizes the separation between $r$ and $\boldsymbol{\theta}$ above. The optimal NN trained on data in which only $\{\Omega_{\rm m},h\}$ are varied in a $\textit{flat}$ $\Lambda$CDM model produces a basis $\mathcal{B}$ comprising $9$ functions capable of describing $\xi_{\rm lin}(r)$ to $\sim0.6\%$ accuracy in $\textit{curved}$ $w$CDM models varying 7 parameters within $\sim5\%$ of their fiducial, flat $\Lambda$CDM values. Scales such as the peak, linear point and zero-crossing of $\xi_{\rm lin}(r)$ are also recovered with very high accuracy. We compare our approach to other compression schemes in the literature, and speculate that $\mathcal{B}$ may also encompass $\xi_{\rm lin}(r)$ in modified gravity models near our fiducial $\Lambda$CDM model. Using our basis functions in model-agnostic BAO analyses can potentially lead to significant statistical gains., Comment: 20 pages, 9 figures, to be submitted to JCAP. The implementation of the BiSequential architecture, along with a simple example notebook, is publicly available as part of the MLFundas repository at https://github.com/a-paranjape/mlfundas
Published: 2024

36. Putting Off the Catching Up: Online Joint Replenishment Problem with Holding and Backlog Costs

Author: Moseley, Benjamin, Niaparast, Aidin, and Ravi, R.
Subjects: Computer Science - Data Structures and Algorithms
Abstract: We study an online generalization of the classic Joint Replenishment Problem (JRP) that models the trade-off between ordering costs, holding costs, and backlog costs in supply chain planning systems. A retailer places orders to a supplier for multiple items over time: each request is for some item that the retailer needs in the future, and has an arrival time and a soft deadline. If a request is served before its deadline, the retailer pays a holding cost per unit of the item until the deadline. However, if a request is served after its deadline, the retailer pays a backlog cost per unit. Each service incurs a fixed joint service cost and a fixed item-dependent cost for every item included in a service. These fixed costs are the same irrespective of the units of each item ordered. The goal is to schedule services to satisfy all the online requests while minimizing the sum of the service costs, the holding costs, and the backlog costs. Constant competitive online algorithms have been developed for two special cases: the make-to-order version when the deadlines are equal to arrival times (Buchbinder et al., 2013), and the make-to-stock version with hard deadlines with zero holding costs (Bienkowski et al., 2014). Our general model with holding and backlog costs has not been investigated earlier, and no online algorithms are known even in the make-to-stock version with hard deadlines and non-zero holding costs. We develop a new online algorithm for the general version of online JRP with both holding and backlog costs and establish that it is 30-competitive. Along the way, we develop a 3-competitive algorithm for the single-item case that we build on to get our final result. Our algorithm uses a greedy strategy and its competitiveness is shown using a dual fitting analysis.
Published: 2024

37. SERN: Simulation-Enhanced Realistic Navigation for Multi-Agent Robotic Systems in Contested Environments

Author: Hossain, Jumman, Dey, Emon, Chugh, Snehalraj, Ahmed, Masud, Anwar, MS, Faridee, Abu-Zaher, Hoppes, Jason, Trout, Theron, Basak, Anjon, Chowdhury, Rafidh, Mistry, Rishabh, Kim, Hyun, Freeman, Jade, Suri, Niranjan, Raglin, Adrienne, Busart, Carl, Gregory, Timothy, Ravi, Anuradha, and Roy, Nirmalya
Subjects: Computer Science - Robotics, Computer Science - Multiagent Systems
Abstract: The increasing deployment of autonomous systems in complex environments necessitates efficient communication and task completion among multiple agents. This paper presents SERN (Simulation-Enhanced Realistic Navigation), a novel framework integrating virtual and physical environments for real-time collaborative decision-making in multi-robot systems. SERN addresses key challenges in asset deployment and coordination through a bi-directional communication framework using the AuroraXR ROS Bridge. Our approach advances the SOTA through accurate real-world representation in virtual environments using Unity high-fidelity simulator; synchronization of physical and virtual robot movements; efficient ROS data distribution between remote locations; and integration of SOTA semantic segmentation for enhanced environmental perception. Our evaluations show a 15% to 24% improvement in latency and up to a 15% increase in processing efficiency compared to traditional ROS setups. Real-world and virtual simulation experiments with multiple robots demonstrate synchronization accuracy, achieving less than 5 cm positional error and under 2-degree rotational error. These results highlight SERN's potential to enhance situational awareness and multi-agent coordination in diverse, contested environments., Comment: Under Review for ICRA 2025
Published: 2024

38. Efficient Feature Extraction Using Light-Weight CNN Attention-Based Deep Learning Architectures for Ultrasound Fetal Plane Classification

Author: Sivasubramanian, Arrun, Sasidharan, Divya, V, Sowmya, and Ravi, Vinayakumar
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Ultrasound fetal imaging is beneficial to support prenatal development because it is affordable and non-intrusive. Nevertheless, fetal plane classification (FPC) remains challenging and time-consuming for obstetricians since it depends on nuanced clinical aspects, which increases the difficulty in identifying relevant features of the fetal anatomy. Thus, to assist with its accurate feature extraction, a lightweight artificial intelligence architecture leveraging convolutional neural networks and attention mechanisms is proposed to classify the largest benchmark ultrasound dataset. The approach fine-tunes from lightweight EfficientNet feature extraction backbones pre-trained on the ImageNet1k. to classify key fetal planes such as the brain, femur, thorax, cervix, and abdomen. Our methodology incorporates the attention mechanism to refine features and 3-layer perceptrons for classification, achieving superior performance with the highest Top-1 accuracy of 96.25%, Top-2 accuracy of 99.80% and F1-Score of 0.9576. Importantly, the model has 40x fewer trainable parameters than existing benchmark ensemble or transformer pipelines, facilitating easy deployment on edge devices to help clinical practitioners with real-time FPC. The findings are also interpreted using GradCAM to carry out clinical correlation to aid doctors with diagnostics and improve treatment plans for expectant mothers., Comment: Submitted to Computers in Biology and Medicine journal
Published: 2024

39. 12-spin-qubit arrays fabricated on a 300 mm semiconductor manufacturing line

Author: George, Hubert C., Mądzik, Mateusz T., Henry, Eric M., Wagner, Andrew J., Islam, Mohammad M., Borjans, Felix, Connors, Elliot J., Corrigan, Joelle, Curry, Matthew, Harper, Michael K., Keith, Daniel, Lampert, Lester, Luthi, Florian, Mohiyaddin, Fahd A., Murcia, Sandra, Nair, Rohit, Nahm, Rambert, Nethwewala, Aditi, Neyens, Samuel, Raharjo, Roy D., Rogan, Carly, Savytskyy, Rostyslav, Watson, Thomas F., Ziegler, Josh, Zietz, Otto K., Pillarisetty, Ravi, Bishop, Nathaniel C., Bojarski, Stephanie A., Roberts, Jeanette, and Clarke, James S.
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics, Quantum Physics
Abstract: Intels efforts to build a practical quantum computer are focused on developing a scalable spin-qubit platform leveraging industrial high-volume semiconductor manufacturing expertise and 300 mm fabrication infrastructure. Here, we provide an overview of the design, fabrication, and demonstration of a new customized quantum test chip, which contains 12-quantum-dot spin-qubit linear arrays, code named Tunnel Falls. These devices are fabricated using immersion and extreme ultraviolet lithography (EUV), along with other standard high-volume manufacturing (HVM) processes, as well as production-level process control. We present key device features and fabrication details, as well as qubit characterization results confirming device functionality. These results corroborate our fabrication methods and are a crucial step towards scaling of extensible 2D qubit array schemes.
Published: 2024

40. Opportunities and Challenges of Generative-AI in Finance

Author: Desai, Akshar Prabhu, Mallya, Ganesh Satish, Luqman, Mohammad, Ravi, Tejasvi, Kota, Nithya, and Yadav, Pranjul
Subjects: Computer Science - Artificial Intelligence
Abstract: Gen-AI techniques are able to improve understanding of context and nuances in language modeling, translation between languages, handle large volumes of data, provide fast, low-latency responses and can be fine-tuned for various tasks and domains. In this manuscript, we present a comprehensive overview of the applications of Gen-AI techniques in the finance domain. In particular, we present the opportunities and challenges associated with the usage of Gen-AI techniques. We also illustrate the various methodologies which can be used to train Gen-AI techniques and present the various application areas of Gen-AI technologies in the finance ecosystem. To the best of our knowledge, this work represents the most comprehensive summarization of Gen-AI techniques within the financial domain. The analysis is designed for a deep overview of areas marked for substantial advancement while simultaneously pin-point those warranting future prioritization. We also hope that this work would serve as a conduit between finance and other domains, thus fostering the cross-pollination of innovative concepts and practices.
Published: 2024

41. Surface acoustic waves Brillouin photonics on a silicon nitride chip

Author: Klaver, Yvan, Morsche, Randy te, Botter, Roel A., Hashemi, Batoul, Frare, Bruno L. Segat, Mishra, Akhileshwar, Ye, Kaixuan, Mbonde, Hamidu, Ahmadi, Pooya Torab, Taleghani, Niloofar Majidian, Jonker, Evan, Braamhaar, Redlef B. G., Selvaganapathy, Ponnambalam Ravi, Mascher, Peter, van der Slot, Peter J. M., Bradley, Jonathan D. B., and Marpaung, David
Subjects: Physics - Optics, Physics - Applied Physics
Abstract: Seamlessly integrating stimulated Brillouin scattering (SBS) in a low-loss and mature photonic integration platform remains a complicated task. Virtually all current approaches fall short in simultaneously achieving strong SBS, low losses, and technological scalability. In this work we incorporate stong SBS into a standard silicon nitride platform by a simple deposition of a tellurium oxide layer, a commonly used material for acousto-optic modulators. In these heterogeneously integrated waveguides, we harness novel SBS interactions actuated by surface acoustic waves (SAWs) leading to more than two orders of magnitude gain enhancement. Three novel applications are demonstrated in this platform: (i) a silicon nitride Brillouin amplifier with 5 dB net optical gain, (ii) a compact intermodal stimulated Brillouin laser (SBL) capable of high purity radio frequency (RF) signal generation with 7 Hz intrinsic linewidth, and (iii) a widely tunable microwave photonic notch filter with ultra-narrow linewidth of 2.2 MHz enabled by Brillouin induced opacity. These advancements can unlock an array of new RF and optical technologies to be directly integrated in silicon nitride.
Published: 2024

42. Double Distributionally Robust Bid Shading for First Price Auctions

Author: Qu, Yanlin, Kant, Ravi, Chen, Yan, Kitts, Brendan, Gultekin, San, Flores, Aaron, and Blanchet, Jose
Subjects: Computer Science - Computer Science and Game Theory, Mathematics - Optimization and Control
Abstract: Bid shading has become a standard practice in the digital advertising industry, in which most auctions for advertising (ad) opportunities are now of first price type. Given an ad opportunity, performing bid shading requires estimating not only the value of the opportunity but also the distribution of the highest bid from competitors (i.e. the competitive landscape). Since these two estimates tend to be very noisy in practice, first-price auction participants need a bid shading policy that is robust against relatively significant estimation errors. In this work, we provide a max-min formulation in which we maximize the surplus against an adversary that chooses a distribution both for the value and the competitive landscape, each from a Kullback-Leibler-based ambiguity set. As we demonstrate, the two ambiguity sets are essential to adjusting the shape of the bid-shading policy in a principled way so as to effectively cope with uncertainty. Our distributionally robust bid shading policy is efficient to compute and systematically outperforms its non-robust counterpart on real datasets provided by Yahoo DSP.
Published: 2024

43. A graph product and its Application

Author: Sonar, Bishal and Srivastava, Ravi
Subjects: Mathematics - Combinatorics, 05C76, 05C50, 05C22
Abstract: The spectrum of Laplacian and signless Laplacian matrix for a graph product is obtained, where both underlying graphs are regular. As an application of this, we have been able to generate the Kirchhoff Index and Wiener Index and determine the number of spanning trees. Additionally, we derived the conditions necessary for obtaining a Laplacian and signless Laplacian integral product graph.
Published: 2024

44. Correction to Local Information Privacy and Its Applications to Data Aggregation

Author: Jiang, Bo, Li, Ming, and Tandon, Ravi
Subjects: Computer Science - Cryptography and Security
Abstract: In our previous works, we defined Local Information Privacy (LIP) as a context-aware privacy notion and presented the corresponding privacy-preserving mechanism. Then we claim that the mechanism satisfies epsilon-LIP for any epsilon>0 for arbitrary Px. However, this claim is not completely correct. In this document, we provide a correction to the valid range of privacy parameters of our previously proposed LIP mechanism. Further, we propose efficient algorithms to expand the range of valid privacy parameters. Finally, we discuss the impact of updated results on our original paper's experiments, the rationale of the proposed correction and corrected results.
Published: 2024

45. Differential Privacy on Trust Graphs

Author: Ghazi, Badih, Kumar, Ravi, Manurangsi, Pasin, and Wang, Serena
Subjects: Computer Science - Cryptography and Security, Computer Science - Data Structures and Algorithms, Computer Science - Machine Learning, Computer Science - Social and Information Networks
Abstract: We study differential privacy (DP) in a multi-party setting where each party only trusts a (known) subset of the other parties with its data. Specifically, given a trust graph where vertices correspond to parties and neighbors are mutually trusting, we give a DP algorithm for aggregation with a much better privacy-utility trade-off than in the well-studied local model of DP (where each party trusts no other party). We further study a robust variant where each party trusts all but an unknown subset of at most $t$ of its neighbors (where $t$ is a given parameter), and give an algorithm for this setting. We complement our algorithms with lower bounds, and discuss implications of our work to other tasks in private learning and analytics.
Published: 2024

46. Can LLMs be Scammed? A Baseline Measurement Study

Author: Sehwag, Udari Madhushani, Patel, Kelly, Mosca, Francesca, Ravi, Vineeth, and Staddon, Jessica
Subjects: Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence
Abstract: Despite the importance of developing generative AI models that can effectively resist scams, current literature lacks a structured framework for evaluating their vulnerability to such threats. In this work, we address this gap by constructing a benchmark based on the FINRA taxonomy and systematically assessing Large Language Models' (LLMs') vulnerability to a variety of scam tactics. First, we incorporate 37 well-defined base scam scenarios reflecting the diverse scam categories identified by FINRA taxonomy, providing a focused evaluation of LLMs' scam detection capabilities. Second, we utilize representative proprietary (GPT-3.5, GPT-4) and open-source (Llama) models to analyze their performance in scam detection. Third, our research provides critical insights into which scam tactics are most effective against LLMs and how varying persona traits and persuasive techniques influence these vulnerabilities. We reveal distinct susceptibility patterns across different models and scenarios, underscoring the need for targeted enhancements in LLM design and deployment.
Published: 2024

47. Unlearn and Burn: Adversarial Machine Unlearning Requests Destroy Model Accuracy

Author: Huang, Yangsibo, Liu, Daogao, Chua, Lynn, Ghazi, Badih, Kamath, Pritish, Kumar, Ravi, Manurangsi, Pasin, Nasr, Milad, Sinha, Amer, and Zhang, Chiyuan
Subjects: Computer Science - Cryptography and Security
Abstract: Machine unlearning algorithms, designed for selective removal of training data from models, have emerged as a promising approach to growing privacy concerns. In this work, we expose a critical yet underexplored vulnerability in the deployment of unlearning systems: the assumption that the data requested for removal is always part of the original training set. We present a threat model where an attacker can degrade model accuracy by submitting adversarial unlearning requests for data not present in the training set. We propose white-box and black-box attack algorithms and evaluate them through a case study on image classification tasks using the CIFAR-10 and ImageNet datasets, targeting a family of widely used unlearning methods. Our results show extremely poor test accuracy following the attack: 3.6% on CIFAR-10 and 0.4% on ImageNet for white-box attacks, and 8.5% on CIFAR-10 and 1.3% on ImageNet for black-box attacks. Additionally, we evaluate various verification mechanisms to detect the legitimacy of unlearning requests and reveal the challenges in verification, as most of the mechanisms fail to detect stealthy attacks without severely impairing their ability to process valid requests. These findings underscore the urgent need for research on more robust request verification methods and unlearning protocols, should the deployment of machine unlearning systems become more prevalent in the future.
Published: 2024

48. Cross-Domain Evaluation of Few-Shot Classification Models: Natural Images vs. Histopathological Images

Author: Sekhar, Ardhendu, Bhattacharya, Aditya, Goyal, Vinayak, Goel, Vrinda, Bhangale, Aditya, Gupta, Ravi Kant, and Sethi, Amit
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this study, we investigate the performance of few-shot classification models across different domains, specifically natural images and histopathological images. We first train several few-shot classification models on natural images and evaluate their performance on histopathological images. Subsequently, we train the same models on histopathological images and compare their performance. We incorporated four histopathology datasets and one natural images dataset and assessed performance across 5-way 1-shot, 5-way 5-shot, and 5-way 10-shot scenarios using a selection of state-of-the-art classification techniques. Our experimental results reveal insights into the transferability and generalization capabilities of few-shot classification models between diverse image domains. We analyze the strengths and limitations of these models in adapting to new domains and provide recommendations for optimizing their performance in cross-domain scenarios. This research contributes to advancing our understanding of few-shot learning in the context of image classification across diverse domains.
Published: 2024

49. Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy

Author: Wu, Tong, Zhang, Shujian, Song, Kaiqiang, Xu, Silei, Zhao, Sanqiang, Agrawal, Ravi, Indurthi, Sathish Reddy, Xiang, Chong, Mittal, Prateek, and Zhou, Wenxuan
Subjects: Computer Science - Cryptography and Security, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Large Language Models (LLMs) are susceptible to security and safety threats, such as prompt injection, prompt extraction, and harmful requests. One major cause of these vulnerabilities is the lack of an instruction hierarchy. Modern LLM architectures treat all inputs equally, failing to distinguish between and prioritize various types of instructions, such as system messages, user prompts, and data. As a result, lower-priority user prompts may override more critical system instructions, including safety protocols. Existing approaches to achieving instruction hierarchy, such as delimiters and instruction-based training, do not address this issue at the architectural level. We introduce the Instructional Segment Embedding (ISE) technique, inspired by BERT, to modern large language models, which embeds instruction priority information directly into the model. This approach enables models to explicitly differentiate and prioritize various instruction types, significantly improving safety against malicious prompts that attempt to override priority rules. Our experiments on the Structured Query and Instruction Hierarchy benchmarks demonstrate an average robust accuracy increase of up to 15.75% and 18.68%, respectively. Furthermore, we observe an improvement in instruction-following capability of up to 4.1% evaluated on AlpacaEval. Overall, our approach offers a promising direction for enhancing the safety and effectiveness of LLM architectures., Comment: Preprint
Published: 2024

50. How Unique is Whose Web Browser? The role of demographics in browser fingerprinting among US users

Author: Berke, Alex, Bacis, Enrico, Ghazi, Badih, Kamath, Pritish, Kumar, Ravi, Lassonde, Robin, Manurangsi, Pasin, and Syed, Umar
Subjects: Computer Science - Computers and Society
Abstract: Browser fingerprinting can be used to identify and track users across the Web, even without cookies, by collecting attributes from users' devices to create unique "fingerprints". This technique and resulting privacy risks have been studied for over a decade. Yet further research is limited because prior studies used data not publicly available. Additionally, data in prior studies lacked user demographics. Here we provide a first-of-its-kind dataset to enable further research. It includes browser attributes with users' demographics and survey responses, collected with informed consent from 8,400 US study participants. We use this dataset to demonstrate how fingerprinting risks differ across demographic groups. For example, we find lower income users are more at risk, and find that as users' age increases, they are both more likely to be concerned about fingerprinting and at real risk of fingerprinting. Furthermore, we demonstrate an overlooked risk: user demographics, such as gender, age, income level and race, can be inferred from browser attributes commonly used for fingerprinting, and we identify which browser attributes most contribute to this risk. Our data collection process also conducted an experiment to study what impacts users' likelihood to share browser data for open research, in order to inform future data collection efforts, with responses from 12,461 total participants. Female participants were significantly less likely to share their browser data, as were participants who were shown the browser data we asked to collect. Overall, we show the important role of user demographics in the ongoing work that intends to assess fingerprinting risks and improve user privacy, with findings to inform future privacy enhancing browser developments. The dataset and data collection tool we provide can be used to further study research questions not addressed in this work., Comment: In Proceedings on Privacy Enhancing Technologies 2025(1)
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Region

Database

Publisher

57,613 results on '"Ravi, P."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources