885 results on '"Sagar P."'
Search Results
2. A Hybrid Deep Learning CNN Model for Enhanced COVID-19 Detection from Computed Tomography (CT) Scan Images
- Author
-
Nettur, Suresh Babu, Karpurapu, Shanthi, Nettur, Unnati, Gajja, Likhit Sagar, Myneni, Sravanthy, Dusi, Akhil, and Posham, Lalithya
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Early detection of COVID-19 is crucial for effective treatment and controlling its spread. This study proposes a novel hybrid deep learning model for detecting COVID-19 from CT scan images, designed to assist overburdened medical professionals. Our proposed model leverages the strengths of VGG16, DenseNet121, and MobileNetV2 to extract features, followed by Principal Component Analysis (PCA) for dimensionality reduction, after which the features are stacked and classified using a Support Vector Classifier (SVC). We conducted comparative analysis between the proposed hybrid model and individual pre-trained CNN models, using a dataset of 2,108 training images and 373 test images comprising both COVID-positive and non-COVID images. Our proposed hybrid model achieved an accuracy of 98.93%, outperforming the individual models in terms of precision, recall, F1 scores, and ROC curve performance., Comment: Corresponding authors: Shanthi Karpurapu (shanthi.karpurapu@gmail.com), Suresh Babu Nettur (nettursuresh@gmail.com) Shanthi Karpurapu and Suresh Babu Nettur are co-first authors
- Published
- 2025
3. Evaporative cooling by pulse width modulation (PWM) of optical dipole traps
- Author
-
Maurya, S. Sagar, Sunil, Joel M., Bhartiya, Monu, Dutta, Pranab, Mangaonkar, Jay, Sawant, Rahul, and Rapol, Umakant D.
- Subjects
Physics - Atomic Physics - Abstract
We introduce a method for cooling atoms in an optical dipole trap using pulse-width modulation (PWM) technique, without reducing the laser power of the dipole trap. The PWM technique involves digital modulation of the trap at a fixed frequency. The effective time-averaged dipole potential is lowered by adjusting the duty cycle of the modulation, thereby implementing evaporative cooling. We show that, this technique effectively reduces temperature and enhances phase space density. A comparison with the standard method of evaporative cooling has also been made. Apart from the atom loss due to reduction of the effective trapping potential, we observe an additional loss channel originating from the lack of trapping potential during the trap off time. This atom loss is observed at different modulation frequencies which are an order of magnitude higher compared to trapping frequency of dipole trap. The PWM technique provides an alternative to traditional evaporative cooling in scenarios where it is preferred that the laser power of the trap should be constant.
- Published
- 2025
4. Lightweight Weighted Average Ensemble Model for Pneumonia Detection in Chest X-Ray Images
- Author
-
Nettur, Suresh Babu, Karpurapu, Shanthi, Nettur, Unnati, Gajja, Likhit Sagar, Myneni, Sravanthy, Dusi, Akhil, and Posham, Lalithya
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Pneumonia is a leading cause of illness and death in children, underscoring the need for early and accurate detection. In this study, we propose a novel lightweight ensemble model for detecting pneumonia in children using chest X-ray images. This ensemble model integrates two pre-trained convolutional neural networks (CNNs), MobileNetV2 and NASNetMobile, selected for their balance of computational efficiency and accuracy. These models were fine-tuned on a pediatric chest X-ray dataset and combined to enhance classification performance. Our proposed ensemble model achieved a classification accuracy of 98.63%, significantly outperforming individual models such as MobileNetV2 (97.10%) and NASNetMobile(96.25%) in terms of accuracy, precision, recall, and F1 score. Moreover, the ensemble model outperformed state-of-the-art architectures, including ResNet50, InceptionV3, and DenseNet201, while maintaining computational efficiency. The proposed lightweight ensemble model presents a highly effective and resource-efficient solution for pneumonia detection, making it particularly suitable for deployment in resource-constrained settings., Comment: Corresponding authors: Shanthi Karpurapu (shanthi.karpurapu@gmail.com), Suresh Babu Nettur (nettursuresh@gmail.com)
- Published
- 2025
5. UltraLightSqueezeNet: A Deep Learning Architecture for Malaria Classification with up to 54x fewer trainable parameters for resource constrained devices
- Author
-
Nettur, Suresh Babu, Karpurapu, Shanthi, Nettur, Unnati, Gajja, Likhit Sagar, Myneni, Sravanthy, Dusi, Akhil, and Posham, Lalithya
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Lightweight deep learning approaches for malaria detection have gained attention for their potential to enhance diagnostics in resource constrained environments. For our study, we selected SqueezeNet1.1 as it is one of the most popular lightweight architectures. SqueezeNet1.1 is a later version of SqueezeNet1.0 and is 2.4 times more computationally efficient than the original model. We proposed and implemented three ultra-lightweight architecture variants to SqueezeNet1.1 architecture, namely Variant 1 (one fire module), Variant 2 (two fire modules), and Variant 3 (four fire modules), which are even more compact than SqueezeNetV1.1 (eight fire modules). These models were implemented to evaluate the best performing variant that achieves superior computational efficiency without sacrificing accuracy in malaria blood cell classification. The models were trained and evaluated using the NIH Malaria dataset. We assessed each model's performance based on metrics including accuracy, recall, precision, F1-score, and Area Under the Curve (AUC). The results show that the SqueezeNet1.1 model achieves the highest performance across all metrics, with a classification accuracy of 97.12%. Variant 3 (four fire modules) offers a competitive alternative, delivering almost identical results (accuracy 96.55%) with a 6x reduction in computational overhead compared to SqueezeNet1.1. Variant 2 and Variant 1 perform slightly lower than Variant 3, with Variant 2 (two fire modules) reducing computational overhead by 28x, and Variant 1 (one fire module) achieving a 54x reduction in trainable parameters compared to SqueezeNet1.1. These findings demonstrate that our SqueezeNet1.1 architecture variants provide a flexible approach to malaria detection, enabling the selection of a variant that balances resource constraints and performance.
- Published
- 2025
6. Domain-Factored Untrained Deep Prior for Spectrum Cartography
- Author
-
Timilsina, Subash, Shrestha, Sagar, Cheng, Lei, and Fu, Xiao
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
Spectrum cartography (SC) focuses on estimating the radio power propagation map of multiple emitters across space and frequency using limited sensor measurements. Recent advances in SC have shown that leveraging learned deep generative models (DGMs) as structural constraints yields state-of-the-art performance. By harnessing the expressive power of neural networks, these structural "priors" capture intricate patterns in radio maps. However, training DGMs requires substantial data, which is not always available, and distribution shifts between training and testing data can further degrade performance. To address these challenges, this work proposes using untrained neural networks (UNNs) for SC. UNNs, commonly applied in vision tasks to represent complex data without training, encode structural information of data in neural architectures. In our approach, a custom-designed UNN represents radio maps under a spatio-spectral domain factorization model, leveraging physical characteristics to reduce sample complexity of SC. Experiments show that the method achieves performance comparable to learned DGM-based SC, without requiring training data., Comment: 6 pages 6 figures
- Published
- 2025
7. A Comprehensive Metric for Resilience Evaluation of Power Distribution Systems under Cyber Attacks
- Author
-
Babu, Mitikiri Sagar, Babu, Victor Sam Moses, Srinivas, Vedantham Lakshmi, Chakraborty, Pratyush, and Pal, Mayukha
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
Power distribution systems (PDS) serve as the backbone of our modern society, ensuring electricity reaches homes, businesses, and critical infrastructure. However, the increasing digitization and interconnectivity of these systems have exposed them to cyber threats. This study presents a comprehensive approach to evaluate and enhance the resilience of PDS under cyber attacks using the Common Vulnerability Scoring System (CVSS) and complex network parameters. By systematically assessing vulnerabilities and computing resilience once critical CVSS thresholds are reached, this work identifies key resilience metrics including the critical loads service requirements. The proposed methodology improves system resilience through strategic tie-line switching, which is validated on the modified IEEE 33-bus system. Four case studies are conducted, illustrating the performance of the proposed methodology under various cyber attack scenarios. The results demonstrate the effectiveness of the approach in quantifying and enhancing resilience, offering a valuable tool for PDS operators to mitigate risks and ensure continuous service delivery to critical loads during the exploitation of cyber threats.
- Published
- 2025
8. A Class of Non-Contracting Branch Groups with Non-Torsion Rigid Kernels
- Author
-
Saha, Sagar and Krishna, K. V.
- Subjects
Mathematics - Group Theory ,20E08, 20E18 - Abstract
In this work, we provide the first example of an infinite family of branch groups in the class of non-contracting self-similar groups. We show that these groups are very strongly fractal, not regular branch, and of exponential growth. Further, we prove that these groups do not have the congruence subgroup property by explicitly calculating the structure of their rigid kernels. This class of groups is also the first example of branch groups with non-torsion rigid kernels. As a consequence of these results, we also determine the Hausdorff dimension of these groups.
- Published
- 2025
9. Kolmogorov equations for 2D stochastic convective Brinkman-Forchheimer equations: Analysis and Applications
- Author
-
Gautam, Sagar and Mohan, Manil T.
- Subjects
Mathematics - Optimization and Control - Abstract
In this work, we consider the following 2D stochastic convective Brinkman-Forchheimer (SCBF) equations in a bounded smooth domain $\mathcal{O}$: \begin{align*} \mathrm{d}\boldsymbol{u}+\left[-\mu \Delta\boldsymbol{u}+(\boldsymbol{u}\cdot\nabla)\boldsymbol{u}+\alpha\boldsymbol{u}+\beta|\boldsymbol{u}|^{r-1}\boldsymbol{u}+\nabla p\right]\mathrm{d}t=\sqrt{\mathrm{Q}}\mathrm{W}, \ \nabla\cdot\boldsymbol{u}=0, \end{align*} where $\mu,\alpha,\beta>0$, $r\in\{1,2,3\}$, $\mathrm{Q}$ is a non-negative operator of trace class, $\mathrm{W}$ is a cylindrical Wiener process in a Hilbert space $\mathbb{H}$. Under the following assumption on the viscosity co-efficient $\mu$ and the Darcy co-efficient $\alpha$: for some positive constant $\gamma_1$, \begin{equation*} \mu(\mu+\alpha)^2>\gamma_1\max\{4\mathrm{Tr}(\mathrm{Q}),\mathrm{Tr}(\mathrm{A}^{2\delta}\mathrm{Q})\}, \end{equation*} where $\mathrm{A}$ is the Stokes operator and $\delta\in(0,\frac{1}{2})$, our primary goal is to solve the corresponding Kolmogorov equation in the space $\mathbb{L}^2(\mathbb{H};\eta),$ where $\eta$ is the unique invariant measure associated with 2D SCBF equations. Then, we establish the well-known ``carr\'e du champs'' identity. Some sharp estimates on the derivatives of the solution constitute the key component of the proofs. We take into consideration two control problems from the application point of view. The first is an infinite horizon control problem for which we establish the existence of a solution for the Hamilton-Jacobi-Bellman equation associated with it. Finally, by exploiting $m$-accretive theory, we demonstrate the existence of a unique solution for an obstacle problem associated with the Kolmogorov operator corresponding to the stopping-time problem for 2D SCBF equations.
- Published
- 2024
10. On the convective Brinkman-Forchheimer equations
- Author
-
Gautam, Sagar and Mohan, Manil T.
- Subjects
Mathematics - Analysis of PDEs - Abstract
The convective Brinkman--Forchheimer equations or the Navier--Stokes equations with damping in bounded or periodic domains $\subset\mathbb{R}^d$, $2\leq d\leq 4$ are considered in this work. The existence and uniqueness of a global weak solution in the Leray-Hopf sense satisfying the energy equality to the system: $$\partial_t\boldsymbol{u}-\mu \Delta\boldsymbol{u}+(\boldsymbol{u}\cdot\nabla)\boldsymbol{u}+\alpha\boldsymbol{u}+\beta|\boldsymbol{u}|^{r-1}\boldsymbol{u}+\nabla p=\boldsymbol{f},\ \nabla\cdot\boldsymbol{u}=0,$$ (for all values of $\beta>0$ and $\mu>0$, whenever the absorption exponent $r>3$ and $2\beta\mu \geq 1$, for the critical case $r=3$) is proved. We exploit the monotonicity as well as the demicontinuity properties of the linear and nonlinear operators and the Minty-Browder technique in the proofs. Finally, we discuss the existence of global-in-time strong solutions to such systems in periodic domains., Comment: arXiv admin note: text overlap with arXiv:2008.08577
- Published
- 2024
11. Evaluate Summarization in Fine-Granularity: Auto Evaluation with LLM
- Author
-
Yuan, Dong, Rastogi, Eti, Zhao, Fen, Goyal, Sagar, Naik, Gautam, and Rajagopal, Sree Prasanna
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Due to the exponential growth of information and the need for efficient information consumption the task of summarization has gained paramount importance. Evaluating summarization accurately and objectively presents significant challenges, particularly when dealing with long and unstructured texts rich in content. Existing methods, such as ROUGE (Lin, 2004) and embedding similarities, often yield scores that have low correlation with human judgements and are also not intuitively understandable, making it difficult to gauge the true quality of the summaries. LLMs can mimic human in giving subjective reviews but subjective scores are hard to interpret and justify. They can be easily manipulated by altering the models and the tones of the prompts. In this paper, we introduce a novel evaluation methodology and tooling designed to address these challenges, providing a more comprehensive, accurate and interpretable assessment of summarization outputs. Our method (SumAutoEval) proposes and evaluates metrics at varying granularity levels, giving objective scores on 4 key dimensions such as completeness, correctness, Alignment and readability. We empirically demonstrate, that SumAutoEval enhances the understanding of output quality with better human correlation.
- Published
- 2024
12. The Rendezvous Between Extreme Value Theory and Next-generation Networks
- Author
-
Sagar, Srinivas, Subhash, Athira, Liu, Chen-Feng, Elzanaty, Ahmed, Al-Badarneh, Yazan H., Kalyani, Sheetal, Alouini, Mohamed-Slim, Bennis, Mehdi, and Hanzo, Lajos
- Subjects
Computer Science - Information Theory - Abstract
Promising technologies such as massive multiple-input and multiple-output, reconfigurable intelligent reflecting surfaces, non-terrestrial networks, millimetre wave communication, ultra-reliable lowlatency communication are envisioned as the enablers for next-generation (NG) networks. In contrast to conventional communication systems meeting specific average performance requirements, NG systems are expected to meet quality-of-service requirements in extreme scenarios as well. In this regard, extreme value theory (EVT) provides a powerful framework for the design of communication systems. In this paper, we provide a comprehensive survey of advances in communication that utilize EVT to characterize the extreme order statistics of interest. We first give an overview of the history of EVT and then elaborate on the fundamental theorems. Subsequently, we discuss different problems of interest in NG communication systems and how EVT can be utilized for their analysis. We finally point out the open challenges and future directions of EVT in NG communication systems.
- Published
- 2024
13. Bounds for higher Steklov and mixed Steklov Neumann eigenvalues on domains with holes
- Author
-
Basak, Sagar and Verma, Sheela
- Subjects
Mathematics - Spectral Theory ,58J50, 35P15 - Abstract
In this article, we study Steklov eigenvalues and mixed Steklov Neumann eigenvalues on a smooth bounded domain in $\mathbb{R}^{n}$, $n \geq 2$, having a spherical hole. We focus on two main results related to Steklov eigenvalues. First, we obtain explicit expression for the second nonzero Steklov eigenvalue on concentric annular domain. Secondly, we derive a sharp upper bound of the first $n$ nonzero Steklov eigenvalues on a domain $\Omega \subset \mathbb{R}^{n}$ having symmetry of order $4$ and a ball removed from its center. This bound is given in terms of the corresponding Steklov eigenvalues on a concentric annular domain of the same volume as $\Omega$. Next, we consider the mixed Steklov Neumann eigenvalue problem on $4^{\text{th}}$ order symmetric domains in $\mathbb{R}^{n}$ having a spherical hole and obtain upper bound of the first $n$ nonzero eigenvalues. We also provide some examples to illustrate that symmetry assumption in our results is crucial. Finally, We make some numerical observations about these eigenvalues using FreeFEM++ and state them as conjectures.
- Published
- 2024
14. EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues
- Author
-
Soni, Sagar, Dudhane, Akshay, Debary, Hiyam, Fiaz, Mustansar, Munir, Muhammad Akhtar, Danish, Muhammad Sohail, Fraccaro, Paolo, Watson, Campbell D, Klein, Levente J, Khan, Fahad Shahbaz, and Khan, Salman
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Automated analysis of vast Earth observation data via interactive Vision-Language Models (VLMs) can unlock new opportunities for environmental monitoring, disaster response, and resource management. Existing generic VLMs do not perform well on Remote Sensing data, while the recent Geo-spatial VLMs remain restricted to a fixed resolution and few sensor modalities. In this paper, we introduce EarthDial, a conversational assistant specifically designed for Earth Observation (EO) data, transforming complex, multi-sensory Earth observations into interactive, natural language dialogues. EarthDial supports multi-spectral, multi-temporal, and multi-resolution imagery, enabling a wide range of remote sensing tasks, including classification, detection, captioning, question answering, visual reasoning, and visual grounding. To achieve this, we introduce an extensive instruction tuning dataset comprising over 11.11M instruction pairs covering RGB, Synthetic Aperture Radar (SAR), and multispectral modalities such as Near-Infrared (NIR) and infrared. Furthermore, EarthDial handles bi-temporal and multi-temporal sequence analysis for applications like change detection. Our extensive experimental results on 37 downstream applications demonstrate that EarthDial outperforms existing generic and domain-specific models, achieving better generalization across various EO tasks.
- Published
- 2024
15. uGMRT observation of the unidentified PeVatron candidate LHAASO J2108+5157
- Author
-
Mahanta, Gunindra Krishna, Roy, Subhashis, Godambe, Sagar, Ghosal, Bitan, Bhatt, Nilay, and Bhattacharyya, Subir
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
Recent observations by the Large High Altitude Air Shower Observatory (LHAASO) detected Ultra High Energy (UHE) photons in the range 100 TeV to 1.4 PeV from twelve sources including Crab nebula. The detection of these photons demands the presence of at least PeV energy particle in the source. It is important to understand particle acceleration and radiation emission processes in such source. One of those twelve sources, LHAASO J2108+5157 does not show any association or counterparts at any other wavelength. In search of counterpart, we surveyed the region with Giant Metrewave Radio Telescope (GMRT) at 650 MHz frequency. GMRT observation revel radio emission from an extended source within the PSF of LHAASO which shows disk-jet morphology. Considering the spatial association and extent of the source, it is plausible that particle acceleration to PeV energies originates from this source., Comment: A detailed manuscript of this work has been submitted to a peer-reviewed journal
- Published
- 2024
16. Driving Innovation in 6G Wireless Technologies: The OpenAirInterface Approach
- Author
-
Kaltenberger, Florian, Melodia, Tommaso, Ghauri, Irfan, Polese, Michele, Knopp, Raymond, Nguyen, Tien Thinh, Velumani, Sakthivel, Villa, Davide, Bonati, Leonardo, Schmidt, Robert, Arora, Sagar, Irazabal, Mikel, and Nikaein, Navid
- Subjects
Computer Science - Networking and Internet Architecture ,Electrical Engineering and Systems Science - Signal Processing - Abstract
The development of 6G wireless technologies is rapidly advancing, with the 3rd Generation Partnership Project (3GPP) entering the pre-standardization phase and aiming to deliver the first specifications by 2028. This paper explores the OpenAirInterface (OAI) project, an open-source initiative that plays a crucial role in the evolution of 5G and the future 6G networks. OAI provides a comprehensive implementation of 3GPP and O-RAN compliant networks, including Radio Access Network (RAN), Core Network (CN), and software-defined User Equipment (UE) components. The paper details the history and evolution of OAI, its licensing model, and the various projects under its umbrella, such as RAN, the CN, as well as the Operations, Administration and Maintenance (OAM) projects. It also highlights the development methodology, Continuous Integration/Continuous Delivery (CI/CD) processes, and end-to-end systems powered by OAI. Furthermore, the paper discusses the potential of OAI for 6G research, focusing on spectrum, reflective intelligent surfaces, and Artificial Intelligence (AI)/Machine Learning (ML) integration. The open-source approach of OAI is emphasized as essential for tackling the challenges of 6G, fostering community collaboration, and driving innovation in next-generation wireless technologies., Comment: This work has been submitted to the Elsesvier Computer Networks Journal for possible publication
- Published
- 2024
17. Meeting Utility Constraints in Differential Privacy: A Privacy-Boosting Approach
- Author
-
Jiang, Bo, Zhang, Wanrong, Lu, Donghang, Du, Jian, Sharma, Sagar, and Yan, Qiang
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Data Structures and Algorithms ,Computer Science - Information Theory - Abstract
Data engineering often requires accuracy (utility) constraints on results, posing significant challenges in designing differentially private (DP) mechanisms, particularly under stringent privacy parameter $\epsilon$. In this paper, we propose a privacy-boosting framework that is compatible with most noise-adding DP mechanisms. Our framework enhances the likelihood of outputs falling within a preferred subset of the support to meet utility requirements while enlarging the overall variance to reduce privacy leakage. We characterize the privacy loss distribution of our framework and present the privacy profile formulation for $(\epsilon,\delta)$-DP and R\'enyi DP (RDP) guarantees. We study special cases involving data-dependent and data-independent utility formulations. Through extensive experiments, we demonstrate that our framework achieves lower privacy loss than standard DP mechanisms under utility constraints. Notably, our approach is particularly effective in reducing privacy loss with large query sensitivity relative to the true answer, offering a more practical and flexible approach to designing differentially private mechanisms that meet specific utility constraints., Comment: published on IEEE S&P 2025
- Published
- 2024
18. Large collective power enhancement in dissipative charging of a quantum battery
- Author
-
Pokhrel, Sagar and Gea-Banacloche, Julio
- Subjects
Quantum Physics - Abstract
We consider a model for a quantum battery consisting of a collection of $N$ two-level atoms driven by a classical field and decaying to a common reservoir. In the extensive regime, where the energy $E$ scales as $N$ and the fluctuations $\Delta E/E \to 0$, our dissipative charging protocol yields a power proportional to $N^2$, a scaling that cannot be achieved in this regime by Hamiltonian protocols. The tradeoff for this enhanced charging power is a relative inefficiency, since a large fraction of the incoming energy is lost through spontaneous emission to the environment. Nevertheless, we find the system can store a large amount of coherence, and also release the stored energy coherently through spontaneous emission, again with a power scaling as $N^2$.
- Published
- 2024
19. Two-dimensional Constacyclic Codes over $\mathbb{F}_q$
- Author
-
Sagar, Vidya, Patel, Shikha, and Garani, Shayan Srinivasa
- Subjects
Computer Science - Information Theory - Abstract
We consider two-dimensional $(\lambda_1, \lambda_2)$-constacyclic codes over $\mathbb{F}_{q}$ of area $M N$, where $q$ is some power of prime $p$ with $\gcd(M,p)=1$ and $\gcd(N,p)=1$. With the help of common zero (CZ) set, we characterize 2-D constacyclic codes. Further, we provide an algorithm to construct an ideal basis of these codes by using their essential common zero (ECZ) sets. We describe the dual of 2-D constacyclic codes. Finally, we provide an encoding scheme for generating 2-D constacyclic codes. We present an example to illustrate that 2-D constacyclic codes can have better minimum distance compared to their cyclic counterparts with the same code size and code rate., Comment: 23 pages
- Published
- 2024
20. Vector Portals at Future Lepton Colliders
- Author
-
Airen, Sagar, Broadberry, Edward, Marques-Tavares, Gustavo, and Ricci, Lorenzo
- Subjects
High Energy Physics - Phenomenology - Abstract
We assess the sensitivity of future lepton colliders to weakly coupled vector dark portals (aka ``$ Z' $ bosons'') with masses ranging from tens of GeV to a few TeV. Our analysis focuses on dark photons and $ L_{\mu} - L_{\tau} $ gauge bosons. We consider both visible and invisible decay channels. We demonstrate that both high energy $\mu$ colliders and future $ e^+e^- $ colliders, using the FCC-ee $Z$-pole and $ZH$ operation modes as a benchmark, offer significant improvements in sensitivity. We find that both colliders can enhance the sensitivity to $ L_{\mu} - L_{\tau} $ bosons (for both visible and invisible decays) and to invisibly decaying dark photons by 1--2 orders of magnitude across the relevant mass range. Furthermore, we study the impact of forward $ \mu $ detectors at the $ \mu $-collider on the sensitivity to both models., Comment: 20 pages, 8 figures
- Published
- 2024
21. Dissipative Dynamical Phase Transition as a Complex Ising Model
- Author
-
Yan, Stephen W., Barberena, Diego, Fisher, Matthew P. A., and Vijay, Sagar
- Subjects
Quantum Physics ,Condensed Matter - Statistical Mechanics ,Condensed Matter - Strongly Correlated Electrons - Abstract
We investigate a quantum dynamical phase transition induced by the competition between local unitary evolution and dissipation in a qubit chain with a strong, on-site $\mathbb{Z}_2$ symmetry. While the steady-state of this evolution is always maximally-mixed, we show that the dynamical behavior of certain non-local observables on the approach to this steady-state is dictated by a quantum Ising model with a $\textit{complex}$ transverse-field (cTFIM). We investigate these observables analytically, uncovering a dynamical phase transition as the relative rate of unitary evolution and dissipation is tuned. We show that the weak-dissipation limit corresponds to a cTFIM with a large magnitude of the imaginary transverse-field, for which the many-body "ground-state" (with smallest real eigenvalue) is gapless, exhibiting quasi-long-range correlations of the local magnetization with a continuously-varying exponent. Correspondingly, the dynamics of the non-local observables show oscillatory behavior with an amplitude decaying exponentially in time. The strong-dissipation limit corresponds to a gapped ferromagnetic phase of the cTFIM, and non-local observables show exponential decay on the approach to equilibrium. This transition in (1+1)-dimensions has a peculiar, "two-sided" nature appearing as either first- or second-order depending on the phase from which the transition is approached, an analytic result which is corroborated by numerical studies. In higher dimensions, we present a field-theoretic understanding of the first-order nature of this transition, when approaching from the ferromagnetic phase of the cTFIM, though the nature of the phase with large imaginary transverse-field remains to be understood., Comment: 15 + 14 pages, 8 figures
- Published
- 2024
22. Bond exchange reactions as a paradigm for mitigating residual stress in polymer matrix fiber composites
- Author
-
Wang, Zhongtong, Wagner, Robert J., Chen, Tianke, Shah, Sagar P., Maiaru, Marianna, and Silberstein, Meredith N.
- Subjects
Condensed Matter - Soft Condensed Matter - Abstract
Polymer matrix fiber composites often suffer from residual stresses due to differences in coefficients of thermal expansion between the fibers and resins, as well as contractile strain of the resins during curing. To address residual stress driven composite failure, we propose the use of vitrimers as composite resins, which can undergo thermally activated, stress alleviating, bond exchange reactions (BERs). We conduct fiber Bragg grating measurements for a single glass fiber within bulk vitrimer. These show that the fiber strain in vitrimers with 5% catalyst is significantly lower than in those with 0% catalyst (minimal BER expected) during both curing and post-curing phases. We developed a finite deformation, micromechanically-inspired model that incorporates curing, thermal processes, and BERs, and then implemented this model it into finite element software to simulate stress evolution within single fiber composite systems. The combination of experimental and computational results reveals that BERs can effectively mitigate, but not eliminate, the residual stress in polymer matrix fiber composites.
- Published
- 2024
23. Accelerating Manufacturing Scale-Up from Material Discovery Using Agentic Web Navigation and Retrieval-Augmented AI for Process Engineering Schematics Design
- Author
-
Srinivas, Sakhinana Sagar, Das, Akash, Gupta, Shivam, and Runkana, Venkataramana
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Information Retrieval ,Computer Science - Multiagent Systems - Abstract
Process Flow Diagrams (PFDs) and Process and Instrumentation Diagrams (PIDs) are critical tools for industrial process design, control, and safety. However, the generation of precise and regulation-compliant diagrams remains a significant challenge, particularly in scaling breakthroughs from material discovery to industrial production in an era of automation and digitalization. This paper introduces an autonomous agentic framework to address these challenges through a twostage approach involving knowledge acquisition and generation. The framework integrates specialized sub-agents for retrieving and synthesizing multimodal data from publicly available online sources and constructs ontological knowledge graphs using a Graph Retrieval-Augmented Generation (Graph RAG) paradigm. These capabilities enable the automation of diagram generation and open-domain question answering (ODQA) tasks with high contextual accuracy. Extensive empirical experiments demonstrate the frameworks ability to deliver regulation-compliant diagrams with minimal expert intervention, highlighting its practical utility for industrial applications.
- Published
- 2024
24. What Do Machine Learning Researchers Mean by 'Reproducible'?
- Author
-
Raff, Edward, Benaroch, Michel, Samtani, Sagar, and Farris, Andrew L.
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Statistics - Machine Learning - Abstract
The concern that Artificial Intelligence (AI) and Machine Learning (ML) are entering a "reproducibility crisis" has spurred significant research in the past few years. Yet with each paper, it is often unclear what someone means by "reproducibility". Our work attempts to clarify the scope of "reproducibility" as displayed by the community at large. In doing so, we propose to refine the research to eight general topic areas. In this light, we see that each of these areas contains many works that do not advertise themselves as being about "reproducibility", in part because they go back decades before the matter came to broader attention., Comment: To appear in AAAI 2025, Senior Member Presentation Track
- Published
- 2024
25. RoboFail: Analyzing Failures in Robot Learning Policies
- Author
-
Sagar, Som and Senanayake, Ransalu
- Subjects
Computer Science - Robotics ,Computer Science - Machine Learning - Abstract
Despite being trained on increasingly large datasets, robot models often overfit to specific environments or datasets. Consequently, they excel within their training distribution but face challenges in generalizing to novel or unforeseen scenarios. This paper presents a method to proactively identify failure mode probabilities in robot manipulation policies, providing insights into where these models are likely to falter. To this end, since exhaustively searching over a large space of failures is infeasible, we propose a deep reinforcement learning-based framework, RoboFail. It is designed to detect scenarios prone to failure and quantify their likelihood, thus offering a structured approach to anticipate failures. By identifying these high-risk states in advance, RoboFail enables researchers and engineers to better understand the robustness limits of robot policies, contributing to the development of safer and more adaptable robotic systems., Comment: 14 Pages, 6 figures
- Published
- 2024
26. Weak measurement-based protocol for ergotropy protection in open quantum batteries
- Author
-
Malavazi, André H. A., Sagar, Rishav, Ahmadi, Borhan, and Dieguez, Pedro R.
- Subjects
Quantum Physics - Abstract
Quantum batteries are emerging as highly efficient energy storage devices that can exceed classical performance limits. Although there have been significant advancements in controlling these systems, challenges remain in stabilizing stored energy and minimizing losses due to inevitable environmental interaction. In this paper, we propose a protocol that employs selective weak measurements to protect quantum states from such influence and mitigate battery discharging. We establish thermodynamic constraints that allow this method to be implemented without disrupting the overall energy and ergotropy balance of the system. Our findings demonstrate that appropriately chosen measurement intensity can reduce unwanted discharging effects, thereby preserving ergotropy and improving the stability of quantum batteries. Additionally, we explore how weak measurements influence the coherent and incoherent components of ergotropy, providing new insights into the practical application of quantum coherence in energy storage technologies., Comment: 17 pages, 14 figures
- Published
- 2024
27. Approximating One-Sided and Two-Sided Nash Social Welfare With Capacities
- Author
-
Gokhale, Salil, Sagar, Harshul, Vaish, Rohit, and Yadav, Jatin
- Subjects
Computer Science - Computer Science and Game Theory - Abstract
We study the problem of maximizing Nash social welfare, which is the geometric mean of agents' utilities, in two well-known models. The first model involves one-sided preferences, where a set of indivisible items is allocated among a group of agents (commonly studied in fair division). The second model deals with two-sided preferences, where a set of workers and firms, each having numerical valuations for the other side, are matched with each other (commonly studied in matching-under-preferences literature). We study these models under capacity constraints, which restrict the number of items (respectively, workers) that an agent (respectively, a firm) can receive. We develop constant-factor approximation algorithms for both problems under a broad class of valuations. Specifically, our main results are the following: (a) For any $\epsilon > 0$, a $(6+\epsilon)$-approximation algorithm for the one-sided problem when agents have submodular valuations, and (b) a $1.33$-approximation algorithm for the two-sided problem when the firms have subadditive valuations. The former result provides the first constant-factor approximation algorithm for Nash welfare in the one-sided problem with submodular valuations and capacities, while the latter result improves upon an existing $\sqrt{OPT}$-approximation algorithm for additive valuations. Our result for the two-sided setting also establishes a computational separation between the Nash and utilitarian welfare objectives. We also complement our algorithms with hardness-of-approximation results., Comment: 28 pages, 1 figure
- Published
- 2024
28. Extending Video Masked Autoencoders to 128 frames
- Author
-
Gundavarapu, Nitesh Bharadwaj, Friedman, Luke, Goyal, Raghav, Hegde, Chaitra, Agustsson, Eirikur, Waghmare, Sagar M., Sirotenko, Mikhail, Yang, Ming-Hsuan, Weyand, Tobias, Gong, Boqing, and Sigal, Leonid
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Video understanding has witnessed significant progress with recent video foundation models demonstrating strong performance owing to self-supervised pre-training objectives; Masked Autoencoders (MAE) being the design of choice. Nevertheless, the majority of prior works that leverage MAE pre-training have focused on relatively short video representations (16 / 32 frames in length) largely due to hardware memory and compute limitations that scale poorly with video length due to the dense memory-intensive self-attention decoding. One natural strategy to address these challenges is to subsample tokens to reconstruct during decoding (or decoder masking). In this work, we propose an effective strategy for prioritizing tokens which allows training on longer video sequences (128 frames) and gets better performance than, more typical, random and uniform masking strategies. The core of our approach is an adaptive decoder masking strategy that prioritizes the most important tokens and uses quantized tokens as reconstruction objectives. Our adaptive strategy leverages a powerful MAGVIT-based tokenizer that jointly learns the tokens and their priority. We validate our design choices through exhaustive ablations and observe improved performance of the resulting long-video (128 frames) encoders over short-video (32 frames) counterparts. With our long-video masked autoencoder (LVMAE) strategy, we surpass state-of-the-art on Diving48 by 3.9 points and EPIC-Kitchens-100 verb classification by 2.5 points while relying on a simple core architecture and video-only pre-training (unlike some of the prior works that require millions of labeled video-text pairs or specialized encoders)., Comment: 10.5 pages of main paper, 25 pages total, 4 figures and 10 tables. To appear in NeurIPS'24
- Published
- 2024
29. Perturbations of Black Holes Surrounded by Anisotropic Matter Field
- Author
-
C, Sagar J, R, Karthik, Hegde, Katheek, Ajith, K. M., Punacha, Shreyas, and Kumara, A. Naveena
- Subjects
General Relativity and Quantum Cosmology - Abstract
Our research aims to probe the anisotropic matter field around black holes using black hole perturbation theory. Black holes in the universe are usually surrounded by matter or fields, and it is important to study the perturbation and the characteristic modes of a black hole that coexists with such a matter field. In this study, we focus on a family of black hole solutions to Einstein's equations that extend the Reissner-Nordstr\"{o}m spacetime to include an anisotropic matter field. In addition to mass and charge, this type of black hole possesses additional hair due to the negative radial pressure of the anisotropic matter. We investigate the perturbations of the massless scalar and electromagnetic fields and calculate the quasinormal modes (QNMs). We also study the critical orbits around the black hole and their properties to investigate the connection between the eikonal QNMs, black hole shadow radius, and Lyapunov exponent. Additionally, we analyze the grey-body factors and scattering coefficients using the perturbation results. Our findings indicate that the presence of anisotropic matter fields leads to a splitting in the QNM frequencies compared to the Schwarzschild case. This splitting feature is also reflected in the shadow radius, Lyapunov exponent, and grey-body factors., Comment: 35 pages, 10 figures
- Published
- 2024
30. FlowScope: Enhancing Decision Making by Time Series Forecasting based on Prediction Optimization using HybridFlow Forecast Framework
- Author
-
Boyeena, Nitin Sagar and Kumar, Begari Susheel
- Subjects
Computer Science - Machine Learning ,Computer Science - Computational Engineering, Finance, and Science ,Electrical Engineering and Systems Science - Signal Processing ,62M10 (Primary), 68T07 (Secondary) ,I.2.6 ,G.3 ,I.5 - Abstract
Time series forecasting is crucial in several sectors, such as meteorology, retail, healthcare, and finance. Accurately forecasting future trends and patterns is crucial for strategic planning and making well-informed decisions. In this case, it is crucial to include many forecasting methodologies. The strengths of Auto-regressive Integrated Moving Average (ARIMA) for linear time series, Seasonal ARIMA models (SARIMA) for seasonal time series, Exponential Smoothing State Space Models (ETS) for handling errors and trends, and Long Short-Term Memory (LSTM) Neural Network model for complex pattern recognition have been combined to create a comprehensive framework called FlowScope. SARIMA excels in capturing seasonal variations, whereas ARIMA ensures effective handling of linear time series. ETS models excel in capturing trends and correcting errors, whereas LSTM networks excel in reflecting intricate temporal connections. By combining these methods from both machine learning and deep learning, we propose a deep-hybrid learning approach FlowScope which offers a versatile and robust platform for predicting time series data. This empowers enterprises to make informed decisions and optimize long-term strategies for maximum performance. Keywords: Time Series Forecasting, HybridFlow Forecast Framework, Deep-Hybrid Learning, Informed Decisions., Comment: 12 pages and 6 figures
- Published
- 2024
31. Turbulent pipe flow with spherical particles: drag as a function of particle size and volume fraction
- Author
-
Leskovec, Martin, Zade, Sagar, Niazi, Mehdi, Costa, Pedro, Lundell, Fredrik, and Brandt, Luca
- Subjects
Physics - Fluid Dynamics - Abstract
Suspensions of finite-size solid particles in a turbulent pipe flow are found in many industrial and technical flows. Due to the ample parameter space consisting of particle size, concentration, density and Reynolds number, a complete picture of the particle-fluid interaction is still lacking. Pressure drop predictions are often made using viscosity models only considering the bulk solid volume fraction. For the case of turbulent pipe flow laden with neutrally buoyant spherical particles, we investigate the pressure drop and overall drag (friction factor), fluid velocity and particle distribution in the pipe. We use a combination of experimental (MRV) and numerical (DNS) techniques and a continuum flow model. We find that the particle size and the bulk flow rate influence the mean fluid velocity, velocity fluctuations and the particle distribution in the pipe for low flow rates. However, the effects of the added solid particles diminish as the flow rate increases. We created a master curve for drag change compared to single-phase flow for the particle-laden cases. This curve can be used to achieve more accurate friction factor predictions than the traditional modified viscosity approach that does not account for particle size.
- Published
- 2024
- Full Text
- View/download PDF
32. On the Universal Statistical Consistency of Expansive Hyperbolic Deep Convolutional Neural Networks
- Author
-
Ghosh, Sagar, Bose, Kushal, and Das, Swagatam
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning - Abstract
The emergence of Deep Convolutional Neural Networks (DCNNs) has been a pervasive tool for accomplishing widespread applications in computer vision. Despite its potential capability to capture intricate patterns inside the data, the underlying embedding space remains Euclidean and primarily pursues contractive convolution. Several instances can serve as a precedent for the exacerbating performance of DCNNs. The recent advancement of neural networks in the hyperbolic spaces gained traction, incentivizing the development of convolutional deep neural networks in the hyperbolic space. In this work, we propose Hyperbolic DCNN based on the Poincar\'{e} Disc. The work predominantly revolves around analyzing the nature of expansive convolution in the context of the non-Euclidean domain. We further offer extensive theoretical insights pertaining to the universal consistency of the expansive convolution in the hyperbolic space. Several simulations were performed not only on the synthetic datasets but also on some real-world datasets. The experimental results reveal that the hyperbolic convolutional architecture outperforms the Euclidean ones by a commendable margin.
- Published
- 2024
33. ExpressivityArena: Can LLMs Express Information Implicitly?
- Author
-
Tint, Joshua, Sagar, Som, Taparia, Aditya, Raines, Kelly, Pathiraja, Bimsara, Liu, Caleb, and Senanayake, Ransalu
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,I.2.7 - Abstract
While Large Language Models (LLMs) have demonstrated remarkable performance in certain dimensions, their ability to express implicit language cues that human use for effective communication remains unclear. This paper presents ExpressivityArena, a Python library for measuring the implicit communication abilities of LLMs. We provide a comprehensive framework to evaluate expressivity of arbitrary LLMs and explore its practical implications. To this end, we refine the definition and measurements of ``expressivity,'' and use our framework in a set of small experiments. These experiments test LLMs in creative and logical tasks such as poetry, coding, and emotion-based responses. They are then evaluated by an automated grader, through ExpressivityArena, which we verify to be the most pragmatic for testing expressivity. Building on these experiments, we deepen our understanding of the expressivity of LLMs by assessing their ability to remain expressive in conversations. Our findings indicate that LLMs are capable of generating and understanding expressive content, however, with some limitations. These insights will inform the future development and deployment of expressive LLMs. We provide the code for ExpressivityArena alongside our paper., Comment: 8 pages, 22 figures
- Published
- 2024
34. Generalizing upper bounds for the prime divisors of friendly number of 10
- Author
-
Mandal, Sagar and Mandal, Sourav
- Subjects
Mathematics - General Mathematics ,11Axx, 11A25 - Abstract
In this paper, we propose necessary upper bounds for all prime divisors of friends of 10. This is precisely a generalization of a recent paper \cite{SS} of the authors where they proposed upper bounds for the second, third, and fourth smallest prime divisors of friends of 10., Comment: 8 pages
- Published
- 2024
35. OpenFLAME: Building a large scale federated localization and mapping service
- Author
-
Bharadwaj, Sagar, Wang, Luke, Liang, Michael, Williams, Harrison, Liang, Ivan, Seshan, Srinivasan, and Rowe, Anthony
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
The widespread availability of maps has enabled the development of numerous location-based applications, including navigation, ride-sharing, fitness tracking, gaming, robotics, and augmented reality. Today, the maps that power these services are predominantly controlled by a few large corporations and mostly cover outdoor spaces. As the use of these applications expands and indoor localization technologies advance, we are seeing the need for a scalable, federated location management system that can extend into private spaces. We introduce OpenFLAME (Open Federated Localization and Mapping Engine), the first federated and decentralized localization service. OpenFLAME links servers that handle localization for specific regions, providing applications with a seamless global view. Creating a federated localization system poses challenges, such as discovering the appropriate servers for a region and integrating services managed by independent providers. To address these issues and ensure scalability, we leverage Domain Name System (DNS) for service discovery and implement map abstractions to retrieve and merge locations across different maps. Our trace-driven study demonstrates that federated localization across remote servers is feasible with acceptable query latencies. To highlight the potential of the system, we developed an augmented reality navigation application for a large indoor space, showing that OpenFLAME can successfully power location-based applications.
- Published
- 2024
36. Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions
- Author
-
Shrestha, Sagar and Fu, Xiao
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Understanding identifiability of latent content and style variables from unaligned multi-domain data is essential for tasks such as domain translation and data generation. Existing works on content-style identification were often developed under somewhat stringent conditions, e.g., that all latent components are mutually independent and that the dimensions of the content and style variables are known. We introduce a new analytical framework via cross-domain \textit{latent distribution matching} (LDM), which establishes content-style identifiability under substantially more relaxed conditions. Specifically, we show that restrictive assumptions such as component-wise independence of the latent variables can be removed. Most notably, we prove that prior knowledge of the content and style dimensions is not necessary for ensuring identifiability, if sparsity constraints are properly imposed onto the learned latent representations. Bypassing the knowledge of the exact latent dimension has been a longstanding aspiration in unsupervised representation learning -- our analysis is the first to underpin its theoretical and practical viability. On the implementation side, we recast the LDM formulation into a regularized multi-domain GAN loss with coupled latent variables. We show that the reformulation is equivalent to LDM under mild conditions -- yet requiring considerably less computational resource. Experiments corroborate with our theoretical claims.
- Published
- 2024
37. Constructing Emergent U(1) Symmetries in the Gamma-prime $\left(\bf \Gamma^{\prime} \right)$ model
- Author
-
Ramchandani, Sagar, Trebst, Simon, and Hickey, Ciarán
- Subjects
Condensed Matter - Strongly Correlated Electrons - Abstract
Frustrated magnets can elude the paradigm of conventional symmetry breaking and instead exhibit signatures of emergent symmetries at low temperatures. Such symmetries arise from "accidental" degeneracies within the ground state manifold and have been explored in a number of disparate models, in both two and three dimensions. Here we report the systematic construction of a family of classical spin models that, for a wide variety of lattice geometries with triangular motifs in one, two and three spatial dimensions, such as the kagome or hyperkagome lattices, exhibit an emergent, continuous U(1) symmetry. This is particularly surprising because the underlying Hamiltonian actually has very little symmetry - a bond-directional, off-diagonal exchange model inspired by the microscopics of spin-orbit entangled materials (the $\Gamma^{\prime}$-model). The construction thus allows for a systematic study of the interplay between the emergent continuous U(1) symmetry and the underlying discrete Hamiltonian symmetries in different lattices across different spatial dimensions. We discuss the impact of thermal and quantum fluctuations in lifting the accidental ground state degeneracy via the thermal and quantum order-by-disorder mechanisms, and how spatial dimensionality and lattice symmetries play a crucial role in shaping the physics of the model. Complementary Monte Carlo simulations, for representative one-, two-, and three-dimensional lattice geometries, provide a complete account of the thermodynamics and confirm our analytical expectations., Comment: 14 pages, 14 figures
- Published
- 2024
38. Deterministic Suffix-reading Automata
- Author
-
Keerthan, R, Srivathsan, B, Venkatesh, R, and Verma, Sagar
- Subjects
Computer Science - Formal Languages and Automata Theory ,F.1.1 - Abstract
We introduce deterministic suffix-reading automata (DSA), a new automaton model over finite words. Transitions in a DSA are labeled with words. From a state, a DSA triggers an outgoing transition on seeing a word ending with the transition's label. Therefore, rather than moving along an input word letter by letter, a DSA can jump along blocks of letters, with each block ending in a suitable suffix. This feature allows DSAs to recognize regular languages more concisely, compared to DFAs. In this work, we focus on questions around finding a "minimal" DSA for a regular language. The number of states is not a faithful measure of the size of a DSA, since the transition-labels contain strings of arbitrary length. Hence, we consider total-size (number of states + number of edges + total length of transition-labels) as the size measure of DSAs. We start by formally defining the model and providing a DSA-to-DFA conversion that allows to compare the expressiveness and succinctness of DSA with related automata models. Our main technical contribution is a method to derive DSAs from a given DFA: a DFA-to-DSA conversion. We make a surprising observation that the smallest DSA derived from the canonical DFA of a regular language L need not be a minimal DSA for L. This observation leads to a fundamental bottleneck in deriving a minimal DSA for a regular language. In fact, we prove that given a DFA and a number k, the problem of deciding if there exists an equivalent DSA of total-size at most k is NP-complete., Comment: In Proceedings GandALF 2024, arXiv:2410.21884
- Published
- 2024
- Full Text
- View/download PDF
39. Integrating Deep Feature Extraction and Hybrid ResNet-DenseNet Model for Multi-Class Abnormality Detection in Endoscopic Images
- Author
-
Sagar, Aman, Mehta, Preeti, Shrivastva, Monika, and Kumari, Suchi
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
This paper presents a deep learning framework for the multi-class classification of gastrointestinal abnormalities in Video Capsule Endoscopy (VCE) frames. The aim is to automate the identification of ten GI abnormality classes, including angioectasia, bleeding, and ulcers, thereby reducing the diagnostic burden on gastroenterologists. Utilizing an ensemble of DenseNet and ResNet architectures, the proposed model achieves an overall accuracy of 94\% across a well-structured dataset. Precision scores range from 0.56 for erythema to 1.00 for worms, with recall rates peaking at 98% for normal findings. This study emphasizes the importance of robust data preprocessing techniques, including normalization and augmentation, in enhancing model performance. The contributions of this work lie in developing an effective AI-driven tool that streamlines the diagnostic process in gastroenterology, ultimately improving patient care and clinical outcomes., Comment: 10 pages, 5 figures, CVIP challenge report including the validation results
- Published
- 2024
40. LLM-Assisted Red Teaming of Diffusion Models through 'Failures Are Fated, But Can Be Faded'
- Author
-
Sagar, Som, Taparia, Aditya, and Senanayake, Ransalu
- Subjects
Computer Science - Machine Learning - Abstract
In large deep neural networks that seem to perform surprisingly well on many tasks, we also observe a few failures related to accuracy, social biases, and alignment with human values, among others. Therefore, before deploying these models, it is crucial to characterize this failure landscape for engineers to debug or audit models. Nevertheless, it is infeasible to exhaustively test for all possible combinations of factors that could lead to a model's failure. In this paper, we improve the "Failures are fated, but can be faded" framework (arXiv:2406.07145)--a post-hoc method to explore and construct the failure landscape in pre-trained generative models--with a variety of deep reinforcement learning algorithms, screening tests, and LLM-based rewards and state generation. With the aid of limited human feedback, we then demonstrate how to restructure the failure landscape to be more desirable by moving away from the discovered failure modes. We empirically demonstrate the effectiveness of the proposed method on diffusion models. We also highlight the strengths and weaknesses of each algorithm in identifying failure modes., Comment: 13 pages, 11 figures. arXiv admin note: substantial text overlap with arXiv:2406.07145
- Published
- 2024
41. Univariate Conditional Variational Autoencoder for Morphogenic Patterns Design in Frontal Polymerization-Based Manufacturing
- Author
-
Liu, Qibang, Cai, Pengfei, Abueidda, Diab, Vyas, Sagar, Koric, Seid, Gomez-Bombarelli, Rafael, and Geubelle, Philippe
- Subjects
Physics - Computational Physics ,Computer Science - Machine Learning - Abstract
Under some initial and boundary conditions, the rapid reaction-thermal diffusion process taking place during frontal polymerization (FP) destabilizes the planar mode of front propagation, leading to spatially varying, complex hierarchical patterns in thermoset polymeric materials. Although modern reaction-diffusion models can predict the patterns resulting from unstable FP, the inverse design of patterns, which aims to retrieve process conditions that produce a desired pattern, remains an open challenge due to the non-unique and non-intuitive mapping between process conditions and manufactured patterns. In this work, we propose a probabilistic generative model named univariate conditional variational autoencoder (UcVAE) for the inverse design of hierarchical patterns in FP-based manufacturing. Unlike the cVAE, which encodes both the design space and the design target, the UcVAE encodes only the design space. In the encoder of the UcVAE, the number of training parameters is significantly reduced compared to the cVAE, resulting in a shorter training time while maintaining comparable performance. Given desired pattern images, the trained UcVAE can generate multiple process condition solutions that produce high-fidelity hierarchical patterns.
- Published
- 2024
42. FloRa: Flow Table Low-Rate Overflow Reconnaissance and Detection in SDN
- Author
-
Mudgal, Ankur, Verma, Abhishek, Singh, Munesh, Sahoo, Kshira Sagar, Elmroth, Erik, and Bhuyan, Monowar
- Subjects
Computer Science - Networking and Internet Architecture - Abstract
Software Defined Networking (SDN) has evolved to revolutionize next-generation networks, offering programmability for on-the-fly service provisioning, primarily supported by the OpenFlow (OF) protocol. The limited storage capacity of Ternary Content Addressable Memory (TCAM) for storing flow tables in OF switches introduces vulnerabilities, notably the Low-Rate Flow Table Overflow (LOFT) attacks. LOFT exploits the flow table's storage capacity by occupying a substantial amount of space with malicious flow, leading to a gradual degradation in the flow-forwarding performance of OF switches. To mitigate this threat, we propose FloRa, a machine learning-based solution designed for monitoring and detecting LOFT attacks in SDN. FloRa continuously examines and determines the status of the flow table by closely examining the features of the flow table entries. Upon detecting an attack FloRa promptly activates the detection module. The module monitors flow properties, identifies malicious flows, and blacklists them, facilitating their eviction from the flow table. Incorporating novel features such as Packet Arrival Frequency, Content Relevance Score, and Possible Spoofed IP along with Cat Boost employed as the attack detection method. The proposed method reduces CPU overhead, memory overhead, and classification latency significantly and achieves a detection accuracy of 99.49%, which is more than the state-of-the-art methods to the best of our knowledge. This approach not only protects the integrity of the flow tables but also guarantees the uninterrupted flow of legitimate traffic. Experimental results indicate the effectiveness of FloRa in LOFT attack detection, ensuring uninterrupted data forwarding and continuous availability of flow table resources in SDN., Comment: IEEE Transactions on Network and Service Management (2024)
- Published
- 2024
- Full Text
- View/download PDF
43. Oscillatory equilibrium in asymmetric evolutionary games: Generalizing evolutionarily stable strategy
- Author
-
Dubey, Vikash Kumar, Chakraborty, Suman, and Chakraborty, Sagar
- Subjects
Quantitative Biology - Populations and Evolution ,Nonlinear Sciences - Adaptation and Self-Organizing Systems - Abstract
The concept of evolutionarily stability and its relation with the fixed points of the replicator equation are important aspects of evolutionary game dynamics. In the light of the fact that oscillating state of a population and individuals (or players) of different roles are quite natural occurrences, we ask the question how the concept of evolutionarily stability can be generalized so as to associate game-theoretic meaning to oscillatory behaviours of players asymmetrically interacting, i.e., if there are both intraspecific and interspecific interactions between two subpopulations in the population. We guide our scheme of generalization such that the evolutionary stability is related to the dynamic stability of the corresponding periodic orbits of a time-discrete replicator dynamics. We name the generalization of evolutionarily stable state as two-species heterogeneity stable orbit. Furthermore, we invoke the principle of decrease of relative entropy in order to associate the generalization of evolutionary stability with an information-theoretic meaning. This particular generalization is aptly termed as two-species information stable orbit.
- Published
- 2024
44. Ten years of searching for relics of AGN jet feedback through RAD@home citizen science
- Author
-
Hota, Ananda, Dabhade, Pratik, Machado, Prasun, Kumar, Avinash, Avinash, Ck., Manaswini, Ninisha, Das, Joydeep, Sethi, Sagar, Sahoo, Sumanta, Dubal, Shilpa, Bhoga, Sai Arun Dharmik, Navaneeth, P. K., Konar, C., Pal, Sabyasachi, Vaddi, Sravani, Apoorva, Prakash, Rajoria, Megha, and Purohit, Arundhati
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
Understanding the evolution of galaxies cannot exclude the important role played by the central supermassive black hole and the circumgalactic medium (CGM). Simulations have strongly suggested the negative feedback of AGN Jet/wind/outflows on the ISM/CGM of a galaxy leading to the eventual decline of star formation. However, no "smoking gun" evidence exists so far where relics of feedback, observed in any band, are consistent with the time scale of a major decline in star formation, in any sample of galaxies. Relics of any AGN-driven outflows will be observed as a faint and fuzzy structure which may be difficult to characterise by automated algorithms but trained citizen scientists can possibly perform better through their intuitive vision with additional heterogeneous data available anywhere on the Internet. RAD@home, launched on 15th April 2013, is not only the first Indian Citizen Science Research (CSR) platform in astronomy but also the only CSR publishing discoveries using any Indian telescope. We briefly report 11 CSR discoveries collected over the last eleven years. While searching for such relics we have spotted cases of offset relic lobes from elliptical and spiral, episodic radio galaxies with overlapping lobes as the host galaxy is in motion, large diffuse spiral-shaped emission, cases of jet-galaxy interaction, kinks and burls on the jets, a collimated synchrotron thread etc. Such exotic sources push the boundaries of our understanding of classical Seyferts and radio galaxies with jets and the process of discovery prepares the next generation for science with the upgraded GMRT and Square Kilometre Array Observatory (SKAO)., Comment: 14 pages, 8 figures. Accepted for publication in the Springer-Nature conference proceedings for "ISRA 2023: The Relativistic Universe: From Classical to Quantum Proceedings of the International Symposium on Recent Developments in Relativistic Astrophysics". Comments and collaborations, most welcome! Please visit #RADatHomeIndia website at radathomeindia.org
- Published
- 2024
45. Participatory Budget Allocation Method for Approval Ballots
- Author
-
Page, Rutvik, Doifode, Arnav, Tembhurne, Jitendra, and Ukey, Aishwarya Sagar Anand
- Subjects
Computer Science - Computational Engineering, Finance, and Science - Abstract
In this paper, we study the problem of Participatory Budgeting (PB) with approval ballots, inspired by Multi-Winner Voting schemes. We present generalized preference aggregation methods for participatory budgeting, especially for finding seemingly fair budget allocations. To achieve this, we generalize such preference aggregation methods from the well-known methods, namely the Sequential Chamberlin Courant rule and the Sequential Monroe Rule in the realm of social choice theory. Further, we provide an experimental evaluation of the preference aggregation methods using an impartial culture method of preference generation and study the extent to which such polynomial time algorithms satisfy one of the most popular notions of fairness called proportional representation., Comment: 8 pages, 3 figures
- Published
- 2024
46. Pixtral 12B
- Author
-
Agrawal, Pravesh, Antoniak, Szymon, Hanna, Emma Bou, Bout, Baptiste, Chaplot, Devendra, Chudnovsky, Jessica, Costa, Diogo, De Monicault, Baudouin, Garg, Saurabh, Gervet, Theophile, Ghosh, Soham, Héliou, Amélie, Jacob, Paul, Jiang, Albert Q., Khandelwal, Kartik, Lacroix, Timothée, Lample, Guillaume, Casas, Diego Las, Lavril, Thibaut, Scao, Teven Le, Lo, Andy, Marshall, William, Martin, Louis, Mensch, Arthur, Muddireddy, Pavankumar, Nemychnikova, Valera, Pellat, Marie, Von Platen, Patrick, Raghuraman, Nikhil, Rozière, Baptiste, Sablayrolles, Alexandre, Saulnier, Lucile, Sauvestre, Romain, Shang, Wendy, Soletskyi, Roman, Stewart, Lawrence, Stock, Pierre, Studnia, Joachim, Subramanian, Sandeep, Vaze, Sagar, Wang, Thomas, and Yang, Sophia
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Computation and Language - Abstract
We introduce Pixtral-12B, a 12--billion-parameter multimodal language model. Pixtral-12B is trained to understand both natural images and documents, achieving leading performance on various multimodal benchmarks, surpassing a number of larger models. Unlike many open-source models, Pixtral is also a cutting-edge text model for its size, and does not compromise on natural language performance to excel in multimodal tasks. Pixtral uses a new vision encoder trained from scratch, which allows it to ingest images at their natural resolution and aspect ratio. This gives users flexibility on the number of tokens used to process an image. Pixtral is also able to process any number of images in its long context window of 128K tokens. Pixtral 12B substanially outperforms other open models of similar sizes (Llama-3.2 11B \& Qwen-2-VL 7B). It also outperforms much larger open models like Llama-3.2 90B while being 7x smaller. We further contribute an open-source benchmark, MM-MT-Bench, for evaluating vision-language models in practical scenarios, and provide detailed analysis and code for standardized evaluation protocols for multimodal LLMs. Pixtral-12B is released under Apache 2.0 license.
- Published
- 2024
47. Hysteresis design of non-stoichiometric Fe2P-type alloys with giant magnetocaloric Effect
- Author
-
Ghorai, Sagar, Clulow, Rebecca, Cedervall, Johan, Huang, Shuo, Ericsson, Tore, Häggström, Lennart, Skini, Ridha, Shtender, Vitalii, Vitos, Levente, Eriksson, Olle, Scheibel, Franziska, Gutfleisch, Oliver, Sahlberg, Martin, and Svedlindh, Peter
- Subjects
Condensed Matter - Materials Science ,Physics - Applied Physics - Abstract
The non-stoichiometric Fe$_2$P-type (FeMnP$_{0.5}$Si$_{0.5}$)$_{1-x}$(FeV)$_{x}$ alloys ( $x=0, 0.01$, $0.02$, and $0.03$) have been investigated as potential candidates for magnetic refrigeration near room temperature. The magnetic ordering temperature decreases with increasing FeV concentration, $x$, which can be ascribed to decreased ferromagnetic coupling strength between the magnetic atoms. The strong magnetoelastic coupling in these alloys results in large values of the isothermal entropy change ($\Delta S_M$); $15.7$ J/kgK, at $2$ T magnetic field for the $x = 0$ alloy. $\Delta S_M$ decreases with increasing $x$. Results from M{\"o}ssbauer spectroscopy reveal that the average hyperfine field (in the ferromagnetic state) and average center shift (in the paramagnetic state) have the same decreasing trend as $\Delta S_M$. The thermal hysteresis ($\Delta T_{hyst}$) of the magnetic phase transition decreases with increasing $x$, while the mechanical stability of the alloys improves due to the reduced lattice volume change across the magnetoelastic phase transition. The adiabatic temperature change $\Delta T_{ad}$, which highly depends on $\Delta T_{hyst}$, is $1.7$ K at $1.9$ T applied field for the $x = 0.02$ alloy.
- Published
- 2024
48. Exploring the central region of NGC 1365 in the ultraviolet domain
- Author
-
Kurian, Kshama Sara, Stalin, C. S., Wylezalek, Dominika, Lyubenova, Mariya, Adhikari, Tek Prasad, Devaraj, Ashish, Sagar, Ram, Patig, Markus-Kissler, and Mondal, Santanu
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
Active galactic nuclei (AGN) feedback and its impact on their host galaxies are critical to our understanding of galaxy evolution. Here, we present a combined analysis of new high resolution ultraviolet (UV) data from the Ultraviolet Imaging Telescope (UVIT) on AstroSat and archival optical spectroscopic data from VLT/MUSE, for the Seyfert galaxy, NGC 1365. Concentrating on the central 5 kpc region, the UVIT images in the far and near UV show bright star forming knots in the circumnuclear ring as well as a faint central source. After correcting for extinction, we found the star formation rate (SFR) surface density of the circumnuclear 2 kpc ring to be similar to other starbursts, despite the presence of an AGN outflow, as seen in [OIII] 5007 Angstrom. On the other hand, we found fainter UV and thus lower SFR in the direction south-east of the AGN relative to north-west in agreement with observations at other wavelengths from JWST and ALMA. The AGN outflow velocity is found to be lesser than the escape velocity, suggesting that the outflowing gas will rain back into the galaxy. The deep UV data has also revealed diffuse UV emission in the direction of the AGN outflow. By combining [OIII] and UV data, we found the diffuse emission to be of AGN origin., Comment: 12 Pages, 7 figures, Accepted for publication in ApJ
- Published
- 2024
49. Abstractive Summarization of Low resourced Nepali language using Multilingual Transformers
- Author
-
Dhakal, Prakash and Baral, Daya Sagar
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Automatic text summarization in Nepali language is an unexplored area in natural language processing (NLP). Although considerable research has been dedicated to extractive summarization, the area of abstractive summarization, especially for low-resource languages such as Nepali, remains largely unexplored. This study explores the use of multilingual transformer models, specifically mBART and mT5, for generating headlines for Nepali news articles through abstractive summarization. The research addresses key challenges associated with summarizing texts in Nepali by first creating a summarization dataset through web scraping from various Nepali news portals. These multilingual models were then fine-tuned using different strategies. The performance of the fine-tuned models were then assessed using ROUGE scores and human evaluation to ensure the generated summaries were coherent and conveyed the original meaning. During the human evaluation, the participants were asked to select the best summary among those generated by the models, based on criteria such as relevance, fluency, conciseness, informativeness, factual accuracy, and coverage. During the evaluation with ROUGE scores, the 4-bit quantized mBART with LoRA model was found to be effective in generating better Nepali news headlines in comparison to other models and also it was selected 34.05% of the time during the human evaluation, outperforming all other fine-tuned models created for Nepali News headline generation.
- Published
- 2024
50. Performance Evaluation of Tokenizers in Large Language Models for the Assamese Language
- Author
-
Tamang, Sagar and Bora, Dibya Jyoti
- Subjects
Computer Science - Computation and Language - Abstract
Training of a tokenizer plays an important role in the performance of deep learning models. This research aims to understand the performance of tokenizers in five state-of-the-art (SOTA) large language models (LLMs) in the Assamese language of India. The research is important to understand the multi-lingual support for a low-resourced language such as Assamese. Our research reveals that the tokenizer of SUTRA from Two AI performs the best with an average Normalized Sequence Length (NSL) value of 0.45, closely followed by the tokenizer of GPT-4o from Open AI with an average NSL value of 0.54, followed by Gemma 2, Meta Llama 3.1, and Mistral Large Instruct 2407 with an average NSL value of 0.82, 1.4, and 1.48 respectively.
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.