188,688 results on '"A. A. Patil"'
Search Results
2. SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
- Author
-
Yoon, Jaehong, Yu, Shoubin, Patil, Vaidehi, Yao, Huaxiu, and Bansal, Mohit
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Recent advances in diffusion models have significantly enhanced their ability to generate high-quality images and videos, but they have also increased the risk of producing unsafe content. Existing unlearning/editing-based methods for safe generation remove harmful concepts from models but face several challenges: (1) They cannot instantly remove harmful concepts without training. (2) Their safe generation capabilities depend on collected training data. (3) They alter model weights, risking degradation in quality for content unrelated to toxic concepts. To address these, we propose SAFREE, a novel, training-free approach for safe T2I and T2V, that does not alter the model's weights. Specifically, we detect a subspace corresponding to a set of toxic concepts in the text embedding space and steer prompt embeddings away from this subspace, thereby filtering out harmful content while preserving intended semantics. To balance the trade-off between filtering toxicity and preserving safe concepts, SAFREE incorporates a novel self-validating filtering mechanism that dynamically adjusts the denoising steps when applying the filtered embeddings. Additionally, we incorporate adaptive re-attention mechanisms within the diffusion latent space to selectively diminish the influence of features related to toxic concepts at the pixel level. In the end, SAFREE ensures coherent safety checking, preserving the fidelity, quality, and safety of the output. SAFREE achieves SOTA performance in suppressing unsafe content in T2I generation compared to training-free baselines and effectively filters targeted concepts while maintaining high-quality images. It also shows competitive results against training-based methods. We extend SAFREE to various T2I backbones and T2V tasks, showcasing its flexibility and generalization. SAFREE provides a robust and adaptable safeguard for ensuring safe visual generation., Comment: The first two authors contributed equally; Project page: https://safree-safe-t2i-t2v.github.io/
- Published
- 2024
3. Time Series Viewmakers for Robust Disruption Prediction
- Author
-
Chayapathy, Dhruva, Siebert, Tavis, Spangher, Lucas, Moharir, Akshata Kishore, Patil, Om Manoj, and Rea, Cristina
- Subjects
Computer Science - Machine Learning - Abstract
Machine Learning guided data augmentation may support the development of technologies in the physical sciences, such as nuclear fusion tokamaks. Here we endeavor to study the problem of detecting disruptions i.e. plasma instabilities that can cause significant damages, impairing the reliability and efficiency required for their real world viability. Machine learning (ML) prediction models have shown promise in detecting disruptions for specific tokamaks, but they often struggle in generalizing to the diverse characteristics and dynamics of different machines. This limits the effectiveness of ML models across different tokamak designs and operating conditions, which is a critical barrier to scaling fusion technology. Given the success of data augmentation in improving model robustness and generalizability in other fields, this study explores the use of a novel time series viewmaker network to generate diverse augmentations or "views" of training data. Our results show that incorporating views during training improves AUC and F2 scores on DisruptionBench tasks compared to standard or no augmentations. This approach represents a promising step towards developing more broadly applicable ML models for disruption avoidance, which is essential for advancing fusion technology and, ultimately, addressing climate change through reliable and sustainable energy production.
- Published
- 2024
4. Achieving multi uav best viewpoint coordination in obstructed environments
- Author
-
Baglioni, Mirko, Patil, Apurva, Sentis, Luis, and Jamshidnejad, Anahita
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
Wildfire suppression is a complex task that poses high risks to humans. Using robotic teams for wildfire suppression enhances the safety and efficiency of detecting, monitoring, and extinguishing fires. We propose a control architecture based on task hierarchical control for the autonomous steering of a system of flying robots in wildfire suppression. We incorporate a novel line-of-sight obstacle avoidance method that calculates the best viewpoints and ensures an occlusion-free view for the suppression robot during the mission. Path integral control generates optimal trajectories towards the goals. We conduct an ablation study to assess the effectiveness of our approach by comparing it to scenarios where these key components are excluded, in order to validate the approach in simulations using Matlab and Unity. The results demonstrate significant performance improvements, with 44.0 % increase in effectiveness with the new line-of-sight obstacle avoidance task and up to 39.6 % improvement when using path integral control., Comment: 6 pages, 5 figures, submitted to joint ACC and L-CSS
- Published
- 2024
5. Investigating the sightline of a highly scattered FRB through a filamentary structure in the local Universe
- Author
-
Shin, Kaitlyn, Leung, Calvin, Simha, Sunil, Andersen, Bridget C., Fonseca, Emmanuel, Nimmo, Kenzie, Bhardwaj, Mohit, Brar, Charanjot, Chatterjee, Shami, Cook, Amanda M., Gaensler, B. M., Joseph, Ronniy C., Jow, Dylan, Kaczmarek, Jane, Kahinga, Lordrick, Kaspi, Victoria M., Kharel, Bikash, Lanman, Adam E., Lazda, Mattias, Main, Robert A., Mas-Ribas, Lluis, Masui, Kiyoshi W., Mena-Parra, Juan, Michilli, Daniele, Pandhi, Ayush, Patil, Swarali Shivraj, Pearlman, Aaron B., Pleunis, Ziggy, Prochaska, J. Xavier, Rafiei-Ravandi, Masoud, Sammons, Mawson W., Sand, Ketan R., Smith, Kendrick, and Stairs, Ingrid
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
Fast radio bursts (FRBs) are unique probes of extragalactic ionized baryonic structure as each signal, through its burst properties, holds information about the ionized matter it encounters along its sightline. FRB 20200723B is a burst with a scattering timescale of $\tau_\mathrm{400\,MHz} >$1 second at 400 MHz and a dispersion measure of DM $\sim$ 244 pc cm$^{-3}$. Observed across the entire CHIME/FRB frequency band, it is the single-component burst with the largest scattering timescale yet observed by CHIME/FRB. The combination of its high scattering timescale and relatively low dispersion measure present an uncommon opportunity to use FRB 20200723B to explore the properties of the cosmic web it traversed. With an $\sim$arcminute-scale localization region, we find the most likely host galaxy is NGC 4602 (with PATH probability $P(O|x)=0.985$), which resides $\sim$30 Mpc away within a sheet filamentary structure on the outskirts of the Virgo Cluster. We place an upper limit on the average free electron density of this filamentary structure of $\langle n_e \rangle < 4.6^{+9.6}_{-2.0} \times 10^{-5}$ cm$^{-3}$, broadly consistent with expectations from cosmological simulations. We investigate whether the source of scattering lies within the same galaxy as the FRB, or at a farther distance from an intervening structure along the line of sight. Comparing with Milky Way pulsar observations, we suggest the scattering may originate from within the host galaxy of FRB 20200723B., Comment: 20 pages, 6 figures, submitted. Comments welcome!
- Published
- 2024
6. Retrieval-Augmented Decision Transformer: External Memory for In-context RL
- Author
-
Schmied, Thomas, Paischer, Fabian, Patil, Vihang, Hofmarcher, Markus, Pascanu, Razvan, and Hochreiter, Sepp
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
In-context learning (ICL) is the ability of a model to learn a new task by observing a few exemplars in its context. While prevalent in NLP, this capability has recently also been observed in Reinforcement Learning (RL) settings. Prior in-context RL methods, however, require entire episodes in the agent's context. Given that complex environments typically lead to long episodes with sparse rewards, these methods are constrained to simple environments with short episodes. To address these challenges, we introduce Retrieval-Augmented Decision Transformer (RA-DT). RA-DT employs an external memory mechanism to store past experiences from which it retrieves only sub-trajectories relevant for the current situation. The retrieval component in RA-DT does not require training and can be entirely domain-agnostic. We evaluate the capabilities of RA-DT on grid-world environments, robotics simulations, and procedurally-generated video games. On grid-worlds, RA-DT outperforms baselines, while using only a fraction of their context length. Furthermore, we illuminate the limitations of current in-context RL methods on complex environments and discuss future directions. To facilitate future research, we release datasets for four of the considered environments.
- Published
- 2024
7. Precise Model Benchmarking with Only a Few Observations
- Author
-
Fogliato, Riccardo, Patil, Pratik, Akpinar, Nil-Jana, and Monfort, Mathew
- Subjects
Computer Science - Machine Learning ,Computer Science - Computation and Language ,Computer Science - Computer Vision and Pattern Recognition ,Statistics - Applications - Abstract
How can we precisely estimate a large language model's (LLM) accuracy on questions belonging to a specific topic within a larger question-answering dataset? The standard direct estimator, which averages the model's accuracy on the questions in each subgroup, may exhibit high variance for subgroups (topics) with small sample sizes. Synthetic regression modeling, which leverages the model's accuracy on questions about other topics, may yield biased estimates that are too unreliable for large subgroups. We prescribe a simple yet effective solution: an empirical Bayes (EB) estimator that balances direct and regression estimates for each subgroup separately, improving the precision of subgroup-level estimates of model performance. Our experiments on multiple datasets show that this approach consistently provides more precise estimates of the LLM performance compared to the direct and regression approaches, achieving substantial reductions in the mean squared error. Confidence intervals for EB estimates also have near-nominal coverage and are narrower compared to those for the direct estimator. Additional experiments on tabular and vision data validate the benefits of this EB approach., Comment: To appear at EMNLP 2024
- Published
- 2024
8. Measuring the ISM Content of Nearby, Luminous, Type 1 and Type 2 QSOs through CO and [C II]
- Author
-
Luo, Yuanze, Petric, A. O., Janssen, R. M. J., Fadda, D., Flagey, N., Omont, A., Jacob, A. M., Rowlands, K., Alatalo, K., Billot, N., Heckman, T., Husemann, B., Kakkad, D., Lacy, M., Marshall, J., Minchin, R., Minsley, R., Nesvadba, N., Otter, J. A., Patil, P., and Urrutia, T.
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We present observations of CO(1--0) and CO(2--1) lines from the Institut de radioastronomie millim\'etrique (IRAM) 30m telescope toward 20 nearby, optically luminous type 2 quasars (QSO2s) and observations of [C II] 158$\mu$m line from the Stratospheric Observatory For Infrared Astronomy (SOFIA) for 5 QSO2s in the CO sample and 5 type 1 quasars (QSO1s). In the traditional evolutionary scenario explaining different types of QSOs, obscured QSO2s emerge from gas-rich mergers observed as luminous infrared galaxies (LIRGs) and then turn into unobscured QSO1s as the black holes clear out the obscuring material in a blow-out phase. We test the validity of this theoretical prediction by comparing the gas fractions and star formation efficiencies among LIRGs and QSOs. We find that CO luminosity, CO-derived gas masses and gas fractions in QSO1s are consistent with those estimated for QSO2s, while LIRGs exhibit a closer resemblance to QSO2s in terms of CO-derived gas masses and gas fractions, and [C II] luminosity. However, comparisons between [C II] luminosity and star formation tracers such as the CO and infrared luminosity imply additional sources of [C II] emission in QSO1s likely tracing neutral atomic or ionized gas. All three types of galaxies have statistically indistinguishable distributions of star formation efficiency. Our results are consistent with part of the evolutionary scenario where nearby QSO2s could emerge from LIRGs, but they are unlikely to be the precursors of nearby QSO1s., Comment: 32 pages, 10 figures, 7 tables; the complete set of Figure 1 is appended to the end of document. Under review (revisions submitted) by ApJ
- Published
- 2024
9. Toward Debugging Deep Reinforcement Learning Programs with RLExplorer
- Author
-
Bouchoucha, Rached, Yahmed, Ahmed Haj, Patil, Darshan, Rajendran, Janarthanan, Nikanjam, Amin, Chandar, Sarath, and Khomh, Foutse
- Subjects
Computer Science - Software Engineering ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Deep reinforcement learning (DRL) has shown success in diverse domains such as robotics, computer games, and recommendation systems. However, like any other software system, DRL-based software systems are susceptible to faults that pose unique challenges for debugging and diagnosing. These faults often result in unexpected behavior without explicit failures and error messages, making debugging difficult and time-consuming. Therefore, automating the monitoring and diagnosis of DRL systems is crucial to alleviate the burden on developers. In this paper, we propose RLExplorer, the first fault diagnosis approach for DRL-based software systems. RLExplorer automatically monitors training traces and runs diagnosis routines based on properties of the DRL learning dynamics to detect the occurrence of DRL-specific faults. It then logs the results of these diagnoses as warnings that cover theoretical concepts, recommended practices, and potential solutions to the identified faults. We conducted two sets of evaluations to assess RLExplorer. Our first evaluation of faulty DRL samples from Stack Overflow revealed that our approach can effectively diagnose real faults in 83% of the cases. Our second evaluation of RLExplorer with 15 DRL experts/developers showed that (1) RLExplorer could identify 3.6 times more defects than manual debugging and (2) RLExplorer is easily integrated into DRL applications., Comment: Accepted for publication in The International Conference on Software Maintenance and Evolution (ICSME 2024)
- Published
- 2024
10. MMP: Towards Robust Multi-Modal Learning with Masked Modality Projection
- Author
-
Nezakati, Niki, Reza, Md Kaykobad, Patil, Ameya, Solh, Mashhour, and Asif, M. Salman
- Subjects
Computer Science - Machine Learning ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Multimodal learning seeks to combine data from multiple input sources to enhance the performance of different downstream tasks. In real-world scenarios, performance can degrade substantially if some input modalities are missing. Existing methods that can handle missing modalities involve custom training or adaptation steps for each input modality combination. These approaches are either tied to specific modalities or become computationally expensive as the number of input modalities increases. In this paper, we propose Masked Modality Projection (MMP), a method designed to train a single model that is robust to any missing modality scenario. We achieve this by randomly masking a subset of modalities during training and learning to project available input modalities to estimate the tokens for the masked modalities. This approach enables the model to effectively learn to leverage the information from the available modalities to compensate for the missing ones, enhancing missing modality robustness. We conduct a series of experiments with various baseline models and datasets to assess the effectiveness of this strategy. Experiments demonstrate that our approach improves robustness to different missing modality scenarios, outperforming existing methods designed for missing modalities or specific modality combinations.
- Published
- 2024
11. Learning Wheelchair Tennis Navigation from Broadcast Videos with Domain Knowledge Transfer and Diffusion Motion Planning
- Author
-
Wu, Zixuan, Zaidi, Zulfiqar, Patil, Adithya, Xiao, Qingyu, and Gombolay, Matthew
- Subjects
Computer Science - Robotics - Abstract
In this paper, we propose a novel and generalizable zero-shot knowledge transfer framework that distills expert sports navigation strategies from web videos into robotic systems with adversarial constraints and out-of-distribution image trajectories. Our pipeline enables diffusion-based imitation learning by reconstructing the full 3D task space from multiple partial views, warping it into 2D image space, closing the planning loop within this 2D space, and transfer constrained motion of interest back to task space. Additionally, we demonstrate that the learned policy can serve as a local planner in conjunction with position control. We apply this framework in the wheelchair tennis navigation problem to guide the wheelchair into the ball-hitting region. Our pipeline achieves a navigation success rate of 97.67% in reaching real-world recorded tennis ball trajectories with a physical robot wheelchair, and achieve a success rate of 68.49% in a real-world, real-time experiment on a full-sized tennis court., Comment: This manuscript has been submitted to 2025 IEEE International Conference on Robotics & Automation (ICRA)
- Published
- 2024
12. Collective action and entanglement of magnetically active liquid crystal elastomer ribbons
- Author
-
Dana, Asaf, Benson, Christian, Kalairaj, Manivannan Sivaperuman, Hellikson, Kayla, George, Sasha M., Chimene, David C., Gibson, Jared A., Tasmim, Seelay, Kohl, Phillip A., Li, Youli, Abdelrahman, Mustafa K., Patil, Vishal P., and Ware, Taylor H.
- Subjects
Condensed Matter - Soft Condensed Matter - Abstract
Interactions between active individuals in animal collectives lead to emergent responses that remain elusive in synthetic soft matter. Here, shape-morphing polymers are used to create bio-inspired transient solids that self-assemble with controlled mechanical properties and disassemble on demand. Dilute-suspensions of magnetic, heat-responsive liquid crystal elastomer ribbons mechanically interlock, inducing reversible aggregation. A mathematical model is developed that sheds light on the role of topological mechanisms in aggregation. Aggregation was favored for ribbons with moderate curvature at 25C above crosslinking temperature as compared to flat ribbons or higher curvature ribbons at higher temperatures. The ribbon suspensions reversibly transition between fluid- and solid-like states, exhibiting up to 6 orders-of-magnitude increase in the storage moduli of the entangled aggregates compared with the liquid dispersions. Controlled dissociation is induced by imparting kinetic energy to the individual ribbons at high magnetic field rotation speeds (> 200 RPM). Ribbon shape and the medium in which dissociation occurs were shown to govern disassembly. Imparting dynamic collective behaviors into synthetic systems may enable a range of potential applications from bio-inspired soft robotics to injectable biomaterials., Comment: 16 pages, 6 figures
- Published
- 2024
13. Pulling back the curtain on shocks and star-formation in NGC 1266 with Gemini-NIFS
- Author
-
Otter, Justin Atsushi, Alatalo, Katherine, Rowlands, Kate, McDermid, Richard M., Davis, Timothy A., Federrath, Christoph, French, K. Decker, Heckman, Timothy, Ogle, Patrick, Kakkad, Darshan, Luo, Yuanze, Nyland, Kristina, Tripathi, Akshat, Patil, Pallavi, Petric, Andreea, Smercina, Adam, Skarbinski, Maya, Lanz, Lauranne, Larson, Kristin, Appleton, Philip N., Aalto, Susanne, Olander, Gustav, Sazonova, Elizaveta, and Smith, J. D. T.
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We present Gemini near-infrared integral field spectrograph (NIFS) K-band observations of the central 400 pc of NGC 1266, a nearby (D$\approx$30 Mpc) post-starburst galaxy with a powerful multi-phase outflow and a shocked ISM. We detect 7 H$_2$ ro-vibrational emission lines excited thermally to $T$$\sim$2000 K, and weak Br$\gamma$ emission, consistent with a fast C-shock. With these bright H$_2$ lines, we observe the spatial structure of the shock with an unambiguous tracer for the first time. The Br$\gamma$ emission is concentrated in the central $\lesssim$100 pc, indicating that any remaining star-formation in NGC 1266 is in the nucleus while the surrounding cold molecular gas has little on-going star-formation. Though it is unclear what fraction of this Br$\gamma$ emission is from star-formation or the AGN, assuming it is entirely due to star-formation we measure an instantaneous star-formation rate of 0.7 M$_\odot$ yr$^{-1}$, though the star-formation rate may be significantly higher in the presence of additional extinction. NGC 1266 provides a unique laboratory to study the complex interactions between AGN, outflows, shocks, and star-formation, all of which are necessary to unravel the evolution of the post-starburst phase., Comment: ApJ accepted
- Published
- 2024
14. Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with Large Language Models
- Author
-
Zhang, Mike, Qu, Kaixian, Patil, Vaishakh, Cadena, Cesar, and Hutter, Marco
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Large Language Models (LLM) have emerged as a tool for robots to generate task plans using common sense reasoning. For the LLM to generate actionable plans, scene context must be provided, often through a map. Recent works have shifted from explicit maps with fixed semantic classes to implicit open vocabulary maps based on queryable embeddings capable of representing any semantic class. However, embeddings cannot directly report the scene context as they are implicit, requiring further processing for LLM integration. To address this, we propose an explicit text-based map that can represent thousands of semantic classes while easily integrating with LLMs due to their text-based nature by building upon large-scale image recognition models. We study how entities in our map can be localized and show through evaluations that our text-based map localizations perform comparably to those from open vocabulary maps while using two to four orders of magnitude less memory. Real-robot experiments demonstrate the grounding of an LLM with the text-based map to solve user tasks.
- Published
- 2024
15. Precise Asymptotics of Bagging Regularized M-estimators
- Author
-
Koriyama, Takuya, Patil, Pratik, Du, Jin-Hong, Tan, Kai, and Bellec, Pierre C.
- Subjects
Mathematics - Statistics Theory ,Statistics - Machine Learning - Abstract
We characterize the squared prediction risk of ensemble estimators obtained through subagging (subsample bootstrap aggregating) regularized M-estimators and construct a consistent estimator for the risk. Specifically, we consider a heterogeneous collection of $M \ge 1$ regularized M-estimators, each trained with (possibly different) subsample sizes, convex differentiable losses, and convex regularizers. We operate under the proportional asymptotics regime, where the sample size $n$, feature size $p$, and subsample sizes $k_m$ for $m \in [M]$ all diverge with fixed limiting ratios $n/p$ and $k_m/n$. Key to our analysis is a new result on the joint asymptotic behavior of correlations between the estimator and residual errors on overlapping subsamples, governed through a (provably) contractible nonlinear system of equations. Of independent interest, we also establish convergence of trace functionals related to degrees of freedom in the non-ensemble setting (with $M = 1$) along the way, extending previously known cases for square loss and ridge, lasso regularizers. When specialized to homogeneous ensembles trained with a common loss, regularizer, and subsample size, the risk characterization sheds some light on the implicit regularization effect due to the ensemble and subsample sizes $(M,k)$. For any ensemble size $M$, optimally tuning subsample size yields sample-wise monotonic risk. For the full-ensemble estimator (when $M \to \infty$), the optimal subsample size $k^\star$ tends to be in the overparameterized regime $(k^\star \le \min\{n,p\})$, when explicit regularization is vanishing. Finally, joint optimization of subsample size, ensemble size, and regularization can significantly outperform regularizer optimization alone on the full data (without any subagging)., Comment: 72 pages, 14 figures, 6 tables
- Published
- 2024
16. Characterizing the Molecular Gas in Infrared Bright Galaxies with CARMA
- Author
-
Alatalo, Katherine, Petric, Andreea O., Lanz, Lauranne, Rowlands, Kate, U, Vivian, Larson, Kirsten L., Armus, Lee, Barcos-Muñoz, Loreto, Evans, Aaron S., Koda, Jin, Luo, Yuanze, Medling, Anne M., Nyland, Kristina E., Otter, Justin A., Patil, Pallavi, Peñaloza, Fernando, Salim, Diane, Sanders, David B., Sazonova, Elizaveta, Skarbinski, Maya, Song, Yiqing, Treister, Ezequiel, and Urry, C. Meg
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We present the CO(1-0) maps of 28 infrared-bright galaxies from the Great Observatories All-Sky Luminous Infrared Galaxy Survey (GOALS) taken with the Combined Array for Research in Millimeter Astronomy (CARMA). We detect 100GHz continuum in 16 of 28 galaxies, which trace both active galactic nuclei (AGNs) and compact star-forming cores. The GOALS galaxies show a variety of molecular gas morphologies, though in the majority of cases, the average velocity fields show a gradient consistent with rotation. We fit the full continuum SEDs of each of the source using either MAGPHYS or SED3FIT (if there are signs of an AGN) to derive the total stellar mass, dust mass, and star formation rates of each object. We adopt a value determined from luminous and ultraluminous infrared galaxies (LIRGs and ULIRGs) of $\alpha_{\rm CO}=1.5^{+1.3}_{-0.8}~M_\odot$ (K km s$^{-1}$ pc$^2)^{-1}$, which leads to more physical values for $f_{\rm mol}$ and the gas-to-dust ratio. Mergers tend to have the highest gas-to-dust ratios. We assume the cospatiality of the molecular gas and star formation, and plot the sample on the Schmidt-Kennicutt relation, we find that they preferentially lie above the line set by normal star-forming galaxies. This hyper-efficiency is likely due to the increased turbulence in these systems, which decreases the freefall time compared to star-forming galaxies, leading to "enhanced" star formation efficiency. Line wings are present in a non-negligible subsample (11/28) of the CARMA GOALS sources and are likely due to outflows driven by AGNs or star formation, gas inflows, or additional decoupled gas components., Comment: 29 pages, 4 tables, 11 figures, Accepted by the Astrophysical Journal
- Published
- 2024
17. Rapid Assessment of Stable Crystal Structures in Single Phase High Entropy Alloys Via Graph Neural Network Based Surrogate Modelling
- Author
-
Beaver, Nicholas, Dive, Aniruddha, Wong, Marina, Shimanuki, Keita, Patil, Ananya, Ferrell, Anthony, and Kivy, Mohsen B.
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Disordered Systems and Neural Networks - Abstract
In an effort to develop a rapid, reliable, and cost-effective method for predicting the structure of single-phase high entropy alloys, a Graph Neural Network (ALIGNN-FF) based approach was introduced. This method was successfully tested on 132 different high entropy alloys, and the results were analyzed and compared with density functional theory and valence electron concentration calculations. Additionally, the effects of various factors, including lattice parameters and the number of supercells with unique atomic configurations, on the prediction accuracy were investigated. The ALIGNN-FF based approach was subsequently used to predict the structure of a novel cobalt-free 3d high entropy alloy, and the result was experimentally verified., Comment: 16 pages, 8 Figures. To be published in Results Materials
- Published
- 2024
18. Implicit Regularization Paths of Weighted Neural Representations
- Author
-
Du, Jin-Hong and Patil, Pratik
- Subjects
Computer Science - Machine Learning ,Mathematics - Statistics Theory ,Statistics - Machine Learning - Abstract
We study the implicit regularization effects induced by (observation) weighting of pretrained features. For weight and feature matrices of bounded operator norms that are infinitesimally free with respect to (normalized) trace functionals, we derive equivalence paths connecting different weighting matrices and ridge regularization levels. Specifically, we show that ridge estimators trained on weighted features along the same path are asymptotically equivalent when evaluated against test vectors of bounded norms. These paths can be interpreted as matching the effective degrees of freedom of ridge estimators fitted with weighted features. For the special case of subsampling without replacement, our results apply to independently sampled random features and kernel features and confirm recent conjectures (Conjectures 7 and 8) of the authors on the existence of such paths in Patil et al. We also present an additive risk decomposition for ensembles of weighted estimators and show that the risks are equivalent along the paths when the ensemble size goes to infinity. As a practical consequence of the path equivalences, we develop an efficient cross-validation method for tuning and apply it to subsampled pretrained representations across several models (e.g., ResNet-50) and datasets (e.g., CIFAR-100)., Comment: 19 pages for main and 19 pages for appendix
- Published
- 2024
19. HER2 and FISH Status Prediction in Breast Biopsy H&E-Stained Images Using Deep Learning
- Author
-
Sekhar, Ardhendu, Goel, Vrinda, Jain, Garima, Patil, Abhijeet, Gupta, Ravi Kant, Bameta, Tripti, Rane, Swapnil, and Sethi, Amit
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition - Abstract
The current standard for detecting human epidermal growth factor receptor 2 (HER2) status in breast cancer patients relies on HER2 amplification, identified through fluorescence in situ hybridization (FISH) or immunohistochemistry (IHC). However, hematoxylin and eosin (H\&E) tumor stains are more widely available, and accurately predicting HER2 status using H\&E could reduce costs and expedite treatment selection. Deep Learning algorithms for H&E have shown effectiveness in predicting various cancer features and clinical outcomes, including moderate success in HER2 status prediction. In this work, we employed a customized weak supervision classification technique combined with MoCo-v2 contrastive learning to predict HER2 status. We trained our pipeline on 182 publicly available H&E Whole Slide Images (WSIs) from The Cancer Genome Atlas (TCGA), for which annotations by the pathology team at Yale School of Medicine are publicly available. Our pipeline achieved an Area Under the Curve (AUC) of 0.85 across four different test folds. Additionally, we tested our model on 44 H&E slides from the TCGA-BRCA dataset, which had an HER2 score of 2+ and included corresponding HER2 status and FISH test results. These cases are considered equivocal for IHC, requiring an expensive FISH test on their IHC slides for disambiguation. Our pipeline demonstrated an AUC of 0.81 on these challenging H&E slides. Reducing the need for FISH test can have significant implications in cancer treatment equity for underserved populations.
- Published
- 2024
20. Predicting Solar Energy Generation with Machine Learning based on AQI and Weather Features
- Author
-
Shah, Arjun, Viswanath, Varun, Gandhi, Kashish, and Patil, Nilesh Madhukar
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
This paper addresses the pressing need for an accurate solar energy prediction model, which is crucial for efficient grid integration. We explore the influence of the Air Quality Index and weather features on solar energy generation, employing advanced Machine Learning and Deep Learning techniques. Our methodology uses time series modeling and makes novel use of power transform normalization and zero-inflated modeling. Various Machine Learning algorithms and Conv2D Long Short-Term Memory model based Deep Learning models are applied to these transformations for precise predictions. Results underscore the effectiveness of our approach, demonstrating enhanced prediction accuracy with Air Quality Index and weather features. We achieved a 0.9691 $R^2$ Score, 0.18 MAE, 0.10 RMSE with Conv2D Long Short-Term Memory model, showcasing the power transform technique's innovation in enhancing time series forecasting for solar energy generation. Such results help our research contribute valuable insights to the synergy between Air Quality Index, weather features, and Deep Learning techniques for solar energy prediction., Comment: Accepted at AISD2024 : Second International Workshop on Artificial Intelligence: Empowering Sustainable Development
- Published
- 2024
- Full Text
- View/download PDF
21. Towards a theory of phase transitions in quantum control landscapes
- Author
-
Beato, Nicolò, Patil, Pranay, and Bukov, Marin
- Subjects
Quantum Physics ,Condensed Matter - Other Condensed Matter ,Condensed Matter - Quantum Gases ,Condensed Matter - Statistical Mechanics ,Condensed Matter - Strongly Correlated Electrons - Abstract
Control landscape phase transitions (CLPTs) occur as abrupt changes in the cost function landscape upon varying a control parameter, and can be revealed by non-analytic points in statistical order parameters. A prime example are quantum speed limits (QSL) which mark the onset of controllability as the protocol duration is increased. Here we lay the foundations of an analytical theory for CLPTs by developing Dyson, Magnus, and cumulant expansions for the cost function that capture the behavior of CLPTs with a controlled precision. Using linear and quadratic stability analysis, we reveal that CLPTs can be associated with different types of instabilities of the optimal protocol. This allows us to explicitly relate CLPTs to critical structural rearrangements in the extrema of the control landscape: utilizing path integral methods from statistical field theory, we trace back the critical scaling of the order parameter at the QSL to the topological and geometric properties of the set of optimal protocols, such as the number of connected components and its dimensionality. We verify our predictions by introducing a numerical sampling algorithm designed to explore this optimal set via a homotopic stochastic update rule. We apply this new toolbox explicitly to analyze CLPTs in the single- and two-qubit control problems whose landscapes are analytically tractable, and compare the landscapes for bang-bang and continuous protocols. Our work provides the first steps towards a systematic theory of CLPTs and paves the way for utilizing statistical field theory techniques for generic complex control landscapes., Comment: 38 pages, 24 figures
- Published
- 2024
22. Undominated monopoly regulation
- Author
-
Mishra, Debasis and Patil, Sanket
- Subjects
Economics - Theoretical Economics - Abstract
We study undominated mechanisms with transfers for regulating a monopolist who privately observes the marginal cost of production. We show that in any undominated mechanism, there is a quantity floor, which depends only on the primitives, and the regulator's operation decision is stochastic only if the monopolist produces at the quantity floor. We provide a near-complete characterization of the set of undominated mechanisms and use it to (a) provide a foundation for deterministic mechanisms, (b) show that the efficient mechanism is dominated, and (c) derive a max-min optimal regulatory mechanism.
- Published
- 2024
23. Hybrid Semantic Search: Unveiling User Intent Beyond Keywords
- Author
-
Ahluwalia, Aman, Sutradhar, Bishwajit, Ghosh, Karishma, Yadav, Indrapal, Sheetal, Arpan, and Patil, Prashant
- Subjects
Computer Science - Information Retrieval ,Computer Science - Artificial Intelligence - Abstract
This paper addresses the limitations of traditional keyword-based search in understanding user intent and introduces a novel hybrid search approach that leverages the strengths of non-semantic search engines, Large Language Models (LLMs), and embedding models. The proposed system integrates keyword matching, semantic vector embeddings, and LLM-generated structured queries to deliver highly relevant and contextually appropriate search results. By combining these complementary methods, the hybrid approach effectively captures both explicit and implicit user intent.The paper further explores techniques to optimize query execution for faster response times and demonstrates the effectiveness of this hybrid search model in producing comprehensive and accurate search outcomes.
- Published
- 2024
24. Nonlinear Quantum Optics at a Topological Interface Enabled by Defect Engineering
- Author
-
Hallacy, L., Martin, N. J., Mehrabad, M. Jalali, Hallett, D., Chen, X., Dost, R., Foster, A., Brunswick, L., Fenzl, A., Clarke, E., Patil, P. K., Fox, A. M, Skolnick, M. S., and Wilson, L. R.
- Subjects
Physics - Optics ,Quantum Physics - Abstract
The integration of topology into photonics has generated a new design framework for constructing robust and unidirectional waveguides, which are not feasible with traditional photonic devices. Here, we overcome current barriers to the successful integration of quantum emitters such as quantum dots (QDs) into valley-Hall (VH) topological waveguides, utilising photonic defects at the topological interface to stabilise the local charge environment and inverse design for efficient topological-conventional mode conversion. By incorporating QDs within defects of VH-photonic crystals, we demonstrate the first instances of single-photon resonant fluorescence and resonant transmission spectroscopy of a quantum emitter at a topological waveguide interface. Our results bring together topological photonics with optical nonlinear effects at the single-photon level, offering a new avenue to investigate the interaction between topology and quantum nonlinear systems.
- Published
- 2024
25. Stabilizer Entanglement Distillation and Efficient Fault-Tolerant Encoder
- Author
-
Shi, Yu, Patil, Ashlesha, and Guha, Saikat
- Subjects
Quantum Physics - Abstract
Entanglement is essential for quantum information processing but is limited by noise. We address this by developing high-yield entanglement distillation protocols with several advancements. (1) We extend the 2-to-1 recurrence entanglement distillation protocol to higher-rate n-to-(n-1) protocols that can correct any single-qubit errors. These protocols are evaluated through numerical simulations focusing on fidelity and yield. We also outline a method to adapt any classical error-correcting code for entanglement distillation, where the code can correct both bit-flip and phase-flip errors by incorporating Hadamard gates. (2) We propose a constant-depth decoder for stabilizer codes that transforms logical states into physical ones using single-qubit measurements. This decoder is applied to entanglement distillation protocols, reducing circuit depth and enabling protocols derived from advanced quantum error-correcting codes. We demonstrate this by evaluating the circuit complexity for entanglement distillation protocols based on surface codes and quantum convolutional codes. (3) Our stabilizer entanglement distillation techniques advance quantum computing. We propose a fault-tolerant protocol for constant-depth encoding and decoding of arbitrary quantum states, applicable to quantum low-density parity-check (qLDPC) codes and surface codes. This protocol is feasible with state-of-the-art reconfigurable atom arrays and surpasses the limits of conventional logarithmic depth encoders. Overall, our study integrates stabilizer formalism, measurement-based quantum computing, and entanglement distillation, advancing both quantum communication and computing., Comment: 19 pages, 7 figures
- Published
- 2024
26. LSST: Learned Single-Shot Trajectory and Reconstruction Network for MR Imaging
- Author
-
Aggarwal, Hemant Kumar, Chatterjee, Sudhanya, Shanbhag, Dattesh, Patil, Uday, and Hari, K. V. S.
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Single-shot magnetic resonance (MR) imaging acquires the entire k-space data in a single shot and it has various applications in whole-body imaging. However, the long acquisition time for the entire k-space in single-shot fast spin echo (SSFSE) MR imaging poses a challenge, as it introduces T2-blur in the acquired images. This study aims to enhance the reconstruction quality of SSFSE MR images by (a) optimizing the trajectory for measuring the k-space, (b) acquiring fewer samples to speed up the acquisition process, and (c) reducing the impact of T2-blur. The proposed method adheres to physics constraints due to maximum gradient strength and slew-rate available while optimizing the trajectory within an end-to-end learning framework. Experiments were conducted on publicly available fastMRI multichannel dataset with 8-fold and 16-fold acceleration factors. An experienced radiologist's evaluation on a five-point Likert scale indicates improvements in the reconstruction quality as the ACL fibers are sharper than comparative methods.
- Published
- 2024
27. Covariant Jacobi-Legendre expansion for total energy calculations within the projector-augmented-wave formalism
- Author
-
Focassio, Bruno, Domina, Michelangelo, Patil, Urvesh, Fazzio, Adalberto, and Sanvito, Stefano
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Strongly Correlated Electrons - Abstract
Machine-learning models can be trained to predict the converged electron charge density of a density functional theory (DFT) calculation. In general, the value of the density at a given point in space is invariant under global translations and rotations having that point as a centre. Hence, one can construct locally invariant machine-learning density predictors. However, the widely used projector augmented wave (PAW) implementation of DFT requires the evaluation of the one-center augmentation contributions, that are not rotationally invariant. Building on our recently proposed Jacobi-Legendre charge-density scheme, we construct a covariant Jacobi-Legendre model capable of predicting the local occupancies needed to compose the augmentation charge density. Our formalism is then applied to the prediction of the energy barrier for the 1H-to-1T phase transition of two-dimensional MoS$_2$. With extremely modest training, the model is capable of performing a non-self-consistent nudged elastic band calculation at virtually the same accuracy as a fully DFT-converged one, thus saving thousands of self-consistent DFT steps. Furthermore, at variance with machine-learning force fields, the charge density is here available for any nudged elastic band image, so that we can trace the evolution of the electronic structure across the phase transition., Comment: 12 pages, 4 figures
- Published
- 2024
28. A Methodology for Improving the Quality of the Research Article Publications in Engineering Institutions in India: A Case Study
- Author
-
Rajkumar Bhimgonda Patil, Prachi Vinod Ingle, and Padmakar A. Deshmukh
- Abstract
Research article publication is often considered a critical indicator of academic institutions' success and productivity. It improves the institution's reputation, attracts talented students and faculty members, and increases the institution's chances of receiving funding opportunities from different funding agencies. This paper provides a reliable and sustainable methodology for improving the quality and quantity of research article publications for engineering institutions in India. The various tools, techniques, and initiatives that promote research culture and improve its outcome in terms of research papers are also discussed. A case study of Pimpri Chinchwad College of Engineering (PCCOE), Pune, India, depicts how predictive, prescriptive, descriptive, and diagnostic data analytics approaches help to identify the barriers in the research article publications in academic institutions and provides the ways to overcome them. It also helps to set the publication targets and develop the path to perceive the targets. The outcomes and effectiveness of the case study are discussed using the papers published in Scopus, Web of Science, and Google Scholar databases. The challenges, opportunities, and recommendations are also provided for the smooth and effective implementation of the developed methodologies.
- Published
- 2024
- Full Text
- View/download PDF
29. Neoadjuvant Osimertinib for the Treatment of Stage I-IIIA Epidermal Growth Factor Receptor–Mutated Non–Small Cell Lung Cancer: A Phase II Multicenter Study
- Author
-
Blakely, Collin M, Urisman, Anatoly, Gubens, Matthew A, Mulvey, Claire K, Allen, Greg M, Shiboski, Stephen C, Rotow, Julia K, Chakrabarti, Turja, Kerr, D Lucas, Aredo, Jacqueline V, Bacaltos, Bianca, Gee, Megan, Tan, Lisa, Jones, Kirk D, Devine, W Patrick, Doebele, Robert C, Aisner, Dara L, Patil, Tejas, Schenk, Erin L, Bivona, Trever G, Riess, Jonathan W, Coleman, Melissa, Kratz, Johannes R, and Jablons, David M
- Subjects
Biomedical and Clinical Sciences ,Clinical Sciences ,Oncology and Carcinogenesis ,Lung ,Women's Health ,Clinical Research ,Clinical Trials and Supportive Activities ,Cancer ,Minority Health ,Lung Cancer ,Patient Safety ,6.4 Surgery ,6.1 Pharmaceuticals ,Humans ,Acrylamides ,Female ,Carcinoma ,Non-Small-Cell Lung ,Aniline Compounds ,Male ,Lung Neoplasms ,Middle Aged ,ErbB Receptors ,Aged ,Neoadjuvant Therapy ,Mutation ,Neoplasm Staging ,Adult ,Protein Kinase Inhibitors ,Antineoplastic Agents ,Indoles ,Pyrimidines ,Oncology & Carcinogenesis ,Oncology and carcinogenesis - Abstract
PurposeTo assess the safety and efficacy of the third-generation epidermal growth factor receptor (EGFR) tyrosine kinase inhibitor osimertinib as neoadjuvant therapy in patients with surgically resectable stage I-IIIA EGFR-mutated non-small cell lung cancer (NSCLC).Patients and methodsThis was a multi-institutional phase II trial of neoadjuvant osimertinib for patients with surgically resectable stage I-IIIA (American Joint Committee on Cancer [AJCC] V7) EGFR-mutated (L858R or exon 19 deletion) NSCLC (ClinicalTrials.gov identifier: NCT03433469). Patients received osimertinib 80 mg orally once daily for up to two 28-day cycles before surgical resection. The primary end point was major pathological response (MPR) rate. Secondary safety and efficacy end points were also assessed. Exploratory end points included pretreatment and post-treatment tumor mutation profiling.ResultsA total of 27 patients were enrolled and treated with neoadjuvant osimertinib for a median 56 days before surgical resection. Twenty-four (89%) patients underwent subsequent surgery; three (11%) patients were converted to definitive chemoradiotherapy. The MPR rate was 14.8% (95% CI, 4.2 to 33.7). No pathological complete responses were observed. The ORR was 52%, and the median DFS was 40.9 months. One treatment-related serious adverse event (AE) occurred (3.7%). No patients were unable to undergo surgical resection or had surgery delayed because of an AE. The most common co-occurring tumor genomic alterations were in TP53 (42%) and RBM10 (21%).ConclusionTreatment with neoadjuvant osimertinib in surgically resectable (stage IA-IIIA, AJCC V7) EGFR-mutated NSCLC did not meet its primary end point for MPR rate. However, neoadjuvant osimertinib did not lead to unanticipated AEs, surgical delays, nor result in a significant unresectability rate.
- Published
- 2024
30. A proximity proteomics pipeline with improved reproducibility and throughput.
- Author
-
Zhong, Xiaofang, Li, Qiongyu, Polacco, Benjamin, Patil, Trupti, Marley, Aaron, Foussard, Helene, Khare, Prachi, Vartak, Rasika, Xu, Jiewei, DiBerto, Jeffrey, Roth, Bryan, Eckhardt, Manon, von Zastrow, Mark, Krogan, Nevan, and Hüttenhain, Ruth
- Subjects
APEX2-based Proximity Labeling ,G Protein-Coupled Receptor ,Protein–Protein Interaction ,Proximity Proteomics ,Subcellular Proteomics ,Proteomics ,Biotinylation ,Reproducibility of Results ,Humans ,Proteome ,Mass Spectrometry ,HEK293 Cells - Abstract
Proximity labeling (PL) via biotinylation coupled with mass spectrometry (MS) captures spatial proteomes in cells. Large-scale processing requires a workflow minimizing hands-on time and enhancing quantitative reproducibility. We introduced a scalable PL pipeline integrating automated enrichment of biotinylated proteins in a 96-well plate format. Combining this with optimized quantitative MS based on data-independent acquisition (DIA), we increased sample throughput and improved protein identification and quantification reproducibility. We applied this pipeline to delineate subcellular proteomes across various compartments. Using the 5HT2A serotonin receptor as a model, we studied temporal changes of proximal interaction networks induced by receptor activation. In addition, we modified the pipeline for reduced sample input to accommodate CRISPR-based gene knockout, assessing dynamics of the 5HT2A network in response to perturbation of selected interactors. This PL approach is universally applicable to PL proteomics using biotinylation-based PL enzymes, enhancing throughput and reproducibility of standard protocols.
- Published
- 2024
31. Radiance Fields for Robotic Teleoperation
- Author
-
Wilder-Smith, Maximum, Patil, Vaishakh, and Hutter, Marco
- Subjects
Computer Science - Robotics - Abstract
Radiance field methods such as Neural Radiance Fields (NeRFs) or 3D Gaussian Splatting (3DGS), have revolutionized graphics and novel view synthesis. Their ability to synthesize new viewpoints with photo-realistic quality, as well as capture complex volumetric and specular scenes, makes them an ideal visualization for robotic teleoperation setups. Direct camera teleoperation provides high-fidelity operation at the cost of maneuverability, while reconstruction-based approaches offer controllable scenes with lower fidelity. With this in mind, we propose replacing the traditional reconstruction-visualization components of the robotic teleoperation pipeline with online Radiance Fields, offering highly maneuverable scenes with photorealistic quality. As such, there are three main contributions to state of the art: (1) online training of Radiance Fields using live data from multiple cameras, (2) support for a variety of radiance methods including NeRF and 3DGS, (3) visualization suite for these methods including a virtual reality scene. To enable seamless integration with existing setups, these components were tested with multiple robots in multiple configurations and were displayed using traditional tools as well as the VR headset. The results across methods and robots were compared quantitatively to a baseline of mesh reconstruction, and a user study was conducted to compare the different visualization methods. For videos and code, check out https://leggedrobotics.github.io/rffr.github.io/., Comment: 8 pages, 10 figures, Accepted to IROS 2024
- Published
- 2024
32. Minimum Time Consensus of Multi-agent System under Fuel Constraints
- Author
-
Rautela, Akansha, Patil, Deepak, Mulla, Ameer, and Kar, Indra Narayan
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
This work addresses the problem of finding a consensus point in the state space ($\mathbb{R}^2$) for a multi-agent system that is comprised of $N$ identical double integrator agents. It is assumed that each agent operates under constrained control input (i.e., $|u_i(t)| \leq 1$ $\forall i = 1, \hdots N$). Further, a fixed fuel budget is also assumed i.e., the total amount of cumulative input that can be expended is limited by $\int_0^{t_f}|u(t)|dt \le \beta$. First, the attainable set $\mathcal{A}(t,x_0,\beta)$ at time $t$, which is the set of all states that an agent can attain starting from initial conditions $x_0$ under the fuel budget constraints at time $t$ is computed for every agent. This attainable set is a convex set for all $t\ge0$. Then the minimum time to consensus is the minimum time $\bar{t}$ at which attainable sets of all agents intersect, and the consensus point is the point of intersection. A closed-form expression for the minimum time consensus point is provided for the case of three agents. Then, using Helly's theorem, the intersection will be non-empty at a time when all the $N \choose 3$ triplets of agents have non-empty intersection. The computation of minimum time consensus for all $N \choose 3$ triplets is performed independently and can be distributed among all the $N$ agents. Finally, the overall minimum time to consensus is given by the triplet that has the highest minimum time to consensus. Further, the intersection of all the attainable sets of this triplet gives the minimum time consensus point for all $N$ agents.
- Published
- 2024
33. Improving Domain Adaptation Through Class Aware Frequency Transformation
- Author
-
Kumar, Vikash, Patil, Himanshu, Lal, Rohit, and Chakraborty, Anirban
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
In this work, we explore the usage of the Frequency Transformation for reducing the domain shift between the source and target domain (e.g., synthetic image and real image respectively) towards solving the Domain Adaptation task. Most of the Unsupervised Domain Adaptation (UDA) algorithms focus on reducing the global domain shift between labelled source and unlabelled target domains by matching the marginal distributions under a small domain gap assumption. UDA performance degrades for the cases where the domain gap between source and target distribution is large. In order to bring the source and the target domains closer, we propose a novel approach based on traditional image processing technique Class Aware Frequency Transformation (CAFT) that utilizes pseudo label based class consistent low-frequency swapping for improving the overall performance of the existing UDA algorithms. The proposed approach, when compared with the state-of-the-art deep learning based methods, is computationally more efficient and can easily be plugged into any existing UDA algorithm to improve its performance. Additionally, we introduce a novel approach based on absolute difference of top-2 class prediction probabilities (ADT2P) for filtering target pseudo labels into clean and noisy sets. Samples with clean pseudo labels can be used to improve the performance of unsupervised learning algorithms. We name the overall framework as CAFT++. We evaluate the same on the top of different UDA algorithms across many public domain adaptation datasets. Our extensive experiments indicate that CAFT++ is able to achieve significant performance gains across all the popular benchmarks., Comment: Accepted at the International Journal of Computer Vision
- Published
- 2024
- Full Text
- View/download PDF
34. HDL-GPT: High-Quality HDL is All You Need
- Author
-
Kumar, Bhuvnesh, Nanda, Saurav, Parthasarathy, Ganapathy, Patil, Pawan, Tsai, Austin, and Choudhary, Parivesh
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
This paper presents Hardware Description Language Generative Pre-trained Transformers (HDL-GPT), a novel approach that leverages the vast repository of open-source High Definition Language (HDL) codes to train superior quality large code models. The core premise of this paper is the hypothesis that high-quality HDL is all you need to create models with exceptional performance and broad zero-shot generalization abilities. The paper elucidates the methods employed for the curation and augmentation of large corpora from open-source HDL code, transforming highly variable quality data into high-quality data through careful prompting and context maintenance. We demonstrate that the careful selection, filtering, and augmentation of data across HDLs can yield powerful models that surpass current state-of-the-art models. We also explore the impact of different fine-tuning methods on the quality of results. We describe experimental results across a range of fine-tuned SOTA LLMs, substantiating our claims. We demonstrate improvements of 50% to 200% over SOTA HDL models on current benchmarks in tasks ranging from HDL circuit explanations, code generation, formal and simulation testbench creation, triaging bugs, and fixing them. HDL-GPT opens new avenues for the development of advanced model training techniques for circuit design tasks., Comment: DAC 2024 Invited Paper
- Published
- 2024
35. Exploring the Design of Collaborative Applications via the Lens of NDN Workspace
- Author
-
Yu, Tianyuan, Ma, Xinyu, Patil, Varun, Kocaogullar, Yekta, and Zhang, Lixia
- Subjects
Computer Science - Networking and Internet Architecture - Abstract
Metaverse applications desire to communicate with semantically identified objects among a diverse set of cyberspace entities, such as cameras for collecting images from, sensors for sensing environment, and users collaborating with each other, all could be nearby or far away, in a timely and secure way. However, supporting the above function faces networking challenges. Today's metaverse implementations are, by and large, use secure transport connections to communicate with cloud servers instead of letting participating entities communicate directly. In this paper, we use the design and implementation of NDN Workspace, a web-based, multi-user collaborative app to showcase a new way to networking that supports many-to-many secure data exchanges among communicating entities directly. NDN Workspace users establish trust relations among each other, exchange URI-identified objects directly, and can collaborate through intermittent connectivity, all in the absence of cloud servers. Its data-centric design offers an exciting new approach to metaverse app development.
- Published
- 2024
36. Secure Web Objects: Building Blocks for Metaverse Interoperability and Decentralization
- Author
-
Yu, Tianyuan, Ma, Xinyu, Patil, Varun, Kocaogullar, Yekta, Zhang, Yulong, Burke, Jeff, Kutscher, Dirk, and Zhang, Lixia
- Subjects
Computer Science - Networking and Internet Architecture ,Computer Science - Distributed, Parallel, and Cluster Computing ,H.3.5 - Abstract
This position paper explores how to support the Web's evolution through an underlying data-centric approach that better matches the data-orientedness of modern and emerging applications. We revisit the original vision of the Web as a hypermedia system that supports document composability and application interoperability via name-based data access. We propose the use of secure web objects (SWO), a data-oriented communication approach that can reduce complexity, centrality, and inefficiency, particularly for collaborative and local-first applications, such as the Metaverse and other collaborative applications. SWO are named, signed, application-defined objects that are secured independently of their containers or communications channels, an approach that leverages the results from over a decade-long data-centric networking research. This approach does not require intermediation by aggregators of identity, storage, and other services that are common today. We present a brief design overview, illustrated through prototypes for two editors of shared hypermedia documents: one for 3D and one for LaTeX. We also discuss our findings and suggest a roadmap for future research., Comment: 9 pages
- Published
- 2024
37. SAM Fewshot Finetuning for Anatomical Segmentation in Medical Images
- Author
-
Xie, Weiyi, Willems, Nathalie, Patil, Shubham, Li, Yang, and Kumar, Mayank
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,I.4.6 ,I.5.4 ,I.5.1 - Abstract
We propose a straightforward yet highly effective few-shot fine-tuning strategy for adapting the Segment Anything (SAM) to anatomical segmentation tasks in medical images. Our novel approach revolves around reformulating the mask decoder within SAM, leveraging few-shot embeddings derived from a limited set of labeled images (few-shot collection) as prompts for querying anatomical objects captured in image embeddings. This innovative reformulation greatly reduces the need for time-consuming online user interactions for labeling volumetric images, such as exhaustively marking points and bounding boxes to provide prompts slice by slice. With our method, users can manually segment a few 2D slices offline, and the embeddings of these annotated image regions serve as effective prompts for online segmentation tasks. Our method prioritizes the efficiency of the fine-tuning process by exclusively training the mask decoder through caching mechanisms while keeping the image encoder frozen. Importantly, this approach is not limited to volumetric medical images, but can generically be applied to any 2D/3D segmentation task. To thoroughly evaluate our method, we conducted extensive validation on four datasets, covering six anatomical segmentation tasks across two modalities. Furthermore, we conducted a comparative analysis of different prompting options within SAM and the fully-supervised nnU-Net. The results demonstrate the superior performance of our method compared to SAM employing only point prompts (approximately 50% improvement in IoU) and performs on-par with fully supervised methods whilst reducing the requirement of labeled data by at least an order of magnitude., Comment: 9 pages, Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2024
- Published
- 2024
38. Crystal Growth, Terahertz Generation and Optical Characterization of Sodium Mesitylene Sulphonate (SMS)Crystal
- Author
-
Murtunge, Yamuna, Patil, Vidhyadhar, Puranik, Ruturaj, S, Jayakrishnan S, Bansal, D, Maity, Arijit, Venkatramani, Ravindra, Kulkarni, S. B., Thamizhavel, A, and Prabhu, S. S.
- Subjects
Physics - Optics - Abstract
An optically high-quality single crystal of sodium mesitylene sulfonate crystal was successfully grown by a slow evaporation method using methanol as solvent at room temperature. Single-crystal XRD has characterized the material and belongs to a monoclinic structure with a C2 space group. Functional groups were determined using Fourier-transformed infrared spectroscopy. The optical quality of the generated crystal was evaluated using UV-Vis NIR spectral analysis, which is transparent in the range of 300-1500 nm. We report the optical properties using terahertz time-domain spectroscopy (THz-TDS) and THz generation using crystal.
- Published
- 2024
39. A Teacher Is Worth A Million Instructions
- Author
-
Kothari, Nikhil, Nayak, Ravindra, Shetty, Shreyas, Patil, Amey, and Garera, Nikesh
- Subjects
Computer Science - Machine Learning - Abstract
Large Language Models(LLMs) have shown exceptional abilities, yet training these models can be quite challenging. There is a strong dependence on the quality of data and finding the best instruction tuning set. Further, the inherent limitations in training methods create substantial difficulties to train relatively smaller models with 7B and 13B parameters. In our research, we suggest an improved training method for these models by utilising knowledge from larger models, such as a mixture of experts (8x7B) architectures. The scale of these larger models allows them to capture a wide range of variations from data alone, making them effective teachers for smaller models. Moreover, we implement a novel post-training domain alignment phase that employs domain-specific expert models to boost domain-specific knowledge during training while preserving the model's ability to generalise. Fine-tuning Mistral 7B and 2x7B with our method surpasses the performance of state-of-the-art language models with more than 7B and 13B parameters: achieving up to $7.9$ in MT-Bench and $93.04\%$ on AlpacaEval., Comment: 7 pages, 4 figures
- Published
- 2024
40. Adaptive Deep Neural Network-Based Control Barrier Functions
- Author
-
Sweatland, Hannah M., Patil, Omkar Sudhir, and Dixon, Warren E.
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
Safety constraints of nonlinear control systems are commonly enforced through the use of control barrier functions (CBFs). Uncertainties in the dynamic model can disrupt forward invariance guarantees or cause the state to be restricted to an overly conservative subset of the safe set. In this paper, adaptive deep neural networks (DNNs) are combined with CBFs to produce a family of controllers that ensure safety while learning the system's dynamics in real-time without the requirement for pre-training. By basing the least squares adaptation law on a state derivative estimator-based identification error, the DNN parameter estimation error is shown to be uniformly ultimately bounded. The convergent bound on the parameter estimation error is then used to formulate CBF-constraints in an optimization-based controller to guarantee safety despite model uncertainty. Furthermore, the developed method is applicable for use under intermittent loss of state-feedback. Comparative simulation results demonstrate the ability of the developed method to ensure safety in an adaptive cruise control problem and when feedback is lost, unlike baseline methods., Comment: 7 pages, 2 figures, 28 references
- Published
- 2024
41. Development of Volume Produced Negative Ion Source using a CCRF Discharge
- Author
-
Singh, Pawandeep, Dahiya, Swati, Pandey, Avnish, Patil, Yashashri, and Karkari, Shantanu
- Subjects
Physics - Plasma Physics - Abstract
This work shows the development of a volume-produced negative ion source that consists of annular parallel plates driven by a 13.56 MHz capacitively coupled radio frequency in a push-pull configuration. This source shows advantages in controlling plasma conditions by varying the pressure, power, and applied axial magnetic field. It is found that the push-pull configuration allows the plasma potential to remain in the range of 20 to 40 Volts. Conversely, the application of a magnetic field helps serves to augment the production of negative ions in the central hollow part of the annular plate. Further, a plausible explanation to the obtained experimental results is presented., Comment: 5 pages, 7 figures
- Published
- 2024
42. Sheath effects with thermal electrons on the resonance frequency of a DC-biased hairpin probe
- Author
-
Singh, Pawandeep, Pandey, Avnish, Dahiya, Swati, Patil, Yashashri, Sirse, Nishant, and Karkari, Shantanu
- Subjects
Physics - Plasma Physics - Abstract
The dielectric constant of a sheath, whether ionic or electronic, formed around the cylindrical limbs of a hairpin probe, is often considered the same as that of a vacuum. However, this assumption does not hold true for electron sheaths and electron-permeating ionic sheaths, resulting in a deviation of the sheath dielectric constant from that of a vacuum. This deviation significantly influences the effective dielectric between the cylindrical limbs. As a result, it impacts the theoretically estimated resonance frequency characteristic curve of a DC-biased hairpin probe. In this study, we investigate the influence of electron temperature on the sheath dielectric and, consequently, on the resonance frequency characteristic curve. The findings shows that electron temperature primarily determines the resonance frequency characteristic curve. With increasing electron temperature, the peak in the resonance frequency characteristic curve shifts towards higher positive probe bias values and exhibits a broadening near the maxima instead of a sharp peak. This broadening near the maxima has also been validated with an experimentally measured resonance frequency characteristic curve in a capacitively coupled argon discharge., Comment: 9 pages, 13 figures
- Published
- 2024
43. Advancements in Orthopaedic Arm Segmentation: A Comprehensive Review
- Author
-
Swami, Abhishek, Farande, Snehal, Patil, Atharv, Parle, Atharva, Mane, Vivekanand, and Thorat, Prathamesh
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,68T07 - Abstract
The most recent advances in medical imaging that have transformed diagnosis, especially in the case of interpreting X-ray images, are actively involved in the healthcare sector. The advent of digital image processing technology and the implementation of deep learning models such as Convolutional Neural Networks (CNNs) have made the analysis of X-rays much more accurate and efficient. In this article, some essential techniques such as edge detection, region-growing technique, and thresholding approach, and the deep learning models such as variants of YOLOv8-which is the best object detection and segmentation framework-are reviewed. We further investigate that the traditional image processing techniques like segmentation are very much simple and provides the alternative to the advanced methods as well. Our review gives useful knowledge on the practical usage of the innovative and traditional approaches of manual X-ray interpretation. The discovered information will help professionals and researchers to gain more profound knowledge in digital interpretation techniques in medical imaging., Comment: 29 pages, 20 figures
- Published
- 2024
- Full Text
- View/download PDF
44. Curating Stopwords in Marathi: A TF-IDF Approach for Improved Text Analysis and Information Retrieval
- Author
-
Chavan, Rohan, Patil, Gaurav, Madle, Vishal, and Joshi, Raviraj
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Stopwords are commonly used words in a language that are often considered to be of little value in determining the meaning or significance of a document. These words occur frequently in most texts and don't provide much useful information for tasks like sentiment analysis and text classification. English, which is a high-resource language, takes advantage of the availability of stopwords, whereas low-resource Indian languages like Marathi are very limited, standardized, and can be used in available packages, but the number of available words in those packages is low. Our work targets the curation of stopwords in the Marathi language using the MahaCorpus, with 24.8 million sentences. We make use of the TF-IDF approach coupled with human evaluation to curate a strong stopword list of 400 words. We apply the stop word removal to the text classification task and show its efficacy. The work also presents a simple recipe for stopword curation in a low-resource language. The stopwords are integrated into the mahaNLP library and publicly available on https://github.com/l3cube-pune/MarathiNLP ., Comment: Accepted at I2CT 2024
- Published
- 2024
- Full Text
- View/download PDF
45. Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM
- Author
-
Muddu, Sri Raghava, Rangaraju, Rupasai, Siledar, Tejpalsingh, Nath, Swaroop, Bhattacharyya, Pushpak, Nath, Swaprava, Banerjee, Suman, Patil, Amey, Chelliah, Muthusamy, Singh, Sudhanshu Shekhar, and Garera, Nikesh
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limitations. To mitigate, we propose a scalable framework called Xl-OpSumm that generates summaries incrementally. However, the existing test set, AMASUM has only 560 reviews per product on average. Due to the lack of a test set with thousands of reviews, we created a new test set called Xl-Flipkart by gathering data from the Flipkart website and generating summaries using GPT-4. Through various automatic evaluations and extensive analysis, we evaluated the framework's efficiency on two datasets, AMASUM and Xl-Flipkart. Experimental results show that our framework, Xl-OpSumm powered by Llama-3-8B-8k, achieves an average ROUGE-1 F1 gain of 4.38% and a ROUGE-L F1 gain of 3.70% over the next best-performing model.
- Published
- 2024
46. A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation
- Author
-
Fogliato, Riccardo, Patil, Pratik, Monfort, Mathew, and Perona, Pietro
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Statistics - Applications - Abstract
Model performance evaluation is a critical and expensive task in machine learning and computer vision. Without clear guidelines, practitioners often estimate model accuracy using a one-time completely random selection of the data. However, by employing tailored sampling and estimation strategies, one can obtain more precise estimates and reduce annotation costs. In this paper, we propose a statistical framework for model evaluation that includes stratification, sampling, and estimation components. We examine the statistical properties of each component and evaluate their efficiency (precision). One key result of our work is that stratification via k-means clustering based on accurate predictions of model performance yields efficient estimators. Our experiments on computer vision datasets show that this method consistently provides more precise accuracy estimates than the traditional simple random sampling, even with substantial efficiency gains of 10x. We also find that model-assisted estimators, which leverage predictions of model accuracy on the unlabeled portion of the dataset, are generally more efficient than the traditional estimates based solely on the labeled data., Comment: To appear at ECCV 2024
- Published
- 2024
47. Model fusion for efficient learning of nonlinear dynamical systems
- Author
-
Kedia, Vatsal, Pinnamaraju, Vivek S., and Patil, Dinesh
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
In the context of model-based control of industrial processes, it is a common practice to develop a data-driven linear dynamical model around a specified operating point. However, in applications involving wider operating conditions, representation of the dynamics using a single linear dynamic model is often inadequate, requiring either a nonlinear model or multiple linear models to accommodate the nonlinear behaviour. While the development of the former suffers from the requirements of extensive experiments spanning multiple levels, significant compromise in the nominal product quality and dealing with unmeasured disturbances over wider operating conditions, the latter faces the challenge of model switch scheduling and inadequate description of dynamics for the operating regions in-between. To overcome these challenges, we propose an efficient approach to obtain a parsimonious nonlinear dynamic model by developing multiple linear models from data at multiple operating points, lifting the data features obtained from individual model simulations to adequately accommodate the underlying nonlinear behaviour and finally, sparse optimization techniques to obtain a parsimonious model. The performance and effectiveness of the proposed algorithm is demonstrated through simulation case studies.
- Published
- 2024
48. Sloshing and spiral structures breeding a putative radio mini-halo in the environment of a cool-core cluster Abell 795
- Author
-
Kadam, S. K., Salunkhe, Sameer, Vagshette, N. D., Paul, Surajit, Sonkamble, Satish S., Pawar, P. K., and Patil, M. K.
- Subjects
Astrophysics - Astrophysics of Galaxies ,Astrophysics - High Energy Astrophysical Phenomena - Abstract
Spiral structures and cold fronts in X-rays are frequently observed in cool core galaxy clusters. However, studies on radio mini-haloes associated with such spirals and their physical connections are rare. Here, we present the detection of an extended diffuse radio emission entrained in the X-ray spiral structure in a known cool core cluster Abell 795 (A795). Though the cool core is a sign of the relaxed nature of the clusters, our re-analysed 30 ks Chandra X-ray data of cluster A795 confirms the presence of an interesting log spiral structure of X-ray deficit region complemented by an X-ray excess counter spiral in the residual map, exposing its dynamical activity. Our new analysis of 150 $\&$ 325 MHz GMRT archival data of the cluster confirms the detection of a $\sim180$ kpc ultra-steep ($\alpha\sim-2.7$) diffuse radio structure which was previously reported as a candidate radio mini halo from low sensitive survey maps. This radio emission spans the entire spiral structure ($\sim186$ kpc), enclosed by two previously reported cold fronts. Furthermore, SDSS DR13 optical spectra, as well as GALEX's FUV data, show a considerably low total star formation rate of 2.52 M$_{\odot}$ yr$^{-1}$ and having no significant variation in metallicity distribution. We argued that the two-phase (hot and cold) plasma at the cluster core with differential velocity has probably caused the spiral formation and has redistributed the secondary electrons from the central BCG or the pre-accelerated electrons which have been (re-)accelerated by the sloshing turbulence to form the observed candidate radio mini-halo structure. This has been supported by a few previous studies that indicate spiral formation and sloshing turbulence may quench star formation and facilitate smooth metallicity distribution by mixing the gas in the core., Comment: 11 pages, 9 figures, Accepted for publication in MNRAS
- Published
- 2024
- Full Text
- View/download PDF
49. Post-Minkowskian Theory Meets the Spinning Effective-One-Body Approach for Bound-Orbit Waveforms
- Author
-
Buonanno, Alessandra, Mogull, Gustav, Patil, Raj, and Pompili, Lorenzo
- Subjects
General Relativity and Quantum Cosmology ,High Energy Physics - Theory - Abstract
Driven by advances in scattering amplitudes and worldline-based methods, recent years have seen significant progress in our ability to calculate gravitational two-body scattering observables. These observables effectively encapsulate the gravitational two-body problem in the weak-field and high-velocity regime (post-Minkowskian, PM), with applications to the bound two-body problem and gravitational-wave modeling. We leverage PM data to construct a complete inspiral-merger-ringdown waveform model for non-precessing spinning black holes within the effective-one-body (EOB) formalism: SEOBNR-PM. This model is closely based on the highly successful SEOBNRv5 model, used by the LIGO-Virgo-KAGRA Collaboration, with its key new feature being an EOB Hamiltonian derived by matching the two-body scattering angle in a perturbative PM expansion. The model performs remarkably well, showing a median mismatch against 441 numerical-relativity (NR) simulations that is somewhat lower than a similarly calibrated version of SEOBNRv5. Comparisons of the binding energy with NR also demonstrate better agreement than SEOBNRv5, despite the latter containing additional calibration to NR simulations., Comment: 5 pages, 4 figures; supplemental material; attached ancillary Mathematica file
- Published
- 2024
50. The 2024 Brain Tumor Segmentation (BraTS) Challenge: Glioma Segmentation on Post-treatment MRI
- Author
-
de Verdier, Maria Correia, Saluja, Rachit, Gagnon, Louis, LaBella, Dominic, Baid, Ujjwall, Tahon, Nourel Hoda, Foltyn-Dumitru, Martha, Zhang, Jikai, Alafif, Maram, Baig, Saif, Chang, Ken, D'Anna, Gennaro, Deptula, Lisa, Gupta, Diviya, Haider, Muhammad Ammar, Hussain, Ali, Iv, Michael, Kontzialis, Marinos, Manning, Paul, Moodi, Farzan, Nunes, Teresa, Simon, Aaron, Sollmann, Nico, Vu, David, Adewole, Maruf, Albrecht, Jake, Anazodo, Udunna, Chai, Rongrong, Chung, Verena, Faghani, Shahriar, Farahani, Keyvan, Kazerooni, Anahita Fathi, Iglesias, Eugenio, Kofler, Florian, Li, Hongwei, Linguraru, Marius George, Menze, Bjoern, Moawad, Ahmed W., Velichko, Yury, Wiestler, Benedikt, Altes, Talissa, Basavasagar, Patil, Bendszus, Martin, Brugnara, Gianluca, Cho, Jaeyoung, Dhemesh, Yaseen, Fields, Brandon K. K., Garrett, Filip, Gass, Jaime, Hadjiiski, Lubomir, Hattangadi-Gluth, Jona, Hess, Christopher, Houk, Jessica L., Isufi, Edvin, Layfield, Lester J., Mastorakos, George, Mongan, John, Nedelec, Pierre, Nguyen, Uyen, Oliva, Sebastian, Pease, Matthew W., Rastogi, Aditya, Sinclair, Jason, Smith, Robert X., Sugrue, Leo P., Thacker, Jonathan, Vidic, Igor, Villanueva-Meyer, Javier, White, Nathan S., Aboian, Mariam, Conte, Gian Marco, Dale, Anders, Sabuncu, Mert R., Seibert, Tyler M., Weinberg, Brent, Abayazeed, Aly, Huang, Raymond, Turk, Sevcan, Rauschecker, Andreas M., Farid, Nikdokht, Vollmuth, Philipp, Nada, Ayman, Bakas, Spyridon, Calabrese, Evan, and Rudie, Jeffrey D.
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Gliomas are the most common malignant primary brain tumors in adults and one of the deadliest types of cancer. There are many challenges in treatment and monitoring due to the genetic diversity and high intrinsic heterogeneity in appearance, shape, histology, and treatment response. Treatments include surgery, radiation, and systemic therapies, with magnetic resonance imaging (MRI) playing a key role in treatment planning and post-treatment longitudinal assessment. The 2024 Brain Tumor Segmentation (BraTS) challenge on post-treatment glioma MRI will provide a community standard and benchmark for state-of-the-art automated segmentation models based on the largest expert-annotated post-treatment glioma MRI dataset. Challenge competitors will develop automated segmentation models to predict four distinct tumor sub-regions consisting of enhancing tissue (ET), surrounding non-enhancing T2/fluid-attenuated inversion recovery (FLAIR) hyperintensity (SNFH), non-enhancing tumor core (NETC), and resection cavity (RC). Models will be evaluated on separate validation and test datasets using standardized performance metrics utilized across the BraTS 2024 cluster of challenges, including lesion-wise Dice Similarity Coefficient and Hausdorff Distance. Models developed during this challenge will advance the field of automated MRI segmentation and contribute to their integration into clinical practice, ultimately enhancing patient care., Comment: 10 pages, 4 figures, 1 table
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.