14,851 results on '"Divya, P."'
Search Results
2. Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning
- Author
-
Ruan, Penghui, Wang, Pichao, Saxena, Divya, Cao, Jiannong, and Shi, Yuhui
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Despite advancements in Text-to-Video (T2V) generation, producing videos with realistic motion remains challenging. Current models often yield static or minimally dynamic outputs, failing to capture complex motions described by text. This issue stems from the internal biases in text encoding, which overlooks motions, and inadequate conditioning mechanisms in T2V generation models. To address this, we propose a novel framework called DEcomposed MOtion (DEMO), which enhances motion synthesis in T2V generation by decomposing both text encoding and conditioning into content and motion components. Our method includes a content encoder for static elements and a motion encoder for temporal dynamics, alongside separate content and motion conditioning mechanisms. Crucially, we introduce text-motion and video-motion supervision to improve the model's understanding and generation of motion. Evaluations on benchmarks such as MSR-VTT, UCF-101, WebVid-10M, EvalCrafter, and VBench demonstrate DEMO's superior ability to produce videos with enhanced motion dynamics while maintaining high visual quality. Our approach significantly advances T2V generation by integrating comprehensive motion understanding directly from textual descriptions. Project page: https://PR-Ryan.github.io/DEMO-project/, Comment: Accepted at NeurIPS 2024, code available at https://github.com/PR-Ryan/DEMO
- Published
- 2024
3. Palisade -- Prompt Injection Detection Framework
- Author
-
Kokkula, Sahasra, R, Somanathan, R, Nandavardhan, Aashishkumar, and Divya, G
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
The advent of Large Language Models LLMs marks a milestone in Artificial Intelligence, altering how machines comprehend and generate human language. However, LLMs are vulnerable to malicious prompt injection attacks, where crafted inputs manipulate the models behavior in unintended ways, compromising system integrity and causing incorrect outcomes. Conventional detection methods rely on static, rule-based approaches, which often fail against sophisticated threats like abnormal token sequences and alias substitutions, leading to limited adaptability and higher rates of false positives and false negatives.This paper proposes a novel NLP based approach for prompt injection detection, emphasizing accuracy and optimization through a layered input screening process. In this framework, prompts are filtered through three distinct layers rule-based, ML classifier, and companion LLM before reaching the target model, thereby minimizing the risk of malicious interaction.Tests show the ML classifier achieves the highest accuracy among individual layers, yet the multi-layer framework enhances overall detection accuracy by reducing false negatives. Although this increases false positives, it minimizes the risk of overlooking genuine injected prompts, thus prioritizing security.This multi-layered detection approach highlights LLM vulnerabilities and provides a comprehensive framework for future research, promoting secure interactions between humans and AI systems.
- Published
- 2024
4. The Galaxy Zoo Catalogs for the Galaxy And Mass Assembly (GAMA) Survey
- Author
-
Holwerda, Benne W., Robertson, Clayton, Cook, Kyle, Pimbblet, Kevin A., Casura, Sarah, Sansom, Anne E., Patel, Divya, Butrum, Trevor, Glass, David H. W., Kelvin, Lee, Baldry, Ivan K., De Propris, Roberto, Bamford, Steven, Masters, Karen, Stone, Maria, Hardin, Tim, Walmsley, Mike, Liske, Jochen, and Adnan, S M Rafee
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
Galaxy Zoo is an online project to classify morphological features in extra-galactic imaging surveys with public voting. In this paper, we compare the classifications made for two different surveys, the Dark Energy Spectroscopic Instrument (DESI) imaging survey and a part of the Kilo-Degree Survey (KiDS), in the equatorial fields of the Galaxy And Mass Assembly (GAMA) survey. Our aim is to cross-validate and compare the classifications based on different imaging quality and depth. We find that generally the voting agrees globally but with substantial scatter i.e. substantial differences for individual galaxies. There is a notable higher voting fraction in favor of ``smooth'' galaxies in the DESI+\rev{{\sc zoobot}} classifications, most likely due to the difference between imaging depth. DESI imaging is shallower and slightly lower resolution than KiDS and the Galaxy Zoo images do not reveal details such as disk features \rev{and thus are missed in the {\sc zoobot} training sample}. \rev{We check against expert visual classifications and find good agreement with KiDS-based Galaxy Zoo voting.} We reproduce the results from Porter-Temple+ (2022), on the dependence of stellar mass, star-formation, and specific star-formation on the number of spiral arms. This shows that once corrected for redshift, the DESI Galaxy Zoo and KiDS Galaxy Zoo classifications agree well on population properties. The zoobot cross-validation increases confidence in its ability to compliment Galaxy Zoo classifications and its ability for transfer learning across surveys., Comment: 20 pages, 22 figures, 8 tables, accepted for publication in PASA
- Published
- 2024
5. Efficient Feature Extraction Using Light-Weight CNN Attention-Based Deep Learning Architectures for Ultrasound Fetal Plane Classification
- Author
-
Sivasubramanian, Arrun, Sasidharan, Divya, V, Sowmya, and Ravi, Vinayakumar
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Ultrasound fetal imaging is beneficial to support prenatal development because it is affordable and non-intrusive. Nevertheless, fetal plane classification (FPC) remains challenging and time-consuming for obstetricians since it depends on nuanced clinical aspects, which increases the difficulty in identifying relevant features of the fetal anatomy. Thus, to assist with its accurate feature extraction, a lightweight artificial intelligence architecture leveraging convolutional neural networks and attention mechanisms is proposed to classify the largest benchmark ultrasound dataset. The approach fine-tunes from lightweight EfficientNet feature extraction backbones pre-trained on the ImageNet1k. to classify key fetal planes such as the brain, femur, thorax, cervix, and abdomen. Our methodology incorporates the attention mechanism to refine features and 3-layer perceptrons for classification, achieving superior performance with the highest Top-1 accuracy of 96.25%, Top-2 accuracy of 99.80% and F1-Score of 0.9576. Importantly, the model has 40x fewer trainable parameters than existing benchmark ensemble or transformer pipelines, facilitating easy deployment on edge devices to help clinical practitioners with real-time FPC. The findings are also interpreted using GradCAM to carry out clinical correlation to aid doctors with diagnostics and improve treatment plans for expectant mothers., Comment: Submitted to Computers in Biology and Medicine journal
- Published
- 2024
6. Chirality induced phase separation in active circle swimmers
- Author
-
Kushwaha, Divya and Mishra, Shradha
- Subjects
Condensed Matter - Soft Condensed Matter - Abstract
Many microswimmers are inherently chiral, and this chirality can introduce fascinating behaviors in a collection of microswimmers. The dynamics become even more intriguing when two types of microswimmers with distinct chirality are mixed. Our study considers a mixture of self-propelled particles with opposite chirality, examining how the system characteristics evolve as the magnitude of chirality is tuned. In weakly chiral systems, the particles exhibit similar behavior, leading to a globally flocking phase where both types of particles are well mixed. However, in an intermediate range of chirality, the particles demix and follow their trajectories, creating a competition between chirality and self-propulsion. This competition results in interesting phases within the system. We explore the characteristics of these different phases in detail, focusing on the roles of self-propulsion speed and chirality., Comment: 13 pages,10 figures
- Published
- 2024
7. PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters
- Author
-
Ghazimatin, Azin, Garmash, Ekaterina, Penha, Gustavo, Sheets, Kristen, Achenbach, Martin, Semerci, Oguz, Galvez, Remi, Tannenberg, Marcus, Mantravadi, Sahitya, Narayanan, Divya, Kalaydzhyan, Ofeliya, Cole, Douglas, Carterette, Ben, Clifton, Ann, Bennett, Paul N., Hauff, Claudia, and Lalmas, Mounia
- Subjects
Computer Science - Information Retrieval ,Computer Science - Artificial Intelligence ,68P20 ,H.3.3 - Abstract
Listeners of long-form talk-audio content, such as podcast episodes, often find it challenging to understand the overall structure and locate relevant sections. A practical solution is to divide episodes into chapters--semantically coherent segments labeled with titles and timestamps. Since most episodes on our platform at Spotify currently lack creator-provided chapters, automating the creation of chapters is essential. Scaling the chapterization of podcast episodes presents unique challenges. First, episodes tend to be less structured than written texts, featuring spontaneous discussions with nuanced transitions. Second, the transcripts are usually lengthy, averaging about 16,000 tokens, which necessitates efficient processing that can preserve context. To address these challenges, we introduce PODTILE, a fine-tuned encoder-decoder transformer to segment conversational data. The model simultaneously generates chapter transitions and titles for the input transcript. To preserve context, each input text is augmented with global context, including the episode's title, description, and previous chapter titles. In our intrinsic evaluation, PODTILE achieved an 11% improvement in ROUGE score over the strongest baseline. Additionally, we provide insights into the practical benefits of auto-generated chapters for listeners navigating episode content. Our findings indicate that auto-generated chapters serve as a useful tool for engaging with less popular podcasts. Finally, we present empirical evidence that using chapter titles can enhance effectiveness of sparse retrieval in search tasks., Comment: 9 pages, 4 figures, CIKM industry track 2024
- Published
- 2024
- Full Text
- View/download PDF
8. Enabling a multifunctional telecommunications fiber optic network: Ultrastable optical frequency transfer and attosecond timing in deployed multicore fiber
- Author
-
Hoghooghi, Nazanin, Mazur, Mikael, Fontaine, Nicolas, Liu, Yifan, Lee, Dahyeon, McLemore, Charles, Nakamura, Takuma, Hayashi, Tetsuya, Di Sciullo, Giammarco, Shaji, Divya, Mecozzi, Antonio, Antonelli, Cristian, and Quinlan, Franklyn
- Subjects
Physics - Optics - Abstract
The telecommunications industry's deployment of billions of kilometers of optical fiber has created a vast global network that can be exploited for additional applications such as environmental sensing, quantum networking and international clock comparisons. However, for reasons such as the unidirectionality of long-haul fiber links, telecom fiber networks cannot always be adapted for important applications beyond data transmission. Fortunately, new multicore optical fibers create the opportunity for application coexistence with data traffic, creating expansive multifunctional networks. Towards that end, we propose and demonstrate the faithful transfer of ultrastable optical signals through multicore fiber in a way that is compatible with the unidirectionality of long-haul fiber optic systems, demonstrating a fractional frequency instability of 3x10-19 at 10,000 seconds. This opens the door towards intercontinental optical clock comparisons, with applications in fundamental physics and the redefinition of the second.
- Published
- 2024
9. KinDEL: DNA-Encoded Library Dataset for Kinase Inhibitors
- Author
-
Chen, Benson, Danel, Tomasz, McEnaney, Patrick J., Jain, Nikhil, Novikov, Kirill, Akki, Spurti Umesh, Turnbull, Joshua L., Pandya, Virja Atul, Belotserkovskii, Boris P., Weaver, Jared Bryce, Biswas, Ankita, Nguyen, Dat, Dreiman, Gabriel H. S., Sultan, Mohammad, Stanley, Nathaniel, Whalen, Daniel M, Kanichar, Divya, Klein, Christoph, Fox, Emily, and Watts, R. Edward
- Subjects
Quantitative Biology - Quantitative Methods ,Computer Science - Machine Learning - Abstract
DNA-Encoded Libraries (DEL) are combinatorial small molecule libraries that offer an efficient way to characterize diverse chemical spaces. Selection experiments using DELs are pivotal to drug discovery efforts, enabling high-throughput screens for hit finding. However, limited availability of public DEL datasets hinders the advancement of computational techniques designed to process such data. To bridge this gap, we present KinDEL, one of the first large, publicly available DEL datasets on two kinases: Mitogen-Activated Protein Kinase 14 (MAPK14) and Discoidin Domain Receptor Tyrosine Kinase 1 (DDR1). Interest in this data modality is growing due to its ability to generate extensive supervised chemical data that densely samples around select molecular structures. Demonstrating one such application of the data, we benchmark different machine learning techniques to develop predictive models for hit identification; in particular, we highlight recent structure-based probabilistic approaches. Finally, we provide biophysical assay data, both on- and off-DNA, to validate our models on a smaller subset of molecules. Data and code for our benchmarks can be found at: https://github.com/insitro/kindel.
- Published
- 2024
10. Distributed Inference on Mobile Edge and Cloud: An Early Exit based Clustering Approach
- Author
-
Bajpai, Divya Jyoti and Hanawal, Manjesh Kumar
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
Recent advances in Deep Neural Networks (DNNs) have demonstrated outstanding performance across various domains. However, their large size is a challenge for deployment on resource-constrained devices such as mobile, edge, and IoT platforms. To overcome this, a distributed inference setup can be used where a small-sized DNN (initial few layers) can be deployed on mobile, a bigger version on the edge, and the full-fledged, on the cloud. A sample that has low complexity (easy) could be then inferred on mobile, that has moderate complexity (medium) on edge, and higher complexity (hard) on the cloud. As the complexity of each sample is not known beforehand, the following question arises in distributed inference: how to decide complexity so that it is processed by enough layers of DNNs. We develop a novel approach named DIMEE that utilizes Early Exit (EE) strategies developed to minimize inference latency in DNNs. DIMEE aims to improve the accuracy, taking into account the offloading cost from mobile to edge/cloud. Experimental validation on GLUE datasets, encompassing various NLP tasks, shows that our method significantly reduces the inference cost (> 43%) while maintaining a minimal drop in accuracy (< 0.3%) compared to the case where all the inference is made in cloud., Comment: 8 pages, 3 figures
- Published
- 2024
11. CAPEEN: Image Captioning with Early Exits and Knowledge Distillation
- Author
-
Bajpai, Divya Jyoti and Hanawal, Manjesh Kumar
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language - Abstract
Deep neural networks (DNNs) have made significant progress in recognizing visual elements and generating descriptive text in image-captioning tasks. However, their improved performance comes from increased computational burden and inference latency. Early Exit (EE) strategies can be used to enhance their efficiency, but their adaptation presents challenges in image captioning as it requires varying levels of semantic information for accurate predictions. To overcome this, we introduce CAPEEN to improve the performance of EE strategies using knowledge distillation. Inference in CAPEEN is completed at intermediary layers if prediction confidence exceeds a predefined value learned from the training data. To account for real-world deployments, where target distributions could drift from that of training samples, we introduce a variant A-CAPEEN to adapt the thresholds on the fly using Multiarmed bandits framework. Experiments on the MS COCO and Flickr30k datasets show that CAPEEN gains speedup of 1.77x while maintaining competitive performance compared to the final layer, and A-CAPEEN additionally offers robustness against distortions. The source code is available at https://github.com/Div290/CapEEN, Comment: To appear in EMNLP (finding) 2024
- Published
- 2024
12. DAdEE: Unsupervised Domain Adaptation in Early Exit PLMs
- Author
-
Bajpai, Divya Jyoti and Hanawal, Manjesh Kumar
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Pre-trained Language Models (PLMs) exhibit good accuracy and generalization ability across various tasks using self-supervision, but their large size results in high inference latency. Early Exit (EE) strategies handle the issue by allowing the samples to exit from classifiers attached to the intermediary layers, but they do not generalize well, as exit classifiers can be sensitive to domain changes. To address this, we propose Unsupervised Domain Adaptation in EE framework (DADEE) that employs multi-level adaptation using knowledge distillation. DADEE utilizes GAN-based adversarial adaptation at each layer to achieve domain-invariant representations, reducing the domain gap between the source and target domain across all layers. The attached exits not only speed up inference but also enhance domain adaptation by reducing catastrophic forgetting and mode collapse, making it more suitable for real-world scenarios. Experiments on tasks such as sentiment analysis, entailment classification, and natural language inference demonstrate that DADEE consistently outperforms not only early exit methods but also various domain adaptation methods under domain shift scenarios. The anonymized source code is available at https://github.com/Div290/DAdEE., Comment: To appear in EMNLP (findings) 2024
- Published
- 2024
13. Crafting Narrative Closures: Zero-Shot Learning with SSM Mamba for Short Story Ending Generation
- Author
-
Sharma, Divyam and Santhanam, Divya
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Writing stories is an engaging yet challenging endeavor. Often, authors encounter moments of creative block, where the path forward in their narrative becomes obscured. This paper is designed to address such moments by providing an innovative solution: A tool that completes stories based on given prompts. By inputting a short story prompt, users can receive a conclusion to their story, articulated in one sentence or more, thereby enhancing the storytelling process with AI-driven creativity. This tool aims not only to assist authors in navigating writer's block but also to offer a fun and interactive way for anyone to expand on story ideas spontaneously. Through this paper, we explore the intersection of artificial intelligence and creative writing, pushing the boundaries of how stories can be crafted and concluded. To create our final text-generation models, we used a pre-trained GPT-3.5 model and a newly created finetuned SSM-Mamba model, both of which perform well on a comprehensive list of metrics including BERT score, METEOR, BLEU, ROUGE, and Perplexity. The SSM model has also been made public for the NLP community on HuggingFace models as an open source contribution, which for the timebeing is a first of its kind state-space model for story-generation task on HuggingFace., Comment: 9 pages
- Published
- 2024
14. Solvent-cosolvent attraction is sufficient to induce polymer collapse in good solvent mixtures
- Author
-
Garg, Hitesh, Nayar, Divya, and Vemparala, Satyavani
- Subjects
Condensed Matter - Soft Condensed Matter ,Physics - Biological Physics ,Physics - Chemical Physics - Abstract
Cononsolvency occurs when two miscible, competing good solvents for a polymer are mixed, resulting in a loss of solubility. In this study, we demonstrate through simulations, supported by theory, that cononsolvency can be driven solely by solvent-cosolvent attraction ($\epsilon_{sc}$). The primary mechanism underlying this behavior is the emergent depletion effect, which is amplified by solvent-cosolvent interactions. The polymer reaches a compact state when the solvent and cosolvent fractions are equal ($x_s = x_c = 0.5$), a finding that aligns with predictions from Flory-Huggins theory and the random phase approximation. We show that this cononsolvency behavior is observed for different cosolvent sizes, provided the cosolvent density remains below the depletion threshold and the sizes of solvent and cosolvent particles are not smaller than the monomer size. Additionally, we investigate the role of temperature and find that cononsolvency weakens as temperature increases, due to a reduction in the depletion effect. Finally, we show that when preferential cosolvent attraction is introduced in this simple model, it leads to cononsolvency driven by bridging interactions, occurring at lower cosolvent fractions ($x_c < 0.5$)., Comment: 10 pages and an ancillary file
- Published
- 2024
15. Bayesian estimation for novel geometric INGARCH model
- Author
-
Andrews, Divya Kuttenchalil and Balakrishna, N.
- Subjects
Statistics - Methodology ,Statistics - Computation - Abstract
This paper introduces an integer-valued generalized autoregressive conditional heteroskedasticity (INGARCH) model based on the novel geometric distribution and discusses some of its properties. The parameter estimation problem of the models are studied by conditional maximum likelihood and Bayesian approach using Hamiltonian Monte Carlo (HMC) algorithm. The results of the simulation studies and real data analysis affirm the good performance of the estimators and the model.
- Published
- 2024
16. Are Large Language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done!
- Author
-
Patel, Divya, Patel, Pathik, Chander, Ankush, Dasgupta, Sourish, and Chakraborty, Tanmoy
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning ,Computer Science - Neural and Evolutionary Computing ,I.2.7 - Abstract
Large Language Models (LLMs) have succeeded considerably in In-Context-Learning (ICL) based summarization. However, saliency is subject to the users' specific preference histories. Hence, we need reliable In-Context Personalization Learning (ICPL) capabilities within such LLMs. For any arbitrary LLM to exhibit ICPL, it needs to have the ability to discern contrast in user profiles. A recent study proposed a measure for degree-of-personalization called EGISES for the first time. EGISES measures a model's responsiveness to user profile differences. However, it cannot test if a model utilizes all three types of cues provided in ICPL prompts: (i) example summaries, (ii) user's reading histories, and (iii) contrast in user profiles. To address this, we propose the iCOPERNICUS framework, a novel In-COntext PERsonalization learNIng sCrUtiny of Summarization capability in LLMs that uses EGISES as a comparative measure. As a case-study, we evaluate 17 state-of-the-art LLMs based on their reported ICL performances and observe that 15 models' ICPL degrades (min: 1.6%; max: 3.6%) when probed with richer prompts, thereby showing lack of true ICPL.
- Published
- 2024
17. Controlling the band structure and quench dynamics in one-dimensional optomechanical array driven by a phase modulated laser
- Author
-
Mishra, Divya and Kumar, Parvendra
- Subjects
Quantum Physics ,Physics - Optics - Abstract
We theoretically investigated an array of coupled optomechanical cavities driven by a phase-modulated laser. We show that phase modulation enables the control of band structure and switching of the relative weights of photons and phonons in hybrid eigenmodes. Finally, we show how phase affects the population of hybrid modes and quench dynamics., Comment: 10 pages, 7 figures
- Published
- 2024
18. Wavelength-dependent anisotropic light-matter interaction in 2D ferroelectric In2Se3
- Author
-
Jangra, Divya, De, Binoy Krishna, Sharma, Pragati, Chakraborty, Koushik, Parate, Shubham, Yogi, Arvind Kumar, Mittal, Ranjan, Gupta, Mayanak K, Nukala, Pavan, Velpula, Praveen Kumar, and Sathe, Vasant G.
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
The anisotropic light-matter interactions in 2D materials have garnered significant attention for their potential to develop futuristic polarization-based optoelectronic devices, such as photodetectors and photo-actuators. In this study, we investigate the polarization-dependent interactions in ferroelectric 3R alpha-In2Se3 using Angle-Resolved Polarized Raman Spectroscopy (ARPRS) with different excitation lasers. Our experimental findings supported by complementary Density Functional Theory calculations demonstrate that the light-matter interactions depend not only on the crystallographic orientation but also on the excitation energy. Scanning transmission electron microscopy (STEM) confirms the highly anisotropic 3R crystal structure of alpha-In2Se3. This anisotropy in crystal structure facilitates significant optical anisotropy, driven by a complex interplay of electron-photon and electron-phonon interactions, which is reflected in the complex nature of the Raman tensor elements. These anisotropy interactions extend to the materials electrical response under light illumination. Remarkably, the anisotropic photo-response can be tuned by both polarization and wavelength of the incident light, making In2Se3 a promising material for advanced polarization-sensitive photodetection applications., Comment: 19 pages, 6 figures, supporting information
- Published
- 2024
19. ImPoster: Text and Frequency Guidance for Subject Driven Action Personalization using Diffusion Models
- Author
-
Kothandaraman, Divya, Kulkarni, Kuldeep, Shekhar, Sumit, Srinivasan, Balaji Vasan, and Manocha, Dinesh
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
We present ImPoster, a novel algorithm for generating a target image of a 'source' subject performing a 'driving' action. The inputs to our algorithm are a single pair of a source image with the subject that we wish to edit and a driving image with a subject of an arbitrary class performing the driving action, along with the text descriptions of the two images. Our approach is completely unsupervised and does not require any access to additional annotations like keypoints or pose. Our approach builds on a pretrained text-to-image latent diffusion model and learns the characteristics of the source and the driving image by finetuning the diffusion model for a small number of iterations. At inference time, ImPoster performs step-wise text prompting i.e. it denoises by first moving in the direction of the image manifold corresponding to the driving image followed by the direction of the image manifold corresponding to the text description of the desired target image. We propose a novel diffusion guidance formulation, image frequency guidance, to steer the generation towards the manifold of the source subject and the driving action at every step of the inference denoising. Our frequency guidance formulations are derived from the frequency domain properties of images. We extensively evaluate ImPoster on a diverse set of source-driving image pairs to demonstrate improvements over baselines. To the best of our knowledge, ImPoster is the first approach towards achieving both subject-driven as well as action-driven image personalization. Code and data is available at https://github.com/divyakraman/ImPosterDiffusion2024.
- Published
- 2024
20. Inductive Spatial Temporal Prediction Under Data Drift with Informative Graph Neural Network
- Author
-
Zheng, Jialun, Saxena, Divya, Cao, Jiannong, Yang, Hanchen, and Ruan, Penghui
- Subjects
Computer Science - Machine Learning - Abstract
Inductive spatial temporal prediction can generalize historical data to predict unseen data, crucial for highly dynamic scenarios (e.g., traffic systems, stock markets). However, external events (e.g., urban structural growth, market crash) and emerging new entities (e.g., locations, stocks) can undermine prediction accuracy by inducing data drift over time. Most existing studies extract invariant patterns to counter data drift but ignore pattern diversity, exhibiting poor generalization to unseen entities. To address this issue, we design an Informative Graph Neural Network (INF-GNN) to distill diversified invariant patterns and improve prediction accuracy under data drift. Firstly, we build an informative subgraph with a uniquely designed metric, Relation Importance (RI), that can effectively select stable entities and distinct spatial relationships. This subgraph further generalizes new entities' data via neighbors merging. Secondly, we propose an informative temporal memory buffer to help the model emphasize valuable timestamps extracted using influence functions within time intervals. This memory buffer allows INF-GNN to discern influential temporal patterns. Finally, RI loss optimization is designed for pattern consolidation. Extensive experiments on real-world dataset under substantial data drift demonstrate that INF-GNN significantly outperforms existing alternatives.
- Published
- 2024
21. On orthogonality preserving and reversing operators
- Author
-
Khurana, Divya
- Subjects
Mathematics - Functional Analysis - Abstract
We study approximately orthogonality (in the sense of Dragomir) preserving and reversing operators. We obtain a complete characterization of approximate orthogonality preserving and reversing operators for a class of operators. We also study the locally approximate orthogonality preserving and reversing operators defined on some finite-dimensional Banach spaces.
- Published
- 2024
22. On symmetric and approximately symmetric operators
- Author
-
Khurana, Divya
- Subjects
Mathematics - Functional Analysis - Abstract
We introduce the notion of local orthogonality preserving operators to study the right-symmetry of operators. As a consequence of our work, we show that any smooth compact operator defined on a smooth and reflexive Banach space is either a rank one operator or it is not right-symmetric. We show that there are no right-symmetric smooth compact operators defined on a smooth and reflexive Banach space that fails to have any non-zero left-symmetric point. We also study approximately orthogonality preserving and reversing operators (in the sense of Chmieli\'{n}ski and Dragomir). We show that on a finite-dimensional Banach space, an operator is approximately orthogonality preserving (reversing) in the sense of Dragomir if and only if it is an injective operator.
- Published
- 2024
23. Aligning Judgment Using Task Context and Explanations to Improve Human-Recommender System Performance
- Author
-
Srivastava, Divya and Feigh, Karen M.
- Subjects
Computer Science - Human-Computer Interaction - Abstract
Recommender systems, while a powerful decision making tool, are often operationalized as black box models, such that their AI algorithms are not accessible or interpretable by human operators. This in turn can cause confusion and frustration for the operator and result in unsatisfactory outcomes. While the field of explainable AI has made remarkable strides in addressing this challenge by focusing on interpreting and explaining the algorithms to human operators, there are remaining gaps in the human's understanding of the recommender system. This paper investigates the relative impact of using context, properties of the decision making task and environment, to align human and AI algorithm understanding of the state of the world, i.e. judgment, to improve joint human-recommender performance as compared to utilizing post-hoc algorithmic explanations. We conducted an empirical, between-subjects experiment in which participants were asked to work with an automated recommender system to complete a decision making task. We manipulated the method of transparency (shared contextual information to support shared judgment vs algorithmic explanations) and record the human's understanding of the task, the recommender system, and their overall performance. We found that both techniques yielded equivalent agreement on final decisions. However, those who saw task context had less tendency to over-rely on the recommender system and were able to better pinpoint in what conditions the AI erred. Both methods improved participants' confidence in their own decision making, and increased mental demand equally and frustration negligibly. These results present an alternative approach to improving team performance to post-hoc explanations and illustrate the impact of judgment on human cognition in working with recommender systems., Comment: arXiv admin note: text overlap with arXiv:2310.11370
- Published
- 2024
24. An Analog and Digital Hybrid Attention Accelerator for Transformers with Charge-based In-memory Computing
- Author
-
Moradifirouzabadi, Ashkan, Dodla, Divya Sri, and Kang, Mingu
- Subjects
Computer Science - Hardware Architecture ,Computer Science - Machine Learning - Abstract
The attention mechanism is a key computing kernel of Transformers, calculating pairwise correlations across the entire input sequence. The computing complexity and frequent memory access in computing self-attention put a huge burden on the system especially when the sequence length increases. This paper presents an analog and digital hybrid processor to accelerate the attention mechanism for transformers in 65nm CMOS technology. We propose an analog computing-in-memory (CIM) core, which prunes ~75% of low-score tokens on average during runtime at ultra-low power and delay. Additionally, a digital processor performs precise computations only for ~25% unpruned tokens selected by the analog CIM core, preventing accuracy degradation. Measured results show peak energy efficiency of 14.8 and 1.65 TOPS/W, and peak area efficiency of 976.6 and 79.4 GOPS/mm$^\mathrm{2}$ in the analog core and the system-on-chip (SoC), respectively., Comment: 4 pages, 9 figures, to be published at ESSERC 2024
- Published
- 2024
- Full Text
- View/download PDF
25. Deciphering Cardiac Destiny: Unveiling Future Risks Through Cutting-Edge Machine Learning Approaches
- Author
-
Divya, G., SravanKumar, M. Naga, JayaDharani, T., Pavan, B., and Praveen, K.
- Subjects
Computer Science - Computers and Society - Abstract
Cardiac arrest remains a leading cause of death worldwide, necessitating proactive measures for early detection and intervention. This project aims to develop and assess predictive models for the timely identification of cardiac arrest incidents, utilizing a comprehensive dataset of clinical parameters and patient histories. Employing machine learning (ML) algorithms like XGBoost, Gradient Boosting, and Naive Bayes, alongside a deep learning (DL) approach with Recurrent Neural Networks (RNNs), we aim to enhance early detection capabilities. Rigorous experimentation and validation revealed the superior performance of the RNN model, which effectively captures complex temporal dependencies within the data. Our findings highlight the efficacy of these models in accurately predicting cardiac arrest likelihood, emphasizing the potential for improved patient care through early risk stratification and personalized interventions. By leveraging advanced analytics, healthcare providers can proactively mitigate cardiac arrest risk, optimize resource allocation, and improve patient outcomes. This research highlights the transformative potential of machine learning and deep learning techniques in managing cardiovascular risk and advances the field of predictive healthcare analytics.
- Published
- 2024
26. A Comprehensive Catalog of UVIT Observations I: Catalog Description and First Release of Source Catalog (UVIT DR1)
- Author
-
Piridi, Sonika, Kumar, Ranjan, Pandey, Divya, and Pradhan, Ananta C.
- Subjects
Astrophysics - Astrophysics of Galaxies ,Astrophysics - Instrumentation and Methods for Astrophysics ,Astrophysics - Solar and Stellar Astrophysics - Abstract
We present the first comprehensive source catalog (UVIT DR1) of ultraviolet (UV) photometry in four far-UV (FUV $\sim$1300$-$1800 \AA) and five near-UV (NUV $\sim$2000$-$3000 \AA) filters of the Ultraviolet Imaging Telescope (UVIT) on board {\em AstroSat}. UVIT DR1 includes bright UV sources in 291 fields that UVIT detected during its first two years of pointed observation, encompassing an area of 58 square degrees. We used the {\sc ccdlab} pipeline to reduce the L1 data, source-extractor for source detection, and four photometric procedures to determine the magnitudes of the detected sources. We provided the 3$\sigma$ and 5$\sigma$ detection limits for all the filters of UVIT. We describe the details of observation, source extraction methods, and photometry procedures applied to prepare the catalog. In the final UVIT DR1 catalog, we have point sources, extended sources, clumps from nearby galaxies, There are 239,520 unique sources in the combined UVIT DR1, of which 70,488 sources have FUV magnitudes, and 211,410 have NUV magnitudes. We cross-matched and compared non-crowded sources of UVIT with the {\em Galaxy Evolution Explorer (GALEX)} and {\em Gaia} source catalogs. We provide a clean catalog of the unique sources in various UVIT filters that will help further multi-wavelength scientific analysis of the objects., Comment: 23 pages, 10 figures, 10 tables, accepted for publication in the Astrophysical Journal Supplement Series
- Published
- 2024
27. Overview of School-Based Telehealth Network Grant Program Services Delivered to Students in Rural Schools
- Author
-
Marcia M. Ward, Divya Bhagianadh, Fred Ullrich, Kimberly A. S. Merchant, and Carlos Mena
- Abstract
Telehealth can expand and enhance access to school-based health care, but its use has been relatively limited. Recognizing that school-based health care is still not reaching many students, the Health Resources and Services Administration (HRSA) funded the School Based Telehealth Network Grant Program to expand telehealth in rural school-based settings to help to increase the availability and use of these services. The 19 grantees delivered telehealth to over 200 schools across 17 states, choosing which services they would deliver and how. Looking across the services, these fell into three categories -- primary/urgent care, behavioral health, and other more specialized services. The majority of grantees offered multiple telehealth services with the combination of behavioral health and primary/urgent care the most common. The current study adds to the literature by elucidating that telehealth in schools can address multiple clinical conditions through separate services even though doing so involves using various combinations of clinicians providing different services.
- Published
- 2024
- Full Text
- View/download PDF
28. How will advanced AI systems impact democracy?
- Author
-
Summerfield, Christopher, Argyle, Lisa, Bakker, Michiel, Collins, Teddy, Durmus, Esin, Eloundou, Tyna, Gabriel, Iason, Ganguli, Deep, Hackenburg, Kobi, Hadfield, Gillian, Hewitt, Luke, Huang, Saffron, Landemore, Helene, Marchal, Nahema, Ovadya, Aviv, Procaccia, Ariel, Risse, Mathias, Schneier, Bruce, Seger, Elizabeth, Siddarth, Divya, Sætra, Henrik Skaug, Tessler, MH, and Botvinick, Matthew
- Subjects
Computer Science - Computers and Society ,Computer Science - Artificial Intelligence - Abstract
Advanced AI systems capable of generating humanlike text and multimodal content are now widely available. In this paper, we discuss the impacts that generative artificial intelligence may have on democratic processes. We consider the consequences of AI for citizens' ability to make informed choices about political representatives and issues (epistemic impacts). We ask how AI might be used to destabilise or support democratic mechanisms like elections (material impacts). Finally, we discuss whether AI will strengthen or weaken democratic principles (foundational impacts). It is widely acknowledged that new AI systems could pose significant challenges for democracy. However, it has also been argued that generative AI offers new opportunities to educate and learn from citizens, strengthen public discourse, help people find common ground, and to reimagine how democracies might work better., Comment: 25 pages
- Published
- 2024
29. Multi-domain Network Slice Partitioning: A Graph Neural Network Algorithm
- Author
-
Wu, Zhouxiang, Ishigaki, Genya, Gour, Riti, Li, Congzhou, Khanure, Divya, and Jue, Jason P.
- Subjects
Computer Science - Networking and Internet Architecture - Abstract
In the context of multi-domain network slices, multiple domains need to work together to provide a service. The problem of determining which part of the service fits within which domain is referred to as slice partitioning. The partitioning of multi-domain network slices poses a challenging problem, particularly when striving to strike the right balance between inter-domain and intra-domain costs, as well as ensuring optimal load distribution within each domain. To approach the optimal partition solution while maintaining load balance between domains, a framework has been proposed. This framework not only generates partition plans with various characteristics but also employs a Graph Neural Network solver, which significantly reduces the plan generation time. The proposed approach is promising in generating partition plans for multi-domain network slices and is expected to improve the overall performance of the network.
- Published
- 2024
30. A Multi-Agent Reinforcement Learning Scheme for SFC Placement in Edge Computing Networks
- Author
-
Li, Congzhou, Wu, Zhouxiang, Khanure, Divya, and Jue, Jason P.
- Subjects
Computer Science - Networking and Internet Architecture - Abstract
In the 5G era and beyond, it is favorable to deploy latency-sensitive and reliability-aware services on edge computing networks in which the computing and network resources are more limited compared to cloud and core networks but can respond more promptly. These services can be composed as Service Function Chains (SFCs) which consist of a sequence of ordered Virtual Network Functions (VNFs). To achieve efficient edge resources allocation for SFC requests and optimal profit for edge service providers, we formulate the SFC placement problem in an edge environment and propose a multi-agent Reinforcement Learning (RL) scheme to address the problem. The proposed scheme employs a set of RL agents to collaboratively make SFC placement decisions, such as path selection, VNF configuration, and VNF deployment. Simulation results show our model can improve the profit of edge service providers by 12\% compared with a heuristic solution.
- Published
- 2024
31. On the Initial Value Problem for Hyperbolic Systems with Discontinuous Coefficients
- Author
-
Joseph, Kayyunnapara Divya
- Subjects
Mathematics - Analysis of PDEs ,2020 Mathematics Subject Classification. 35R05, 35L45, 35D30 - Abstract
Hyperbolic systems of the first and higher-order partial differential equations appear in many multiphysics problems. We will be dealing with a wave propagation problem in a piece-wise homogeneous medium. Mathematically, the problem is reduced to analyzing two systems of partial differential equations posed on two domains with a common boundary. The differential equations may not be satisfied on the boundary (or part of the boundary), but some interface conditions are satisfied. These interface conditions depend on a specific physical problem. We aim to prove the existence and regularity of the solution for the case of hyperbolic systems of first-order equations with different domains separated by a hyperplane, where we need to formulate the interface conditions. We do this for the initial value problem in 1D-space variable when the coefficient matrix has discontinuity on $m$ lines. More specifically, we find explicit solutions to the case when the coefficient matrix is piecewise constant with a discontinuity along $1$ line or $2$ lines. We also prove the existence of solution to the general initial value problem. We then formulate the weak solution of initial value problem for the corresponding symmetric hyperbolic system in $n $D-space variables with interface conditions, get the energy estimates for this system, and prove the existence of solution to the system.
- Published
- 2024
32. New Insights into Type-I Solar Noise Storms from High Angular Resolution Spectroscopic Imaging with the upgraded Giant Metrewave Radio Telescope
- Author
-
Mondal, Surajit, Kansabanik, Devojyoti, Oberoi, Divya, and Dey, Soham
- Subjects
Astrophysics - Solar and Stellar Astrophysics - Abstract
Type-I solar noise storms are perhaps the most commonly observed active radio emissions from the Sun at meter-wavelengths. Noise storms have a long-lived and wideband continuum background with superposed islands of much brighter narrowband and short-lived emissions, known as type-I bursts. There is a serious paucity of studies focusing on the morphology of these two types of emissions, primarily because of the belief that coronal scattering will always wash out any features at small angular scales. However, it is important to { investigate} their spatial structures in detail to make a spatio-temporal connection with observations at extreme-ultraviolet/ X-ray bands to understand the detailed nature of these emissions. In this work, we use high angular resolution observations from the upgraded Giant Metrewave Radio Telescope to demonstrate that it is possible to detect structures with angular scales as small as $\sim 9\arcsec$, about three times smaller than the smallest structure reported to date from noise storms. Our observations also suggest while the individual type-I bursts are narrowband in nature, the bursts are probably caused by traveling disturbance(s) inducing magnetic reconnections at different coronal heights, and thus leading to correlated change in the morphology of the type-I bursts observed at a wide range of frequencies., Comment: Accepted for publication in the Astrophysical Journal
- Published
- 2024
33. Basset-Boussinesq history force and inertia are relevant for unsteady particle settling dynamics
- Author
-
Jaroslawski, Tomek, Jaganathan, Divya, Govindarajan, Rama, and McKeon, Beverley
- Subjects
Physics - Fluid Dynamics - Abstract
Our experiments on a sphere falling under gravity in Stokes flow show significant history effects. We observe an algebraic, not exponential, relaxation rate to the terminal velocity, validating the solution to the Basset-Boussinesq-Oseen equation. Unlike in steady Stokes theory, our experiments and theory reveal a vortex ring forming around the sphere and drifting away. As the Reynolds number nears unity, the vortex ring lags behind the sphere, departing from Stokesian theory, though the sphere's algebraic response persists. These findings are critical for interactions in the Stokes limit.
- Published
- 2024
34. The Median of Sierpinski Triangle Graphs
- Author
-
Balakrishnan, Kannan, Changat, Manoj, Dhanyamol, M V., Hinz, Andreas M., Koley, Hrishik, and Lekha, Divya Sindhu
- Subjects
Mathematics - Combinatorics - Abstract
The median $M$ of a graph $G$ is the set of vertices with a minimum total distance to all other vertices in the graph. In this paper, we determine the median of Sierpi\'{n}ski triangle graphs. Sierpi\'{n}ski triangle graphs, also known as Sierpi\'{n}ski gasket graphs of order $n$ are graphs formed by contracting all non-clique edges from the Sierpi\'{n}ski graphs of order ($n+1$).
- Published
- 2024
35. Neutrino Nonstandard Interactions and Lepton Flavor Universality violation at SND@LHC via charm production
- Author
-
Bhattacharya, Bhubanjyoti, Datta, Alakabha, Graverini, Elena, Mukherjee, Lopamudra, Sachdeva, Divya, and Waite, John
- Subjects
High Energy Physics - Phenomenology ,High Energy Physics - Experiment - Abstract
In this work, we explore the effect of neutrino nonstandard interactions (NSI) involving the charm quark at SND@LHC. Using an effective description of new physics in terms of four-fermion operators involving a charm quark, we constrain the Wilson coefficients of the effective interaction from two and three-body charmed meson decays. In our fit, we include charmed meson decays not only to pseudoscalar final states but also to vector final states and include decays to the $\eta$ and $\eta^\prime$ final states. We also consider constraints from charmed baryon decays. We then study the effect of new physics in neutrino scattering processes, involving charm production at SND@LHC, for various benchmark new physics couplings obtained from the low energy fits. Finally, we also study the effects of lepton universality violation (LUV) assuming that the new physics coupling is not lepton universal., Comment: 23 pages, 3 figures; included neutrino production modes that affect results, typos corrected, references added
- Published
- 2024
36. Overcoming Growth-Induced Forgetting in Task-Agnostic Continual Learning
- Author
-
Zhao, Yuqing, Saxena, Divya, Cao, Jiannong, Liu, Xiaoyun, and Song, Changlin
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
In continual learning (CL), model growth enhances adaptability over new data, improving knowledge retention for more tasks. However, improper model growth can lead to severe degradation of previously learned knowledge, an issue we name as growth-induced forgetting (GIFt), especially in task-agnostic CL using entire grown model for inference. Existing works, despite adopting model growth and random initialization for better adaptability, often fail to recognize the presence of GIFt caused by improper model growth. This oversight limits comprehensive control of forgetting and hinders full utilization of model growth. We are the first in CL to identify this issue and conduct an in-depth study on root cause of GIFt, where layer expansion stands out among model growth strategies, widening layers without affecting model functionality. Yet, direct adoption of layer expansion presents challenges. It lacks data-driven control and initialization of expanded parameters to balance adaptability and knowledge retention. This paper presents a novel SparseGrow approach to overcome the issue of GIFt while enhancing adaptability over new data. SparseGrow employs data-driven sparse layer expansion to control efficient parameter usage during growth, reducing GIFt from excessive growth and functionality changes. It also combines sparse growth with on-data initialization at training late-stage to create partially 0-valued expansions that fit learned distribution, enhancing retention and adaptability. To further minimize forgetting, freezing is applied by calculating the sparse mask, allowing data-driven preservation of important parameters. Through experiments across datasets with various settings, cases, and task numbers, we demonstrate the necessity of layer expansion and showcase the effectiveness of SparseGrow in overcoming GIFt, highlighting its adaptability and knowledge retention for incremental tasks.
- Published
- 2024
37. Singular solutions of a system of a non-strictly hyperbolic system
- Author
-
Joseph, Kayyunnapara Divya
- Subjects
Mathematics - Analysis of PDEs ,35A20, 35L40, 35F50, 35F55 - Abstract
Systems of the first order partial differential equations with singular solutions appear in many multiphysics problems and the weak formulation of solutions involve in many cases product of distributions. In this paper we study such a system derived from Eulerian droplet model for air particle flow. This is a 2 x 2 non - strictly hyperbolic system of conservation laws with linear damping. We first study a regularized viscous system with variable viscosity term and obtain a weak asymptotic solution with general initial data and also get solution in the Colombeau algebra. We also study the vanishing viscosity limit and show that this limit is a distribution solution. Further we study the large time asymptotic behaviour of the viscous system. This important system, is not very well studied due to complexities in the analysis. As far as we know the only work done on this system is for Riemann type of initial data. The significance of this paper is that we work on the system having general initial data and not just initial data of the Riemann type., Comment: Preprint
- Published
- 2024
38. Weak asymptotic solution of one dimensional zero pressure dynamics system in the quarter plane
- Author
-
Joseph, Kayyunnapara Divya
- Subjects
Mathematics - Analysis of PDEs ,35A20, 35L50, 35R05 - Abstract
In this paper we study a system of equations which appear in the modelling of many physical phenomena. Initially this system appeared in description of the large scale structure formation. Recently it is derived as a second order queueing model. We construct weakly asymptotic solutions of the initial boundary value problem for the system and interaction of waves in the quarter plane $\{(x,t): x>0,t>0\}$ with boundary Riemann solution centered at $(0,0)$ and Riemann solution centered at a point $(x_0,0), x_0>0$.
- Published
- 2024
- Full Text
- View/download PDF
39. Wave Interaction For A System In Elastodynamics With Initial Data Lying On The Level Set Of One Of The Riemann Invariants
- Author
-
Joseph, Kayyunnapara Divya
- Subjects
Mathematics - Analysis of PDEs ,35A20, 35L50, 35R05 - Abstract
This paper is concerned with the study of interaction of waves originating from the Riemann problem centred at two different points for a system of equations modelling propagation of elastic waves. The system consists of two equations for $(u,\sigma)$, where, u is the velocity and $\sigma$ is the stress and is strictly hyperbolic and nonconservative. Study of interaction of waves is one of the most important steps in the construction of global solution with initial data in the space of functions of bounded variation using approximation procedure like the Glimm's scheme. This amounts to constructing a solution with initial data consisting of three states $(u_L, \sigma_L), (u_m, \sigma_m),$ and $(u_R, \sigma_R)$. Usually this analysis is done for the states which are in a small neighbourhood of a fixed state. Here we get explicit formula for the solution of the system when the data lies in the level sets of Riemann invariants. The speciality of the work is that we donot assume smallness conditions on the initial data.
- Published
- 2024
- Full Text
- View/download PDF
40. Riemann type initial boundary value problem for a system in elastodynamics
- Author
-
Joseph, Kayyunnapara Divya and Dinesh, P. A
- Subjects
Mathematics - Analysis of PDEs ,35A20, 35L50, 35R05 - Abstract
This paper is a continuation of our previous paper \cite{d1} on the initial boundary value problem for a nonconservative system appearing in elastodynamics in the space time domain $x>0,t>0$. There, the initial and boundary data were assumed to lie on the level sets of one of the Riemann invariants where as in this paper we consider the general initial and boundary data of Riemann type, formulate the boundary value problem based on the Riemann problem and construct explicitly the solution.
- Published
- 2024
- Full Text
- View/download PDF
41. Auptimize: Optimal Placement of Spatial Audio Cues for Extended Reality
- Author
-
Cho, Hyunsung, Wang, Alexander, Kartik, Divya, Xie, Emily Liying, Yan, Yukang, and Lindlbauer, David
- Subjects
Computer Science - Human-Computer Interaction ,Computer Science - Sound ,Electrical Engineering and Systems Science - Audio and Speech Processing ,H.5.1 ,H.5.2 ,H.5.5 - Abstract
Spatial audio in Extended Reality (XR) provides users with better awareness of where virtual elements are placed, and efficiently guides them to events such as notifications, system alerts from different windows, or approaching avatars. Humans, however, are inaccurate in localizing sound cues, especially with multiple sources due to limitations in human auditory perception such as angular discrimination error and front-back confusion. This decreases the efficiency of XR interfaces because users misidentify from which XR element a sound is coming. To address this, we propose Auptimize, a novel computational approach for placing XR sound sources, which mitigates such localization errors by utilizing the ventriloquist effect. Auptimize disentangles the sound source locations from the visual elements and relocates the sound sources to optimal positions for unambiguous identification of sound cues, avoiding errors due to inter-source proximity and front-back confusion. Our evaluation shows that Auptimize decreases spatial audio-based source identification errors compared to playing sound cues at the paired visual-sound locations. We demonstrate the applicability of Auptimize for diverse spatial audio-based interactive XR scenarios., Comment: UIST 2024
- Published
- 2024
- Full Text
- View/download PDF
42. RollingCache: Using Runtime Behavior to Defend Against Cache Side Channel Attacks
- Author
-
Ojha, Divya and Dwarkadas, Sandhya
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Hardware Architecture - Abstract
Shared caches are vulnerable to side channel attacks through contention in cache sets. Besides being a simple source of information leak, these side channels form useful gadgets for more sophisticated attacks that compromise the security of shared systems. The fundamental design aspect that contention attacks exploit is the deterministic nature of the set of addresses contending for a cache set. In this paper, we present RollingCache, a cache design that defends against contention attacks by dynamically changing the set of addresses contending for cache sets. Unlike prior defenses, RollingCache does not rely on address encryption/decryption, data relocation, or cache partitioning. We use one level of indirection to implement dynamic mapping controlled by the whole-cache runtime behavior. Our solution does not depend on having defined security domains, and can defend against an attacker running on the same or another core. We evaluate RollingCache on ChampSim using the SPEC-2017 benchmark suite. Our security evaluation shows that our dynamic mapping removes the deterministic ability to identify the source of contention. The performance evaluation shows an impact of 1.67\% over a mix of workloads, with a corresponding
- Published
- 2024
43. Initial Boundary Value Problem For A Non-conservative System In Elastodynamics
- Author
-
Joseph, Kayyunnapara Divya and Dinesh, P. A
- Subjects
Mathematics - Analysis of PDEs ,35A20, 35L50, 35R05 - Abstract
This paper is concerned with the initial boundary value problem for a nonconservative system of hyperbolic equation appearing in elastodynamics in the space time domain $x > 0, t > 0$. The number of boundary conditions to be prescribed at the boundary $x = 0$, depend on the number of characteristics entering the domain. Since our system is nonlinear the characteristic speeds depends on the unknown and the direction of the characteristics curves are known apriori. As it is well known, the boundary condition has to be understood in a generalised way. One of the standard way is using vanishing viscosity method. We use this method to construct solution for a particular class of initial and boundary data, namely the initial and boundary datas that lie on the level sets of one of the Riemann invariants., Comment: 13 pages
- Published
- 2024
- Full Text
- View/download PDF
44. Personhood credentials: Artificial intelligence and the value of privacy-preserving tools to distinguish who is real online
- Author
-
Adler, Steven, Hitzig, Zoë, Jain, Shrey, Brewer, Catherine, Chang, Wayne, DiResta, Renée, Lazzarin, Eddy, McGregor, Sean, Seltzer, Wendy, Siddarth, Divya, Soliman, Nouran, South, Tobin, Spelliscy, Connor, Sporny, Manu, Srivastava, Varya, Bailey, John, Christian, Brian, Critch, Andrew, Falcon, Ronnie, Flanagan, Heather, Duffy, Kim Hamilton, Ho, Eric, Leibowicz, Claire R., Nadhamuni, Srikanth, Rozenshtein, Alan Z., Schnurr, David, Shapiro, Evan, Strahm, Lacey, Trask, Andrew, Weinberg, Zoe, Whitney, Cedric, and Zick, Tom
- Subjects
Computer Science - Computers and Society - Abstract
Anonymity is an important principle online. However, malicious actors have long used misleading identities to conduct fraud, spread disinformation, and carry out other deceptive schemes. With the advent of increasingly capable AI, bad actors can amplify the potential scale and effectiveness of their operations, intensifying the challenge of balancing anonymity and trustworthiness online. In this paper, we analyze the value of a new tool to address this challenge: "personhood credentials" (PHCs), digital credentials that empower users to demonstrate that they are real people -- not AIs -- to online services, without disclosing any personal information. Such credentials can be issued by a range of trusted institutions -- governments or otherwise. A PHC system, according to our definition, could be local or global, and does not need to be biometrics-based. Two trends in AI contribute to the urgency of the challenge: AI's increasing indistinguishability from people online (i.e., lifelike content and avatars, agentic activity), and AI's increasing scalability (i.e., cost-effectiveness, accessibility). Drawing on a long history of research into anonymous credentials and "proof-of-personhood" systems, personhood credentials give people a way to signal their trustworthiness on online platforms, and offer service providers new tools for reducing misuse by bad actors. In contrast, existing countermeasures to automated deception -- such as CAPTCHAs -- are inadequate against sophisticated AI, while stringent identity verification solutions are insufficiently private for many use-cases. After surveying the benefits of personhood credentials, we also examine deployment risks and design challenges. We conclude with actionable next steps for policymakers, technologists, and standards bodies to consider in consultation with the public., Comment: 63 pages, 7 figures, 5 tables; minor additions to acknowledgments and wording changes for clarity; corrected typo
- Published
- 2024
45. 3D-free meets 3D priors: Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance
- Author
-
Kang, Taewon, Kothandaraman, Divya, Manocha, Dinesh, and Lin, Ming C.
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Recent 3D novel view synthesis (NVS) methods are limited to single-object-centric scenes and struggle with complex environments. They often require extensive 3D data for training, lacking generalization beyond the training distribution. Conversely, 3D-free methods can generate text-controlled views of complex, in-the-wild scenes using a pretrained stable diffusion model without the need for a large amount of 3D-based training data, but lack camera control. In this paper, we introduce a method capable of generating camera-controlled viewpoints from a single input image, by combining the benefits of 3D-free and 3D-based approaches. Our method excels in handling complex and diverse scenes without extensive training or additional 3D and multiview data. It leverages widely available pretrained NVS models for weak guidance, integrating this knowledge into a 3D-free view synthesis approach to achieve the desired results. Experimental results demonstrate that our method outperforms existing models in both qualitative and quantitative evaluations, providing high-fidelity and consistent novel view synthesis at desired camera angles across a wide variety of scenes., Comment: 13 pages, 12 figures, v3: corrected typos in figures
- Published
- 2024
46. Impact Analysis of Data Drift Towards The Development of Safety-Critical Automotive System
- Author
-
Hossain, Md Shahi Amran, Ahammed, Abu Shad, Biswas, Divya Prakash, and Obermaisser, Roman
- Subjects
Mathematics - Logic - Abstract
A significant part of contemporary research in autonomous vehicles is dedicated to the development of safety critical systems where state-of-the-art artificial intelligence (AI) algorithms, like computer vision (CV), can play a major role. Vision models have great potential for the real-time detection of numerous traffic signs and obstacles, which is essential to avoid accidents and protect human lives. Despite vast potential, computer vision-based systems have critical safety concerns too if the traffic condition drifts over time. This paper represents an analysis of how data drift can affect the performance of vision models in terms of traffic sign detection. The novelty in this research is provided through a YOLO-based fusion model that is trained with drifted data from the CARLA simulator and delivers a robust and enhanced performance in object detection. The enhanced model showed an average precision of 97.5\% compared to the 58.27\% precision of the original model. A detailed performance review of the original and fusion models is depicted in the paper, which promises to have a significant impact on safety-critical automotive systems.
- Published
- 2024
47. Accelerating Full Waveform Inversion By Transfer Learning
- Author
-
Singh, Divya Shyam, Herrmann, Leon, Sun, Qing, Bürchner, Tim, Dietrich, Felix, and Kollmannsberger, Stefan
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Full waveform inversion (FWI) is a powerful tool for reconstructing material fields based on sparsely measured data obtained by wave propagation. For specific problems, discretizing the material field with a neural network (NN) improves the robustness and reconstruction quality of the corresponding optimization problem. We call this method NN-based FWI. Starting from an initial guess, the weights of the NN are iteratively updated to fit the simulated wave signals to the sparsely measured data set. For gradient-based optimization, a suitable choice of the initial guess, i.e., a suitable NN weight initialization, is crucial for fast and robust convergence. In this paper, we introduce a novel transfer learning approach to further improve NN-based FWI. This approach leverages supervised pretraining to provide a better NN weight initialization, leading to faster convergence of the subsequent optimization problem. Moreover, the inversions yield physically more meaningful local minima. The network is pretrained to predict the unknown material field using the gradient information from the first iteration of conventional FWI. In our computational experiments on two-dimensional domains, the training data set consists of reference simulations with arbitrarily positioned elliptical voids of different shapes and orientations. We compare the performance of the proposed transfer learning NN-based FWI with three other methods: conventional FWI, NN-based FWI without pretraining and conventional FWI with an initial guess predicted from the pretrained NN. Our results show that transfer learning NN-based FWI outperforms the other methods in terms of convergence speed and reconstruction quality.
- Published
- 2024
48. Differential torsion theories on Eilenberg-Moore categories of monads
- Author
-
Ahuja, Divya and Kour, Surjeet
- Subjects
Mathematics - Category Theory ,13N15, 16S90, 18C20, 18E40 - Abstract
Let $\mathcal C$ be a Grothendieck category and $U$ be a monad on $\mathcal C$ that is exact and preserves colimits. In this article, we prove that every hereditary torsion theory on the Eilenberg-Moore category of modules over a monad $U$ is differential. Further, if $\delta:U\longrightarrow U$ denotes a derivation on a monad $U$, then we show that every $\delta$-derivation on a $U$-module $M$ extends uniquely to a $\delta$-derivation on the module of quotients of $M$.
- Published
- 2024
49. Giant gate-controlled room temperature odd-parity magnetoresistance in magnetized bilayer graphene
- Author
-
Sahani, Divya, Das, Sunit, Watanabe, Kenji, Taniguchi, Takashi, Agarwal, Amit, and Bid, Aveek
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Materials Science - Abstract
Magnetotransport measurements are crucial for understanding the Fermi surface properties, magnetism, and topology in quantum materials. Here, we report the discovery of giant room temperature odd-parity magnetoresistance (OMR) in a bilayer graphene (BLG) heterostructure interfaced with Cr$_2$Te$_2$Ge$_6$ (CGT). Using magnetotransport measurements, we demonstrate that the BLG/CGT heterostructure exhibits a significant antisymmetric longitudinal magnetoresistance, indicative of intrinsic time-reversal symmetry (TRS) breaking in the system. We show that the OMR is tunable via electrostatic gating. Additionally, the OMR is pronounced near the band edges and diminishes with increasing charge carrier density in graphene. Our theoretical analysis reveals that this phenomenon arises from the coupling of the out-of-plane components of Berry curvature and orbital magnetic moment to the applied magnetic field in a TRS-broken system. Our findings establish OMR as a significant probe for TRS breaking in quantum materials in which the crystal symmetries preclude the appearance of anomalous Hall effect., Comment: 34 pages
- Published
- 2024
50. Integrated Hardware Architecture and Device Placement Search
- Author
-
Wang, Irene, Tarnawski, Jakub, Phanishayee, Amar, and Mahajan, Divya
- Subjects
Computer Science - Machine Learning ,Computer Science - Hardware Architecture ,Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
Distributed execution of deep learning training involves a dynamic interplay between hardware accelerator architecture and device placement strategy. This is the first work to explore the co-optimization of determining the optimal architecture and device placement strategy through novel algorithms, improving the balance of computational resources, memory usage, and data distribution. Our architecture search leverages tensor and vector units, determining their quantity and dimensionality, and on-chip and off-chip memory configurations. It also determines the microbatch size and decides whether to recompute or stash activations, balancing the memory footprint of training and storage size. For each explored architecture configuration, we use an Integer Linear Program (ILP) to find the optimal schedule for executing operators on the accelerator. The ILP results then integrate with a dynamic programming solution to identify the most effective device placement strategy, combining data, pipeline, and tensor model parallelism across multiple accelerators. Our approach achieves higher throughput on large language models compared to the state-of-the-art TPUv4 and the Spotlight accelerator search framework. The entire source code of PHAZE is available at https://github.com/msr-fiddle/phaze., Comment: Accepted at the 41st International Conference on Machine Learning (ICML), 2024
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.