12,401 results on '"Kishore, P"'
Search Results
2. Enhancing Text Generation in Joint NLG/NLU Learning Through Curriculum Learning, Semi-Supervised Training, and Advanced Optimization Techniques
- Author
-
Shaik, Rahimanuddin and Kishore, Katikela Sreeharsha
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Text generation is the automated process of producing written or spoken language using computational methods. It involves generating coherent and contextually relevant text based on predefined rules or learned patterns. However, challenges in text generation arise from maintaining coherence, ensuring diversity and creativity, and avoiding biases or inappropriate content. This research paper developed a novel approach to improve text generation in the context of joint Natural Language Generation (NLG) and Natural Language Understanding (NLU) learning. The data is prepared by gathering and preprocessing annotated datasets, including cleaning, tokenization, stemming, and stop-word removal. Feature extraction techniques such as POS tagging, Bag of words, and Term Frequency-Inverse Document Frequency (TF-IDF) are applied. Transformer-based encoders and decoders, capturing long range dependencies and improving source-target sequence modelling. Pre-trained language models like Optimized BERT are incorporated, along with a Hybrid Redfox Artificial Hummingbird Algorithm (HRAHA). Reinforcement learning with policy gradient techniques, semi-supervised training, improved attention mechanisms, and differentiable approximations like straight-through Gumbel SoftMax estimator are employed to fine-tune the models and handle complex linguistic tasks effectively. The proposed model is implemented using Python.
- Published
- 2024
3. Time Series Viewmakers for Robust Disruption Prediction
- Author
-
Chayapathy, Dhruva, Siebert, Tavis, Spangher, Lucas, Moharir, Akshata Kishore, Patil, Om Manoj, and Rea, Cristina
- Subjects
Computer Science - Machine Learning - Abstract
Machine Learning guided data augmentation may support the development of technologies in the physical sciences, such as nuclear fusion tokamaks. Here we endeavor to study the problem of detecting disruptions i.e. plasma instabilities that can cause significant damages, impairing the reliability and efficiency required for their real world viability. Machine learning (ML) prediction models have shown promise in detecting disruptions for specific tokamaks, but they often struggle in generalizing to the diverse characteristics and dynamics of different machines. This limits the effectiveness of ML models across different tokamak designs and operating conditions, which is a critical barrier to scaling fusion technology. Given the success of data augmentation in improving model robustness and generalizability in other fields, this study explores the use of a novel time series viewmaker network to generate diverse augmentations or "views" of training data. Our results show that incorporating views during training improves AUC and F2 scores on DisruptionBench tasks compared to standard or no augmentations. This approach represents a promising step towards developing more broadly applicable ML models for disruption avoidance, which is essential for advancing fusion technology and, ultimately, addressing climate change through reliable and sustainable energy production.
- Published
- 2024
4. A Generative AI Technique for Synthesizing a Digital Twin for U.S. Residential Solar Adoption and Generation
- Author
-
Kishore, Aparna, Thorve, Swapna, and Marathe, Madhav
- Subjects
Computer Science - Artificial Intelligence - Abstract
Residential rooftop solar adoption is considered crucial for reducing carbon emissions. The lack of photovoltaic (PV) data at a finer resolution (e.g., household, hourly levels) poses a significant roadblock to informed decision-making. We discuss a novel methodology to generate a highly granular, residential-scale realistic dataset for rooftop solar adoption across the contiguous United States. The data-driven methodology consists of: (i) integrated machine learning models to identify PV adopters, (ii) methods to augment the data using explainable AI techniques to glean insights about key features and their interactions, and (iii) methods to generate household-level hourly solar energy output using an analytical model. The resulting synthetic datasets are validated using real-world data and can serve as a digital twin for modeling downstream tasks. Finally, a policy-based case study utilizing the digital twin for Virginia demonstrated increased rooftop solar adoption with the 30\% Federal Solar Investment Tax Credit, especially in Low-to-Moderate-Income communities., Comment: 41 pages including references and supplementary
- Published
- 2024
5. Probing quantum correlations in non-degenerate hyper-Raman process
- Author
-
Das, Moumita, Sen, Biswajit, Sensharma, Ankur, Thapliyal, Kishore, and Pathak, Anirban
- Subjects
Quantum Physics - Abstract
Possibilities of observing single mode and intermodal quantum correlations (e.g., antibunching, steering and entanglement) are studied for a probed-hyper-Raman system with specific attention on the impact of a probe on the single and multi-mode quantum correlations generated in a hyper-Raman active system. The physical system studied here considers that the probe interacts continuously with the non-degenerate pump modes in the hyper-Raman active system via a nonlinear coupling. The investigation has revealed that quantum correlations in the Raman systems can be controlled using the probe. Further, it is observed that the quantum steering between the pump and anti-Stokes modes can be influenced significantly by controlling the interaction between the system and the probe. Unlike steering, probe could neither deteriorate the nonclassical correlations, namely intermodal entanglement and photon antibunching, nor induce them. Though the witness of the corresponding nonclassical effect depends on the initial state of the probe as well as the coupling strength, Comment: 8 figures
- Published
- 2024
6. Hate Speech Detection Using Cross-Platform Social Media Data In English and German Language
- Author
-
Shahi, Gautam Kishore and Majchrzak, Tim A.
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Social and Information Networks - Abstract
Hate speech has grown into a pervasive phenomenon, intensifying during times of crisis, elections, and social unrest. Multiple approaches have been developed to detect hate speech using artificial intelligence, but a generalized model is yet unaccomplished. The challenge for hate speech detection as text classification is the cost of obtaining high-quality training data. This study focuses on detecting bilingual hate speech in YouTube comments and measuring the impact of using additional data from other platforms in the performance of the classification model. We examine the value of additional training datasets from cross-platforms for improving the performance of classification models. We also included factors such as content similarity, definition similarity, and common hate words to measure the impact of datasets on performance. Our findings show that adding more similar datasets based on content similarity, hate words, and definitions improves the performance of classification models. The best performance was obtained by combining datasets from YouTube comments, Twitter, and Gab with an F1-score of 0.74 and 0.68 for English and German YouTube comments.
- Published
- 2024
7. Language-Embedded Gaussian Splats (LEGS): Incrementally Building Room-Scale Representations with a Mobile Robot
- Author
-
Yu, Justin, Hari, Kush, Srinivas, Kishore, El-Refai, Karim, Rashid, Adam, Kim, Chung Min, Kerr, Justin, Cheng, Richard, Irshad, Muhammad Zubair, Balakrishna, Ashwin, Kollar, Thomas, and Goldberg, Ken
- Subjects
Computer Science - Robotics - Abstract
Building semantic 3D maps is valuable for searching for objects of interest in offices, warehouses, stores, and homes. We present a mapping system that incrementally builds a Language-Embedded Gaussian Splat (LEGS): a detailed 3D scene representation that encodes both appearance and semantics in a unified representation. LEGS is trained online as a robot traverses its environment to enable localization of open-vocabulary object queries. We evaluate LEGS on 4 room-scale scenes where we query for objects in the scene to assess how LEGS can capture semantic meaning. We compare LEGS to LERF and find that while both systems have comparable object query success rates, LEGS trains over 3.5x faster than LERF. Results suggest that a multi-camera setup and incremental bundle adjustment can boost visual reconstruction quality in constrained robot trajectories, and suggest LEGS can localize open-vocabulary and long-tail object queries with up to 66% accuracy.
- Published
- 2024
8. Strengthening of the f mode due to subsurface magnetic fields in simulations of convection
- Author
-
Kishore, G., Singh, Nishant K, Käpylä, Petri, and Roth, Markus
- Subjects
Astrophysics - Solar and Stellar Astrophysics - Abstract
Previous studies have found that localized strengthening of the f mode recedes the emergence of active regions on the Sun by one day to three days. To help interpret these observations, we have performed nonlinear simulations of convection with imposed magnetic fields at different depths. We find that the f mode is strengthened when a super-equipartition magnetic field is imposed near the top of the domain. However, neither a magnetic field of equal strength near the bottom of the domain nor an equipartition magnetic field near the top of the domain have a significant effect. Our results suggest that the magnetic precursors of active regions are present near the surface of the Sun for much longer than would be expected if active regions were formed by flux tubes rising from deep within the convection zone. Application to observations should account for the fact that the effects we observe are transient., Comment: 12 pages, 8 figures, 2 tables
- Published
- 2024
9. SuperCoder2.0: Technical Report on Exploring the feasibility of LLMs as Autonomous Programmer
- Author
-
Gautam, Anmol, Kumar, Kishore, Jha, Adarsh, NS, Mukunda, and Bhola, Ishaan
- Subjects
Computer Science - Software Engineering ,Computer Science - Artificial Intelligence - Abstract
We present SuperCoder2.0, an advanced autonomous system designed to enhance software development through artificial intelligence. The system combines an AI-native development approach with intelligent agents to enable fully autonomous coding. Key focus areas include a retry mechanism with error output traceback, comprehensive code rewriting and replacement using Abstract Syntax Tree (ast) parsing to minimize linting issues, code embedding technique for retrieval-augmented generation, and a focus on localizing methods for problem-solving rather than identifying specific line numbers. The methodology employs a three-step hierarchical search space reduction approach for code base navigation and bug localization:utilizing Retrieval Augmented Generation (RAG) and a Repository File Level Map to identify candidate files, (2) narrowing down to the most relevant files using a File Level Schematic Map, and (3) extracting 'relevant locations' within these files. Code editing is performed through a two-part module comprising CodeGeneration and CodeEditing, which generates multiple solutions at different temperature values and replaces entire methods or classes to maintain code integrity. A feedback loop executes repository-level test cases to validate and refine solutions. Experiments conducted on the SWE-bench Lite dataset demonstrate SuperCoder2.0's effectiveness, achieving correct file localization in 84.33% of cases within the top 5 candidates and successfully resolving 34% of test instances. This performance places SuperCoder2.0 fourth globally on the SWE-bench leaderboard. The system's ability to handle diverse repositories and problem types highlights its potential as a versatile tool for autonomous software development. Future work will focus on refining the code editing process and exploring advanced embedding models for improved natural language to code mapping.
- Published
- 2024
10. PIP-Loco: A Proprioceptive Infinite Horizon Planning Framework for Quadrupedal Robot Locomotion
- Author
-
Shirwatkar, Aditya, Saxena, Naman, Chandra, Kishore, and Kolathaya, Shishir
- Subjects
Computer Science - Robotics ,Computer Science - Machine Learning - Abstract
A core strength of Model Predictive Control (MPC) for quadrupedal locomotion has been its ability to enforce constraints and provide interpretability of the sequence of commands over the horizon. However, despite being able to plan, MPC struggles to scale with task complexity, often failing to achieve robust behavior on rapidly changing surfaces. On the other hand, model-free Reinforcement Learning (RL) methods have outperformed MPC on multiple terrains, showing emergent motions but inherently lack any ability to handle constraints or perform planning. To address these limitations, we propose a framework that integrates proprioceptive planning with RL, allowing for agile and safe locomotion behaviors through the horizon. Inspired by MPC, we incorporate an internal model that includes a velocity estimator and a Dreamer module. During training, the framework learns an expert policy and an internal model that are co-dependent, facilitating exploration for improved locomotion behaviors. During deployment, the Dreamer module solves an infinite-horizon MPC problem, adapting actions and velocity commands to respect the constraints. We validate the robustness of our training framework through ablation studies on internal model components and demonstrate improved robustness to training noise. Finally, we evaluate our approach across multi-terrain scenarios in both simulation and hardware., Comment: Preprint under review
- Published
- 2024
11. DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks
- Author
-
Monsefi, Amin Karimi, Sailaja, Kishore Prakash, Alilooee, Ali, Lim, Ser-Nam, and Ramnath, Rajiv
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
In this paper, we introduce DetailCLIP: A Detail-Oriented CLIP to address the limitations of contrastive learning-based vision-language models, particularly CLIP, in handling detail-oriented and fine-grained tasks like segmentation. While CLIP and its variants excel in the global alignment of image and text representations, they often struggle to capture the fine-grained details necessary for precise segmentation. To overcome these challenges, we propose a novel framework that employs patch-level comparison of self-distillation and pixel-level reconstruction losses, enhanced with an attention-based token removal mechanism. This approach selectively retains semantically relevant tokens, enabling the model to focus on the image's critical regions aligned with the specific functions of our model, including textual information processing, patch comparison, and image reconstruction, ensuring that the model learns high-level semantics and detailed visual features. Our experiments demonstrate that DetailCLIP surpasses existing CLIP-based and traditional self-supervised learning (SSL) models in segmentation accuracy and exhibits superior generalization across diverse datasets. DetailCLIP represents a significant advancement in vision-language modeling, offering a robust solution for tasks that demand high-level semantic understanding and detailed feature extraction. https://github.com/KishoreP1/DetailCLIP.
- Published
- 2024
12. Searching for Tidal Orbital Decay in Hot Jupiters
- Author
-
Alvarado III, Efrain, Bostow, Kate B., Patra, Kishore C., Jacobus, Cooper H., Baer-Way, Raphael A., Jennings, Connor F., Pichay, Neil R., deGraw, Asia A., Vidal, Edgar P., Chander, Vidhi, Altunin, Ivan A., Brendel, Victoria M., Ehrich, Kingsley E., Sunseri, James D., May, Michael B., Punjabi, Druv H., Gendreau-Distler, Eli A., Risin, Sophia, Brink, Thomas G., Zheng, WeiKang, and Filippenko, Alexei V.
- Subjects
Astrophysics - Earth and Planetary Astrophysics - Abstract
We study transits of several ``hot Jupiter'' systems - including WASP-12 b, WASP-43 b, WASP-103 b, HAT-P-23 b, KELT-16 b, WD 1856+534 b, and WTS-2 b - with the goal of detecting tidal orbital decay and extending the baselines of transit times. We find no evidence of orbital decay in any of the observed systems except for that of the extensively studied WASP-12 b. Although the orbit of WASP-12 b is unequivocally decaying, we find no evidence for acceleration of said orbital decay, with measured $\ddot{P} = (-7 \pm 8) \times 10^{-14} \rm ~s^{-1}$, against the expected acceleration decay of $\ddot{P} \approx -10^{-23} \rm ~s^{-1}$. In the case of WD 1856+534 b, there is a tentative detection of orbital growth with $\dot{P} = (5.0 \pm 1.5) \times 10^{-10}$. While statistically significant, we err on the side of caution and wait for longer follow-up observations to consider the measured $\dot{P}$ real. For most systems, we provide a 95\%-confidence lower limit on the tidal quality factor, $Q_\star'$. The possibility of detecting orbital decay in hot Jupiters via long-term radial velocity (RV) measurements is also explored. We find that $\sim 1 \rm ~m~s^{-1}$ precision in RVs will be required to detect orbital decay of WASP-12 b with only 3 yr of observations. Currently available RV measurements and precision are unable to detect orbital decay in any of the systems studied here., Comment: 14 pages, 11 figures; Accepted in MNRAS on 2024 August 30. Received 2024 August 29; in original form 2024 February 13
- Published
- 2024
13. Optical Coherence Tomography Angiography-OCTA dataset for the study of Diabetic Retinopathy
- Author
-
Bidwai, Pooja, Gite, Shilpa, Pradhan, Biswajeet, Gupta, Aditi, and pahuja, Kishore
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition - Abstract
This study presents a dataset consisting of 268 retinal images from 179 individuals, including 133 left-eye and 135 right-eye images, collected from Natasha Eye Care and Research Institute in Pune, Maharashtra, India. The images were captured using a nonmydriatic Optical Coherence Tomography Angiography (OCTA) device, specifically the Optovue Avanti Edition machine as per the protocol mentioned in this paper. Two ophthalmologists then annotated the images. This dataset can be used by researchers and doctors to develop automated diagnostic tools for early detection of diabetic retinopathy (DR)., Comment: 11 pages
- Published
- 2024
14. Physics case for quarkonium studies at the Electron Ion Collider
- Author
-
Boer, Daniël, Flett, Chris A., Flore, Carlo, Kikoła, Daniel, Lansberg, Jean-Philippe, Nefedov, Maxim, Van Hulse, Charlotte, Bhattacharya, Shohini, Bor, Jelle, Butenschoen, Mathias, Ceccopieri, Federico, Chen, Longjie, Cheung, Vincent, D'Alesio, Umberto, Echevarria, Miguel, Hatta, Yoshitaka, Hyde, Charles E., Kishore, Raj, Kosarzewski, Leszek, Lorcé, Cédric, Li, Wenliang, Li, Xuan, Maxia, Luca, Metz, Andreas, Mukherjee, Asmita, Camacho, Carlos Muñoz, Murgia, Francesco, Nadel-Turonski, Pawel, Pisano, Cristian, Qiu, Jian-Wei, Rajesh, Sangem, Rinaldi, Matteo, West, Jennifer Rittenhouse, Saleev, Vladimir, Santiesteban, Nathaly, Setyadi, Chalis, Taels, Pieter, Tu, Zhoudunmin, Vitev, Ivan, Vogt, Ramona, Watanabe, Kazuhiro, Yao, Xiaojun, Yedelkina, Yelyzaveta, and Yoshida, Shinsuke
- Subjects
High Energy Physics - Phenomenology ,High Energy Physics - Experiment ,Nuclear Experiment ,Nuclear Theory - Abstract
The physics case for quarkonium-production studies accessible at the US Electron Ion Collider is described., Comment: Latex, 84 pages. Review prepared for Progress in Particle and Nuclear Physics
- Published
- 2024
15. Minute-Cadence Observations of the LAMOST Fields with the TMTS: IV -- Catalog of Cataclysmic Variables from the First 3-yr Survey
- Author
-
Liu, Qichun, Lin, Jie, Wang, Xiaofeng, Dai, Zhibin, Sun, Yongkang, Xi, Gaobo, Mo, Jun, Liu, Jialian, Yan, Shengyu, Filippenko, Alexei V., Brink, Thomas G., Yang, Yi, Patra, Kishore C., Cai, Yongzhi, Chen, Zhihao, Chen, Liyang, Guo, Fangzhou, Jiang, Xiaojun, Li, Gaici, Li, Wenxiong, Lin, Weili, Miao, Cheng, Ma, Xiaoran, Peng, Haowei, Xia, Qiqi, Xiang, Danfeng, and Zhang, Jicheng
- Subjects
Astrophysics - Solar and Stellar Astrophysics - Abstract
The Tsinghua University--Ma Huateng Telescopes for Survey (TMTS) started to monitor the LAMOST plates in 2020, leading to the discovery of numerous short-period eclipsing binaries, peculiar pulsators, flare stars, and other variable objects. Here, we present the uninterrupted light curves for a sample of 64 cataclysmic variables (CVs) observed/discovered using the TMTS during its first three-year observations, and we introduce new CVs and new light-variation periods (from known CVs) revealed through the TMTS observations. Thanks to the high-cadence observations of TMTS, diverse light variations, including superhumps, quasi-periodic oscillations, large-amplitude orbital modulations, and rotational modulations, are able to be detected in our CV samples, providing key observational clues for understanding the fast-developing physical processes in various CVs. All of these short-timescale light-curve features help further classify the subtypes of CV systems. We highlight the light-curve features observed in our CV sample and discuss further implications of minute-cadence light curves for CV identifications and classifications. Moreover, we examine the H$\alpha$ emission lines in the spectra from our nonmagnetic CV samples (i.e., dwarf novae and nova-like subclasses) and find that the distribution of H$\alpha$ emission strength shows significant differences between the sources with orbital periods above and below the period gap, which agrees with the trend seen from the SDSS nonmagnetic CV sample., Comment: 27 pages, 12 figures in main text, accepted for the publication in Universe
- Published
- 2024
16. TMD evolution effect on $\cos2\phi$ azimuthal asymmetry in a back-to-back production of $J/\psi$ and jet at the EIC
- Author
-
Kishore, Raj, Mukherjee, Asmita, Pawar, Amol, Rajesh, Sangem, and Siddiqah, Mariyah
- Subjects
High Energy Physics - Phenomenology - Abstract
A back-to-back semi-inclusive $J/\psi + jet$ production is a promising process to study gluon transverse momentum distribution (TMDs) at the future electron-ion collider (EIC). A back-to-back configuration allows a higher transverse momentum for $J/\psi$. We present an extension of a previous work where we studied $\cos2\phi$ azimuthal asymmetry within the TMD factorization framework for this process. We present and compare the effect of TMD evolution on the asymmetry, in two approaches that differ in the parameterization of the perturbative tails of the TMDs and the non-perturbative factors. We show that the asymmetry depends on the parameterizations of the non-perturbative Sudakov factors in the larger $b_T$ region and on the perturbative part of the evolution kernel. We use NRQCD to estimate the $J/\psi$ production and show the effect of using different long-distance matrix element (LDME) sets. Overall, the asymmetry after incorporating TMD evolution is small, but increases with the transverse momentum imbalance of the $J/\psi$-jet pair., Comment: 16 pages, 7 figures
- Published
- 2024
17. Diffusion Guided Language Modeling
- Author
-
Lovelace, Justin, Kishore, Varsha, Chen, Yiwei, and Weinberger, Kilian Q.
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Current language models demonstrate remarkable proficiency in text generation. However, for many applications it is desirable to control attributes, such as sentiment, or toxicity, of the generated language -- ideally tailored towards each specific use case and target audience. For auto-regressive language models, existing guidance methods are prone to decoding errors that cascade during generation and degrade performance. In contrast, text diffusion models can easily be guided with, for example, a simple linear sentiment classifier -- however they do suffer from significantly higher perplexity than auto-regressive alternatives. In this paper we use a guided diffusion model to produce a latent proposal that steers an auto-regressive language model to generate text with desired properties. Our model inherits the unmatched fluency of the auto-regressive approach and the plug-and-play flexibility of diffusion. We show that it outperforms previous plug-and-play guidance methods across a wide range of benchmark data sets. Further, controlling a new attribute in our framework is reduced to training a single logistic regression classifier., Comment: ACL Findings 2024
- Published
- 2024
18. Dissipation dynamics and persistence of thiamethoxam 25wg in mango fruit and soil
- Author
-
Jemimah, N, Kavitha, K, Sridevi, G, Aparna, B, and Kishore, P
- Published
- 2021
- Full Text
- View/download PDF
19. What determines groundwater depletion in India? A meso level panel analysis
- Author
-
Kishore, P., Singh, D.R., Chand, P., and Prakash, P.
- Published
- 2020
- Full Text
- View/download PDF
20. Conforming mesh modeling of multi-physics effect on residual stress in multi-layer powder bed fusion process
- Author
-
Kishore, Mysore Nagaraja, Qian, Dong, Soshi, Masakazu, and Li, Wei
- Subjects
Manufacturing Engineering ,Engineering ,Powder bed fusion ,Computational fluid dynamics ,Finite element method ,Discrete element method ,Conforming mesh ,Residual stress ,Industrial Engineering & Automation ,Manufacturing engineering ,Mechanical engineering - Abstract
The current research aims to predict the residual stress accumulation and evolution in the powder bed fusion processed multi-layer thin wall structures through a conforming mesh modeling approach. It involves the discrete element method (DEM) interfaced with the volume of fluid (VOF) method using computational fluid dynamics (CFD) coupled with the finite element method (FEM). The conforming mesh approach developed in the research predicts multi-physics, its induced porosity, and the cumulative effect on the residual stress in the powder bed fusion processed Ti-6Al-4V thin wall structures. The results of the residual stress in the multi-layered component from this method were further quantitatively compared with the non-conforming finite element method. The results show the conforming mesh approach was not only effective in capturing the layer geometry, and defects induced during the printing, but also predicted the residual stress in the region of the defect more accurately than the non-conforming mesh methods.
- Published
- 2024
21. Sign-balance of excedances over mod-k-alternating permutations and gamma-positivity
- Author
-
Dey, Hiranya Kishore and Mahato, Iswar
- Subjects
Mathematics - Combinatorics ,05A15, 15A15, 05E99 - Abstract
A permutation is called mod-k-alternating if its entries are restricted to having the same remainder as the index, modulo some integer $k \geq 1.$ In this paper, we find the sign-balance for mod-k-alternating permutations with respect to the statistic excedance. Moreover, we study the sign-balance for excedances over mod-k-alternating derangements. The results are obtained by constructing suitable matrices and connecting their determinants with the signed excedance enumeration of mod-k-alternating permutations. As an application of the signed excedance enumeration, we prove that when $n \equiv k \pmod {2k}$, the excedance enumerating polynomials over the even and odd mod-k-alternating permutations, starting with a fixed remainder, are gamma-positive., Comment: Comments are welcome
- Published
- 2024
22. Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines
- Author
-
Li, Yuchen, Kirchmeyer, Alexandre, Mehta, Aashay, Qin, Yilong, Dadachev, Boris, Papineni, Kishore, Kumar, Sanjiv, and Risteski, Andrej
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Autoregressive language models are the currently dominant paradigm for text generation, but they have some fundamental limitations that cannot be remedied by scale-for example inherently sequential and unidirectional generation. While alternate classes of models have been explored, we have limited mathematical understanding of their fundamental power and limitations. In this paper we focus on Generative Masked Language Models (GMLMs), a non-autoregressive paradigm in which we train a model to fit conditional probabilities of the data distribution via masking, which are subsequently used as inputs to a Markov Chain to draw samples from the model, These models empirically strike a promising speed-quality trade-off as each step can be typically parallelized by decoding the entire sequence in parallel. We develop a mathematical framework for analyzing and improving such models which sheds light on questions of sample complexity and inference speed and quality. Empirically, we adapt the T5 model for iteratively-refined parallel decoding, achieving 2-3x speedup in machine translation with minimal sacrifice in quality compared with autoregressive models. We run careful ablation experiments to give recommendations on key design choices, and make fine-grained observations on the common error modes in connection with our theory. Our mathematical analyses and empirical observations characterize both potentials and limitations of this approach, and can be applied to future works on improving understanding and performance of GMLMs. Our codes are released at https://github.com/google-research/google-research/tree/master/padir, Comment: ICML 2024
- Published
- 2024
23. Multi-Platform Framing Analysis: A Case Study of Kristiansand Quran Burning
- Author
-
Jung, Anna-Katharina, Shahi, Gautam Kishore, Fromm, Jennifer, Røysland, Kari Anne, and Gronert, Kim Henrik
- Subjects
Computer Science - Social and Information Networks - Abstract
The framing of events in various media and discourse spaces is crucial in the era of misinformation and polarization. Many studies, however, are limited to specific media or networks, disregarding the importance of cross-platform diffusion. This study overcomes that limitation by conducting a multi-platform framing analysis on Twitter, YouTube, and traditional media analyzing the 2019 Koran burning in Kristiansand, Norway. It examines media and policy frames and uncovers network connections through shared URLs. The findings show that online news emphasizes the incident's legality, while social media focuses on its morality, with harsh hate speech prevalent in YouTube comments. Additionally, YouTube is identified as the most self-contained community, whereas Twitter is the most open to external inputs.
- Published
- 2024
24. Experimental photon addition and subtraction in multi-mode and entangled optical fields
- Author
-
Thapliyal, Kishore, Peřina Jr., Jan, Haderka, Ondřej, Michálek, Václav, and Machulka, Radek
- Subjects
Quantum Physics - Abstract
Multiple photon addition and subtraction applied to multi-mode thermal and sub-Poissonian fields as well as twin beams is mutually compared using one experimental setup. Twin beams with tight spatial correlations detected by an intensified CCD camera with high spatial resolution are used to prepare the initial fields. Up to three photons are added or subtracted to arrive at the nonclassical and non-Gaussian states. Only the photon-subtracted thermal states remain classical. In general, the experimental photon-added states exhibit greater nonclassicality and non-Gaussianity than the comparable photon-subtracted states. Once photons are added or subtracted in twin beams, both processes result in comparable properties of the obtained states owing to twin-beam photon pairing., Comment: Multiphoton addition and subtraction on multi-mode thermal, sub-Poissonian states and twin beam are carried out in one experimental setup
- Published
- 2024
- Full Text
- View/download PDF
25. An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
- Author
-
Mundra, Nandini, Kishore, Aditya Nanda, Dabre, Raj, Puduppully, Ratish, Kunchukuttan, Anoop, and Khapra, Mitesh M.
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Language Models (LMs) excel in natural language processing tasks for English but show reduced performance in most other languages. This problem is commonly tackled by continually pre-training and fine-tuning these models for said languages. A significant issue in this process is the limited vocabulary coverage in the original model's tokenizer, leading to inadequate representation of new languages and necessitating an expansion of the tokenizer. The initialization of the embeddings corresponding to new vocabulary items presents a further challenge. Current strategies require cross-lingual embeddings and lack a solid theoretical foundation as well as comparisons with strong baselines. In this paper, we first establish theoretically that initializing within the convex hull of existing embeddings is a good initialization, followed by a novel but simple approach, Constrained Word2Vec (CW2V), which does not require cross-lingual embeddings. Our study evaluates different initialization methods for expanding RoBERTa and LLaMA 2 across four languages and five tasks. The results show that CW2V performs equally well or even better than more advanced techniques. Additionally, simpler approaches like multivariate initialization perform on par with these advanced methods indicating that efficient large-scale multilingual continued pretraining can be achieved even with simpler initialization methods., Comment: Under review
- Published
- 2024
26. Aspects of dS/CFT Holography
- Author
-
Dey, Indranil, Nanda, Kanhu Kishore, Roy, Akashdeep, and Trivedi, Sandip P.
- Subjects
High Energy Physics - Theory ,General Relativity and Quantum Cosmology - Abstract
It has been suggested that a $dS_{d+1}$ spacetime of radius $R_{ds}$ has a holographic dual, living at future space-like infinity ${\cal I}^+$, with the bulk wave function being dual to the partition function of the boundary theory, [arXiv:astro-ph/0210603v5]. We consider some aspects of this correspondence. For under damped scalars with mass $M^2R_{ds}^2>{d^2\over4}$, belonging to the principal series, we show that for the Bunch Davies vacuum a suitable source in the boundary theory can be identified in terms of the coherent state representation of the wave function. We argue that terms in the resulting correlation functions, which are independent of the late time cut-off, satisfy the Ward identities of a conformal field theory. We also discuss other ways to identify sources, both in the under damped and the over damped case, where $M^2R_{ds}^2<{d^2\over4}$, and argue that these too can lead to correlators satisfying the Ward identities of a CFT. Some comments on the violation of reflection positivity, and the cut-off dependent terms, along with some explicit checks and sample calculations, are also included.
- Published
- 2024
27. Radii for sections of functions convex in one direction
- Author
-
Dash, Prachi Prajna, Prajapat, Jugal Kishore, and Kumari, Naveen
- Subjects
Mathematics - Complex Variables ,30C45, 30C20, 30C75, 30C80 - Abstract
Let $\mathcal{G}(\alpha)$ denote the family of functions $ f(z)$ in the open unit disk $\mathbb D :=\{z\in\mathbb{C}: |z|<1\}$ that satisfy $ f(0)=0= f'(0)=1$ and \[\Re \left(1+ \dfrac{z f''(z)}{ f'(z)}\right)<1+\dfrac{\alpha}{2} , \quad z\in \mathbb D.\] We determine the disks $|z|<\rho_n$ in which sections $ s_n(z; f)$ of $ f(z)$ are convex, starlike, and close-to-convex of order $\beta\;(0\le \beta< 1)$. Further, we obtain certain inequalities of sections in the considered class of functions.
- Published
- 2024
28. Certain subclass of harmonic functions associated with univalent functions
- Author
-
Dash, Prachi Prajna and Prajapat, Jugal Kishore
- Subjects
Mathematics - Complex Variables ,30C45, 30C50 - Abstract
In this paper, we define a subclass of sense-preserving harmonic functions associated with a class of analytic functions satisfying a differential inequality. We then establish a close relation between both subclasses. Further, we obtain some characteristic properties, including radius properties, convolution, coefficient estimates and their properties, growth estimates, and convex combination for the functions in the defined subclass. At last, we produce conditions for some special functions as well as harmonic univalent polynomials to belong to the defined subclass of harmonic functions.
- Published
- 2024
29. On certain analytic functions defined by differential inequality
- Author
-
Dash, Prachi Prajna and Prajapat, Jugal Kishore
- Subjects
Mathematics - Complex Variables ,30C45 - Abstract
For the family of analytic functions $f(z)$ in the open unit disk $\mathbb{D}$ with $f(0)=f'(0)-1=0$, satisfying the differential equation \begin{equation*} zf'(z) - f(z) = \dfrac{1}{2} z^2 \phi(z), \quad |\phi(z)| \leq 1, \end{equation*} we obtain radii of convexity, starlikeness, and close-to-convexity of partial sums of $f(z)$. We also study the generalization of this family having the form \begin{equation*} zf'(z)-f(z) = \lambda z^2 \phi(z), \quad |\phi(z)| \leq 1, \end{equation*} where $\lambda > 0,$ and obtain some useful properties of these functions.
- Published
- 2024
30. FPN-IAIA-BL: A Multi-Scale Interpretable Deep Learning Model for Classification of Mass Margins in Digital Mammography
- Author
-
Yang, Julia, Barnett, Alina Jade, Donnelly, Jon, Kishore, Satvik, Fang, Jerry, Schwartz, Fides Regina, Chen, Chaofan, Lo, Joseph Y., and Rudin, Cynthia
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Digital mammography is essential to breast cancer detection, and deep learning offers promising tools for faster and more accurate mammogram analysis. In radiology and other high-stakes environments, uninterpretable ("black box") deep learning models are unsuitable and there is a call in these fields to make interpretable models. Recent work in interpretable computer vision provides transparency to these formerly black boxes by utilizing prototypes for case-based explanations, achieving high accuracy in applications including mammography. However, these models struggle with precise feature localization, reasoning on large portions of an image when only a small part is relevant. This paper addresses this gap by proposing a novel multi-scale interpretable deep learning model for mammographic mass margin classification. Our contribution not only offers an interpretable model with reasoning aligned with radiologist practices, but also provides a general architecture for computer vision with user-configurable prototypes from coarse- to fine-grained prototypes., Comment: 8 pages, 6 figures, Accepted for oral presentation at the 2024 CVPR Workshop on Domain adaptation, Explainability, Fairness in AI for Medical Image Analysis (DEF-AI-MIA)
- Published
- 2024
31. How large is the character degree sum compared to the character table sum for a finite group?
- Author
-
Ayyer, Arvind, Dey, Hiranya Kishore, and Paul, Digjoy
- Subjects
Mathematics - Representation Theory ,Mathematics - Combinatorics ,20C15, 05A15, 05A16, 05A17, 05E10 - Abstract
In 1961, Solomon gave upper and lower bounds for the sum of all the entries in the character table of a finite group in terms of elementary properties of the group. In a different direction, we consider the ratio of the character table sum to the sum of the entries in the first column, also known as the character degree sum, in this work. First, we propose that this ratio is at most two for many natural groups. Secondly, we extend a conjecture of Fields to postulate that this ratio is at least one with equality if and only if the group is abelian. We establish the validity of this property and conjecture for all finite irreducible Coxeter groups. In addition, we prove the conjecture for generalized symmetric groups. The main tool we use is that the sum of a column in the character table of an irreducible Coxeter group (resp. generalized symmetric group) is given by the number of square roots (resp. absolute square roots) of the corresponding conjugacy class representative. As a byproduct of our results, we show that the asymptotics of character table sums is the same as the number of involutions in symmetric, hyperoctahedral and demihyperoctahedral groups. We also derive explicit generating functions for the character table sums for these latter groups as infinite products of continued fractions. In the same spirit, we prove similar generating function formulas for the number of square roots and absolute square roots in $n$ for the generalized symmetric groups $G(r,1,n)$., Comment: 28 pages, 3 appendices
- Published
- 2024
32. CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
- Author
-
Romero, David, Lyu, Chenyang, Wibowo, Haryo Akbarianto, Lynn, Teresa, Hamed, Injy, Kishore, Aditya Nanda, Mandal, Aishik, Dragonetti, Alina, Abzaliev, Artem, Tonja, Atnafu Lambebo, Balcha, Bontu Fufa, Whitehouse, Chenxi, Salamea, Christian, Velasco, Dan John, Adelani, David Ifeoluwa, Meur, David Le, Villa-Cueva, Emilio, Koto, Fajri, Farooqui, Fauzan, Belcavello, Frederico, Batnasan, Ganzorig, Vallejo, Gisela, Caulfield, Grainne, Ivetta, Guido, Song, Haiyue, Ademtew, Henok Biadglign, Maina, Hernán, Lovenia, Holy, Azime, Israel Abebe, Cruz, Jan Christian Blaise, Gala, Jay, Geng, Jiahui, Ortiz-Barajas, Jesus-German, Baek, Jinheon, Dunstan, Jocelyn, Alemany, Laura Alonso, Nagasinghe, Kumaranage Ravindu Yasas, Benotti, Luciana, D'Haro, Luis Fernando, Viridiano, Marcelo, Estecha-Garitagoitia, Marcos, Cabrera, Maria Camila Buitrago, Rodríguez-Cantelar, Mario, Jouitteau, Mélanie, Mihaylov, Mihail, Imam, Mohamed Fazli Mohamed, Adilazuarda, Muhammad Farid, Gochoo, Munkhjargal, Otgonbold, Munkh-Erdene, Etori, Naome, Niyomugisha, Olivier, Silva, Paula Mónica, Chitale, Pranjal, Dabre, Raj, Chevi, Rendi, Zhang, Ruochen, Diandaru, Ryandito, Cahyawijaya, Samuel, Góngora, Santiago, Jeong, Soyeong, Purkayastha, Sukannya, Kuribayashi, Tatsuki, Jayakumar, Thanmay, Torrent, Tiago Timponi, Ehsan, Toqeer, Araujo, Vladimir, Kementchedjhieva, Yova, Burzo, Zara, Lim, Zheng Wei, Yong, Zheng Xin, Ignat, Oana, Nwatu, Joan, Mihalcea, Rada, Solorio, Thamar, and Aji, Alham Fikri
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recent efforts have tried to increase the number of languages covered on VQA datasets, they still lack diversity in low-resource languages. More importantly, although these datasets often extend their linguistic range via translation or some other approaches, they usually keep images the same, resulting in narrow cultural representation. To address these limitations, we construct CVQA, a new Culturally-diverse multilingual Visual Question Answering benchmark, designed to cover a rich set of languages and cultures, where we engage native speakers and cultural experts in the data collection process. As a result, CVQA includes culturally-driven images and questions from across 28 countries on four continents, covering 26 languages with 11 scripts, providing a total of 9k questions. We then benchmark several Multimodal Large Language Models (MLLMs) on CVQA, and show that the dataset is challenging for the current state-of-the-art models. This benchmark can serve as a probing evaluation suite for assessing the cultural capability and bias of multimodal models and hopefully encourage more research efforts toward increasing cultural awareness and linguistic diversity in this field.
- Published
- 2024
33. EASI-Tex: Edge-Aware Mesh Texturing from Single Image
- Author
-
Perla, Sai Raj Kishore, Wang, Yizhi, Mahdavi-Amiri, Ali, and Zhang, Hao
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
We present a novel approach for single-image mesh texturing, which employs a diffusion model with judicious conditioning to seamlessly transfer an object's texture from a single RGB image to a given 3D mesh object. We do not assume that the two objects belong to the same category, and even if they do, there can be significant discrepancies in their geometry and part proportions. Our method aims to rectify the discrepancies by conditioning a pre-trained Stable Diffusion generator with edges describing the mesh through ControlNet, and features extracted from the input image using IP-Adapter to generate textures that respect the underlying geometry of the mesh and the input texture without any optimization or training. We also introduce Image Inversion, a novel technique to quickly personalize the diffusion model for a single concept using a single image, for cases where the pre-trained IP-Adapter falls short in capturing all the details from the input image faithfully. Experimental results demonstrate the efficiency and effectiveness of our edge-aware single-image mesh texturing approach, coined EASI-Tex, in preserving the details of the input texture on diverse 3D objects, while respecting their geometry., Comment: ACM Transactions on Graphics (Proceedings of SIGGRAPH), 2024. Project Page: https://sairajk.github.io/easi-tex/
- Published
- 2024
34. The Intermediate-Mass Black Hole Reverberation Mapping Project: Initial Results for a candidate IMBH in a nearby Seyfert 1 Galaxy
- Author
-
Zuo, Wenwen, Guo, Hengxiao, Sun, Jingbo, Yuan, Qi, Lira, Paulina, Gu, Minfeng, Edwards, Philip G., Gupta, Alok C., Kishore, Shubham, Stevens, Jamie, An, Tao, Cai, Zhen-Yi, Feng, Haicheng, Ho, Luis C., Ilić, Dragana, Kovačević, Andjelka B., Li, ShaSha, Mezcua, Mar, Popović, Luka Č., Sun, Mouyuan, Tripathi, Tushar, U., Vivian, Vince, Oliver, Wang, Jianguo, Wang, Junxian, Wang, Shu, Wu, Xuebing, and Zheng, Zhenya
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
To investigate the short-term variability and determine the size of the optical continuum emitting size of intermediate-mass black holes (IMBHs), we carried out high-cadence, multi-band photometric monitoring of a Seyfert 1 galaxy J0249-0815 across two nights, together with a one-night single-band preliminary test. The presence of the broad Ha component in our target was confirmed by recent Palomar/P200 spectroscopic observations, 23 years after Sloan Digital Sky Survey, ruling out the supernovae origin of the broad Ha line. The photometric experiment was primarily conducted utilizing four-channel imagers MuSCAT 3 & 4 mounted on 2-meter telescopes within the Las Cumbres Observatory Global Telescope Network. Despite the expectation of variability, we observed no significant variation (<1.4%) on timescales of 6-10 hours. This non-detection is likely due to substantial host galaxy light diluting the subtle AGN variability. Dual-band preliminary tests and tailored simulations may enhance the possibility of detecting variability and lag in future IMBH reverberation campaigns., Comment: 14 pages, 6 figures, submitted to ApJ, comments welcome
- Published
- 2024
35. Evidence of jet activity from the secondary black hole in the OJ287 binary system
- Author
-
Valtonen, Mauri J., Zola, Staszek, Gupta, Alok C., Kishore, Shubham, Gopakumar, Achamveedu, Jorstad, Svetlana G., Wiita, Paul J., Gu, Minfeng, Nilsson, Kari, Marscher, Alan P., Zhang, Zhongli, Hudec, Rene, Matsumoto, Katsura, Drozdz, Marek, Ogloza, Waldemar, Berdyugin, Andrei V., Reichart, Daniel E., Mugrauer, Markus, Dey, Lankeswar, Pursimo, Tapio, Lehto, Harry J., Ciprini, Stefano, Nakaoka, T., Uemura, M., Imazawa, Ryo, Zejmo, Michal, Kouprianov, Vladimir V., Davidson, Jr., James W., Sadun, Alberto, Strobl, Jan, Weaver, Z. R., and Jelinek, Martin
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
We report the study of a huge optical intraday flare on November 12, 2021, at 2 am UT, in the blazar OJ287. In the binary black hole model it is associated with an impact of the secondary black hole on the accretion disk of the primary. Our multifrequency observing campaign was set up to search for such a signature of the impact, based on a prediction made eight years earlier. The first I-band results of the flare have already been reported by \cite{2024ApJ...960...11K}. Here we combine these data with our monitoring in the R-band. There is a big change in the R-I spectral index by $1.0\pm0.1$ between the normal background and the flare, suggesting a new component of radiation. The polarization variation during the rise of the flare suggests the same. The limits on the source size place it most reasonably in the jet of the secondary black hole. We then ask why we have not seen this phenomenon before. We show that OJ287 was never before observed with sufficient sensitivity on the night when the flare should have happened according to the binary model. We also study the probability that this flare is just an oversized example of intraday variability, using the Krakow-dataset of intense monitoring between 2015 and 2023. We find that the occurrence of a flare of this size and rapidity is unlikely. In the Appendix, we give the full orbit-linked historical light curve of OJ287 as well as the dense monitoring sample of Krakow., Comment: to Appear in Astrophysical Journal Letters
- Published
- 2024
36. Deep Oscillatory Neural Network
- Author
-
Rohan, Nurani Rajagopal, C, Vigneswaran, Ghosh, Sayan, Rajendran, Kishore, A, Gaurav, and Chakravarthy, V Srinivasa
- Subjects
Computer Science - Neural and Evolutionary Computing ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
We propose a novel, brain-inspired deep neural network model known as the Deep Oscillatory Neural Network (DONN). Deep neural networks like the Recurrent Neural Networks indeed possess sequence processing capabilities but the internal states of the network are not designed to exhibit brain-like oscillatory activity. With this motivation, the DONN is designed to have oscillatory internal dynamics. Neurons of the DONN are either nonlinear neural oscillators or traditional neurons with sigmoidal or ReLU activation. The neural oscillator used in the model is the Hopf oscillator, with the dynamics described in the complex domain. Input can be presented to the neural oscillator in three possible modes. The sigmoid and ReLU neurons also use complex-valued extensions. All the weight stages are also complex-valued. Training follows the general principle of weight change by minimizing the output error and therefore has an overall resemblance to complex backpropagation. A generalization of DONN to convolutional networks known as the Oscillatory Convolutional Neural Network is also proposed. The two proposed oscillatory networks are applied to a variety of benchmark problems in signal and image/video processing. The performance of the proposed models is either comparable or superior to published results on the same data sets.
- Published
- 2024
37. An exact enumeration of vertex connectivity of the enhanced power graphs of finite nilpotent groups
- Author
-
Bera, Sudip and Dey, Hiranya Kishore
- Subjects
Mathematics - Combinatorics ,Mathematics - Group Theory ,05C25, 20D15, 05A15 - Abstract
The enhanced power graph of a group $G$ is a graph with vertex set $G,$ where two distinct vertices $x$ and $y$ are adjacent if and only if there exists an element $w$ in $G$ such that both $x$ and $y$ are powers of $w.$ In this paper, we determine the vertex connectivity of the enhanced power graph of any finite nilpotent group., Comment: 11 pages, Comments are welcome. arXiv admin note: text overlap with arXiv:2108.05175
- Published
- 2024
38. Multiple quantum exceptional, diabolical, and hybrid points in multimode bosonic systems: II. Nonconventional PT-symmetric dynamics and unidirectional coupling
- Author
-
Peřina Jr., Jan, Thapliyal, Kishore, Chimczak, Grzegorz, Kowalewska-Kudłaszyk, Anna, and Miranowicz, Adam
- Subjects
Quantum Physics - Abstract
We analyze the existence and degeneracies of quantum exceptional, diabolical, and hybrid points of simple bosonic systems, composed of up to six modes with damping and/or amplification and exhibiting nonconventional dynamics. They involve the configurations in which the dynamics typical for PT-symmetric systems is observed only in a subspace of the whole Liouville space of the system states (nonconventional PT-symmetric dynamics) as well as those containing unidirectional coupling. The system dynamics described by quadratic non-Hermitian Hamiltonians is governed by the Heisenberg-Langevin equations. Conditions for the observation of inherited quantum hybrid points with up to sixth-order exceptional and second-order diabolical degeneracies are revealed, though relevant only for short-time dynamics. This raises the question of whether higher-order inherited singularities exist in bosonic systems that exhibit physically meaningful behavior at arbitrary times. On the other hand, for short times, unidirectional coupling of various types enables the concatenation of simple bosonic systems with second- and third-order exceptional degeneracies on demand. This approach allows for the creation of arbitrarily high exceptional degeneracies observed in systems with diverse structures. Methods for numerical identifying the quantum exceptional and hybrid points, and determining their degeneracies are discussed. Rich dynamics of higher-order field-operator moments is analyzed from the point of view of the presence of exceptional and diabolical points with their degeneracies in general., Comment: Partial PT-symmetry like dynamics (nonconventional PT-symmetry) as well as non-Hermitian bosonic systems with unidirectional coupling are investigated. The work is in continuation of the paper entitled "Multiple quantum exceptional, diabolical, and hybrid points in multimode bosonic systems: I. Inherited and genuine singularities"
- Published
- 2024
39. Multiple quantum exceptional, diabolical, and hybrid points in multimode bosonic systems: I. Inherited and genuine singularities
- Author
-
Thapliyal, Kishore, Peřina Jr., Jan, Chimczak, Grzegorz, Kowalewska-Kudłaszyk, Anna, and Miranowicz, Adam
- Subjects
Quantum Physics - Abstract
The existence and degeneracies of quantum exceptional, diabolical, and hybrid (i.e., diabolically degenerated exceptional) singularities of simple bosonic systems composed of up to five modes with damping and/or amplification are analyzed. Their dynamics governed by quadratic non-Hermitian Hamiltonians is followed using the Heisenberg-Langevin equations. Conditions for the observation of inherited quantum hybrid points, observed directly in the dynamics of field operators, having up to third-order exceptional and second-order diabolical degeneracies are revealed. Exceptional and diabolical genuine points and their degeneracies observed in the dynamics of second-order field-operator moments are analyzed. Surprisingly, exceptional degeneracies of only second and third orders are revealed. Nevertheless the analyzed bosonic systems exhibit rich dynamics, also owing to their common second-order diabolical degeneracies., Comment: Inherited EPs with second- and third-order singularities are observed in the bosonic systems with the usual (bidirectional) coupling. In the following paper entitled "Multiple quantum exceptional, diabolical, and hybrid points in multimode bosonic systems: II. Nonconventional PT-symmetric dynamics and unidirectional coupling", the investigation are further extended to other bosonic systems
- Published
- 2024
40. Comparison of brace to observation in stable, radiological developmental dysplasia of the hip: a protocol for a global multicentre non-inferiority randomised trial.
- Author
-
Zomar, Bryn, Bone, Jeffrey, Nguyen, Vuong, Mulpuri, Kishore, Kelley, Simon, and Schaeffer, Emily
- Subjects
hip ,paediatric orthopaedics ,radiology & imaging ,randomized controlled trial ,ultrasound ,Humans ,Braces ,Infant ,Developmental Dysplasia of the Hip ,Multicenter Studies as Topic ,Watchful Waiting ,Equivalence Trials as Topic ,Female ,Radiography ,Infant ,Newborn ,Randomized Controlled Trials as Topic ,Ultrasonography ,Hip Dislocation ,Congenital ,Male - Abstract
INTRODUCTION: Brace treatment is common to address radiological dysplasia in infants with developmental dysplasia of the hip (DDH); however, it is unclear whether bracing provides significant benefit above careful observation by ultrasound. If observation alone is non-inferior to bracing for radiological dysplasia, unnecessary treatment may be avoided. Therefore, the purpose of this study is to determine whether observation is non-inferior to bracing for infants with radiological dysplasia. METHODS AND ANALYSIS: This will be a multicentre, global, randomised, non-inferiority trial performed under the auspices of a global prospective registry for infants and children diagnosed with DDH. Patients will be included if they present with radiological dysplasia (centred hip, alpha angle 43-60°, percent femoral head coverage greater than 35% measured on ultrasound) of a clinically stable hip under 3 months old. Patients will be excluded if they present with clinical hip instability, have received prior treatment or have known/suspected neuromuscular, collagen, chromosomal or lower-extremity congenital abnormalities or syndromic-associated hip abnormalities. Patients will be enrolled and randomised to undergo observation alone or brace treatment with a Pavlik harness for a minimum of 6 weeks. Follow-up visits will occur at 6 weeks, 1 year and 2 years post-enrolment. The primary outcome will be the norm-referenced acetabular index measured on the 2-year radiograph with a 3° non-inferiority margin. A total of 514 patients will be included.The study is anticipated to start in April 2024 and end in September 2028.The primary outcome will be compared between arms with a mixed-effects model with a random intercept for study centre, and a single covariate for the treatment group. If the lower bound of the 95% CI lies within 3° of the mean, we will treat this as evidence for non-inferiority. ETHICS AND DISSEMINATION: Ethics approval has been obtained from the lead sites ethics board (University of British Columbia, Childrens and Womens Research Ethics Board). Ethics approval will be obtained from the local ethics committees or institutional review boards at each institution prior to patient enrolment. It is intended that the results of this study shall be published in peer-reviewed journals and presented at suitable conferences. TRIAL REGISTRATION NUMBER: NCT05869851.
- Published
- 2024
41. A Kinome-Wide Synthetic Lethal CRISPR/Cas9 Screen Reveals That mTOR Inhibition Prevents Adaptive Resistance to CDK4/CDK6 Blockade in HNSCC.
- Author
-
Goto, Yusuke, Koshizuka, Keiichi, Ando, Toshinori, Izumi, Hiroki, Wu, Xingyu, Sato, Kuniaki, Ishikawa, Tomohiko, Ford, Kyle, Feng, Xiaodong, Wang, Zhiyong, Arang, Nadia, Allevato, Michael, Kishore, Ayush, Mali, Prashant, and Gutkind, Jorge
- Subjects
Humans ,Cyclin-Dependent Kinase 4 ,Cyclin-Dependent Kinase 6 ,CRISPR-Cas Systems ,Squamous Cell Carcinoma of Head and Neck ,Piperazines ,Pyridines ,Mice ,Animals ,Drug Resistance ,Neoplasm ,Head and Neck Neoplasms ,Cell Line ,Tumor ,MTOR Inhibitors ,Protein Kinase Inhibitors ,TOR Serine-Threonine Kinases ,Cyclin E ,Xenograft Model Antitumor Assays ,Synthetic Lethal Mutations ,Oncogene Proteins - Abstract
UNLABELLED: The comprehensive genomic analysis of the head and neck squamous cell carcinoma (HNSCC) oncogenome revealed the frequent loss of p16INK4A (CDKN2A) and amplification of cyclin D1 genes in most human papillomavirus-negative HNSCC lesions. However, cyclin-dependent kinase 4 and 6 (CDK4/6) inhibitors have shown modest effects in the clinic. The aberrant activation of the PI3K/mTOR pathway is highly prevalent in HNSCC, and recent clinical trials have shown promising clinical efficacy of mTOR inhibitors (mTORi) in the neoadjuvant and adjuvant settings but not in patients with advanced HNSCC. By implementing a kinome-wide CRISPR/Cas9 screen, we identified cell-cycle inhibition as a synthetic lethal target for mTORis. A combination of mTORi and palbociclib, a CDK4/6-specific inhibitor, showed strong synergism in HNSCC-derived cells in vitro and in vivo. Remarkably, we found that an adaptive increase in cyclin E1 (CCNE1) expression upon palbociclib treatment underlies the rapid acquired resistance to this CDK4/6 inhibitor. Mechanistically, mTORi inhibits the formation of eIF4G-CCNE1 mRNA complexes, with the consequent reduction in mRNA translation and CCNE1 protein expression. Our findings suggest that mTORi reverts the adaptive resistance to palbociclib. This provides a multimodal therapeutic option for HNSCC by cotargeting mTOR and CDK4/6, which in turn may halt the emergence of palbociclib resistance. SIGNIFICANCE: A kinome-wide CRISPR/Cas9 screen identified cell-cycle inhibition as a synthetic lethal target of mTORis. A combination of mTORi and palbociclib, a CDK4/6-specific inhibitor, showed strong synergistic effects in HNSCC. Mechanistically, mTORis inhibited palbociclib-induced increase in CCNE1.
- Published
- 2024
42. Recommenadation aided Caching using Combinatorial Multi-armed Bandits
- Author
-
J, Pavamana K and Singh, Chandramani Kishore
- Subjects
Computer Science - Machine Learning ,Computer Science - Information Retrieval ,Computer Science - Networking and Internet Architecture - Abstract
We study content caching with recommendations in a wireless network where the users are connected through a base station equipped with a finite-capacity cache. We assume a fixed set of contents with unknown user preferences and content popularities. The base station can cache a subset of the contents and can also recommend subsets of the contents to different users in order to encourage them to request the recommended contents. Recommendations, depending on their acceptability, can thus be used to increase cache hits. We first assume that the users' recommendation acceptabilities are known and formulate the cache hit optimization problem as a combinatorial multi-armed bandit (CMAB). We propose a UCB-based algorithm to decide which contents to cache and recommend and provide an upper bound on the regret of this algorithm. Subsequently, we consider a more general scenario where the users' recommendation acceptabilities are also unknown and propose another UCB-based algorithm that learns these as well. We numerically demonstrate the performance of our algorithms and compare these to state-of-the-art algorithms.
- Published
- 2024
43. Evaluating Telugu Proficiency in Large Language Models_ A Comparative Analysis of ChatGPT and Gemini
- Author
-
Kishore, Katikela Sreeharsha and Shaik, Rahimanuddin
- Subjects
Computer Science - Computation and Language ,Computer Science - Human-Computer Interaction - Abstract
The growing prominence of large language models (LLMs) necessitates the exploration of their capabilities beyond English. This research investigates the Telugu language proficiency of ChatGPT and Gemini, two leading LLMs. Through a designed set of 20 questions encompassing greetings, grammar, vocabulary, common phrases, task completion, and situational reasoning, the study delves into their strengths and weaknesses in handling Telugu. The analysis aims to identify the LLM that demonstrates a deeper understanding of Telugu grammatical structures, possesses a broader vocabulary, and exhibits superior performance in tasks like writing and reasoning. By comparing their ability to comprehend and use everyday Telugu expressions, the research sheds light on their suitability for real-world language interaction. Furthermore, the evaluation of adaptability and reasoning capabilities provides insights into how each LLM leverages Telugu to respond to dynamic situations. This comparative analysis contributes to the ongoing discussion on multilingual capabilities in AI and paves the way for future research in developing LLMs that can seamlessly integrate with Telugu-speaking communities.
- Published
- 2024
44. Rapid Optical Flare in the Extreme TeV Blazar 1ES 0229+200 on Intraday Timescale with TESS
- Author
-
Kishore, Shubham, Gupta, Alok C., Wiita, Paul J., and Tiwari, S. N.
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
The extreme TeV blazar 1ES 0229+200 is a high-frequency-peaked BL Lacertae object. It has not shown intraday variability in extensive optical and X-ray observations. Nor has it shown any significant variability on any measurable timescale in the 1-100 GeV energy range over a 14-year span, but variations in the source flux around its average are present in the energy range above 200 GeV. We searched for intraday optical variability in 1ES 0229+200 as part of an ongoing project to search for variability and quasi-periodic oscillations in the high-cadence, nearly uniformly sampled optical light curves of blazars provided by the Transiting Exoplanet Survey Satellite (TESS). 1ES 0229+200 was monitored by TESS in its Sectors 42, 43 and 44. We analysed the data of all these sectors both with the TESS provided lightkurve software and the eleanor reduction pipeline. We detected a strong, essentially symmetric flare that lasted ~6 hours in Sector 42. We fit the flare's rising and declining phases to exponential functions. We also analysed the light curve of Sector 42 using the Lomb-Scargle periodogram (LSP) and continuous auto-regressive moving average (CARMA) methods. The Sector 42 light curve displayed in the present work provides the first evidence of a strong, rapid, short-lived optical flare on the intraday timescale in 1ES 0229+200. The variability timescale of the flare provides the upper limit for the size of the emission region to be within (3.3\pm0.2 - 8.3\pm0.5)x10^{15}cm. Away from the flare, the slope of the periodogram's power spectrum is fairly typical of many blazars (\alpha<2), but the nominal slopes for the flaring regions are very steep (\alpha~4.3), which may indicate the electron distribution undergoes a sudden change. We discuss possible emission mechanisms that could explain this substantial and rapid flare., Comment: 7 pages, 4 figures, 3 tables, Accepted for publication in A&A
- Published
- 2024
45. Exploring and Improving Drafts in Blockwise Parallel Decoding
- Author
-
Kim, Taehyeon, Suresh, Ananda Theertha, Papineni, Kishore, Riley, Michael, Kumar, Sanjiv, and Benton, Adrian
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Despite the remarkable strides made by autoregressive language models, their potential is often hampered by the slow inference speeds inherent in sequential token generation. Blockwise parallel decoding (BPD) was proposed by Stern et al. as a method to improve inference speed of language models by simultaneously predicting multiple future tokens, termed block drafts, which are subsequently verified and conditionally accepted by the autoregressive model. This paper contributes to the understanding and improvement of block drafts in two ways. First, we analyze the token distributions produced by multiple prediction heads. Secondly, we leverage this analysis to develop algorithms to improve BPD inference speed by refining the block drafts using n-gram and neural language models. Experiments demonstrate that refined block drafts yield a +5-21% increase in block efficiency (i.e., the number of accepted tokens from the block draft) across diverse datasets.
- Published
- 2024
46. Privacy preserving layer partitioning for Deep Neural Network models
- Author
-
Rajasekar, Kishore, Loh, Randolph, Fok, Kar Wai, and Thing, Vrizlynn L. L.
- Subjects
Computer Science - Cryptography and Security - Abstract
MLaaS (Machine Learning as a Service) has become popular in the cloud computing domain, allowing users to leverage cloud resources for running private inference of ML models on their data. However, ensuring user input privacy and secure inference execution is essential. One of the approaches to protect data privacy and integrity is to use Trusted Execution Environments (TEEs) by enabling execution of programs in secure hardware enclave. Using TEEs can introduce significant performance overhead due to the additional layers of encryption, decryption, security and integrity checks. This can lead to slower inference times compared to running on unprotected hardware. In our work, we enhance the runtime performance of ML models by introducing layer partitioning technique and offloading computations to GPU. The technique comprises two distinct partitions: one executed within the TEE, and the other carried out using a GPU accelerator. Layer partitioning exposes intermediate feature maps in the clear which can lead to reconstruction attacks to recover the input. We conduct experiments to demonstrate the effectiveness of our approach in protecting against input reconstruction attacks developed using trained conditional Generative Adversarial Network(c-GAN). The evaluation is performed on widely used models such as VGG-16, ResNet-50, and EfficientNetB0, using two datasets: ImageNet for Image classification and TON IoT dataset for cybersecurity attack detection.
- Published
- 2024
47. STITCH: Augmented Dexterity for Suture Throws Including Thread Coordination and Handoffs
- Author
-
Hari, Kush, Kim, Hansoul, Panitch, Will, Srinivas, Kishore, Schorp, Vincent, Dharmarajan, Karthik, Ganti, Shreya, Sadjadpour, Tara, and Goldberg, Ken
- Subjects
Computer Science - Robotics - Abstract
We present STITCH: an augmented dexterity pipeline that performs Suture Throws Including Thread Coordination and Handoffs. STITCH iteratively performs needle insertion, thread sweeping, needle extraction, suture cinching, needle handover, and needle pose correction with failure recovery policies. We introduce a novel visual 6D needle pose estimation framework using a stereo camera pair and new suturing motion primitives. We compare STITCH to baselines, including a proprioception-only and a policy without visual servoing. In physical experiments across 15 trials, STITCH achieves an average of 2.93 sutures without human intervention and 4.47 sutures with human intervention. See https://sites.google.com/berkeley.edu/stitch for code and supplemental materials.
- Published
- 2024
48. Operators in the Internal Space and Locality
- Author
-
Bohra, Hardik, Das, Sumit R., Mandal, Gautam, Nanda, Kanhu Kishore, Radwan, Mohamed Hany, and Trivedi, Sandip P.
- Subjects
High Energy Physics - Theory - Abstract
Realizations of the holographic correspondence in String/M theory typically involve spacetimes of the form $AdS \times Y$ where $Y$ is some internal space which geometrizes an internal symmetry of the dual field theory, hereafter referred to as an "$R$ symmetry". It has been speculated that areas of Ryu-Takayanagi surfaces anchored on the boundary of a subregion of $Y$, and smeared over the base space of the dual field theory, quantify entanglement of internal degrees of freedom. A natural candidate for the corresponding operators are linear combinations of operators with definite $R$ charge with coefficients given by the "spherical harmonics'' of the internal space: this is natural when the product spaces appear as IR geometries of higher dimensional AdS spaces. We study clustering properties of such operators both for pure $AdS \times Y$ and for flow geometries, where $AdS \times Y$ arises in the IR from a different spacetime in the UV, for example higher dimensional AdS or asymptotically flat spacetime. We show, in complete generality, that the two point functions of such operators separated along the internal space obey clustering properties at scales larger than the $AdS$ scale. For non-compact $Y$, this provides a notion of approximate locality. When $Y$ is compact, clustering happens only when the size of $Y$ is parametrically larger than the $AdS$ scale. This latter situation is realized in flow geometries where the product spaces arise in the IR from an asymptotically AdS geometry at UV, but not typically when they arise near black hole horizons in asymptotically flat spacetimes. We discuss the significance of this result for entanglement and comment on the role of color degrees of freedom., Comment: 24 pages, no figures
- Published
- 2024
49. Human Mobility in the Metaverse
- Author
-
Vasan, Kishore, Karsai, Marton, and Barabasi, Albert-Laszlo
- Subjects
Computer Science - Social and Information Networks ,Computer Science - Computers and Society ,Physics - Physics and Society - Abstract
The metaverse promises a shift in the way humans interact with each other, and with their digital and physical environments. The lack of geographical boundaries and travel costs in the metaverse prompts us to ask if the fundamental laws that govern human mobility in the physical world apply. We collected data on avatar movements, along with their network mobility extracted from NFT purchases. We find that despite the absence of commuting costs, an individuals inclination to explore new locations diminishes over time, limiting movement to a small fraction of the metaverse. We also find a lack of correlation between land prices and visitation, a deviation from the patterns characterizing the physical world. Finally, we identify the scaling laws that characterize meta mobility and show that we need to add preferential selection to the existing models to explain quantitative patterns of metaverse mobility. Our ability to predict the characteristics of the emerging meta mobility network implies that the laws governing human mobility are rooted in fundamental patterns of human dynamics, rather than the nature of space and cost of movement., Comment: 4 figures
- Published
- 2024
50. Unveiling Divergent Inductive Biases of LLMs on Temporal Data
- Author
-
Kishore, Sindhu and He, Hangfeng
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Unraveling the intricate details of events in natural language necessitates a subtle understanding of temporal dynamics. Despite the adeptness of Large Language Models (LLMs) in discerning patterns and relationships from data, their inherent comprehension of temporal dynamics remains a formidable challenge. This research meticulously explores these intrinsic challenges within LLMs, with a specific emphasis on evaluating the performance of GPT-3.5 and GPT-4 models in the analysis of temporal data. Employing two distinct prompt types, namely Question Answering (QA) format and Textual Entailment (TE) format, our analysis probes into both implicit and explicit events. The findings underscore noteworthy trends, revealing disparities in the performance of GPT-3.5 and GPT-4. Notably, biases toward specific temporal relationships come to light, with GPT-3.5 demonstrating a preference for "AFTER'' in the QA format for both implicit and explicit events, while GPT-4 leans towards "BEFORE''. Furthermore, a consistent pattern surfaces wherein GPT-3.5 tends towards "TRUE'', and GPT-4 exhibits a preference for "FALSE'' in the TE format for both implicit and explicit events. This persistent discrepancy between GPT-3.5 and GPT-4 in handling temporal data highlights the intricate nature of inductive bias in LLMs, suggesting that the evolution of these models may not merely mitigate bias but may introduce new layers of complexity.
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.