1,996 results on '"Bâli, A"'
Search Results
2. Fake News and Social Media: Indian Perspective
- Author
-
Bali, Aasita and Desai, Prathik
- Published
- 2019
- Full Text
- View/download PDF
3. Sports management in physical education
- Author
-
Gaur, Madhu and Bali, Ashwani
- Published
- 2018
4. Measuring customer satisfaction and service quality in tourism industry
- Author
-
Kachwala, Tohid, Bhadra, Amit, Bali, Aditya, and Dasgupta, Chandan
- Published
- 2018
- Full Text
- View/download PDF
5. Self and environmental problems: A case study of visually, hearing and orthopaedically impaired students of Jammu City
- Author
-
Bali, Ana
- Published
- 2017
6. Song sequences and female representation in V. Ravichandra'select Kannada films
- Author
-
Tejas, NK and Bali, Aasita
- Published
- 2017
7. Effect of different establishment methods and sowing schedules on growth and yield of hybrid rice (Oryza sativa) and their after effects on succeeding wheat (Triticum aestivum) in rice –wheat cropping system
- Author
-
Sharma, Vikas, Bali, A.S., and Kachroo, Dileep
- Published
- 2016
- Full Text
- View/download PDF
8. Mindfulness for New Zealand lawyers : become a switched-on, focused, resilient and joyful lawyer
- Author
-
Bali, Ashika
- Published
- 2019
9. Growth and yield of Ocimum sanctum in response to integrated nutrient management and plant spacing
- Author
-
Raina, N.S., Rafiq, M., Sood, K.K., Bali, A.S., Gupta, S.K., and Sehgal, S.
- Published
- 2013
10. Performance of maize (Zea mays L.) in intercropping systems at different fertility levels
- Author
-
Sharma, Parshotam Kumar, Bali, A. S., and Sharma, B. C.
- Published
- 2012
11. Urea molasses mineral block and Roughage block feeding of livestock in hilly areas of Jammu
- Author
-
Azad, M.S., Singh, Rajeev, and Bali, A.S.
- Published
- 2011
12. Effect of tillage and nutrient management on resource conservation and productivity of wheat (Triticum aestivum)
- Author
-
Gupta, Meenakshi, Bali, Amarjit S., Kour, Sarabdeep, Bharat, Rajeev, and Bazaya, B.R.
- Published
- 2011
13. Effect of tillage and weed control methods on yield and energetics of wheat (Triticum aestvium)
- Author
-
Bali, Amarjit S., Gupta, Meenakshi, Sharma, B. C., and Kumar, Anil
- Published
- 2009
14. Productivity, nutrient uptake and economics of wheat (Triticum aestivum) under various tillage and fertilizer management practices
- Author
-
Gupta, Meenakshi, Bali, Amarjit S., Sharma, B.C., Kachroo, D., and Bharat, Rajeev
- Published
- 2007
15. Integrated plant nutrient supply system in irrigated wheat
- Author
-
Kumar, Anil, Thakur, N.P., Bali, Amarjit S., and Mehta, Suman
- Published
- 2007
16. Responses to Aziz Rana
- Author
-
Iber, Patrick, Getachew, Adom, Wertheim, Stephen, Bâli, Aslı, Linfield, Susie, Kassem, Ramzi, Li, Darryl, and Replies, Aziz Rana
- Published
- 2022
- Full Text
- View/download PDF
17. KAHANI: Culturally-Nuanced Visual Storytelling Pipeline for Non-Western Cultures
- Author
-
Hamna, Sudharsan, Deepthi, Seth, Agrima, Budhiraja, Ritvik, Khullar, Deepika, Jain, Vyshak, Bali, Kalika, Vashistha, Aditya, and Segal, Sameer
- Subjects
Computer Science - Computation and Language - Abstract
Large Language Models (LLMs) and Text-To-Image (T2I) models have demonstrated the ability to generate compelling text and visual stories. However, their outputs are predominantly aligned with the sensibilities of the Global North, often resulting in an outsider's gaze on other cultures. As a result, non-Western communities have to put extra effort into generating culturally specific stories. To address this challenge, we developed a visual storytelling pipeline called KAHANI that generates culturally grounded visual stories for non-Western cultures. Our pipeline leverages off-the-shelf models GPT-4 Turbo and Stable Diffusion XL (SDXL). By using Chain of Thought (CoT) and T2I prompting techniques, we capture the cultural context from user's prompt and generate vivid descriptions of the characters and scene compositions. To evaluate the effectiveness of KAHANI, we conducted a comparative user study with ChatGPT-4 (with DALL-E3) in which participants from different regions of India compared the cultural relevance of stories generated by the two tools. Results from the qualitative and quantitative analysis performed on the user study showed that KAHANI was able to capture and incorporate more Culturally Specific Items (CSIs) compared to ChatGPT-4. In terms of both its cultural competence and visual story generation quality, our pipeline outperformed ChatGPT-4 in 27 out of the 36 comparisons., Comment: Under review
- Published
- 2024
18. Towards Inducing Document-Level Abilities in Standard Multilingual Neural Machine Translation Models
- Author
-
Gumma, Varun, Chitale, Pranjal A., and Bali, Kalika
- Subjects
Computer Science - Computation and Language - Abstract
Neural Machine Translation (NMT) models have traditionally used Sinusoidal Positional Embeddings (PEs), which often struggle to capture long-range dependencies and are less efficient for handling extended context or document-level translation tasks. This work addresses the challenge of transitioning pre-trained NMT models from absolute sinusoidal PEs to relative PEs, such as Rotary Positional Embeddings (ROPE) and Attention with Linear Biases (ALIBI), without compromising performance. We demonstrate that parameter-efficient fine-tuning, using only a small amount of high-quality data, can successfully facilitate this transition. Experimental results indicate that switching from sinusoidal to relative PEs results in competitive translation quality on sentence-level evaluation benchmarks. Additionally, models trained with ROPE consistently outperform those using ALIBI and Sinusoidal PEs on document-level benchmarks across both string-based metrics and qualitative evaluations. Moreover, we find that a small amount of long-context data in a few languages is sufficient for cross-lingual length generalization, thereby inducing long-context capabilities., Comment: Under Review
- Published
- 2024
19. MARLP: Time-series Forecasting Control for Agricultural Managed Aquifer Recharge
- Author
-
Chen, Yuning, Yang, Kang, An, Zhiyu, Holder, Brady, Paloutzian, Luke, Bali, Khaled, and Du, Wan
- Subjects
Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Systems and Control - Abstract
The rapid decline in groundwater around the world poses a significant challenge to sustainable agriculture. To address this issue, agricultural managed aquifer recharge (Ag-MAR) is proposed to recharge the aquifer by artificially flooding agricultural lands using surface water. Ag-MAR requires a carefully selected flooding schedule to avoid affecting the oxygen absorption of crop roots. However, current Ag-MAR scheduling does not take into account complex environmental factors such as weather and soil oxygen, resulting in crop damage and insufficient recharging amounts. This paper proposes MARLP, the first end-to-end data-driven control system for Ag-MAR. We first formulate Ag-MAR as an optimization problem. To that end, we analyze four-year in-field datasets, which reveal the multi-periodicity feature of the soil oxygen level trends and the opportunity to use external weather forecasts and flooding proposals as exogenous clues for soil oxygen prediction. Then, we design a two-stage forecasting framework. In the first stage, it extracts both the cross-variate dependency and the periodic patterns from historical data to conduct preliminary forecasting. In the second stage, it uses weather-soil and flooding-soil causality to facilitate an accurate prediction of soil oxygen levels. Finally, we conduct model predictive control (MPC) for Ag-MAR flooding. To address the challenge of large action spaces, we devise a heuristic planning module to reduce the number of flooding proposals to enable the search for optimal solutions. Real-world experiments show that MARLP reduces the oxygen deficit ratio by 86.8% while improving the recharging amount in unit time by 35.8%, compared with the previous four years., Comment: Accepted by KDD 2024
- Published
- 2024
- Full Text
- View/download PDF
20. Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting
- Author
-
Mukherjee, Sagnik, Adilazuarda, Muhammad Farid, Sitaram, Sunayana, Bali, Kalika, Aji, Alham Fikri, and Choudhury, Monojit
- Subjects
Computer Science - Computation and Language - Abstract
Socio-demographic prompting is a commonly employed approach to study cultural biases in LLMs as well as for aligning models to certain cultures. In this paper, we systematically probe four LLMs (Llama 3, Mistral v0.2, GPT-3.5 Turbo and GPT-4) with prompts that are conditioned on culturally sensitive and non-sensitive cues, on datasets that are supposed to be culturally sensitive (EtiCor and CALI) or neutral (MMLU and ETHICS). We observe that all models except GPT-4 show significant variations in their responses on both kinds of datasets for both kinds of prompts, casting doubt on the robustness of the culturally-conditioned prompting as a method for eliciting cultural bias in models or as an alignment strategy. The work also calls rethinking the control experiment design to tease apart the cultural conditioning of responses from "placebo effect", i.e., random perturbations of model responses due to arbitrary tokens in the prompt.
- Published
- 2024
21. Beyond Metrics: Evaluating LLMs' Effectiveness in Culturally Nuanced, Low-Resource Real-World Scenarios
- Author
-
Ochieng, Millicent, Gumma, Varun, Sitaram, Sunayana, Wang, Jindong, Chaudhary, Vishrav, Ronen, Keshet, Bali, Kalika, and O'Neill, Jacki
- Subjects
Computer Science - Computation and Language - Abstract
The deployment of Large Language Models (LLMs) in real-world applications presents both opportunities and challenges, particularly in multilingual and code-mixed communication settings. This research evaluates the performance of seven leading LLMs in sentiment analysis on a dataset derived from multilingual and code-mixed WhatsApp chats, including Swahili, English and Sheng. Our evaluation includes both quantitative analysis using metrics like F1 score and qualitative assessment of LLMs' explanations for their predictions. We find that, while Mistral-7b and Mixtral-8x7b achieved high F1 scores, they and other LLMs such as GPT-3.5-Turbo, Llama-2-70b, and Gemma-7b struggled with understanding linguistic and contextual nuances, as well as lack of transparency in their decision-making process as observed from their explanations. In contrast, GPT-4 and GPT-4-Turbo excelled in grasping diverse linguistic inputs and managing various contextual information, demonstrating high consistency with human alignment and transparency in their decision-making process. The LLMs however, encountered difficulties in incorporating cultural nuance especially in non-English settings with GPT-4s doing so inconsistently. The findings emphasize the necessity of continuous improvement of LLMs to effectively tackle the challenges of culturally nuanced, low-resource real-world settings and the need for developing evaluation benchmarks for capturing these issues.
- Published
- 2024
22. Progress on nucleon transition matrix elements with a lattice QCD variational analysis
- Author
-
Barca, Lorenzo, Bali, Gunnar, and Collins, Sara
- Subjects
High Energy Physics - Lattice - Abstract
Nucleon weak matrix elements can be extracted from nucleon correlation functions with lattice QCD simulations. The signal-to-noise ratio prohibits the analysis at large source-sink separations and as a consequence, excited state contamination affects the extraction of the nucleon matrix elements. Chiral perturbation theory (ChPT) suggests that the dominant contamination in some of these channels is due to $N\pi$ states where the pion carries the same momentum of the current. In this talk, we report updates on the variational analysis with $qqq$-operators (nucleon-like) and $(qqq)(\bar{q}q)$-operators (nucleon-pion-like) where we report for the first time some preliminary results of $\langle N\pi| \mathcal{J}| N \rangle $, modulo some kinematic and volume factors, and we compare the results against ChPT. This pilot study is performed on a CLS ensemble with $N_f=3$, $m_\pi \approx 420~\mathrm{MeV}$, $a\approx 0.1~\mathrm{fm}$ and $T=2L\approx 4.8~\mathrm{fm}$., Comment: Proceedings for the EuroPLEx Final Conference, 11-15 September 2023, Humboldt University of Berlin
- Published
- 2024
23. Bridging the Gap: Dynamic Learning Strategies for Improving Multilingual Performance in LLMs
- Author
-
Kumar, Somnath, Balloli, Vaibhav, Ranjit, Mercy, Ahuja, Kabir, Ganu, Tanuja, Sitaram, Sunayana, Bali, Kalika, and Nambi, Akshay
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Large language models (LLMs) are at the forefront of transforming numerous domains globally. However, their inclusivity and effectiveness remain limited for non-Latin scripts and low-resource languages. This paper tackles the imperative challenge of enhancing the multilingual performance of LLMs without extensive training or fine-tuning. Through systematic investigation and evaluation of diverse languages using popular question-answering (QA) datasets, we present novel techniques that unlock the true potential of LLMs in a polyglot landscape. Our approach encompasses three key strategies that yield significant improvements in multilingual proficiency. First, by meticulously optimizing prompts tailored for polyglot LLMs, we unlock their latent capabilities, resulting in substantial performance boosts across languages. Second, we introduce a new hybrid approach that synergizes LLM Retrieval Augmented Generation (RAG) with multilingual embeddings and achieves improved multilingual task performance. Finally, we introduce a novel learning approach that dynamically selects the optimal prompt strategy, LLM model, and embedding model per query at run-time. This dynamic adaptation maximizes the efficacy of LLMs across languages, outperforming best static and random strategies. Additionally, our approach adapts configurations in both offline and online settings, and can seamlessly adapt to new languages and datasets, leading to substantial advancements in multilingual understanding and generation across diverse languages.
- Published
- 2024
24. Enhancing Decision-Making in Optimization through LLM-Assisted Inference: A Neural Networks Perspective
- Author
-
Singh, Gaurav and Bali, Kavitesh Kumar
- Subjects
Computer Science - Neural and Evolutionary Computing ,Computer Science - Artificial Intelligence - Abstract
This paper explores the seamless integration of Generative AI (GenAI) and Evolutionary Algorithms (EAs) within the domain of large-scale multi-objective optimization. Focusing on the transformative role of Large Language Models (LLMs), our study investigates the potential of LLM-Assisted Inference to automate and enhance decision-making processes. Specifically, we highlight its effectiveness in illuminating key decision variables in evolutionarily optimized solutions while articulating contextual trade-offs. Tailored to address the challenges inherent in inferring complex multi-objective optimization solutions at scale, our approach emphasizes the adaptive nature of LLMs, allowing them to provide nuanced explanations and align their language with diverse stakeholder expertise levels and domain preferences. Empirical studies underscore the practical applicability and impact of LLM-Assisted Inference in real-world decision-making scenarios., Comment: Accepted IJCNN
- Published
- 2024
25. Akal Badi ya Bias: An Exploratory Study of Gender Bias in Hindi Language Technology
- Author
-
Hada, Rishav, Husain, Safiya, Gumma, Varun, Diddee, Harshita, Yadavalli, Aditya, Seth, Agrima, Kulkarni, Nidhi, Gadiraju, Ujwal, Vashistha, Aditya, Seshadri, Vivek, and Bali, Kalika
- Subjects
Computer Science - Computation and Language - Abstract
Existing research in measuring and mitigating gender bias predominantly centers on English, overlooking the intricate challenges posed by non-English languages and the Global South. This paper presents the first comprehensive study delving into the nuanced landscape of gender bias in Hindi, the third most spoken language globally. Our study employs diverse mining techniques, computational models, field studies and sheds light on the limitations of current methodologies. Given the challenges faced with mining gender biased statements in Hindi using existing methods, we conducted field studies to bootstrap the collection of such sentences. Through field studies involving rural and low-income community women, we uncover diverse perceptions of gender bias, underscoring the necessity for context-specific approaches. This paper advocates for a community-centric research design, amplifying voices often marginalized in previous studies. Our findings not only contribute to the understanding of gender bias in Hindi but also establish a foundation for further exploration of Indic languages. By exploring the intricacies of this understudied context, we call for thoughtful engagement with gender bias, promoting inclusivity and equity in linguistic and cultural contexts beyond the Global North., Comment: Accepted to FAccT 2024
- Published
- 2024
26. Bridging the Gap Between Theory and Practice: Benchmarking Transfer Evolutionary Optimization
- Author
-
Hou, Yaqing, Ma, Wenqiang, Gupta, Abhishek, Bali, Kavitesh Kumar, Ge, Hongwei, Zhang, Qiang, Coello, Carlos A. Coello, and Ong, Yew-Soon
- Subjects
Computer Science - Neural and Evolutionary Computing - Abstract
In recent years, the field of Transfer Evolutionary Optimization (TrEO) has witnessed substantial growth, fueled by the realization of its profound impact on solving complex problems. Numerous algorithms have emerged to address the challenges posed by transferring knowledge between tasks. However, the recently highlighted ``no free lunch theorem'' in transfer optimization clarifies that no single algorithm reigns supreme across diverse problem types. This paper addresses this conundrum by adopting a benchmarking approach to evaluate the performance of various TrEO algorithms in realistic scenarios. Despite the growing methodological focus on transfer optimization, existing benchmark problems often fall short due to inadequate design, predominantly featuring synthetic problems that lack real-world relevance. This paper pioneers a practical TrEO benchmark suite, integrating problems from the literature categorized based on the three essential aspects of Big Source Task-Instances: volume, variety, and velocity. Our primary objective is to provide a comprehensive analysis of existing TrEO algorithms and pave the way for the development of new approaches to tackle practical challenges. By introducing realistic benchmarks that embody the three dimensions of volume, variety, and velocity, we aim to foster a deeper understanding of algorithmic performance in the face of diverse and complex transfer scenarios. This benchmark suite is poised to serve as a valuable resource for researchers, facilitating the refinement and advancement of TrEO algorithms in the pursuit of solving real-world problems., Comment: 17 pages, 18 figures
- Published
- 2024
27. Routing and Spectrum Allocation in Broadband Quantum Entanglement Distribution
- Author
-
Bali, Rohan, Tittelbaugh, Ashley N., Jenkins, Shelbi L., Agrawal, Anuj, Horgan, Jerry, Ruffini, Marco, Kilper, Daniel C., and Bash, Boulat A.
- Subjects
Computer Science - Networking and Internet Architecture ,Computer Science - Emerging Technologies ,Quantum Physics - Abstract
We investigate resource allocation for quantum entanglement distribution over an optical network. We characterize and model a network architecture that employs a single quasi-deterministic time-frequency heralded Einstein-Podolsky-Rosen (EPR) pair source, and develop a routing scheme for distributing entangled photon pairs over such a network. We focus on max-min fairness in entanglement distribution and compare the performance of various spectrum allocation schemes by examining the max-min and median number of EPR-pairs assigned by them, and the Jain index associated with this assignment. Since this presents an NP-hard problem, we identify two approximation algorithms that outperform others in minimum and mean EPR-pair rate distribution and are comparable to others in the Jain index. We also analyze how the network size and connectivity affect these metrics using Watts-Strogatz random graphs. We find that a spectrum allocation approach that achieves high minimum EPR-pair rate can perform significantly worse when the median EPR-pair rate, Jain index, and runtimes are considered., Comment: originally appeared as arXiv:2311.14613v2 in error. arXiv admin note: text overlap with arXiv:2311.14613
- Published
- 2024
28. METAL: Towards Multilingual Meta-Evaluation
- Author
-
Hada, Rishav, Gumma, Varun, Ahmed, Mohamed, Bali, Kalika, and Sitaram, Sunayana
- Subjects
Computer Science - Computation and Language - Abstract
With the rising human-like precision of Large Language Models (LLMs) in numerous tasks, their utilization in a variety of real-world applications is becoming more prevalent. Several studies have shown that LLMs excel on many standard NLP benchmarks. However, it is challenging to evaluate LLMs due to test dataset contamination and the limitations of traditional metrics. Since human evaluations are difficult to collect, there is a growing interest in the community to use LLMs themselves as reference-free evaluators for subjective metrics. However, past work has shown that LLM-based evaluators can exhibit bias and have poor alignment with human judgments. In this study, we propose a framework for an end-to-end assessment of LLMs as evaluators in multilingual scenarios. We create a carefully curated dataset, covering 10 languages containing native speaker judgments for the task of summarization. This dataset is created specifically to evaluate LLM-based evaluators, which we refer to as meta-evaluation (METAL). We compare the performance of LLM-based evaluators created using GPT-3.5-Turbo, GPT-4, and PaLM2. Our results indicate that LLM-based evaluators based on GPT-4 perform the best across languages, while GPT-3.5-Turbo performs poorly. Additionally, we perform an analysis of the reasoning provided by LLM-based evaluators and find that it often does not match the reasoning provided by human judges., Comment: Accepted to NAACL 2024 findings
- Published
- 2024
29. Assistant, parrot, or colonizing loudspeaker?
- Author
-
Gupta, Anuj, Atef, Yasser, Mills, Anna, and Bali, Maha
- Published
- 2024
30. DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures
- Author
-
Seth, Agrima, Ahuja, Sanchit, Bali, Kalika, and Sitaram, Sunayana
- Subjects
Computer Science - Computers and Society ,Computer Science - Computation and Language - Abstract
Generative models are increasingly being used in various applications, such as text generation, commonsense reasoning, and question-answering. To be effective globally, these models must be aware of and account for local socio-cultural contexts, making it necessary to have benchmarks to evaluate the models for their cultural familiarity. Since the training data for LLMs is web-based and the Web is limited in its representation of information, it does not capture knowledge present within communities that are not on the Web. Thus, these models exacerbate the inequities, semantic misalignment, and stereotypes from the Web. There has been a growing call for community-centered participatory research methods in NLP. In this work, we respond to this call by using participatory research methods to introduce $\textit{DOSA}$, the first community-generated $\textbf{D}$ataset $\textbf{o}$f 615 $\textbf{S}$ocial $\textbf{A}$rtifacts, by engaging with 260 participants from 19 different Indian geographic subcultures. We use a gamified framework that relies on collective sensemaking to collect the names and descriptions of these artifacts such that the descriptions semantically align with the shared sensibilities of the individuals from those cultures. Next, we benchmark four popular LLMs and find that they show significant variation across regional sub-cultures in their ability to infer the artifacts.
- Published
- 2024
31. Differentiability in Unrolled Training of Neural Physics Simulators on Transient Dynamics
- Author
-
List, Bjoern, Chen, Li-Wei, Bali, Kartik, and Thuerey, Nils
- Subjects
Physics - Computational Physics ,Computer Science - Machine Learning - Abstract
Unrolling training trajectories over time strongly influences the inference accuracy of neural network-augmented physics simulators. We analyze this in three variants of training neural time-steppers. In addition to one-step setups and fully differentiable unrolling, we include a third, less widely used variant: unrolling without temporal gradients. Comparing networks trained with these three modalities disentangles the two dominant effects of unrolling, training distribution shift and long-term gradients. We present detailed study across physical systems, network sizes and architectures, training setups, and test scenarios. It also encompasses two simulation modes: In prediction setups, we rely solely on neural networks to compute a trajectory. In contrast, correction setups include a numerical solver that is supported by a neural network. Spanning these variations, our study provides the empirical basis for our main findings: Non-differentiable but unrolled training with a numerical solver in a correction setup can yield substantial improvements over a fully differentiable prediction setup not utilizing this solver. The accuracy of models trained in a fully differentiable setup differs compared to their non-differentiable counterparts. Differentiable ones perform best in a comparison among correction networks as well as among prediction setups. For both, the accuracy of non-differentiable unrolling comes close. Furthermore, we show that these behaviors are invariant to the physical system, the network architecture and size, and the numerical scheme. These results motivate integrating non-differentiable numerical simulators into training setups even if full differentiability is unavailable. We show the convergence rate of common architectures to be low compared to numerical algorithms. This motivates correction setups combining neural and numerical parts which utilize benefits of both., Comment: Project Page: https://ge.in.tum.de/publications/how-temporal-unrolling-supports-neural-physics-simulators/ , Github Page: https://github.com/tum-pbs/unrolling
- Published
- 2024
32. RFBES at SemEval-2024 Task 8: Investigating Syntactic and Semantic Features for Distinguishing AI-Generated and Human-Written Texts
- Author
-
Rad, Mohammad Heydari, Farsi, Farhan, Bali, Shayan, Etezadi, Romina, and Shamsfard, Mehrnoush
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Nowadays, the usage of Large Language Models (LLMs) has increased, and LLMs have been used to generate texts in different languages and for different tasks. Additionally, due to the participation of remarkable companies such as Google and OpenAI, LLMs are now more accessible, and people can easily use them. However, an important issue is how we can detect AI-generated texts from human-written ones. In this article, we have investigated the problem of AI-generated text detection from two different aspects: semantics and syntax. Finally, we presented an AI model that can distinguish AI-generated texts from human-written ones with high accuracy on both multilingual and monolingual tasks using the M4 dataset. According to our results, using a semantic approach would be more helpful for detection. However, there is a lot of room for improvement in the syntactic approach, and it would be a good approach for future work., Comment: Mohammad Heydari Rad, Farhan Farsi, and Shayan Bali have made equal contributions to this work
- Published
- 2024
33. MunTTS: A Text-to-Speech System for Mundari
- Author
-
Gumma, Varun, Hada, Rishav, Yadavalli, Aditya, Gogoi, Pamir, Mondal, Ishani, Seshadri, Vivek, and Bali, Kalika
- Subjects
Computer Science - Computation and Language ,Computer Science - Sound ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
We present MunTTS, an end-to-end text-to-speech (TTS) system specifically for Mundari, a low-resource Indian language of the Austo-Asiatic family. Our work addresses the gap in linguistic technology for underrepresented languages by collecting and processing data to build a speech synthesis system. We begin our study by gathering a substantial dataset of Mundari text and speech and train end-to-end speech models. We also delve into the methods used for training our models, ensuring they are efficient and effective despite the data constraints. We evaluate our system with native speakers and objective metrics, demonstrating its potential as a tool for preserving and promoting the Mundari language in the digital age., Comment: Accepted to ComputEL-7
- Published
- 2024
34. Assistant, Parrot, or Colonizing Loudspeaker? ChatGPT Metaphors for Developing Critical AI Literacies
- Author
-
Gupta, Anuj, Atef, Yasser, Mills, Anna, and Bali, Maha
- Subjects
Computer Science - Human-Computer Interaction ,Computer Science - Artificial Intelligence ,Computer Science - Computers and Society ,I.2.0 ,K.3.0 ,K.3.1 ,K.4.0 ,K.4.2 ,J.4 ,J.5 - Abstract
This study explores how discussing metaphors for AI can help build awareness of the frames that shape our understanding of AI systems, particularly large language models (LLMs) like ChatGPT. Given the pressing need to teach "critical AI literacy", discussion of metaphor provides an opportunity for inquiry and dialogue with space for nuance, playfulness, and critique. Using a collaborative autoethnographic methodology, we analyzed metaphors from a range of sources, and reflected on them individually according to seven questions, then met and discussed our interpretations. We then analyzed how our reflections contributed to the three kinds of literacies delineated in Selber's multiliteracies framework: functional, critical, and rhetorical. These allowed us to analyze questions of ethics, equity, and accessibility in relation to AI. We explored each metaphor along the dimension of whether or not it was promoting anthropomorphizing, and to what extent such metaphors imply that AI is sentient. Our findings highlight the role of metaphor reflection in fostering a nuanced understanding of AI, suggesting that our collaborative autoethnographic approach as well as the heuristic model of plotting AI metaphors on dimensions of anthropomorphism and multiliteracies, might be useful for educators and researchers in the pursuit of advancing critical AI literacy., Comment: This is a preprint (accepted version) of an article that has been accepted for publication at the journal Open Praxis: https://openpraxis.org/
- Published
- 2024
- Full Text
- View/download PDF
35. Object Counting from Images Using Deep Learning Technique
- Author
-
Bali, Arishpreet Kour, Kumar, Amit, Kacprzyk, Janusz, Series Editor, Gomide, Fernando, Advisory Editor, Kaynak, Okyay, Advisory Editor, Liu, Derong, Advisory Editor, Pedrycz, Witold, Advisory Editor, Polycarpou, Marios M., Advisory Editor, Rudas, Imre J., Advisory Editor, Wang, Jun, Advisory Editor, Hassanien, Aboul Ella, editor, Anand, Sameer, editor, Jaiswal, Ajay, editor, and Kumar, Prabhat, editor
- Published
- 2025
- Full Text
- View/download PDF
36. Routing and Spectrum Allocation in Broadband Degenerate EPR-Pair Distribution
- Author
-
Bali, Rohan, Tittelbaugh, Ashley, Jenkins, Shelbi L., Agrawal, Anuj, Horgan, Jerry, Ruffini, Marco, Kilper, Daniel, and Bash, Boulat A.
- Subjects
Computer Science - Networking and Internet Architecture - Abstract
We investigate resource allocation for quantum entanglement distribution over an optical network. We characterize and model a network architecture that employs a single quasideterministic time-frequency heralded EPR-pair source, and develop a routing scheme for distributing entangled photon pairs over such a network. We focus on fairness in entanglement distribution, and compare both the performance of various spectrum allocation schemes as well as their Jain index.
- Published
- 2023
37. MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks
- Author
-
Ahuja, Sanchit, Aggarwal, Divyanshu, Gumma, Varun, Watts, Ishaan, Sathe, Ashutosh, Ochieng, Millicent, Hada, Rishav, Jain, Prachi, Axmed, Maxamed, Bali, Kalika, and Sitaram, Sunayana
- Subjects
Computer Science - Computation and Language - Abstract
There has been a surge in LLM evaluation research to understand LLM capabilities and limitations. However, much of this research has been confined to English, leaving LLM building and evaluation for non-English languages relatively unexplored. Several new LLMs have been introduced recently, necessitating their evaluation on non-English languages. This study aims to perform a thorough evaluation of the non-English capabilities of SoTA LLMs (GPT-3.5-Turbo, GPT-4, PaLM2, Gemini-Pro, Mistral, Llama2, and Gemma) by comparing them on the same set of multilingual datasets. Our benchmark comprises 22 datasets covering 83 languages, including low-resource African languages. We also include two multimodal datasets in the benchmark and compare the performance of LLaVA models, GPT-4-Vision and Gemini-Pro-Vision. Our experiments show that larger models such as GPT-4, Gemini-Pro and PaLM2 outperform smaller models on various tasks, notably on low-resource languages, with GPT-4 outperforming PaLM2 and Gemini-Pro on more datasets. We also perform a study on data contamination and find that several models are likely to be contaminated with multilingual evaluation benchmarks, necessitating approaches to detect and handle contamination while assessing the multilingual performance of LLMs., Comment: 40 pages, 35 figures and 34 tables
- Published
- 2023
38. 'Fifty Shades of Bias': Normative Ratings of Gender Bias in GPT Generated English Text
- Author
-
Hada, Rishav, Seth, Agrima, Diddee, Harshita, and Bali, Kalika
- Subjects
Computer Science - Computation and Language - Abstract
Language serves as a powerful tool for the manifestation of societal belief systems. In doing so, it also perpetuates the prevalent biases in our society. Gender bias is one of the most pervasive biases in our society and is seen in online and offline discourses. With LLMs increasingly gaining human-like fluency in text generation, gaining a nuanced understanding of the biases these systems can generate is imperative. Prior work often treats gender bias as a binary classification task. However, acknowledging that bias must be perceived at a relative scale; we investigate the generation and consequent receptivity of manual annotators to bias of varying degrees. Specifically, we create the first dataset of GPT-generated English text with normative ratings of gender bias. Ratings were obtained using Best--Worst Scaling -- an efficient comparative annotation framework. Next, we systematically analyze the variation of themes of gender biases in the observed ranking and show that identity-attack is most closely related to gender bias. Finally, we show the performance of existing automated models trained on related concepts on our dataset., Comment: Camera-ready version in EMNLP 2023
- Published
- 2023
39. Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?
- Author
-
Hada, Rishav, Gumma, Varun, de Wynter, Adrian, Diddee, Harshita, Ahmed, Mohamed, Choudhury, Monojit, Bali, Kalika, and Sitaram, Sunayana
- Subjects
Computer Science - Computation and Language - Abstract
Large Language Models (LLMs) excel in various Natural Language Processing (NLP) tasks, yet their evaluation, particularly in languages beyond the top $20$, remains inadequate due to existing benchmarks and metrics limitations. Employing LLMs as evaluators to rank or score other models' outputs emerges as a viable solution, addressing the constraints tied to human annotators and established benchmarks. In this study, we explore the potential of LLM-based evaluators, specifically GPT-4 in enhancing multilingual evaluation by calibrating them against $20$K human judgments across three text-generation tasks, five metrics, and eight languages. Our analysis reveals a bias in GPT4-based evaluators towards higher scores, underscoring the necessity of calibration with native speaker judgments, especially in low-resource and non-Latin script languages, to ensure accurate evaluation of LLM performance across diverse languages., Comment: Accepted to EACL 2024 findings
- Published
- 2023
40. Spatial quasiperiodic driving of a dissipative optical lattice and origin of directed Brillouin modes in a randomly diffusing cold atom cloud
- Author
-
Cubero, David, Jiang, Kefeng, Staron, Alexander, Scoggins, Casey, Wingert, Daniel, Dilyard, Ian, Oliver, Stone, and Bali, Samir
- Subjects
Quantum Physics ,Physics - Atomic Physics - Abstract
Atoms confined in a three-dimensional dissipative optical lattice oscillate inside potential wells, occasionally hopping to adjacent wells, thereby diffusing in all directions. Illumination by a weak probe beam modulates the lattice, yielding propagating atomic density waves, referred to as Brillouin modes which travel perpendicular to the direction of travel of the probe. The probe is made incident at a small angle relative to a lattice symmetry axis, yielding a driving potential perturbation whose spatial period is not a multiple of the period of the underlying optical potential, thus enabling exploration of the regime of space quasiperiodic drive. A theory, based on the Fourier decomposition of the current into its atomic density wave contributions, reveals that unlike the previously studied time quasiperiodic case, wherein a lattice driven by two incommensurate frequencies may exhibit abrupt suppression in directed current as the driving transitions from quasiperiodic to periodic, a spatial-quasiperiodically driven lattice exhibits no such abrupt response. Further, detailed modeling of spatial-quasiperiodically driven lattices reveals that directed propagation occurs not only as a consequence of velocity-matching between the propagating modulation and the average velocity of the atom oscillating inside a well as was previously reported in the literature, but also as a distinct consequence of a new mechanism, namely, frequency-matching between the modulation frequency and the oscillation frequencies. A systematic measurement of the transmitted probe spectra as a function of off-axis probe angle is presented, which is consistent with the velocity- and frequency-matching predictions from the detailed model., Comment: 12 pages, 7 figures
- Published
- 2023
41. Openness in education as a Praxis: From individual testimonials to collective voices
- Author
-
Bozkurt, Aras, Gjelsvik, Torunn, Adam, Taskeen, Asino, Tutaleni I, Atenas, Javiera, Bali, Maha, Blomgren, Constance, Bond, Melissa, Bonk, Curtis J, Brown, Mark, Burgos, Daniel, Conrad, Dianne, Costello, Eamon, Cronin, Catherine, Czerniewicz, Laura, Deepwell, Maren, Deimann, Markus, DeWaard, Helen J, Dousay, Tonia A, Ebner, Martin, Farrow, Robert, Gil-Jaurena, Ines, Havemann, Leo, Inamorato, Andreia, Irvine, Valerie, Karunanayaka, Shironica P, Kerres, Michael, Lambert, Sarah, Lee, Kyungmee, Makoe, Mpine, Marin, Victoria I, Mikroyannidis, Alexander, Mishra, Sanjaya, Naidu, Som, Nascimbeni, Fabio, Nichols, Mark, Olcott, Don, Ossiannilsson, Ebba, Otto, Daniel, Rodriguez, Brenda Cecilia Padilla, Paskevicius, Michael, Roberts, Verena, Saleem, Tooba, Schuwer, Robert, Sharma, Ramesh C, Stewart, Bonnie, Stracke, Christian M, Tait, Alan, Tlili, Ahmed, Ubachs, George, Weidlich, Joshua, Weller, Martin, Xiao, Junhong, and Zawacki-Richter, Olaf
- Published
- 2023
42. X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents
- Author
-
Moradshahi, Mehrad, Shen, Tianhao, Bali, Kalika, Choudhury, Monojit, de Chalendar, Gaël, Goel, Anmol, Kim, Sungkyun, Kodali, Prashant, Kumaraguru, Ponnurangam, Semmar, Nasredine, Semnani, Sina J., Seo, Jiwon, Seshadri, Vivek, Shrivastava, Manish, Sun, Michael, Yadavalli, Aditya, You, Chaobin, Xiong, Deyi, and Lam, Monica S.
- Subjects
Computer Science - Computation and Language - Abstract
Task-oriented dialogue research has mainly focused on a few popular languages like English and Chinese, due to the high dataset creation cost for a new language. To reduce the cost, we apply manual editing to automatically translated data. We create a new multilingual benchmark, X-RiSAWOZ, by translating the Chinese RiSAWOZ to 4 languages: English, French, Hindi, Korean; and a code-mixed English-Hindi language. X-RiSAWOZ has more than 18,000 human-verified dialogue utterances for each language, and unlike most multilingual prior work, is an end-to-end dataset for building fully-functioning agents. The many difficulties we encountered in creating X-RiSAWOZ led us to develop a toolset to accelerate the post-editing of a new language dataset after translation. This toolset improves machine translation with a hybrid entity alignment technique that combines neural with dictionary-based methods, along with many automated and semi-automated validation checks. We establish strong baselines for X-RiSAWOZ by training dialogue agents in the zero- and few-shot settings where limited gold data is available in the target language. Our results suggest that our translation and post-editing methodology and toolset can be used to create new high-quality multilingual dialogue agents cost-effectively. Our dataset, code, and toolkit are released open-source., Comment: Accepted by ACL 2023 Findings
- Published
- 2023
43. Shifting into Reverse: Turkish Constitutionalism under the AKP
- Author
-
Bâli, Aslı
- Published
- 2016
44. Breaking Language Barriers with a LEAP: Learning Strategies for Polyglot LLMs
- Author
-
Nambi, Akshay, Balloli, Vaibhav, Ranjit, Mercy, Ganu, Tanuja, Ahuja, Kabir, Sitaram, Sunayana, and Bali, Kalika
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Large language models (LLMs) are at the forefront of transforming numerous domains globally. However, their inclusivity and effectiveness remain limited for non-Latin scripts and low-resource languages. This paper tackles the imperative challenge of enhancing the multilingual performance of LLMs, specifically focusing on Generative models. Through systematic investigation and evaluation of diverse languages using popular question-answering (QA) datasets, we present novel techniques that unlock the true potential of LLMs in a polyglot landscape. Our approach encompasses three key strategies that yield remarkable improvements in multilingual proficiency. First, by meticulously optimizing prompts tailored for polyglot LLMs, we unlock their latent capabilities, resulting in substantial performance boosts across languages. Second, we introduce a new hybrid approach that synergizes GPT generation with multilingual embeddings and achieves significant multilingual performance improvement on critical tasks like QA and retrieval. Finally, to further propel the performance of polyglot LLMs, we introduce a novel learning algorithm that dynamically selects the optimal prompt strategy, LLM model, and embeddings per query. This dynamic adaptation maximizes the efficacy of LLMs across languages, outperforming best static and random strategies. Our results show substantial advancements in multilingual understanding and generation across a diverse range of languages.
- Published
- 2023
45. Prompt Evolution for Generative AI: A Classifier-Guided Approach
- Author
-
Wong, Melvin, Ong, Yew-Soon, Gupta, Abhishek, Bali, Kavitesh K., and Chen, Caishun
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Neural and Evolutionary Computing ,I.2 - Abstract
Synthesis of digital artifacts conditioned on user prompts has become an important paradigm facilitating an explosion of use cases with generative AI. However, such models often fail to connect the generated outputs and desired target concepts/preferences implied by the prompts. Current research addressing this limitation has largely focused on enhancing the prompts before output generation or improving the model's performance up front. In contrast, this paper conceptualizes prompt evolution, imparting evolutionary selection pressure and variation during the generative process to produce multiple outputs that satisfy the target concepts/preferences better. We propose a multi-objective instantiation of this broader idea that uses a multi-label image classifier-guided approach. The predicted labels from the classifiers serve as multiple objectives to optimize, with the aim of producing diversified images that meet user preferences. A novelty of our evolutionary algorithm is that the pre-trained generative model gives us implicit mutation operations, leveraging the model's stochastic generative capability to automate the creation of Pareto-optimized images more faithful to user preferences., Comment: To appear in Proceedings of the 2023 IEEE Conference on Artificial Intelligence (CAI'23)
- Published
- 2023
- Full Text
- View/download PDF
46. Octet baryon isovector charges from $N_f = 2 + 1$ lattice QCD
- Author
-
Bali, Gunnar S., Collins, Sara, Heybrock, Simon, Löffler, Marius, Rödl, Rudolf, Söldner, Wolfgang, and Weishäupl, Simon
- Subjects
High Energy Physics - Lattice ,High Energy Physics - Phenomenology - Abstract
We determine the axial, scalar and tensor isovector charges of the nucleon, sigma and cascade baryons as well as the difference between the up and down quark masses, $m_u-m_d$. We employ gauge ensembles with $N_f=2+1$ non-perturbatively improved Wilson fermions at six values of the lattice spacing in the range $a\approx (0.039 - 0.098) \,$fm, generated by the Coordinated Lattice Simulations (CLS) effort. The pion mass $M_\pi$ ranges from around $430 \, $MeV down to a near physical value of $130 \, $MeV and the linear spatial lattice extent $L$ varies from $6.5\,M_{\pi}^{-1}$ to $3.0\,M_{\pi}^{-1}$, where $L M_\pi \geq 4$ for the majority of the ensembles. This allows us to perform a controlled interpolation/extrapolation of the charges to the physical mass point in the infinite volume and continuum limit. Investigating SU(3) flavour symmetry, we find moderate symmetry breaking effects for the axial charges at the physical quark mass point, while no significant effects are found for the other charges within current uncertainties., Comment: 47 pages, 47 figures: v2: minor typos corrected, references added, some figures updated; v3: updated Fig. 11, version published in PRD
- Published
- 2023
- Full Text
- View/download PDF
47. MEGA: Multilingual Evaluation of Generative AI
- Author
-
Ahuja, Kabir, Diddee, Harshita, Hada, Rishav, Ochieng, Millicent, Ramesh, Krithika, Jain, Prachi, Nambi, Akshay, Ganu, Tanuja, Segal, Sameer, Axmed, Maxamed, Bali, Kalika, and Sitaram, Sunayana
- Subjects
Computer Science - Computation and Language - Abstract
Generative AI models have shown impressive performance on many Natural Language Processing tasks such as language understanding, reasoning, and language generation. An important question being asked by the AI community today is about the capabilities and limits of these models, and it is clear that evaluating generative AI is very challenging. Most studies on generative LLMs have been restricted to English and it is unclear how capable these models are at understanding and generating text in other languages. We present the first comprehensive benchmarking of generative LLMs - MEGA, which evaluates models on standard NLP benchmarks, covering 16 NLP datasets across 70 typologically diverse languages. We compare the performance of generative LLMs including Chat-GPT and GPT-4 to State of the Art (SOTA) non-autoregressive models on these tasks to determine how well generative models perform compared to the previous generation of LLMs. We present a thorough analysis of the performance of models across languages and tasks and discuss challenges in improving the performance of generative LLMs on low-resource languages. We create a framework for evaluating generative LLMs in the multilingual setting and provide directions for future progress in the field., Comment: EMNLP 2023
- Published
- 2023
48. Provenance for Lattice QCD workflows
- Author
-
Auge, Tanja, Bali, Gunnar, Klettke, Meike, Ludäscher, Bertram, Söldner, Wolfgang, Weishäupl, Simon, and Wettig, Tilo
- Subjects
High Energy Physics - Lattice ,Computer Science - Computational Engineering, Finance, and Science ,Computer Science - Databases - Abstract
We present a provenance model for the generic workflow of numerical Lattice Quantum Chromodynamics (QCD) calculations, which constitute an important component of particle physics research. These calculations are carried out on the largest supercomputers worldwide with data in the multi-PetaByte range being generated and analyzed. In the Lattice QCD community, a custom metadata standard (QCDml) that includes certain provenance information already exists for one part of the workflow, the so-called generation of configurations. In this paper, we follow the W3C PROV standard and formulate a provenance model that includes both the generation part and the so-called measurement part of the Lattice QCD workflow. We demonstrate the applicability of this model and show how the model can be used to answer some provenance-related research questions. However, many important provenance questions in the Lattice QCD community require extensions of this provenance model. To this end, we propose a multi-layered provenance approach that combines prospective and retrospective elements.
- Published
- 2023
- Full Text
- View/download PDF
49. Nooks: Social Spaces to Lower Hesitations in Interacting with New People at Work
- Author
-
Bali, Shreya, Khadpe, Pranav, Kaufman, Geoff, and Kulkarni, Chinmay
- Subjects
Computer Science - Human-Computer Interaction - Abstract
Initiating conversations with new people at work is often intimidating because of uncertainty about their interests. People worry others may reject their attempts to initiate conversation or that others may not enjoy the conversation. We introduce a new system, Nooks, built on Slack, that reduces fear of social evaluation by enabling individuals to initiate any conversation as a nook -- a conversation room that identifies its topic, but not its creator. Automatically convening others interested in the nook, Nooks further reduces fears of social evaluation by guaranteeing individuals in advance that others they are about to interact with are interested in the conversation. In a multi-month deployment with participants in a summer research program, Nooks provided participants with non-threatening and inclusive interaction opportunities, and ambient awareness, leading to new interactions online and offline. Our results demonstrate how intentionally designed social spaces can reduce fears of social evaluation and catalyze new workplace connections., Comment: CHI 2023
- Published
- 2023
50. On Pisot Units and the Fundamental Domain of Galois Extensions of $\mathbb{Q}$
- Author
-
Porter, Christian, Bali, Alexandre, and Leibak, Alar
- Subjects
Mathematics - Number Theory ,Mathematics - Rings and Algebras - Abstract
In this paper, we present two main results. Let $K$ be a number field that is Galois over $\mathbb{Q}$ with degree $r+2s$, where $r$ is the number of real embeddings and $s$ is the number of pairs of complex embeddings. The first result states that the number of facets of the reduction domain (and therefore the fundamental domain) of $K$ is no greater than $O\left(\left(\frac{1}{2}(r+s-1)^\delta(r+s)^{1+\frac{1}{2(r+s-1)}}\right)^{r+s-1}\right) \cdot\left(e^{1+\frac1{2e}}\right)^{r+s}(r+s)!$, where $\delta=1/2$ if $r+s \leq 11$ or $\delta=1$ otherwise. The second result states that there exists a linear time algorithm to reduce a totally positive unary form $axx^*$, such that the new totally positive element $a^\prime$ that is equivalent to $a$ has trace no greater than a constant multiplied by the integer minimum of the trace-form $\trace(axx^*)$, where the constant is determined by the shortest Pisot unit in the number field. This may have applications in ring-based cryptography. Finally, we show that the Weil height of the shortest Pisot unit in the number field can be no greater than $\frac{1}{[K:\mathbb{Q}]}\left(\frac{\gamma}{2}(r+s-1)^{\delta-\frac{1}{2(r+s-1)}}R_K^{\frac{1}{r+s-1}}+(r+s-1)\epsilon\right)$, where $R_K$ denotes the regulator of $K$, $\gamma=1$ if $K$ is totally real or $2$ otherwise, and $\epsilon>0$ is some arbitrarily small constant., Comment: 15 pages, including abstract and bibliography
- Published
- 2023
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.