Author: "Blaise, A." - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Blaise, A."' showing total 83,631 results

Start Over Author "Blaise, A."

83,631 results on '"Blaise, A."'

1. Accelerated Training through Iterative Gradient Propagation Along the Residual Path

Author: Fagnou, Erwan, Caillon, Paul, Delattre, Blaise, and Allauzen, Alexandre
Subjects: Computer Science - Machine Learning
Abstract: Despite being the cornerstone of deep learning, backpropagation is criticized for its inherent sequentiality, which can limit the scalability of very deep models. Such models faced convergence issues due to vanishing gradient, later resolved using residual connections. Variants of these are now widely used in modern architecture. However, the computational cost of backpropagation remains a major burden, accounting for most of the training time. Taking advantage of residual-like architectural designs, we introduce Highway backpropagation, a parallelizable iterative algorithm that approximates backpropagation, by alternatively i) accumulating the gradient estimates along the residual path, and ii) backpropagating them through every layer in parallel. This algorithm is naturally derived from a decomposition of the gradient as the sum of gradients flowing through all paths and is adaptable to a diverse set of common architectures, ranging from ResNets and Transformers to recurrent neural networks. Through an extensive empirical study on a large selection of tasks and models, we evaluate Highway-BP and show that major speedups can be achieved with minimal performance degradation., Comment: 20 pages, 6 figures, accepted to ICLR 2025
Published: 2025

2. Grain-size dependence of plastic-brittle transgranular fracture

Author: Scherer, Jean-Michel, Ramesh, Mythreyi, Bourdin, Blaise, and Bhattacharya, Kaushik
Subjects: Condensed Matter - Materials Science
Abstract: The role of grain size in determining fracture toughness in metals is incompletely understood with apparently contradictory experimental observations. We study this grain-size dependence computationally by building a model that combines the phase-field formulation of fracture mechanics with dislocation density-based crystal plasticity. We apply the model to cleavage fracture of body-centered cubic materials in plane strain conditions, and find non-monotonic grain-size dependence of plastic-brittle transgranular fracture. We find two mechanisms at play. The first is the nucleation of failure due to cross-slip in critically located grains within transgranular band of localized deformation, and this follows the classical Hall-Petch law that predicts a higher failure stress for smaller grains. The second is the resistance to the propagation of a mode I crack, where grain boundaries can potentially pin a crack, and this follows an inverse Hall-Petch law with higher toughness for larger grains. The result of the competition between the two mechanisms gives rise to non-monotonic behavior and reconciles the apparently contradictory experimental observations.
Published: 2025

3. Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation

Author: Cruz, Jan Christian Blaise and Aji, Alham Fikri
Subjects: Computer Science - Computation and Language
Abstract: In this paper, we propose the use of simple knowledge distillation to produce smaller and more efficient single-language transformers from Massively Multilingual Transformers (MMTs) to alleviate tradeoffs associated with the use of such in low-resource settings. Using Tagalog as a case study, we show that these smaller single-language models perform on-par with strong baselines in a variety of benchmark tasks in a much more efficient manner. Furthermore, we investigate additional steps during the distillation process that improves the soft-supervision of the target language, and provide a number of analyses and ablations to show the efficacy of the proposed method., Comment: LoResLM Workshop @ COLING 2025
Published: 2025

4. Growing Spines Ad Infinitum

Author: Boissonneau, Blaise, De Mase, Anna, Jahnke, Franziska, and Touchard, Pierre
Subjects: Mathematics - Logic, Mathematics - Group Theory, 03C60, 03C64, 06F20 (Primary), 12J20, 12L12 (Secondary)
Abstract: We show that every non-trivial ordered abelian group $G$ is augmentable by infinite elements, i.e., we have $G\preccurlyeq H\oplus G$ for some non-trivial ordered abelian group $H$. As an application, we show that when $k$ is a field of characteristic 0, then $k$ is not $t$-henselian if and only if all henselian valuations with residue field $k$ are ($\emptyset$-)definable., Comment: 17 pages
Published: 2025

5. Cost-effective time-stretch terahertz recorders using 1550 nm probes

Author: Hanoun, Christelle, Roussel, Eléonore, Szwaj, Christophe, Evain, Clément, Parquier, Marc Le, Brubach, Jean-Blaise, Hubert, Nicolas, Labat, Marie, Roy, Pascale, Tordeux, Marie-Agnès, and Bielawski, Serge
Subjects: Physics - Optics, Physics - Accelerator Physics
Abstract: Time-stretch electro-optic detection allows THz waveforms to be recorded in single-shot, up to Megahertz acquisition rates. This capability is required in accelerator physics, and also opens new applications in table-top THz time-domain spectroscopy. However, the technique has also been notoriously known for the need of high speed -- and high cost -- ADCs or oscilloscopes for the readout. Furthermore, the resulting cost considerably increases with the number of samples that is needed per THz waveform, an issue that has severely limited the widespread of the technique so far. In this article, we show that particularly cost-effective designs can be obtained by using 1550~nm probes. We present the performances of an experimental design, that uses only standard components (including dispersive devices), and a standard commercial probe laser without additional pulse broadening. In these conditions, our time-stretch system could already record coherent THz pulses at the SOLEIL synchrotron radiation facility, over an unprecedented number of samples, using oscilloscopes and ADC boards with only 1-3 GHz bandwidth., Comment: Submitted to Optics Express
Published: 2025

6. Exponential Tethers for Accelerated Space Elevator Deployment

Author: Gassend, Blaise
Subjects: Physics - Applied Physics, Physics - Space Physics
Abstract: An exponential space elevator is a space elevator with a tether cross-section that varies exponentially with altitude. With such an elevator it is possible to reel in tether material at one end of the elevator while reeling out at the other end, without changing the overall taper profile. I show how to use this property to build up or clone a space elevator much more efficiently than with standard climber-based methods., Comment: This paper was first published in Proc. of the 3rd International Space Elevator Conference, June 2004, reprinted on arXiv by permission of Bradley Edwards former director at ISR
Published: 2024

7. Adaptive Hierarchical Graph Cut for Multi-granularity Out-of-distribution Detection

Author: Fang, Xiang, Easwaran, Arvind, Genest, Blaise, and Suganthan, Ponnuthurai Nagaratnam
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper focuses on a significant yet challenging task: out-of-distribution detection (OOD detection), which aims to distinguish and reject test samples with semantic shifts, so as to prevent models trained on in-distribution (ID) data from producing unreliable predictions. Although previous works have made decent success, they are ineffective for real-world challenging applications since these methods simply regard all unlabeled data as OOD data and ignore the case that different datasets have different label granularity. For example, "cat" on CIFAR-10 and "tabby cat" on Tiny-ImageNet share the same semantics but have different labels due to various label granularity. To this end, in this paper, we propose a novel Adaptive Hierarchical Graph Cut network (AHGC) to deeply explore the semantic relationship between different images. Specifically, we construct a hierarchical KNN graph to evaluate the similarities between different images based on the cosine similarity. Based on the linkage and density information of the graph, we cut the graph into multiple subgraphs to integrate these semantics-similar samples. If the labeled percentage in a subgraph is larger than a threshold, we will assign the label with the highest percentage to unlabeled images. To further improve the model generalization, we augment each image into two augmentation versions, and maximize the similarity between the two versions. Finally, we leverage the similarity score for OOD detection. Extensive experiments on two challenging benchmarks (CIFAR- 10 and CIFAR-100) illustrate that in representative cases, AHGC outperforms state-of-the-art OOD detection methods by 81.24% on CIFAR-100 and by 40.47% on CIFAR-10 in terms of "FPR95", which shows the effectiveness of our AHGC.
Published: 2024

8. Your Data Is Not Perfect: Towards Cross-Domain Out-of-Distribution Detection in Class-Imbalanced Data

Author: Fang, Xiang, Easwaran, Arvind, Genest, Blaise, and Suganthan, Ponnuthurai Nagaratnam
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Previous OOD detection systems only focus on the semantic gap between ID and OOD samples. Besides the semantic gap, we are faced with two additional gaps: the domain gap between source and target domains, and the class-imbalance gap between different classes. In fact, similar objects from different domains should belong to the same class. In this paper, we introduce a realistic yet challenging setting: class-imbalanced cross-domain OOD detection (CCOD), which contains a well-labeled (but usually small) source set for training and conducts OOD detection on an unlabeled (but usually larger) target set for testing. We do not assume that the target domain contains only OOD classes or that it is class-balanced: the distribution among classes of the target dataset need not be the same as the source dataset. To tackle this challenging setting with an OOD detection system, we propose a novel uncertainty-aware adaptive semantic alignment (UASA) network based on a prototype-based alignment strategy. Specifically, we first build label-driven prototypes in the source domain and utilize these prototypes for target classification to close the domain gap. Rather than utilizing fixed thresholds for OOD detection, we generate adaptive sample-wise thresholds to handle the semantic gap. Finally, we conduct uncertainty-aware clustering to group semantically similar target samples to relieve the class-imbalance gap. Extensive experiments on three challenging benchmarks demonstrate that our proposed UASA outperforms state-of-the-art methods by a large margin., Comment: Accepted by Expert Systems with Applications
Published: 2024

9. State Frequency Estimation for Anomaly Detection

Author: Cao, Clinton, Blaise, Agathe, Panichella, Annibale, and Verwer, Sicco
Subjects: Computer Science - Machine Learning, Computer Science - Cryptography and Security
Abstract: Many works have studied the efficacy of state machines for detecting anomalies within NetFlows. These works typically learn a model from unlabeled data and compute anomaly scores for arbitrary traces based on their likelihood of occurrence or how well they fit within the model. However, these methods do not dynamically adapt their scores based on the traces seen at test time. This becomes a problem when an adversary produces seemingly common traces in their attack, causing the model to miss the detection by assigning low anomaly scores. We propose SEQUENT, a new approach that uses the state visit frequency to adapt its scoring for anomaly detection dynamically. SEQUENT subsequently uses the scores to generate root causes for anomalies. These allow the grouping of alarms and simplify the analysis of anomalies. Our evaluation of SEQUENT on three NetFlow datasets indicates that our approach outperforms existing methods, demonstrating its effectiveness in detecting anomalies., Comment: 9 pages
Published: 2024

10. Down with the Hierarchy: The 'H' in HNSW Stands for 'Hubs'

Author: Munyampirwa, Blaise, Lakshman, Vihan, and Coleman, Benjamin
Subjects: Computer Science - Machine Learning, Computer Science - Databases, Computer Science - Information Retrieval
Abstract: Driven by recent breakthrough advances in neural representation learning, approximate near-neighbor (ANN) search over vector embeddings has emerged as a critical computational workload. With the introduction of the seminal Hierarchical Navigable Small World (HNSW) algorithm, graph-based indexes have established themseves as the overwhelmingly dominant paradigm for efficient and scalable ANN search. As the name suggests, HNSW searches a layered hierarchical graph to quickly identify neighborhoods of similar points to a given query vector. But is this hierarchy even necessary? A rigorous experimental analysis to answer this question would provide valuable insights into the nature of algorithm design for ANN search and motivate directions for future work in this increasingly crucial domain. To that end, we conduct an extensive benchmarking study covering more large-scale datasets than prior investigations of this question. We ultimately find that a flat graph retains all of the benefits of HNSW on high-dimensional datasets, with latency and recall performance essentially \emph{identical} to the original algorithm but with less memory overhead. Furthermore, we go a step further and study \emph{why} the hierarchy of HNSW provides no benefit in high dimensions, hypothesizing that navigable small world graphs contain a well-connected, frequently traversed ``highway" of hub nodes that maintain the same purported function as the hierarchical layers. We present compelling empirical evidence that the \emph{Hub Highway Hypothesis} holds for real datasets and investigate the mechanisms by which the highway forms. The implications of this hypothesis may also provide future research directions in developing enhancements to graph-based ANN search., Comment: 10 pages
Published: 2024

11. Exploring the Changing Modes of Learning and Teaching in Campus-Based Curricula during and Post-COVID-19

Author: Aisling Keane, Kathyrn McFerran, Blaise Acton, Samantha Taylor, and Declan McLaughlin
Abstract: The rise in technology-rich learning environments is reflective of a global trend in higher education (HE), recently accelerated because of necessary digital teaching and assessment practices embraced during the COVID-19 pandemic. This qualitative study facilitated through focus groups and an interview explores the teaching and learning experiences of tertiary level students in the COVID-19 era. Data from 24 students based within a UK Higher Education Institution highlights how an expanded digital environment can optimise conditions for some students to independently practise and apply what they are learning at their own pace. Digitally enhanced opportunities to interact with teaching staff and learning resources also increased the options for these students to experience themselves as competent members of the HE community. This was particularly relevant for first-year students new to the processes and practices of tertiary education. In contrast, third year students with more experience of HE appeared less reliant on the provision of online learning resources. Participants also identified some potential problems associated with the enhanced flexibility of online teaching and learning resources in relation to students' ability to be self-regulated. This paper rationalises the need for educators and educational and learning developers who teach and undertake scholarship in teaching and learning to consider the sociocultural context of the student and their learning environment when designing teaching activities and curricula. The data presented here highlight the need for a clearly defined framework to underpin the integration of digital technologies with on-campus activities.
Published: 2024

12. Modeling electricity generation and consumption in cameroon

Author: Fombuwing, Blaise and Ersoy, Neyre Tekbiyik
Published: 2024

13. Systematic design of compliant morphing structures: a phase-field approach

Author: Shabani, Jamal, Bhattacharya, Kaushik, and Bourdin, Blaise
Subjects: Mathematics - Numerical Analysis, Mathematics - Optimization and Control, 74P10, 74P15, 49N45
Abstract: We investigate the systematic design of compliant morphing structures composed of materials reacting to an external stimulus. We add a perimeter penalty term to ensure existence of solutions. We propose a phase-field approximation of this sharp interface problem, prove its convergence as the regularization length approaches 0 and present an efficient numerical implementation. We illustrate the strengths of our approach through a series of numerical examples.
Published: 2024

14. Uncovering thermodynamic origin of counterflow and coflow instabilities in miscible binary superfluids

Author: An, Yuping, Gouteraux, Blaise, and Li, Li
Subjects: High Energy Physics - Theory, Condensed Matter - Quantum Gases, General Relativity and Quantum Cosmology
Abstract: In this paper, we explore instabilities in binary superfluids with a nonvanishing relative superflow, particularly focusing on counterflow and coflow instabilities. We extend recent results on the thermodynamic origin of finite superflow instabilities in single-component superfluids to binary systems and derive a criterion for the onset of instability through a hydrodynamic analysis. To verify this result, we utilize both the Gross-Pitaevskii equation (GPE) for weakly interacting Bose-Einstein condensates (BEC) and a holographic binary superfluid model, which naturally incorporates strong coupling, finite temperature, and dissipation. We find that the counterflow and coflow instabilities in binary superfluids are all essentially thermodynamic. Except the one due to order competing via global thermodynamic instability, the others are caused by an eigenvalue of the free energy Hessian diverging and changing sign. We also observe that the critical velocities of these instabilities follow a general scaling law related to the interaction strength between superfluid components. The nonlinear stages of the instabilities are also studied by full time evolution, where vortex dynamics is found to play a significant role, resulting in the reduction of superfluid velocity back to a stable phase., Comment: 26 pages, 10 figures
Published: 2024

15. Can LLMs make trade-offs involving stipulated pain and pleasure states?

Author: Keeling, Geoff, Street, Winnie, Stachaczyk, Martyna, Zakharova, Daria, Comsa, Iulia M., Sakovych, Anastasiya, Logothetis, Isabella, Zhang, Zejia, Arcas, Blaise Agüera y, and Birch, Jonathan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Computers and Society
Abstract: Pleasure and pain play an important role in human decision making by providing a common currency for resolving motivational conflicts. While Large Language Models (LLMs) can generate detailed descriptions of pleasure and pain experiences, it is an open question whether LLMs can recreate the motivational force of pleasure and pain in choice scenarios - a question which may bear on debates about LLM sentience, understood as the capacity for valenced experiential states. We probed this question using a simple game in which the stated goal is to maximise points, but where either the points-maximising option is said to incur a pain penalty or a non-points-maximising option is said to incur a pleasure reward, providing incentives to deviate from points-maximising behaviour. Varying the intensity of the pain penalties and pleasure rewards, we found that Claude 3.5 Sonnet, Command R+, GPT-4o, and GPT-4o mini each demonstrated at least one trade-off in which the majority of responses switched from points-maximisation to pain-minimisation or pleasure-maximisation after a critical threshold of stipulated pain or pleasure intensity is reached. LLaMa 3.1-405b demonstrated some graded sensitivity to stipulated pleasure rewards and pain penalties. Gemini 1.5 Pro and PaLM 2 prioritised pain-avoidance over points-maximisation regardless of intensity, while tending to prioritise points over pleasure regardless of intensity. We discuss the implications of these findings for debates about the possibility of LLM sentience.
Published: 2024

16. Thank You, Stingray: Multilingual Large Language Models Can Not (Yet) Disambiguate Cross-Lingual Word Sense

Author: Cahyawijaya, Samuel, Zhang, Ruochen, Lovenia, Holy, Cruz, Jan Christian Blaise, Gilbert, Elisa, Nomoto, Hiroki, and Aji, Alham Fikri
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Multilingual large language models (LLMs) have gained prominence, but concerns arise regarding their reliability beyond English. This study addresses the gap in cross-lingual semantic evaluation by introducing a novel benchmark for cross-lingual sense disambiguation, StingrayBench. In this paper, we demonstrate using false friends -- words that are orthographically similar but have completely different meanings in two languages -- as a possible approach to pinpoint the limitation of cross-lingual sense disambiguation in LLMs. We collect false friends in four language pairs, namely Indonesian-Malay, Indonesian-Tagalog, Chinese-Japanese, and English-German; and challenge LLMs to distinguish the use of them in context. In our analysis of various models, we observe they tend to be biased toward higher-resource languages. We also propose new metrics for quantifying the cross-lingual sense bias and comprehension based on our benchmark. Our work contributes to developing more diverse and inclusive language modeling, promoting fairer access for the wider multilingual community.
Published: 2024

17. Multi-agent cooperation through learning-aware policy gradients

Author: Meulemans, Alexander, Kobayashi, Seijin, von Oswald, Johannes, Scherrer, Nino, Elmoznino, Eric, Richards, Blake, Lajoie, Guillaume, Arcas, Blaise Agüera y, and Sacramento, João
Subjects: Computer Science - Artificial Intelligence
Abstract: Self-interested individuals often fail to cooperate, posing a fundamental challenge for multi-agent learning. How can we achieve cooperation among self-interested, independent learning agents? Promising recent work has shown that in certain tasks cooperation can be established between learning-aware agents who model the learning dynamics of each other. Here, we present the first unbiased, higher-derivative-free policy gradient algorithm for learning-aware reinforcement learning, which takes into account that other agents are themselves learning through trial and error based on multiple noisy trials. We then leverage efficient sequence models to condition behavior on long observation histories that contain traces of the learning dynamics of other agents. Training long-context policies with our algorithm leads to cooperative behavior and high returns on standard social dilemmas, including a challenging environment where temporally-extended action coordination is required. Finally, we derive from the iterated prisoner's dilemma a novel explanation for how and when cooperation arises among self-interested learning-aware agents.
Published: 2024

18. WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines

Author: Winata, Genta Indra, Hudi, Frederikus, Irawan, Patrick Amadeus, Anugraha, David, Putri, Rifki Afina, Wang, Yutong, Nohejl, Adam, Prathama, Ubaidillah Ariq, Ousidhoum, Nedjma, Amriani, Afifa, Rzayev, Anar, Das, Anirban, Pramodya, Ashmari, Adila, Aulia, Wilie, Bryan, Mawalim, Candy Olivia, Cheng, Ching Lam, Abolade, Daud, Chersoni, Emmanuele, Santus, Enrico, Ikhwantri, Fariz, Kuwanto, Garry, Zhao, Hanyang, Wibowo, Haryo Akbarianto, Lovenia, Holy, Cruz, Jan Christian Blaise, Putra, Jan Wira Gotama, Myung, Junho, Susanto, Lucky, Machin, Maria Angelica Riera, Zhukova, Marina, Anugraha, Michael, Adilazuarda, Muhammad Farid, Santosa, Natasha, Limkonchotiwat, Peerat, Dabre, Raj, Audino, Rio Alexander, Cahyawijaya, Samuel, Zhang, Shi-Xiong, Salim, Stephanie Yulia, Zhou, Yi, Gui, Yinxuan, Adelani, David Ifeoluwa, Lee, En-Shiun Annie, Okada, Shogo, Purwarianti, Ayu, Aji, Alham Fikri, Watanabe, Taro, Wijaya, Derry Tanti, Oh, Alice, and Ngo, Chong-Wah
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Vision Language Models (VLMs) often struggle with culture-specific knowledge, particularly in languages other than English and in underrepresented cultural contexts. To evaluate their understanding of such knowledge, we introduce WorldCuisines, a massive-scale benchmark for multilingual and multicultural, visually grounded language understanding. This benchmark includes a visual question answering (VQA) dataset with text-image pairs across 30 languages and dialects, spanning 9 language families and featuring over 1 million data points, making it the largest multicultural VQA benchmark to date. It includes tasks for identifying dish names and their origins. We provide evaluation datasets in two sizes (12k and 60k instances) alongside a training dataset (1 million instances). Our findings show that while VLMs perform better with correct location context, they struggle with adversarial contexts and predicting specific regional cuisines and languages. To support future research, we release a knowledge base with annotated food entries and images along with the VQA data., Comment: Preprint
Published: 2024

19. Chain and Causal Attention for Efficient Entity Tracking

Author: Fagnou, Erwan, Caillon, Paul, Delattre, Blaise, and Allauzen, Alexandre
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language, I.2.7
Abstract: This paper investigates the limitations of transformers for entity-tracking tasks in large language models. We identify a theoretical constraint, showing that transformers require at least $\log_2 (n+1)$ layers to handle entity tracking with $n$ state changes. To address this issue, we propose an efficient and frugal enhancement to the standard attention mechanism, enabling it to manage long-term dependencies more efficiently. By considering attention as an adjacency matrix, our model can track entity states with a single layer. Empirical results demonstrate significant improvements in entity tracking datasets while keeping competitive performance on standard natural language modeling. Our modified attention allows us to achieve the same performance with drastically fewer layers. Additionally, our enhanced mechanism reveals structured internal representations of attention. Extensive experiments on both toy and complex datasets validate our approach. Our contributions include theoretical insights, an improved attention mechanism, and empirical validation., Comment: 15 pages, 5 figures, EMNLP 2024 Main
Published: 2024
Full Text: View/download PDF

20. Methods for Mitigating Uncertainty in Real-Time Operations of a Connected Microgrid

Author: Panda, Subrat Prasad, Genest, Blaise, Easwaran, Arvind, Rigo-Mariani, Rémy, and Lin, PengFeng
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: In this paper, we compare the effectiveness of a two-stage control strategy for the energy management system (EMS) of a grid-connected microgrid under uncertain solar irradiance and load demand using a real-world dataset from an island in Southeast Asia (SEA). The first stage computes a day-ahead commitment for power profile exchanged with the main grid, while the second stage focuses on real-time controls to minimize the system operating cost. Given the challenges in accurately forecasting solar irradiance for a long time horizon, scenario-based stochastic programming (SP) is considered for the first stage. For the second stage, as the most recent weather conditions can be used, several methodologies to handle the uncertainties are investigated, including: (1) the rule-based method historically deployed on EMS, (2) model predictive controller (MPC) using either an explicit forecast or scenario-based stochastic forecast, and (3) Deep Reinforcement Learning (DRL) computing its own implicit forecast through a distribution of costs. Performances of these methodologies are compared in terms of precision with a reference control assuming perfect forecast -- i.e. representing the minimal achievable operation cost in theory. Obtained results show that MPC with a stochastic forecast outperforms MPC with a simple deterministic prediction. This suggests that using an explicit forecast, even within a short time window, is challenging. Using weather conditions can, however, be more efficient, as demonstrated by DRL (with implicit forecast), outperforming MPC with stochastic forecast by 1.3\%., Comment: Published in Sustainable Energy, Grids and Networks 2024
Published: 2024
Full Text: View/download PDF

21. Uncertainty-Guided Appearance-Motion Association Network for Out-of-Distribution Action Detection

Author: Fang, Xiang, Easwaran, Arvind, and Genest, Blaise
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Out-of-distribution (OOD) detection targets to detect and reject test samples with semantic shifts, to prevent models trained on in-distribution (ID) dataset from producing unreliable predictions. Existing works only extract the appearance features on image datasets, and cannot handle dynamic multimedia scenarios with much motion information. Therefore, we target a more realistic and challenging OOD detection task: OOD action detection (ODAD). Given an untrimmed video, ODAD first classifies the ID actions and recognizes the OOD actions, and then localizes ID and OOD actions. To this end, in this paper, we propose a novel Uncertainty-Guided Appearance-Motion Association Network (UAAN), which explores both appearance features and motion contexts to reason spatial-temporal inter-object interaction for ODAD.Firstly, we design separate appearance and motion branches to extract corresponding appearance-oriented and motion-aspect object representations. In each branch, we construct a spatial-temporal graph to reason appearance-guided and motion-driven inter-object interaction. Then, we design an appearance-motion attention module to fuse the appearance and motion features for final action detection. Experimental results on two challenging datasets show that UAAN beats state-of-the-art methods by a significant margin, illustrating its effectiveness., Comment: Accepted by MIPR 2024
Published: 2024

22. Sexisms and Un/welcome Diversity in Australian Universities

Author: Emily M. Gray, Pasley, Mindy Blaise, Jacqueline Ullman, and Emma Fishwick
Abstract: This paper offers an analysis of data from the second phase of a project entitled "Understanding and Addressing Everyday Sexisms in Australian Universities," which involved interviewing key stakeholders with an understanding of and/or experiences of 'Everyday Sexisms' within the academy. The paper demonstrates how women understand themselves as inherently unwelcome within higher education in Australia, and illustrates how this manifests through experiences, complaints procedures and seemingly banal everyday gendered and racilaised interactions. The authors show how complaints procedures often operate to further harass women who have experienced sexist harassment at work. The paper concludes by considering how the shared experiences of minoritised people within universities can pave the way for new ways of understanding diversity and working together to co-create a more equitable Higher Education.
Published: 2024
Full Text: View/download PDF

23. Evaluating the Environmental Impact of Chemistry Education: A Pilot Extracurricular Activity for Undergraduate Students

Author: Ida Helena de Raad, Michel Iltes, Olga Kosjakova, Anni Meerholz, Andrea Portocarrero Gamarra, Jeanne Tilquin, Stijn Helsloot, Renaud Blaise Jolivet, Gavin Phillips, Jurica Bauer, and Katarzyna Maria Dziubinska-Kuehn
Abstract: Nowadays, discussing global environmental issues has become regularly included in STEM-based higher education, emphasizing the importance of the youth in finding "ad hoc" solutions for the climate crisis. One of the most commonly applied strategies is promoting sustainable choices among students, especially linked to their lifestyle choices, or implementing small changes to aim for large-scale cumulative effects. Herein, a learning and research activity designed for undergraduate students is presented, aimed at raising their awareness of a less discussed type of global warming contributors, namely greenhouse gases (GHGs), being a fundamental example of gaseous waste produced at universities as part of STEM education programs. In this extracurricular project, a multidisciplinary group of students performed a series of real-time measurements of exemplary GHGs emissions during the chemistry-oriented practical courses taken by their peers. As a result, qualitative and quantitative information about the gaseous waste produced in the student laboratories was obtained and linked to various laboratory activities, raising student consciousness about the frequently neglected gases produced in the laboratory that also contribute to the environmental crisis. This research activity enables students to apply analytical chemistry to evaluate their own chemical footprint in the laboratory. Furthermore, this project aims to illustrate the importance of engaging students in extracurricular learning and research activities.
Published: 2024
Full Text: View/download PDF

24. Vanilla Gradient Descent for Oblique Decision Trees

Author: Panda, Subrat Prasad, Genest, Blaise, Easwaran, Arvind, and Suganthan, Ponnuthurai Nagaratnam
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Decision Trees (DTs) constitute one of the major highly non-linear AI models, valued, e.g., for their efficiency on tabular data. Learning accurate DTs is, however, complicated, especially for oblique DTs, and does take a significant training time. Further, DTs suffer from overfitting, e.g., they proverbially "do not generalize" in regression tasks. Recently, some works proposed ways to make (oblique) DTs differentiable. This enables highly efficient gradient-descent algorithms to be used to learn DTs. It also enables generalizing capabilities by learning regressors at the leaves simultaneously with the decisions in the tree. Prior approaches to making DTs differentiable rely either on probabilistic approximations at the tree's internal nodes (soft DTs) or on approximations in gradient computation at the internal node (quantized gradient descent). In this work, we propose DTSemNet, a novel semantically equivalent and invertible encoding for (hard, oblique) DTs as Neural Networks (NNs), that uses standard vanilla gradient descent. Experiments across various classification and regression benchmarks show that oblique DTs learned using DTSemNet are more accurate than oblique DTs of similar size learned using state-of-the-art techniques. Further, DT training time is significantly reduced. We also experimentally demonstrate that DTSemNet can learn DT policies as efficiently as NN policies in the Reinforcement Learning (RL) setup with physical inputs (dimensions $\leq32$). The code is available at https://github.com/CPS-research-group/dtsemnet., Comment: Published in European Conference on Artificial Intelligence (ECAI), 2024. Full version (includes supplementary material)
Published: 2024
Full Text: View/download PDF

25. A frugal Spiking Neural Network for unsupervised classification of continuous multivariate temporal data

Author: Pokala, Sai Deepesh, Bernert, Marie, Nanami, Takuya, Kohno, Takashi, Lévi, Timothée, and Yvert, Blaise
Subjects: Computer Science - Neural and Evolutionary Computing, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: As neural interfaces become more advanced, there has been an increase in the volume and complexity of neural data recordings. These interfaces capture rich information about neural dynamics that call for efficient, real-time processing algorithms to spontaneously extract and interpret patterns of neural dynamics. Moreover, being able to do so in a fully unsupervised manner is critical as patterns in vast streams of neural data might not be easily identifiable by the human eye. Formal Deep Neural Networks (DNNs) have come a long way in performing pattern recognition tasks for various static and sequential pattern recognition applications. However, these networks usually require large labeled datasets for training and have high power consumption preventing their future embedding in active brain implants. An alternative aimed at addressing these issues are Spiking Neural Networks (SNNs) which are neuromorphic and use more biologically plausible neurons with evolving membrane potentials. In this context, we introduce here a frugal single-layer SNN designed for fully unsupervised identification and classification of multivariate temporal patterns in continuous data with a sequential approach. We show that, with only a handful number of neurons, this strategy is efficient to recognize highly overlapping multivariate temporal patterns, first on simulated data, and then on Mel Cepstral representations of speech sounds and finally on multichannel neural data. This approach relies on several biologically inspired plasticity rules, including Spike-timing-dependent plasticity (STDP), Short-term plasticity (STP) and intrinsic plasticity (IP). These results pave the way towards highly frugal SNNs for fully unsupervised and online-compatible learning of complex multivariate temporal patterns for future embedding in dedicated very-low power hardware.
Published: 2024

26. BraTS-PEDs: Results of the Multi-Consortium International Pediatric Brain Tumor Segmentation Challenge 2023

Author: Kazerooni, Anahita Fathi, Khalili, Nastaran, Liu, Xinyang, Haldar, Debanjan, Jiang, Zhifan, Zapaishchykova, Anna, Pavaine, Julija, Shah, Lubdha M., Jones, Blaise V., Sheth, Nakul, Prabhu, Sanjay P., McAllister, Aaron S., Tu, Wenxin, Nandolia, Khanak K., Rodriguez, Andres F., Shaikh, Ibraheem Salman, Montano, Mariana Sanchez, Lai, Hollie Anne, Adewole, Maruf, Albrecht, Jake, Anazodo, Udunna, Anderson, Hannah, Anwar, Syed Muhammed, Aristizabal, Alejandro, Bagheri, Sina, Baid, Ujjwal, Bergquist, Timothy, Borja, Austin J., Calabrese, Evan, Chung, Verena, Conte, Gian-Marco, Eddy, James, Ezhov, Ivan, Familiar, Ariana M., Farahani, Keyvan, Gandhi, Deep, Gottipati, Anurag, Haldar, Shuvanjan, Iglesias, Juan Eugenio, Janas, Anastasia, Elaine, Elaine, Karargyris, Alexandros, Kassem, Hasan, Khalili, Neda, Kofler, Florian, LaBella, Dominic, Van Leemput, Koen, Li, Hongwei B., Maleki, Nazanin, Meier, Zeke, Menze, Bjoern, Moawad, Ahmed W., Pati, Sarthak, Piraud, Marie, Poussaint, Tina, Reitman, Zachary J., Rudie, Jeffrey D., Saluja, Rachit, Sheller, MIcah, Shinohara, Russell Takeshi, Viswanathan, Karthik, Wang, Chunhao, Wiestler, Benedikt, Wiggins, Walter F., Davatzikos, Christos, Storm, Phillip B., Bornhorst, Miriam, Packer, Roger, Hummel, Trent, de Blank, Peter, Hoffman, Lindsey, Aboian, Mariam, Nabavizadeh, Ali, Ware, Jeffrey B., Kann, Benjamin H., Rood, Brian, Resnick, Adam, Bakas, Spyridon, Vossough, Arastoo, and Linguraru, Marius George
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Pediatric central nervous system tumors are the leading cause of cancer-related deaths in children. The five-year survival rate for high-grade glioma in children is less than 20%. The development of new treatments is dependent upon multi-institutional collaborative clinical trials requiring reproducible and accurate centralized response assessment. We present the results of the BraTS-PEDs 2023 challenge, the first Brain Tumor Segmentation (BraTS) challenge focused on pediatric brain tumors. This challenge utilized data acquired from multiple international consortia dedicated to pediatric neuro-oncology and clinical trials. BraTS-PEDs 2023 aimed to evaluate volumetric segmentation algorithms for pediatric brain gliomas from magnetic resonance imaging using standardized quantitative performance evaluation metrics employed across the BraTS 2023 challenges. The top-performing AI approaches for pediatric tumor analysis included ensembles of nnU-Net and Swin UNETR, Auto3DSeg, or nnU-Net with a self-supervised framework. The BraTSPEDs 2023 challenge fostered collaboration between clinicians (neuro-oncologists, neuroradiologists) and AI/imaging scientists, promoting faster data sharing and the development of automated volumetric analysis techniques. These advancements could significantly benefit clinical trials and improve the care of children with brain tumors.
Published: 2024

27. Linear dynamical stability and the laws of thermodynamics

Author: Goutéraux, Blaise and Mefford, Eric
Subjects: High Energy Physics - Theory, Condensed Matter - Quantum Gases, Condensed Matter - Statistical Mechanics, Condensed Matter - Strongly Correlated Electrons, High Energy Physics - Phenomenology
Abstract: We show that the dynamical stability under linear perturbations of interacting systems in the hydrodynamic regime follows from the first and the second laws of thermodynamics. Our argument extends to systems with spontaneously or softly broken symmetries and in the presence of magnetic fields., Comment: 20 pages, 3 figures
Published: 2024

28. Function+Data Flow: A Framework to Specify Machine Learning Pipelines for Digital Twinning

Author: de Conto, Eduardo, Genest, Blaise, and Easwaran, Arvind
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The development of digital twins (DTs) for physical systems increasingly leverages artificial intelligence (AI), particularly for combining data from different sources or for creating computationally efficient, reduced-dimension models. Indeed, even in very different application domains, twinning employs common techniques such as model order reduction and modelization with hybrid data (that is, data sourced from both physics-based models and sensors). Despite this apparent generality, current development practices are ad-hoc, making the design of AI pipelines for digital twinning complex and time-consuming. Here we propose Function+Data Flow (FDF), a domain-specific language (DSL) to describe AI pipelines within DTs. FDF aims to facilitate the design and validation of digital twins. Specifically, FDF treats functions as first-class citizens, enabling effective manipulation of models learned with AI. We illustrate the benefits of FDF on two concrete use cases from different domains: predicting the plastic strain of a structure and modeling the electromagnetic behavior of a bearing., Comment: 9 pages, 10 figures, to be published in AIware'24
Published: 2024
Full Text: View/download PDF

29. Computational Life: How Well-formed, Self-replicating Programs Emerge from Simple Interaction

Author: Arcas, Blaise Agüera y, Alakuijala, Jyrki, Evans, James, Laurie, Ben, Mordvintsev, Alexander, Niklasson, Eyvind, Randazzo, Ettore, and Versari, Luca
Subjects: Computer Science - Neural and Evolutionary Computing, Computer Science - Artificial Intelligence, F.2.2, I.2.11
Abstract: The fields of Origin of Life and Artificial Life both question what life is and how it emerges from a distinct set of "pre-life" dynamics. One common feature of most substrates where life emerges is a marked shift in dynamics when self-replication appears. While there are some hypotheses regarding how self-replicators arose in nature, we know very little about the general dynamics, computational principles, and necessary conditions for self-replicators to emerge. This is especially true on "computational substrates" where interactions involve logical, mathematical, or programming rules. In this paper we take a step towards understanding how self-replicators arise by studying several computational substrates based on various simple programming languages and machine instruction sets. We show that when random, non self-replicating programs are placed in an environment lacking any explicit fitness landscape, self-replicators tend to arise. We demonstrate how this occurs due to random interactions and self-modification, and can happen with and without background random mutations. We also show how increasingly complex dynamics continue to emerge following the rise of self-replicators. Finally, we show a counterexample of a minimalistic programming language where self-replicators are possible, but so far have not been observed to arise., Comment: 20 pages; updated introduction with further related work
Published: 2024

30. SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Author: Lovenia, Holy, Mahendra, Rahmad, Akbar, Salsabil Maulana, Miranda, Lester James V., Santoso, Jennifer, Aco, Elyanah, Fadhilah, Akhdan, Mansurov, Jonibek, Imperial, Joseph Marvin, Kampman, Onno P., Moniz, Joel Ruben Antony, Habibi, Muhammad Ravi Shulthan, Hudi, Frederikus, Montalan, Railey, Ignatius, Ryan, Lopo, Joanito Agili, Nixon, William, Karlsson, Börje F., Jaya, James, Diandaru, Ryandito, Gao, Yuze, Amadeus, Patrick, Wang, Bin, Cruz, Jan Christian Blaise, Whitehouse, Chenxi, Parmonangan, Ivan Halim, Khelli, Maria, Zhang, Wenyu, Susanto, Lucky, Ryanda, Reynard Adha, Hermawan, Sonny Lazuardi, Velasco, Dan John, Kautsar, Muhammad Dehan Al, Hendria, Willy Fitra, Moslem, Yasmin, Flynn, Noah, Adilazuarda, Muhammad Farid, Li, Haochen, Lee, Johanes, Damanhuri, R., Sun, Shuo, Qorib, Muhammad Reza, Djanibekov, Amirbek, Leong, Wei Qi, Do, Quyet V., Muennighoff, Niklas, Pansuwan, Tanrada, Putra, Ilham Firdausi, Xu, Yan, Tai, Ngee Chia, Purwarianti, Ayu, Ruder, Sebastian, Tjhi, William, Limkonchotiwat, Peerat, Aji, Alham Fikri, Keh, Sedrick, Winata, Genta Indra, Zhang, Ruochen, Koto, Fajri, Yong, Zheng-Xin, and Cahyawijaya, Samuel
Subjects: Computer Science - Computation and Language
Abstract: Southeast Asia (SEA) is a region rich in linguistic diversity and cultural variety, with over 1,300 indigenous languages and a population of 671 million people. However, prevailing AI models suffer from a significant lack of representation of texts, images, and audio datasets from SEA, compromising the quality of AI models for SEA languages. Evaluating models for SEA languages is challenging due to the scarcity of high-quality datasets, compounded by the dominance of English training data, raising concerns about potential cultural misrepresentation. To address these challenges, we introduce SEACrowd, a collaborative initiative that consolidates a comprehensive resource hub that fills the resource gap by providing standardized corpora in nearly 1,000 SEA languages across three modalities. Through our SEACrowd benchmarks, we assess the quality of AI models on 36 indigenous languages across 13 tasks, offering valuable insights into the current AI landscape in SEA. Furthermore, we propose strategies to facilitate greater AI advancements, maximizing potential utility and resource equity for the future of AI in SEA., Comment: https://seacrowd.github.io/ Accepted in EMNLP 2024
Published: 2024

31. CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

Author: Romero, David, Lyu, Chenyang, Wibowo, Haryo Akbarianto, Lynn, Teresa, Hamed, Injy, Kishore, Aditya Nanda, Mandal, Aishik, Dragonetti, Alina, Abzaliev, Artem, Tonja, Atnafu Lambebo, Balcha, Bontu Fufa, Whitehouse, Chenxi, Salamea, Christian, Velasco, Dan John, Adelani, David Ifeoluwa, Meur, David Le, Villa-Cueva, Emilio, Koto, Fajri, Farooqui, Fauzan, Belcavello, Frederico, Batnasan, Ganzorig, Vallejo, Gisela, Caulfield, Grainne, Ivetta, Guido, Song, Haiyue, Ademtew, Henok Biadglign, Maina, Hernán, Lovenia, Holy, Azime, Israel Abebe, Cruz, Jan Christian Blaise, Gala, Jay, Geng, Jiahui, Ortiz-Barajas, Jesus-German, Baek, Jinheon, Dunstan, Jocelyn, Alemany, Laura Alonso, Nagasinghe, Kumaranage Ravindu Yasas, Benotti, Luciana, D'Haro, Luis Fernando, Viridiano, Marcelo, Estecha-Garitagoitia, Marcos, Cabrera, Maria Camila Buitrago, Rodríguez-Cantelar, Mario, Jouitteau, Mélanie, Mihaylov, Mihail, Imam, Mohamed Fazli Mohamed, Adilazuarda, Muhammad Farid, Gochoo, Munkhjargal, Otgonbold, Munkh-Erdene, Etori, Naome, Niyomugisha, Olivier, Silva, Paula Mónica, Chitale, Pranjal, Dabre, Raj, Chevi, Rendi, Zhang, Ruochen, Diandaru, Ryandito, Cahyawijaya, Samuel, Góngora, Santiago, Jeong, Soyeong, Purkayastha, Sukannya, Kuribayashi, Tatsuki, Clifford, Teresa, Jayakumar, Thanmay, Torrent, Tiago Timponi, Ehsan, Toqeer, Araujo, Vladimir, Kementchedjhieva, Yova, Burzo, Zara, Lim, Zheng Wei, Yong, Zheng Xin, Ignat, Oana, Nwatu, Joan, Mihalcea, Rada, Solorio, Thamar, and Aji, Alham Fikri
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recent efforts have tried to increase the number of languages covered on VQA datasets, they still lack diversity in low-resource languages. More importantly, although these datasets often extend their linguistic range via translation or some other approaches, they usually keep images the same, resulting in narrow cultural representation. To address these limitations, we construct CVQA, a new Culturally-diverse multilingual Visual Question Answering benchmark, designed to cover a rich set of languages and cultures, where we engage native speakers and cultural experts in the data collection process. As a result, CVQA includes culturally-driven images and questions from across 30 countries on four continents, covering 31 languages with 13 scripts, providing a total of 10k questions. We then benchmark several Multimodal Large Language Models (MLLMs) on CVQA, and show that the dataset is challenging for the current state-of-the-art models. This benchmark can serve as a probing evaluation suite for assessing the cultural capability and bias of multimodal models and hopefully encourage more research efforts toward increasing cultural awareness and linguistic diversity in this field., Comment: 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks
Published: 2024

32. On the Path to Accelerating Industrialisation in Africa: The Role of Fiscal Decentralisation

Author: Eloundou, Georges Ngnouwal, Beyene, Blaise Ondoua, and Nkoa, Bruno Emmanuel Ongo
Published: 2025
Full Text: View/download PDF

33. Skeletal muscle elastic modulus in marathon distance runners

Author: Winn, Brad J., Haight, Derek J., Williams, 3rd, D. S. Blaise, and Kirby, Brett S.
Published: 2025
Full Text: View/download PDF

34. Toward expert-level medical question answering with large language models

Author: Singhal, Karan, Tu, Tao, Gottweis, Juraj, Sayres, Rory, Wulczyn, Ellery, Amin, Mohamed, Hou, Le, Clark, Kevin, Pfohl, Stephen R., Cole-Lewis, Heather, Neal, Darlene, Rashid, Qazi Mamunur, Schaekermann, Mike, Wang, Amy, Dash, Dev, Chen, Jonathan H., Shah, Nigam H., Lachgar, Sami, Mansfield, Philip Andrew, Prakash, Sushant, Green, Bradley, Dominowska, Ewa, Agüera y Arcas, Blaise, Tomašev, Nenad, Liu, Yun, Wong, Renee, Semturs, Christopher, Mahdavi, S. Sara, Barral, Joelle K., Webster, Dale R., Corrado, Greg S., Matias, Yossi, Azizi, Shekoofeh, Karthikesalingam, Alan, and Natarajan, Vivek
Published: 2025
Full Text: View/download PDF

35. Post-transplant cyclophosphamide with post-engraftment anti-thymocyte globulin reduce moderate to severe chronic graft-versus-host disease in peripheral stem cell transplantation from HLA-matched unrelated and haploidentical donors

Author: Wang, Ying, Gao, Wen-Hui, Wang, Li-ning, Wang, Ling, Jiang, Jie-ling, Wan, Ming, Liang, Ai-Bin, Blaise, Didier, and Hu, Jiong
Published: 2025
Full Text: View/download PDF

36. Correlation between the chemical composition of fresh and dried Cymbopogon citratus essential oil fractions and their antifungal effects against the causal agents of brown spot and bakanae diseases of rice

Author: Fouelefack, François Romain, Tapan, Kumar Pal, Dongmo, Lekagne Joseph Blaise, Ndonkeu, Mangoumou Ghislaine, Mekam, Pascal Noel, and Nguefack, Julienne
Published: 2025
Full Text: View/download PDF

37. Reshaping of Distorted Signal with Low-Pass Type Negative Group Delay RL-network Circuit

Author: Zhang, Yi, Su, Xun, Wieser, Robert, Sanchez Galan, Raul, and Ravelo, Blaise
Published: 2024
Full Text: View/download PDF

38. Intercropping Legumes with High-Density Cotton to Improve the Land Use Efficiencies of Rainfed Vertisols of India

Author: Manikandan, Angamuthu, Blaise, Desouza, Nalayini, Periyakaruppan, Nagrare, Vishlesh Shankar, and Prasad, Yenumula Gerard
Published: 2024
Full Text: View/download PDF

39. Global potential for natural regeneration in deforested tropical regions

Author: Williams, Brooke A., Beyer, Hawthorne L., Fagan, Matthew E., Chazdon, Robin L., Schmoeller, Marina, Sprenkle-Hyppolite, Starry, Griscom, Bronson W., Watson, James E. M., Tedesco, Anazélia M., Gonzalez-Roglich, Mariano, Daldegan, Gabriel A., Bodin, Blaise, Celentano, Danielle, Wilson, Sarah Jane, Rhodes, Jonathan R., Alexandre, Nikola S., Kim, Do-Hyung, Bastos, Diego, and Crouzeilles, Renato
Published: 2024
Full Text: View/download PDF

40. The mental association between subjective vitality, energy conservation motivation, and cognitive effort motivation according to the schema model of self-control

Author: Blaise, Max and Bertrams, Alex
Published: 2024
Full Text: View/download PDF

41. Weightbearing versus non-weight bearing in geriatric distal femoral fractures: a systematic review and meta-analysis: Weightbearing versus non-weight bearing in geriatric distal femoral fractures: a systematic review and meta-analysis

Author: Wardle, Blaise, Lynch, Joseph T., Staniforth, Thomas, Ward, Thomas, and Smith, Paul
Published: 2024
Full Text: View/download PDF

42. Review on the amelioration of ZnO and its composites: synthesis and applications

Author: Singh, Amitender, Yadav, Kavita, Thakur, Preeti, Wan, Fayu, Ravelo, Blaise, and Thakur, Atul
Published: 2024
Full Text: View/download PDF

43. ECGrecover: a Deep Learning Approach for Electrocardiogram Signal Completion

Author: Lence, Alex, Granese, Federica, Fall, Ahmad, Hanczar, Blaise, Salem, Joe-Elie, Zucker, Jean-Daniel, and Prifti, Edi
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: In this work, we address the challenge of reconstructing the complete 12-lead ECG signal from its incomplete parts. We focus on two main scenarios: (i) reconstructing missing signal segments within an ECG lead and (ii) recovering entire leads from signal in another unique lead. Two emerging clinical applications emphasize the relevance of our work. The first is the increasing need to digitize paper-stored ECGs for utilization in AI-based applications, often limited to digital 12 lead 10s ECGs. The second is the widespread use of wearable devices that record ECGs but typically capture only one or a few leads. In both cases, a non-negligible amount of information is lost or not recorded. Our approach aims to recover this missing signal. We propose ECGrecover, a U-Net neural network model trained on a novel composite objective function to address the reconstruction problem. This function incorporates both spatial and temporal features of the ECG by combining the distance in amplitude and sycnhronization through time between the reconstructed and the real digital signals. We used real-life ECG datasets and through comprehensive assessments compared ECGrecover with three state-of-the-art methods based on generative adversarial networks (EKGAN, Pix2Pix) as well as the CopyPaste strategy. The results demonstrated that ECGrecover consistently outperformed state-of-the-art methods in standard distortion metrics as well as in preserving critical ECG characteristics, particularly the P, QRS, and T wave coordinates., Comment: 31 pages, 14 figures, 29 tables, conference paper
Published: 2024

44. LLMs achieve adult human performance on higher-order theory of mind tasks

Author: Street, Winnie, Siy, John Oliver, Keeling, Geoff, Baranes, Adrien, Barnett, Benjamin, McKibben, Michael, Kanyere, Tatenda, Lentz, Alison, Arcas, Blaise Aguera y, and Dunbar, Robin I. M.
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Human-Computer Interaction, I.2.7, H.1.2
Abstract: This paper examines the extent to which large language models (LLMs) have developed higher-order theory of mind (ToM); the human ability to reason about multiple mental and emotional states in a recursive manner (e.g. I think that you believe that she knows). This paper builds on prior work by introducing a handwritten test suite -- Multi-Order Theory of Mind Q&A -- and using it to compare the performance of five LLMs to a newly gathered adult human benchmark. We find that GPT-4 and Flan-PaLM reach adult-level and near adult-level performance on ToM tasks overall, and that GPT-4 exceeds adult performance on 6th order inferences. Our results suggest that there is an interplay between model size and finetuning for the realisation of ToM abilities, and that the best-performing LLMs have developed a generalised capacity for ToM. Given the role that higher-order ToM plays in a wide range of cooperative and competitive human behaviours, these findings have significant implications for user-facing LLM applications.
Published: 2024

45. The Ethics of Advanced AI Assistants

Author: Gabriel, Iason, Manzini, Arianna, Keeling, Geoff, Hendricks, Lisa Anne, Rieser, Verena, Iqbal, Hasan, Tomašev, Nenad, Ktena, Ira, Kenton, Zachary, Rodriguez, Mikel, El-Sayed, Seliem, Brown, Sasha, Akbulut, Canfer, Trask, Andrew, Hughes, Edward, Bergman, A. Stevie, Shelby, Renee, Marchal, Nahema, Griffin, Conor, Mateos-Garcia, Juan, Weidinger, Laura, Street, Winnie, Lange, Benjamin, Ingerman, Alex, Lentz, Alison, Enger, Reed, Barakat, Andrew, Krakovna, Victoria, Siy, John Oliver, Kurth-Nelson, Zeb, McCroskery, Amanda, Bolina, Vijay, Law, Harry, Shanahan, Murray, Alberts, Lize, Balle, Borja, de Haas, Sarah, Ibitoye, Yetunde, Dafoe, Allan, Goldberg, Beth, Krier, Sébastien, Reese, Alexander, Witherspoon, Sims, Hawkins, Will, Rauh, Maribeth, Wallace, Don, Franklin, Matija, Goldstein, Josh A., Lehman, Joel, Klenk, Michael, Vallor, Shannon, Biles, Courtney, Morris, Meredith Ringel, King, Helen, Arcas, Blaise Agüera y, Isaac, William, and Manyika, James
Subjects: Computer Science - Computers and Society
Abstract: This paper focuses on the opportunities and the ethical and societal risks posed by advanced AI assistants. We define advanced AI assistants as artificial agents with natural language interfaces, whose function is to plan and execute sequences of actions on behalf of a user, across one or more domains, in line with the user's expectations. The paper starts by considering the technology itself, providing an overview of AI assistants, their technical foundations and potential range of applications. It then explores questions around AI value alignment, well-being, safety and malicious uses. Extending the circle of inquiry further, we next consider the relationship between advanced AI assistants and individual users in more detail, exploring topics such as manipulation and persuasion, anthropomorphism, appropriate relationships, trust and privacy. With this analysis in place, we consider the deployment of advanced assistants at a societal scale, focusing on cooperation, equity and access, misinformation, economic impact, the environment and how best to evaluate advanced AI assistants. Finally, we conclude by providing a range of recommendations for researchers, developers, policymakers and public stakeholders.
Published: 2024

46. The Brain Tumor Segmentation in Pediatrics (BraTS-PEDs) Challenge: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)

Author: Kazerooni, Anahita Fathi, Khalili, Nastaran, Liu, Xinyang, Gandhi, Deep, Jiang, Zhifan, Anwar, Syed Muhammed, Albrecht, Jake, Adewole, Maruf, Anazodo, Udunna, Anderson, Hannah, Baid, Ujjwal, Bergquist, Timothy, Borja, Austin J., Calabrese, Evan, Chung, Verena, Conte, Gian-Marco, Dako, Farouk, Eddy, James, Ezhov, Ivan, Familiar, Ariana, Farahani, Keyvan, Franson, Andrea, Gottipati, Anurag, Haldar, Shuvanjan, Iglesias, Juan Eugenio, Janas, Anastasia, Johansen, Elaine, Jones, Blaise V, Khalili, Neda, Kofler, Florian, LaBella, Dominic, Lai, Hollie Anne, Van Leemput, Koen, Li, Hongwei Bran, Maleki, Nazanin, McAllister, Aaron S, Meier, Zeke, Menze, Bjoern, Moawad, Ahmed W, Nandolia, Khanak K, Pavaine, Julija, Piraud, Marie, Poussaint, Tina, Prabhu, Sanjay P, Reitman, Zachary, Rudie, Jeffrey D, Sanchez-Montano, Mariana, Shaikh, Ibraheem Salman, Sheth, Nakul, Tu, Wenxin, Wang, Chunhao, Ware, Jeffrey B, Wiestler, Benedikt, Zapaishchykova, Anna, Bornhorst, Miriam, Deutsch, Michelle, Fouladi, Maryam, Lazow, Margot, Mikael, Leonie, Hummel, Trent, Kann, Benjamin, de Blank, Peter, Hoffman, Lindsey, Aboian, Mariam, Nabavizadeh, Ali, Packer, Roger, Bakas, Spyridon, Resnick, Adam, Rood, Brian, Vossough, Arastoo, and Linguraru, Marius George
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. Here we present the CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs challenge, focused on pediatric brain tumors with data acquired across multiple international consortia dedicated to pediatric neuro-oncology and clinical trials. The CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs challenge brings together clinicians and AI/imaging scientists to lead to faster development of automated segmentation techniques that could benefit clinical trials, and ultimately the care of children with brain tumors., Comment: arXiv admin note: substantial text overlap with arXiv:2305.17033
Published: 2024

47. Can LLMs get help from other LLMs without revealing private information?

Author: Hartmann, Florian, Tran, Duc-Hieu, Kairouz, Peter, Cărbune, Victor, and Arcas, Blaise Aguera y
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Cryptography and Security, Computer Science - Multiagent Systems
Abstract: Cascades are a common type of machine learning systems in which a large, remote model can be queried if a local model is not able to accurately label a user's data by itself. Serving stacks for large language models (LLMs) increasingly use cascades due to their ability to preserve task performance while dramatically reducing inference costs. However, applying cascade systems in situations where the local model has access to sensitive data constitutes a significant privacy risk for users since such data could be forwarded to the remote model. In this work, we show the feasibility of applying cascade systems in such setups by equipping the local model with privacy-preserving techniques that reduce the risk of leaking private information when querying the remote model. To quantify information leakage in such setups, we introduce two privacy measures. We then propose a system that leverages the recently introduced social learning paradigm in which LLMs collaboratively learn from each other by exchanging natural language. Using this paradigm, we demonstrate on several datasets that our methods minimize the privacy loss while at the same time improving task performance compared to a non-cascade baseline.
Published: 2024

48. Mekler's Construction and Murphy's Law for 2-Nilpotent Groups

Author: Boissonneau, Blaise, Papadopoulos, Aris, and Touchard, Pierre
Subjects: Mathematics - Logic, Mathematics - Combinatorics, Mathematics - Group Theory
Abstract: Mekler's construction is a powerful technique for building purely algebraic structures from combinatorial ones. Its power lies in the fact that it allows various model-theoretic tameness properties of the combinatorial structure to transfer to the algebraic one. In this paper, we push this ideology much further, describing a broad class of properties that transfer through Mekler's construction. This technique subsumes many well-known results and opens avenues for many more. As a straightforward application of our methods, we (1) obtain transfer principles for stably embedded pairs of Mekler groups and (2) construct the first examples of strictly $\mathsf{NFOP}_k$ pure groups for all $k\in\mathbb{N}_{>2}$. We also answer a question of Chernikov and Hempel on the transfer of burden., Comment: 45 pages. Preliminary version, comments welcome!
Published: 2024

49. A scalable method to model large suspensions of colloidal phoretic particles with arbitrary shapes

Author: Delmotte, Blaise and Usabiaga, Florencio Balboa
Subjects: Condensed Matter - Soft Condensed Matter, Physics - Computational Physics, Physics - Fluid Dynamics
Abstract: Phoretic colloids self-propel thanks to surface flows generated in response to surface gradients (thermal, electrical, or chemical), that are self-induced and/or generated by other particles. Here we present a scalable and versatile framework to model chemical and hydrodynamic interactions in large suspensions of arbitrarily shaped phoretic particles, accounting for thermal fluctuations at all Damkholer numbers. Our approach, inspired by the Boundary Element Method (BEM), employs second-layer formulations, regularised kernels and a grid optimisation strategy to solve the coupled Laplace-Stokes equations with reasonable accuracy at a fraction of the computational cost associated with BEM. As demonstrated by our large-scale simulations, the capabilities of our method enable the exploration of new physical phenomena that, to our knowledge, have not been previously addressed by numerical simulations., Comment: 43 pages, 14 figures
Published: 2024

50. An asymmetric heuristic for trained ternary quantization based on the statistics of the weights: An application to medical signal classification.

Author: Yamil Vindas, Emmanuel Roux, Blaise Kévin Guépié, Marilys Almar, and Philippe Delachartre
Published: 2025
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

83,631 results on '"Blaise, A."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources