17,745 results on '"Moreira ÂN"'
Search Results
2. Tuning Electric Polarization via Exchange Striction Interaction in CaMn$_7$O$_{12}$ by Sr-Doping
- Author
-
Nonato, A., Villar, S. Y., Mira, J., Señarís-Rodríguez, María A., andújar, Manuel Sánchez, Moreira, J. Agostinho, Almeida, A., Silva, R. X., and Paschoal, C. W. A.
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
Magnetoelectric materials displaying strong magnetically induced polarization have attracted considerable interest due to their potential applications in spintronics and various fast electrically controlled magnetic devices. CaMn$_7$O$_{12}$ (CMO) stands out for its giant spin-induced ferroelectric polarization. However, the origin of the induced electric polarization in CMO remains highly controversial and continues to be a subject of ongoing debate. In this paper, through room temperature X-ray powder diffraction (XRPD), temperature-dependent magnetic susceptibility, and thermally stimulated depolarizing current (TSDC) measurements, we provide experimental evidence for a route to tune the magnetically induced polarization by modifying the exchange-striction in CMO via Sr-doping. Our findings demonstrate that the large and broad current peaks observed near the first magnetic phase transition ($T_N1 \sim 90$ K) indicate polarization contributions from both thermally stimulated depolarization current (TSDC) and intrinsic magnetically induced electric polarization. We suggest that this reduction in induced electric polarization in CMO originates from the increase in the Mn$^{3+}$ -- O -- Mn$^{4+}$ bond angle due to Sr$^{2+}$ doping, weakening the exchange-striction interaction. Meanwhile, the Dzyaloshinskii-Moriya (DM) effect determines the direction of the induced electric polarization. Our result sheds light on understanding the intriguing giant-induced polarization in CMO and similar compounds with complex magnetic structures.
- Published
- 2024
3. An Intelligent Native Network Slicing Security Architecture Empowered by Federated Learning
- Author
-
Moreira, Rodrigo, Villaca, Rodolfo S., Ribeiro, Moises R. N., Martins, Joberto S. B., Correa, Joao Henrique, Carvalho, Tereza C., and Silva, Flavio de Oliveira
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Artificial Intelligence ,Computer Science - Emerging Technologies ,Computer Science - Machine Learning ,Computer Science - Networking and Internet Architecture ,I.2 ,I.6 ,F.2.2 - Abstract
Network Slicing (NS) has transformed the landscape of resource sharing in networks, offering flexibility to support services and applications with highly variable requirements in areas such as the next-generation 5G/6G mobile networks (NGMN), vehicular networks, industrial Internet of Things (IoT), and verticals. Although significant research and experimentation have driven the development of network slicing, existing architectures often fall short in intrinsic architectural intelligent security capabilities. This paper proposes an architecture-intelligent security mechanism to improve the NS solutions. We idealized a security-native architecture that deploys intelligent microservices as federated agents based on machine learning, providing intra-slice and architectural operation security for the Slicing Future Internet Infrastructures (SFI2) reference architecture. It is noteworthy that federated learning approaches match the highly distributed modern microservice-based architectures, thus providing a unifying and scalable design choice for NS platforms addressing both service and security. Using ML-Agents and Security Agents, our approach identified Distributed Denial-of-Service (DDoS) and intrusion attacks within the slice using generic and non-intrusive telemetry records, achieving an average accuracy of approximately $95.60\%$ in the network slicing architecture and $99.99\%$ for the deployed slice -- intra-slice. This result demonstrates the potential for leveraging architectural operational security and introduces a promising new research direction for network slicing architectures., Comment: 18 pages, 12 figures, Future Generation Computer Systems (FGCS)
- Published
- 2024
- Full Text
- View/download PDF
4. Load Balancing-based Topology Adaptation for Integrated Access and Backhaul Networks
- Author
-
Paiva, Raul Victor de O., Carvalho, Fco. Italo G., Lima, Fco. Rafael M., Monteiro, Victor F., Sousa, Diego A., Moreira, Darlan C., Maciel, Tarcisio F., and Makki, Behrooz
- Subjects
Computer Science - Networking and Internet Architecture ,Electrical Engineering and Systems Science - Systems and Control - Abstract
Integrated access and backhaul (IAB) technology is a flexible solution for network densification. IAB nodes can also be deployed in moving nodes such as buses and trains, i.e., mobile IAB (mIAB). As mIAB nodes can move around the coverage area, the connection between mIAB nodes and their parent macro base stations (BSs), IAB donor, is sometimes required to change in order to keep an acceptable backhaul link, the so called topology adaptation (TA). The change from one IAB donor to another may strongly impact the system load distribution, possibly causing unsatisfactory backhaul service due to the lack of radio resources. Based on this, TA should consider both backhaul link quality and traffic load. In this work, we propose a load balancing algorithm based on TA for IAB networks, and compare it with an approach in which TA is triggered based on reference signal received power (RSRP) only. The results show that our proposed algorithm improves the passengers worst connections throughput in uplink (UL) and, more modestly, also in downlink (DL), without impairing the pedestrian quality of service (QoS) significantly., Comment: Paper submitted to Journal of Communication and Information Systems (JCIS)
- Published
- 2024
5. Cellular Network Densification: a System-level Analysis with IAB, NCR and RIS
- Author
-
da Silva, Gabriel C. M., Monteiro, Victor F., Sousa, Diego A., Moreira, Darlan C., Maciel, Tarcisio F., Lima, Fco. Rafael M., and Makki, Behrooz
- Subjects
Electrical Engineering and Systems Science - Systems and Control ,Computer Science - Networking and Internet Architecture - Abstract
As the number of user equipments increases in fifth generation (5G) and beyond, it is desired to densify the cellular network with auxiliary nodes assisting the base stations. Examples of these nodes are integrated access and backhaul (IAB) nodes, network-controlled repeaters (NCRs) and reconfigurable intelligent surfaces (RISs). In this context, this work presents a system level overview of these three nodes. Moreover, this work evaluates through simulations the impact of network planning aiming at enhancing the performance of a network used to cover an outdoor sport event. We show that, in the considered scenario, in general, IAB nodes provide an improved signal to interference-plus-noise ratio and throughput, compared to NCRs and RISs. However, there are situations where NCR outperforms IAB due to higher level of interference caused by the latter. Finally, we show that the deployment of these nodes in unmanned aerial vehicles (UAVs) also achieves performance gains due to their aerial mobility. However, UAV constraints related to aerial deployment may prevent these nodes from reaching results as good as the ones achieved by their stationary deployment., Comment: Paper submitted to IEEE Systems Journal
- Published
- 2024
6. Developing Guidelines for Functionally-Grounded Evaluation of Explainable Artificial Intelligence using Tabular Data
- Author
-
Velmurugan, Mythreyi, Ouyang, Chun, Xu, Yue, Sindhgatta, Renuka, Wickramanayake, Bemali, and Moreira, Catarina
- Subjects
Computer Science - Computers and Society ,Computer Science - Machine Learning - Abstract
Explainable Artificial Intelligence (XAI) techniques are used to provide transparency to complex, opaque predictive models. However, these techniques are often designed for image and text data, and it is unclear how fit-for-purpose they are when applied to tabular data. As XAI techniques are rarely evaluated in settings with tabular data, the applicability of existing evaluation criteria and methods are also unclear and needs (re-)examination. For example, some works suggest that evaluation methods may unduly influence the evaluation results when using tabular data. This lack of clarity on evaluation procedures can lead to reduced transparency and ineffective use of XAI techniques in real world settings. In this study, we examine literature on XAI evaluation to derive guidelines on functionally-grounded assessment of local, post hoc XAI techniques. We identify 20 evaluation criteria and associated evaluation methods, and derive guidelines on when and how each criterion should be evaluated. We also identify key research gaps to be addressed by future work. Our study contributes to the body of knowledge on XAI evaluation through in-depth examination of functionally-grounded XAI evaluation protocols, and has laid the groundwork for future research on XAI evaluation.
- Published
- 2024
7. Coastlines violate the Schramm-Loewner Evolution
- Author
-
Abril, Leidy M. L., Oliveira, Erneson A., Moreira, André A., Andrade Jr., José S., and Herrmann, Hans J.
- Subjects
Condensed Matter - Statistical Mechanics - Abstract
Mandelbrot's empirical observation that the coast of Britain is fractal has been confirmed by many authors, but it can be described by the Schramm--Loewner Evolution? Since the self-affine surface of our planet has a positive Hurst exponent, one would not expect a priori any critical behavior. Here, we investigate numerically the roughness and fractal dimension of the isoheight lines of real and artificial landscapes. Using a novel algorithm to take into account overhangs, we find that the roughness exponent of isoheight lines is consistent with unity regardless of the Hurst exponent of the rough surface. Moreover, the effective fractal dimension of the iso-height lines decays linearly with the Hurst exponent of the surface. We perform several tests to verify if the complete and accessible perimeters would follow the Schramm--Loewner Evolution and find that the left passage probability test is clearly violated, implying that coastlines violate SLE., Comment: 17 pages, 11 figures
- Published
- 2024
- Full Text
- View/download PDF
8. Standing waves for nonlinear Hartree type equations: existence and qualitative properties
- Author
-
Böer, Eduardo de Souza and Santos, Ederson Moreira dos
- Subjects
Mathematics - Analysis of PDEs ,35B06, 35B40, 35J47, 35J50, 35J60, 35Q40, 35Q92 - Abstract
We consider systems of the form \[ \left\{ \begin{array}{l} -\Delta u + u = \frac{2p}{p+q}(I_\alpha \ast |v|^{q})|u|^{p-2}u \ \ \textrm{ in } \mathbb{R}^N, \\ -\Delta v + v = \frac{2q}{p+q}(I_\alpha \ast |u|^{p})|v|^{q-2}v \ \ \textrm{ in } \mathbb{R}^N, \end{array} \right. \] for $\alpha\in (0, N)$, $\max\left\{\frac{2\alpha}{N}, 1\right\} < p, q < 2^*$ and $\frac{2(N+\alpha)}{N} < p+ q < 2^{*}_{\alpha}$, where $I_\alpha$ denotes the Riesz potential, \[ 2^* = \left\{ \begin{array}{l}\frac{2N}{N-2} \ \ \text{for} \ \ N\geq 3,\\ +\infty \ \ \text{for} \ \ N =1,2, \end{array}\right. \quad \text{and} \quad 2^*_{\alpha} = \left\{ \begin{array}{l}\frac{2(N+\alpha)}{N-2} \ \ \text{for} \ \ N\geq 3,\\ +\infty \ \ \text{for} \ \ N =1,2. \end{array} \right. \] This type of systems arises in the study of standing wave solutions for a certain approximation of the Hartree theory for a two-component attractive interaction. We prove existence and some qualitative properties for ground state solutions, such as definite sign for each component, radial symmetry and sharp asymptotic decay at infinity, and a regularity/integrability result for the (weak) solutions. Moreover, we show that the straight lines $p+q=\frac{2(N+\alpha)}{N}$ and $ p+ q = 2^{*}_{\alpha}$ are critical for the existence of solutions., Comment: 35 pages, 7 figures, few typos were fixed
- Published
- 2024
9. FairPIVARA: Reducing and Assessing Biases in CLIP-Based Multimodal Models
- Author
-
Moreira, Diego A. B., Ferreira, Alef Iury, Silva, Jhessica, Santos, Gabriel Oliveira dos, Pereira, Luiz, Gondim, João Medrado, Bonil, Gustavo, Maia, Helena, da Silva, Nádia, Hashiguti, Simone Tiemi, Santos, Jefersson A. dos, Pedrini, Helio, and Avila, Sandra
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Despite significant advancements and pervasive use of vision-language models, a paucity of studies has addressed their ethical implications. These models typically require extensive training data, often from hastily reviewed text and image datasets, leading to highly imbalanced datasets and ethical concerns. Additionally, models initially trained in English are frequently fine-tuned for other languages, such as the CLIP model, which can be expanded with more data to enhance capabilities but can add new biases. The CAPIVARA, a CLIP-based model adapted to Portuguese, has shown strong performance in zero-shot tasks. In this paper, we evaluate four different types of discriminatory practices within visual-language models and introduce FairPIVARA, a method to reduce them by removing the most affected dimensions of feature embeddings. The application of FairPIVARA has led to a significant reduction of up to 98% in observed biases while promoting a more balanced word distribution within the model. Our model and code are available at: https://github.com/hiaac-nlp/FairPIVARA., Comment: 14 pages, 10 figures. Accepted to 35th British Machine Vision Conference (BMVC 2024), Workshop on Privacy, Fairness, Accountability and Transparency in Computer Vision
- Published
- 2024
10. Explainable Artifacts for Synthetic Western Blot Source Attribution
- Author
-
Cardenuto, João Phillipe, Mandelli, Sara, Moreira, Daniel, Bestagini, Paolo, Delp, Edward, and Rocha, Anderson
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Recent advancements in artificial intelligence have enabled generative models to produce synthetic scientific images that are indistinguishable from pristine ones, posing a challenge even for expert scientists habituated to working with such content. When exploited by organizations known as paper mills, which systematically generate fraudulent articles, these technologies can significantly contribute to the spread of misinformation about ungrounded science, potentially undermining trust in scientific research. While previous studies have explored black-box solutions, such as Convolutional Neural Networks, for identifying synthetic content, only some have addressed the challenge of generalizing across different models and providing insight into the artifacts in synthetic images that inform the detection process. This study aims to identify explainable artifacts generated by state-of-the-art generative models (e.g., Generative Adversarial Networks and Diffusion Models) and leverage them for open-set identification and source attribution (i.e., pointing to the model that created the image)., Comment: Accepted in IEEE International Workshop on Information Forensics and Security - WIFS 2024, Rome, Italy
- Published
- 2024
11. Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG
- Author
-
Moreira, Gabriel de Souza P., Ak, Ronay, Schifferer, Benedikt, Xu, Mengyao, Osmulski, Radek, and Oldridge, Even
- Subjects
Computer Science - Information Retrieval ,Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Ranking models play a crucial role in enhancing overall accuracy of text retrieval systems. These multi-stage systems typically utilize either dense embedding models or sparse lexical indices to retrieve relevant passages based on a given query, followed by ranking models that refine the ordering of the candidate passages by its relevance to the query. This paper benchmarks various publicly available ranking models and examines their impact on ranking accuracy. We focus on text retrieval for question-answering tasks, a common use case for Retrieval-Augmented Generation systems. Our evaluation benchmarks include models some of which are commercially viable for industrial applications. We introduce a state-of-the-art ranking model, NV-RerankQA-Mistral-4B-v3, which achieves a significant accuracy increase of ~14% compared to pipelines with other rerankers. We also provide an ablation study comparing the fine-tuning of ranking models with different sizes, losses and self-attention mechanisms. Finally, we discuss challenges of text retrieval pipelines with ranking models in real-world industry applications, in particular the trade-offs among model size, ranking accuracy and system requirements like indexing and serving latency / throughput., Comment: Accepted for the 1st Workshop on GenAI and RAG Systems for Enterprise @ CIKM 2024
- Published
- 2024
12. Operational State Complexity of Block Languages
- Author
-
Duarte, Guilherme, Moreira, Nelma, Prigioniero, Luca, and Reis, Rogério
- Subjects
Computer Science - Formal Languages and Automata Theory - Abstract
In this paper we consider block languages, namely sets of words having the same length, and study the deterministic and nondeterministic state complexity of several operations on these languages. Being a subclass of finite languages, the upper bounds of operational state complexity known for finite languages apply for block languages as well. However, in several cases, smaller values were found. Block languages can be represented as bitmaps, which are a good tool to study their minimal finite automata and their operations, as we illustrate here., Comment: In Proceedings NCMA 2024, arXiv:2409.06120
- Published
- 2024
- Full Text
- View/download PDF
13. Fractal geometry of continued fractions with large coefficients and dimension drop problems
- Author
-
Fang, Lulu, Moreira, Carlos Gustavo, and Zhang, Yiwei
- Subjects
Mathematics - Number Theory ,11K50, 37D35, 28A80 - Abstract
In 1928, Jarn\'{\i}k \cite{Jar} obtained that the set of continued fractions with bounded coefficients has Hausdorff dimension one. Good \cite{Goo} observed a dimension drop phenomenon by proving that the Hausdorff dimension of the set of continued fractions whose coefficients tend to infinity is one-half. For the set of continued fractions whose coefficients tend to infinity rapidly, Luczak \cite{Luc} and Feng et al. \cite{FWLT} showed that its Hausdorff dimension decreases even further. Recently, Liao and Rams \cite{LR16} also observed an analogous dimension drop phenomenon when they studied the subexponential growth rate of the sum of coefficients. In this paper, we consolidate and considerably extend the studies of the abovementioned problem into a general dimension drop problem on the distribution of continued fractions with large coefficients. As applications, we use a different approach to reprove a result of Wang and Wu on the dimensions of the Borel-Bernstein sets \cite{WW}, fulfil the dimension gap proposed by Liao and Rams \cite{LR16}, and establish several new results concerning the dimension theory of liminf and limsup sets related to the maximum of coefficients., Comment: 5 Figures, 56 pages
- Published
- 2024
14. Benchmarking ML Approaches to UWB-Based Range-Only Posture Recognition for Human Robot-Interaction
- Author
-
Salimi, Salma, Salimpour, Sahar, Queralta, Jorge Peña, Bessa, Wallace Moreira, and Westerlund, Tomi
- Subjects
Computer Science - Robotics - Abstract
Human pose estimation involves detecting and tracking the positions of various body parts using input data from sources such as images, videos, or motion and inertial sensors. This paper presents a novel approach to human pose estimation using machine learning algorithms to predict human posture and translate them into robot motion commands using ultra-wideband (UWB) nodes, as an alternative to motion sensors. The study utilizes five UWB sensors implemented on the human body to enable the classification of still poses and more robust posture recognition. This approach ensures effective posture recognition across a variety of subjects. These range measurements serve as input features for posture prediction models, which are implemented and compared for accuracy. For this purpose, machine learning algorithms including K-Nearest Neighbors (KNN), Support Vector Machine (SVM), and deep Multi-Layer Perceptron (MLP) neural network are employed and compared in predicting corresponding postures. We demonstrate the proposed approach for real-time control of different mobile/aerial robots with inference implemented in a ROS 2 node. Experimental results demonstrate the efficacy of the approach, showcasing successful prediction of human posture and corresponding robot movements with high accuracy.
- Published
- 2024
15. Localization of Synthetic Manipulations in Western Blot Images
- Author
-
Manjunath, Anmol, Negroni, Viola, Mandelli, Sara, Moreira, Daniel, and Bestagini, Paolo
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Multimedia - Abstract
Recent breakthroughs in deep learning and generative systems have significantly fostered the creation of synthetic media, as well as the local alteration of real content via the insertion of highly realistic synthetic manipulations. Local image manipulation, in particular, poses serious challenges to the integrity of digital content and societal trust. This problem is not only confined to multimedia data, but also extends to biological images included in scientific publications, like images depicting Western blots. In this work, we address the task of localizing synthetic manipulations in Western blot images. To discriminate between pristine and synthetic pixels of an analyzed image, we propose a synthetic detector that operates on small patches extracted from the image. We aggregate patch contributions to estimate a tampering heatmap, highlighting synthetic pixels out of pristine ones. Our methodology proves effective when tested over two manipulated Western blot image datasets, one altered automatically and the other manually by exploiting advanced AI-based image manipulation tools that are unknown at our training stage. We also explore the robustness of our method over an external dataset of other scientific images depicting different semantics, manipulated through unseen generation techniques.
- Published
- 2024
16. Online ML-based Joint Channel Estimation and MIMO Decoding for Dynamic Channels
- Author
-
Teixeira, Luiz Fernando Moreira, Luiz, Vinicius Henrique, Soares, Jonathan Aguiar, Mayer, Kayol Soares, and Arantes, Dalton Soares
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
This paper presents an online method for joint channel estimation and decoding in massive MIMO-OFDM systems using complex-valued neural networks (CVNNs). The study evaluates the performance of various CVNNs, such as the complex-valued feedforward neural network (CVFNN), split-complex feedforward neural network (SCFNN), complex radial basis function (C-RBF), fully-complex radial basis function (FC-RBF) and phase-transmittance radial basis function (PT-RBF), in realistic 5G communication scenarios. Results demonstrate improvements in mean squared error (MSE), convergence, and bit error rate (BER) accuracy. The C-RBF and PT-RBF architectures show the most promising outcomes, suggesting that RBF-based CVNNs provide a reliable and efficient solution for complex and noisy communication environments. These findings have potential implications for applying advanced neural network techniques in next-generation wireless systems., Comment: XLII Simp\'osio Brasileiro de Telecomunica\c{c}\~oes e Processamento de Sinais (SBrT 2024)
- Published
- 2024
- Full Text
- View/download PDF
17. A Modeling Framework for Equitable Deployment of Energy Storage in Disadvantaged Communities
- Author
-
Heleno, Miguel, Lesur, Paul, and Moreira, Alexandre
- Subjects
Mathematics - Optimization and Control - Abstract
This paper provides an analytical framework to incorporate the deployment of behind-the-meter energy storage coupled with rooftop solar, and their associated revenue streams, in the context of equitable energy policy interventions. We propose an extension to the Justice40 optimization model by adding storage and incorporating more realistic solar compensation mechanisms, such as net-billing, which allows for temporal revenue differentiation and the economic viability of behind-the-meter energy storage devices. The extended model includes household-level PV plus storage co-deployment alongside existing interventions, such as weatherization, rooftop PV only, community solar, and community wind. From a modeling perspective, we propose a novel approximation method to represent storage operations and revenue streams without expanding the temporal dimension of model, thus maintaining its computational efficiency. The proposed model is validated using a case study in Wayne County, Michigan, involving 3,651 energy insecure households.
- Published
- 2024
18. Curio: A Dataflow-Based Framework for Collaborative Urban Visual Analytics
- Author
-
Moreira, Gustavo, Hosseini, Maryam, Veiga, Carolina, Alexandre, Lucas, Colaninno, Nicola, de Oliveira, Daniel, Ferreira, Nivan, Lage, Marcos, and Miranda, Fabio
- Subjects
Computer Science - Human-Computer Interaction ,Computer Science - Computers and Society - Abstract
Over the past decade, several urban visual analytics systems and tools have been proposed to tackle a host of challenges faced by cities, in areas as diverse as transportation, weather, and real estate. Many of these tools have been designed through collaborations with urban experts, aiming to distill intricate urban analysis workflows into interactive visualizations and interfaces. However, the design, implementation, and practical use of these tools still rely on siloed approaches, resulting in bespoke applications that are difficult to reproduce and extend. At the design level, these tools undervalue rich data workflows from urban experts, typically treating them only as data providers and evaluators. At the implementation level, they lack interoperability with other technical frameworks. At the practical use level, they tend to be narrowly focused on specific fields, inadvertently creating barriers to cross-domain collaboration. To address these gaps, we present Curio, a framework for collaborative urban visual analytics. Curio uses a dataflow model with multiple abstraction levels (code, grammar, GUI elements) to facilitate collaboration across the design and implementation of visual analytics components. The framework allows experts to intertwine data preprocessing, management, and visualization stages while tracking the provenance of code and visualizations. In collaboration with urban experts, we evaluate Curio through a diverse set of usage scenarios targeting urban accessibility, urban microclimate, and sunlight access. These scenarios use different types of data and domain methodologies to illustrate Curio's flexibility in tackling pressing societal challenges. Curio is available at https://urbantk.org/curio., Comment: Accepted at IEEE VIS 2024. Source code available at https://urbantk.org/curio
- Published
- 2024
19. On the complexity of subshifts and infinite words
- Author
-
Greenfeld, Be'eri, Moreira, Carlos Gustavo, and Zelmanov, Efim
- Subjects
Mathematics - Dynamical Systems ,Mathematics - Combinatorics ,Mathematics - Rings and Algebras - Abstract
We characterize the complexity functions of subshifts up to asymptotic equivalence. The complexity function of every aperiodic function is non-decreasing, submultiplicative and grows at least linearly. We prove that conversely, every function satisfying these conditions is asymptotically equivalent to the complexity function of a recurrent subshift, equivalently, a recurrent infinite word. Our construction is explicit, algorithmic in nature and is philosophically based on constructing certain 'Cantor sets of integers', whose 'gaps' correspond to blocks of zeros. We also prove that every non-decreasing submultiplicative function is asymptotically equivalent, up a linear error term, to the complexity function of a minimal subshift.
- Published
- 2024
20. Gemma 2: Improving Open Language Models at a Practical Size
- Author
-
Gemma Team, Riviere, Morgane, Pathak, Shreya, Sessa, Pier Giuseppe, Hardin, Cassidy, Bhupatiraju, Surya, Hussenot, Léonard, Mesnard, Thomas, Shahriari, Bobak, Ramé, Alexandre, Ferret, Johan, Liu, Peter, Tafti, Pouya, Friesen, Abe, Casbon, Michelle, Ramos, Sabela, Kumar, Ravin, Lan, Charline Le, Jerome, Sammy, Tsitsulin, Anton, Vieillard, Nino, Stanczyk, Piotr, Girgin, Sertan, Momchev, Nikola, Hoffman, Matt, Thakoor, Shantanu, Grill, Jean-Bastien, Neyshabur, Behnam, Bachem, Olivier, Walton, Alanna, Severyn, Aliaksei, Parrish, Alicia, Ahmad, Aliya, Hutchison, Allen, Abdagic, Alvin, Carl, Amanda, Shen, Amy, Brock, Andy, Coenen, Andy, Laforge, Anthony, Paterson, Antonia, Bastian, Ben, Piot, Bilal, Wu, Bo, Royal, Brandon, Chen, Charlie, Kumar, Chintu, Perry, Chris, Welty, Chris, Choquette-Choo, Christopher A., Sinopalnikov, Danila, Weinberger, David, Vijaykumar, Dimple, Rogozińska, Dominika, Herbison, Dustin, Bandy, Elisa, Wang, Emma, Noland, Eric, Moreira, Erica, Senter, Evan, Eltyshev, Evgenii, Visin, Francesco, Rasskin, Gabriel, Wei, Gary, Cameron, Glenn, Martins, Gus, Hashemi, Hadi, Klimczak-Plucińska, Hanna, Batra, Harleen, Dhand, Harsh, Nardini, Ivan, Mein, Jacinda, Zhou, Jack, Svensson, James, Stanway, Jeff, Chan, Jetha, Zhou, Jin Peng, Carrasqueira, Joana, Iljazi, Joana, Becker, Jocelyn, Fernandez, Joe, van Amersfoort, Joost, Gordon, Josh, Lipschultz, Josh, Newlan, Josh, Ji, Ju-yeong, Mohamed, Kareem, Badola, Kartikeya, Black, Kat, Millican, Katie, McDonell, Keelin, Nguyen, Kelvin, Sodhia, Kiranbir, Greene, Kish, Sjoesund, Lars Lowe, Usui, Lauren, Sifre, Laurent, Heuermann, Lena, Lago, Leticia, McNealus, Lilly, Soares, Livio Baldini, Kilpatrick, Logan, Dixon, Lucas, Martins, Luciano, Reid, Machel, Singh, Manvinder, Iverson, Mark, Görner, Martin, Velloso, Mat, Wirth, Mateo, Davidow, Matt, Miller, Matt, Rahtz, Matthew, Watson, Matthew, Risdal, Meg, Kazemi, Mehran, Moynihan, Michael, Zhang, Ming, Kahng, Minsuk, Park, Minwoo, Rahman, Mofi, Khatwani, Mohit, Dao, Natalie, Bardoliwalla, Nenshad, Devanathan, Nesh, Dumai, Neta, Chauhan, Nilay, Wahltinez, Oscar, Botarda, Pankil, Barnes, Parker, Barham, Paul, Michel, Paul, Jin, Pengchong, Georgiev, Petko, Culliton, Phil, Kuppala, Pradeep, Comanescu, Ramona, Merhej, Ramona, Jana, Reena, Rokni, Reza Ardeshir, Agarwal, Rishabh, Mullins, Ryan, Saadat, Samaneh, Carthy, Sara Mc, Cogan, Sarah, Perrin, Sarah, Arnold, Sébastien M. R., Krause, Sebastian, Dai, Shengyang, Garg, Shruti, Sheth, Shruti, Ronstrom, Sue, Chan, Susan, Jordan, Timothy, Yu, Ting, Eccles, Tom, Hennigan, Tom, Kocisky, Tomas, Doshi, Tulsee, Jain, Vihan, Yadav, Vikas, Meshram, Vilobh, Dharmadhikari, Vishal, Barkley, Warren, Wei, Wei, Ye, Wenming, Han, Woohyun, Kwon, Woosuk, Xu, Xiang, Shen, Zhe, Gong, Zhitao, Wei, Zichuan, Cotruta, Victor, Kirk, Phoebe, Rao, Anand, Giang, Minh, Peran, Ludovic, Warkentin, Tris, Collins, Eli, Barral, Joelle, Ghahramani, Zoubin, Hadsell, Raia, Sculley, D., Banks, Jeanine, Dragan, Anca, Petrov, Slav, Vinyals, Oriol, Dean, Jeff, Hassabis, Demis, Kavukcuoglu, Koray, Farabet, Clement, Buchatskaya, Elena, Borgeaud, Sebastian, Fiedel, Noah, Joulin, Armand, Kenealy, Kathleen, Dadashi, Robert, and Andreev, Alek
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical modifications to the Transformer architecture, such as interleaving local-global attentions (Beltagy et al., 2020a) and group-query attention (Ainslie et al., 2023). We also train the 2B and 9B models with knowledge distillation (Hinton et al., 2015) instead of next token prediction. The resulting models deliver the best performance for their size, and even offer competitive alternatives to models that are 2-3 times bigger. We release all our models to the community.
- Published
- 2024
21. Ge-based Clinopyroxene series: first principles and experimental local probe study
- Author
-
Moreira, Ricardo P., da Silva, E. Lora, Oliveira, Gonçalo N. P., Rocha-Rodrigues, Pedro, Stroppa, Alessandro, Colin, Claire V., Darie, Céline, Correia, João G., Assali, Lucy V. C., Petrilli, Helena M., Lopes, Armandina M. L., and Araújo, João P.
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Strongly Correlated Electrons - Abstract
The structural and electronic properties of the CaMnGe$_2$O$_6$ and SrMnGe$_2$O$_6$ clinopyroxene systems have been investigated by means of perturbed angular correlation (PAC) measurements, performed at ISOLDE, combined with $ab-initio$ electronic structure calculations within the density functional theory (DFT) framework. The partial density of states (PDOS) of the CaMnGe$_2$O$_6$ and SrMnGe$_2$O$_6$ stable compounds has been determined, and it has been observed that the requirement of including an on-site Hubbard-$U$ potential was necessary in order to describe the highly correlated Mn $3d$-states. By considering $U_{eff}$=4 eV, we obtained a band gap width of 1.82 eV and 1.70 eV, for the CaMnGe$_2$O$_6$ and SrMnGe$_2$O$_6$, respectively. Combining electric field gradient (EFG) first principles calculations, using a supercell scheme, with experimental PAC results, we were able to infer that the Cd probe can replace either the $A$ (Ca, Sr) or the Mn sites in the crystalline structures. We also showed that Cd substitution is expected to lead to a reduction in the width of the band gap in these systems, evidencing opportunities for potential band-gap engineering.
- Published
- 2024
22. Entropy production due to spacetime fluctuations
- Author
-
Moreira, Thiago H and Céleri, Lucas C.
- Subjects
General Relativity and Quantum Cosmology ,Quantum Physics - Abstract
Understanding the quantum nature of the gravitational field is undoubtedly one of the greatest challenges in theoretical physics. Despite significant progress, a complete and consistent theory remains elusive. However, in the weak field approximation -- where curvature effects are small -- we can explore some expected properties of such a theory. Particularly relevant to this study is the quantum nature of gravitational waves, which are represented as small perturbations in flat spacetime. In this framework, a quantum description of these perturbations, as a quantum field, is feasible, leading to the emergence of the graviton. Here we consider a non-relativistic quantum system interacting with such a field. We employ the consistent histories approach to quantum mechanics, which allows us to frame classical questions in a quantum context, to define a fluctuation relation for this system. As a result, thermodynamic entropy must be produced in the system due to its unavoidable interaction with the quantum fluctuations of spacetime., Comment: Comments are welcome
- Published
- 2024
23. NV-Retriever: Improving text embedding models with effective hard-negative mining
- Author
-
Moreira, Gabriel de Souza P., Osmulski, Radek, Xu, Mengyao, Ak, Ronay, Schifferer, Benedikt, and Oldridge, Even
- Subjects
Computer Science - Information Retrieval ,Computer Science - Artificial Intelligence - Abstract
Text embedding models have been popular for information retrieval applications such as semantic search and Question-Answering systems based on Retrieval-Augmented Generation (RAG). Those models are typically Transformer models that are fine-tuned with contrastive learning objectives. Many papers introduced new embedding model architectures and training approaches, however, one of the key ingredients, the process of mining negative passages, remains poorly explored or described. One of the challenging aspects of fine-tuning embedding models is the selection of high quality hard-negative passages for contrastive learning. In this paper we propose a family of positive-aware mining methods that leverage the positive relevance score for more effective false negatives removal. We also provide a comprehensive ablation study on hard-negative mining methods over their configurations, exploring different teacher and base models. We demonstrate the efficacy of our proposed methods by introducing the NV-Retriever-v1 model, which scores 60.9 on MTEB Retrieval (BEIR) benchmark and 0.65 points higher than previous methods. The model placed 1st when it was published to MTEB Retrieval on July 07, 2024.
- Published
- 2024
24. Hierarchical Homogeneity-Based Superpixel Segmentation: Application to Hyperspectral Image Analysis
- Author
-
Ayres, Luciano Carvalho, de Almeida, Sérgio José Melo, Bermudez, José Carlos Moreira, and Borsoi, Ricardo Augusto
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Hyperspectral image (HI) analysis approaches have recently become increasingly complex and sophisticated. Recently, the combination of spectral-spatial information and superpixel techniques have addressed some hyperspectral data issues, such as the higher spatial variability of spectral signatures and dimensionality of the data. However, most existing superpixel approaches do not account for specific HI characteristics resulting from its high spectral dimension. In this work, we propose a multiscale superpixel method that is computationally efficient for processing hyperspectral data. The Simple Linear Iterative Clustering (SLIC) oversegmentation algorithm, on which the technique is based, has been extended hierarchically. Using a novel robust homogeneity testing, the proposed hierarchical approach leads to superpixels of variable sizes but with higher spectral homogeneity when compared to the classical SLIC segmentation. For validation, the proposed homogeneity-based hierarchical method was applied as a preprocessing step in the spectral unmixing and classification tasks carried out using, respectively, the Multiscale sparse Unmixing Algorithm (MUA) and the CNN-Enhanced Graph Convolutional Network (CEGCN) methods. Simulation results with both synthetic and real data show that the technique is competitive with state-of-the-art solutions.
- Published
- 2024
- Full Text
- View/download PDF
25. Information measures for fermion localization in $f(T, B)$ gravity with non-minimal couplings
- Author
-
Moreira, Allan R. P., Dong, Shi-Hai, and Saridakis, Emmanuel N.
- Subjects
General Relativity and Quantum Cosmology - Abstract
We investigate the dynamics of fermion localization within the framework of $f(T, B)$ gravity featuring non-minimal couplings. Starting from the Dirac action for a spin-$1/2$ fermion in a five-dimensional spacetime governed by torsional $f(T, B)$ gravity, we derive the Dirac equation and we explore its solutions under various non-minimal coupling functions. We examine two realistic forms of the torsional non-minimal coupling and reveal distinct behaviors that impact the localization of both massless and massive fermionic modes on the brane. Additionally, we employ probabilistic measurements, including Shannon entropy theory, Fisher information theory, and relative probability, to analyze the localization of these fermionic modes. The observed effects offer potential insights into probing torsional modifications.
- Published
- 2024
26. Fighting Sampling Bias: A Framework for Training and Evaluating Credit Scoring Models
- Author
-
Kozodoi, Nikita, Lessmann, Stefan, Alamgir, Morteza, Moreira-Matias, Luis, and Papakonstantinou, Konstantinos
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning - Abstract
Scoring models support decision-making in financial institutions. Their estimation and evaluation are based on the data of previously accepted applicants with known repayment behavior. This creates sampling bias: the available labeled data offers a partial picture of the distribution of candidate borrowers, which the model is supposed to score. The paper addresses the adverse effect of sampling bias on model training and evaluation. To improve scorecard training, we propose bias-aware self-learning - a reject inference framework that augments the biased training data by inferring labels for selected rejected applications. For scorecard evaluation, we propose a Bayesian framework that extends standard accuracy measures to the biased setting and provides a reliable estimate of future scorecard performance. Extensive experiments on synthetic and real-world data confirm the superiority of our propositions over various benchmarks in predictive performance and profitability. By sensitivity analysis, we also identify boundary conditions affecting their performance. Notably, we leverage real-world data from a randomized controlled trial to assess the novel methodologies on holdout data that represent the true borrower population. Our findings confirm that reject inference is a difficult problem with modest potential to improve scorecard performance. Addressing sampling bias during scorecard evaluation is a much more promising route to improve scoring practices. For example, our results suggest a profit improvement of about eight percent, when using Bayesian evaluation to decide on acceptance rates.
- Published
- 2024
27. Investigating Imperceptibility of Adversarial Attacks on Tabular Data: An Empirical Analysis
- Author
-
He, Zhipeng, Ouyang, Chun, Alzubaidi, Laith, Barros, Alistair, and Moreira, Catarina
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Cryptography and Security - Abstract
Adversarial attacks are a potential threat to machine learning models by causing incorrect predictions through imperceptible perturbations to the input data. While these attacks have been extensively studied in unstructured data like images, applying them to tabular data, poses new challenges. These challenges arise from the inherent heterogeneity and complex feature interdependencies in tabular data, which differ from the image data. To account for this distinction, it is necessary to establish tailored imperceptibility criteria specific to tabular data. However, there is currently a lack of standardised metrics for assessing the imperceptibility of adversarial attacks on tabular data. To address this gap, we propose a set of key properties and corresponding metrics designed to comprehensively characterise imperceptible adversarial attacks on tabular data. These are: proximity to the original input, sparsity of altered features, deviation from the original data distribution, sensitivity in perturbing features with narrow distribution, immutability of certain features that should remain unchanged, feasibility of specific feature values that should not go beyond valid practical ranges, and feature interdependencies capturing complex relationships between data attributes. We evaluate the imperceptibility of five adversarial attacks, including both bounded attacks and unbounded attacks, on tabular data using the proposed imperceptibility metrics. The results reveal a trade-off between the imperceptibility and effectiveness of these attacks. The study also identifies limitations in current attack algorithms, offering insights that can guide future research in the area. The findings gained from this empirical analysis provide valuable direction for enhancing the design of adversarial attack algorithms, thereby advancing adversarial machine learning on tabular data., Comment: 36 pages
- Published
- 2024
28. Partition regularity of generalized Pythagorean pairs
- Author
-
Frantzikinakis, Nikos, Klurman, Oleksiy, and Moreira, Joel
- Subjects
Mathematics - Combinatorics ,Mathematics - Number Theory ,Primary: 05D10, Secondary:11N37, 11B30, 37A44 - Abstract
We address partition regularity problems for homogeneous quadratic equations. A consequence of our main results is that, under natural conditions on the coefficients $a,b,c$, for any finite coloring of the positive integers, there exists a solution to $ax^2+by^2=cz^2$ where $x$ and $y$ have the same color (and similar results for $x,z$ and $y,z$). For certain choices of $(a,b,c)$, our result is conditional on an Elliott-type conjecture. Our proofs build on and extend previous arguments of the authors dealing with the Pythagorean equation. We make use of new uniformity properties of aperiodic multiplicative functions and concentration estimates for multiplicative functions along arbitrary binary quadratic forms., Comment: 49 pages. Small changes made
- Published
- 2024
29. DALL-M: Context-Aware Clinical Data Augmentation with LLMs
- Author
-
Hsieh, Chihcheng, Moreira, Catarina, Nobre, Isabel Blanco, Sousa, Sandra Costa, Ouyang, Chun, Brereton, Margot, Jorge, Joaquim, and Nascimento, Jacinto C.
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Information Retrieval ,Computer Science - Machine Learning ,I.5.1 ,J.3 ,H.3.3 ,I.2.7 - Abstract
X-ray images are vital in medical diagnostics, but their effectiveness is limited without clinical context. Radiologists often find chest X-rays insufficient for diagnosing underlying diseases, necessitating comprehensive clinical features and data integration. We present a novel framework to enhance the clinical context through augmentation techniques with clinical tabular data, thereby improving its applicability and reliability in AI medical diagnostics. We introduce a pioneering approach to clinical data augmentation that employs large language models to generate patient contextual synthetic data. This methodology is crucial for training more robust deep learning models in healthcare. It preserves the integrity of real patient data while enriching the dataset with contextually relevant synthetic features, significantly enhancing model performance. Our methodology, termed DALL-M, uses a three-phase feature generation process: (i)clinical context storage, (ii)expert query generation, and (iii)context-aware feature augmentation. DALL-M generates new, clinically relevant features by synthesizing chest X-ray images and reports. Applied to 799 cases using nine features from the MIMIC-IV dataset, it created an augmented set of 91 features. This is the first work to generate contextual values for patients' X-ray reports. Specifically, we provide (i)the capacity of LLMs to generate contextual synthetic values for existing clinical features and (ii)their ability to create entirely new clinically relevant features. Empirical validation with machine learning models showed significant performance improvements. Incorporating augmented features increased the F1 score by 16.5% and Precision and Recall by approximately 25%. DALL-M addresses a critical gap in clinical data augmentation, offering a robust framework for generating contextually enriched datasets., Comment: we introduce a pioneering approach to clinical data augmentation that employs large language models (LLMs) to generate patient contextual synthetic data. It preserves the integrity of real patient data while enriching the dataset with contextually relevant synthetic features, significantly enhancing model performance
- Published
- 2024
30. Thermoelectric cooling of a finite reservoir coupled to a quantum dot
- Author
-
Matern, Stephanie, Moreira, Saulo V., Samuelsson, Peter, and Leijnse, Martin
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
We investigate non-equilibrium transport of charge and heat through an interacting quantum dot coupled to a finite electron reservoir. Both the quantum dot and the finite reservoir are coupled to conventional electric contacts, i.e., infinite electron reservoirs, between which a bias voltage can be applied. We develop a phenomenological description of the system, combining a rate equation for transport through the quantum dot with standard linear response expressions for transport between the finite and infinite reservoirs. The finite reservoir is assumed to be in a quasi-equilibrium state with time-dependent chemical potential and temperature which we solve for self-consistently. We show that the finite reservoir can have a large impact on the stationary state transport properties, including a shift and broadening of the Coulomb diamond edges. We also demonstrate that there is a region around the conductance lines where a heat current flows out of the finite reservoir. Our results reveal the dependence of the temperature that can be reached by this thermoelectric cooling on the system parameters, in particular the coupling between the finite and infinite reservoirs and additional heat currents induced by electron-phonon couplings, and can thus serve as a guide to experiments on quantum dot-enabled thermoelectric cooling of finite electron reservoirs. Finally, we study the full dynamics of the system, with a particular focus on the timescales involved in the thermoelectric cooling., Comment: 6+2 pages, 4 figures
- Published
- 2024
31. Bayesian weighted time-lapse full-waveform inversion using a receiver-extension strategy
- Author
-
da Silva, Sergio Luiz E. F., Karsou, Ammir, Moreira, Roger M., and Cetale, Marco
- Subjects
Physics - Geophysics ,Mathematics - Probability ,Statistics - Applications - Abstract
Time-lapse full-waveform inversion (FWI) has become a powerful tool for characterizing and monitoring subsurface changes in various geophysical applications. However, non-repeatability (NR) issues caused, for instance, by GPS inaccuracies, often make it difficult to obtain unbiased time-lapse models. In this work we explore the portability of combining a receiver-extension FWI approach and Bayesian analysis to mitigate time-lapse noises arising from NR issues. The receiver-extension scheme introduces an artificial degree of freedom in positioning receivers, intending to minimize kinematic mismatches between modeled and observed data. Bayesian analysis systematically explores several potential solutions to mitigate time-lapse changes not associated with reservoir responses, assigning probabilities to each scenario based on prior information and available evidence. We consider two different subsurface models to demonstrate the potential of proposed approaches. First, using the Marmousi model, we investigate two NR scenarios associated with background noise in seismic data. Second, using a challenging deep-water Brazilian pre-salt setting, we investigate several NR scenarios to simulate real-world challenges. Our results demonstrate that combining Bayesian analysis with the receiver-extension FWI strategy can mitigate adverse NR effects successfully, producing cleaner and more reliable time-lapse models than conventional approaches. The results also reveal that the proposed Bayesian weighted procedure is a valuable tool for determining time-lapse estimates through statistical analysis of pre-existing models, allowing its application in ongoing time-lapse (4D) projects., Comment: 27 pages, 16 figures. Submitted as a Journal Paper to IEEE Transactions on Geoscience and Remote Sensing
- Published
- 2024
32. Impact of Network Deployment on the Performance of NCR-assisted Networks
- Author
-
da Silva, Gabriel C. M., Sousa, Diego A., Monteiro, Victor F., Moreira, Darlan C., Maciel, Tarcisio F., Lima, Fco. Rafael M., and Makki, Behrooz
- Subjects
Computer Science - Networking and Internet Architecture ,Electrical Engineering and Systems Science - Systems and Control - Abstract
To address the need of coverage enhancement in the fifth generation (5G) of wireless cellular telecommunications, while taking into account possible bottlenecks related to deploying fiber based backhaul (e.g., required cost and time), the 3rd generation partnership project (3GPP) proposed in Release 18 the concept of network-controlled repeaters (NCRs). NCRs enhance previous radio frequency (RF) repeaters by exploring beamforming transmissions controlled by the network through side control information. In this context, this paper introduces the concept of NCR. Furthermore, we present a system level model that allows the performance evaluation of an NCR-assisted network. Finally, we evaluate the network deployment impact on the performance of NCR-assisted networks. As we show, with proper network planning, NCRs can boost the signal to interference-plus-noise ratio (SINR) of the user equipments (UEs) in a poor coverage of a macro base station. Furthermore, celledge UEs and uplink (UL) communications are the ones that benefit the most from the presence of NCRs., Comment: Paper accepted for publication in the conference proceedings of "19th International Symposium on Wireless Communication Systems" (ISWCS)
- Published
- 2024
33. A Road Less Travelled and Beyond: Towards a Roadmap for Integrating Sustainability into Computing Education
- Author
-
Moreira, Ana, Leifler, Ola, Betz, Stefanie, Brooks, Ian, Capilla, Rafael, Coroama, Vlad Constantin, Duboc, Leticia, Fernandes, Joao Paulo, Heldal, Rogardt, Lago, Patricia, Nguyen, Ngoc-Thanh, Oyedeji, Shola, Penzenstadler, Birgit, Peters, Anne Kathrin, Porras, Jari, and Venters, Colin C.
- Subjects
Computer Science - Software Engineering - Abstract
Education for sustainable development has evolved to include more constructive approaches and a better understanding of what is needed to align education with the cultural, societal, and pedagogical changes required to avoid the risks posed by an unsustainable society. This evolution aims to lead us toward viable, equitable, and sustainable futures. However, computing education, including software engineering, is not fully aligned with the current understanding of what is needed for transformational learning in light of our current challenges. This is partly because computing is primarily seen as a technical field, focused on industry needs. Until recently, sustainability was not a high priority for most businesses, including the digital sector, nor was it a prominent focus for higher education institutions and society. Given these challenges, we aim to propose a research roadmap to integrate sustainability principles and essential skills into the crowded computing curriculum, nurturing future software engineering professionals with a sustainability mindset. We conducted two extensive studies: a systematic review of academic literature on sustainability in computing education and a survey of industry professionals on their interest in sustainability and desired skills for graduates. Using insights from these studies, we identified key topics for teaching sustainability, including core sustainability principles, values and ethics, systems thinking, impact measurement, soft skills, business value, legal standards, and advocacy. Based on these findings, we will develop recommendations for future computing education programs that emphasise sustainability. The paper is accepted at the 2030 Software Engineering workshop, which is co-located with the FSE'24 conference.
- Published
- 2024
34. Multivariate extreme values for dynamical systems
- Author
-
Aimino, Romain, Freitas, Ana Cristina Moreira, Freitas, Jorge Milhazes, and Todd, Mike
- Subjects
Mathematics - Dynamical Systems ,Mathematics - Probability ,37A50, 37A25, 37B20, 60G70, 62H05 - Abstract
We establish a theory for multivariate extreme value analysis of dynamical systems. Namely, we provide conditions adapted to the dynamical setting which enable the study of dependence between extreme values of the components of $\R^d$-valued observables evaluated along the orbits of the systems. We study this cross-sectional dependence, which results from the combination of a spatial and a temporal dependence structures. We give several illustrative applications, where concrete systems and dependence sources are introduced and analysed., Comment: Some rewording of the introduction
- Published
- 2024
35. Modelling the evolution of the Galactic disc scale height traced by open clusters
- Author
-
Moreira, Sandro, Moitinho, André, Silva, André, and Almeida, Duarte
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
Context. The scale height of the spatial distribution of open clusters (OCs) in the Milky Way exhibits a well known increase with age which is usually interpreted as evidence for dynamical heating of the disc or of the disc having been thicker in the past. Aims. We address the increase of the scale height with age of the OC population from a different angle. We propose that the apparent thickening of the disc can be largely explained as a consequence of a stronger disruption of OCs near the Galactic plane by disc phenomena, namely encounters with giant molecular clouds (GMCs). Methods. We present a computational model that forms OCs with different initial masses and follows their orbits while subjecting them to different disruption mechanisms. To setup the model and infer its parameters, we use and analyse a Gaia-based OC catalogue (Dias et al. 2021). We investigate both the spatial and age distributions of the OC population and discuss the completeness of the sample. The simulation results are then compared to the observations. Results. Consistent with previous studies, the observations reveal that the SH of the spatial distribution of OCs increases with age. We find that it is very likely that the OC sample is incomplete even for the solar neighbourhood. The model simulations successfully reproduce the SH increase with age and the total number of OCs that survive with age up to 1 Gyr. For older OCs, the predicted SH from the model starts deviating from the observations, although remaining within the uncertainties of the observations. This can be related with effects of incompleteness and/or simplifications in the model. Conclusions. A selective disruption of OCs near the galactic plane through GMC encounters is able to explain the SH evolution of the OC population., Comment: This paper has been submitted to Astronomy and Astrophysics (A&A) on 15/04/2024
- Published
- 2024
36. Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks
- Author
-
Gallifant, Jack, Chen, Shan, Moreira, Pedro, Munch, Nikolaj, Gao, Mingye, Pond, Jackson, Celi, Leo Anthony, Aerts, Hugo, Hartvigsen, Thomas, and Bitterman, Danielle
- Subjects
Computer Science - Computation and Language - Abstract
Medical knowledge is context-dependent and requires consistent reasoning across various natural language expressions of semantically equivalent phrases. This is particularly crucial for drug names, where patients often use brand names like Advil or Tylenol instead of their generic equivalents. To study this, we create a new robustness dataset, RABBITS, to evaluate performance differences on medical benchmarks after swapping brand and generic drug names using physician expert annotations. We assess both open-source and API-based LLMs on MedQA and MedMCQA, revealing a consistent performance drop ranging from 1-10\%. Furthermore, we identify a potential source of this fragility as the contamination of test data in widely used pre-training datasets. All code is accessible at https://github.com/BittermanLab/RABBITS, and a HuggingFace leaderboard is available at https://huggingface.co/spaces/AIM-Harvard/rabbits-leaderboard., Comment: submitted for review, total 15 pages
- Published
- 2024
37. SPARC: Shared Perspective with Avatar Distortion for Remote Collaboration in VR
- Author
-
Simões, João, Maciel, Anderson, Moreira, Catarina, and Jorge, Joaquim
- Subjects
Computer Science - Human-Computer Interaction - Abstract
Telepresence VR systems allow for face-to-face communication, promoting the feeling of presence and understanding of nonverbal cues. However, when discussing virtual 3D objects, limitations to presence and communication cause deictic gestures to lose meaning due to disparities in orientation. Current approaches use shared perspective, and avatar overlap to restore these references, which cause occlusions and discomfort that worsen when multiple users participate. We introduce a new approach to shared perspective in multi-user collaboration where the avatars are not co-located. Each person sees the others' avatars at their positions around the workspace while having a first-person view of the workspace. Whenever a user manipulates an object, others will see his/her arms stretching to reach that object in their perspective. SPARC combines a shared orientation and supports nonverbal communication, minimizing occlusions. We conducted a user study (n=18) to understand how the novel approach impacts task performance and workspace awareness. We found evidence that SPARC is more efficient and less mentally demanding than life-like settings., Comment: 14 pages 8 figures
- Published
- 2024
38. SelfReDepth: Self-Supervised Real-Time Depth Restoration for Consumer-Grade Sensors
- Author
-
Duarte, Alexandre, Fernandes, Francisco, Pereira, João M., Moreira, Catarina, Nascimento, Jacinto C., and Jorge, Joaquim
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Human-Computer Interaction - Abstract
Depth maps produced by consumer-grade sensors suffer from inaccurate measurements and missing data from either system or scene-specific sources. Data-driven denoising algorithms can mitigate such problems. However, they require vast amounts of ground truth depth data. Recent research has tackled this limitation using self-supervised learning techniques, but it requires multiple RGB-D sensors. Moreover, most existing approaches focus on denoising single isolated depth maps or specific subjects of interest, highlighting a need for methods to effectively denoise depth maps in real-time dynamic environments. This paper extends state-of-the-art approaches for depth-denoising commodity depth devices, proposing SelfReDepth, a self-supervised deep learning technique for depth restoration, via denoising and hole-filling by inpainting full-depth maps captured with RGB-D sensors. The algorithm targets depth data in video streams, utilizing multiple sequential depth frames coupled with color data to achieve high-quality depth videos with temporal coherence. Finally, SelfReDepth is designed to be compatible with various RGB-D sensors and usable in real-time scenarios as a pre-processing step before applying other depth-dependent algorithms. Our results demonstrate our approach's real-time performance on real-world datasets. They show that it outperforms state-of-the-art denoising and restoration performance at over 30fps on Commercial Depth Cameras, with potential benefits for augmented and mixed-reality applications., Comment: 13pp, 5 figures, 1 table
- Published
- 2024
- Full Text
- View/download PDF
39. BVE + EKF: A viewpoint estimator for the estimation of the object's position in the 3D task space using Extended Kalman Filters
- Author
-
Magalhães, Sandro Costa, Moreira, António Paulo, Santos, Filipe Neves dos, and Dias, Jorge
- Subjects
Computer Science - Robotics ,Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Systems and Control - Abstract
RGB-D sensors face multiple challenges operating under open-field environments because of their sensitivity to external perturbations such as radiation or rain. Multiple works are approaching the challenge of perceiving the 3D position of objects using monocular cameras. However, most of these works focus mainly on deep learning-based solutions, which are complex, data-driven, and difficult to predict. So, we aim to approach the problem of predicting the 3D objects' position using a Gaussian viewpoint estimator named best viewpoint estimator (BVE) powered by an extended Kalman filter (EKF). The algorithm proved efficient on the tasks and reached a maximum average Euclidean error of about 32 mm. The experiments were deployed and evaluated in MATLAB using artificial Gaussian noise. Future work aims to implement the system in a robotic system., Comment: Accepted to ICINCO - 21st International Conference on Informatics in Control, Automation and Robotics
- Published
- 2024
40. Bayesian Adaptive Trials for Social Policy
- Author
-
Cripps, Sally, Lopatnikova, Anna, Afshar, Hadi Mohasel, Gales, Ben, Marchant, Roman, Francis, Gilad, Moreira, Catarina, and Fischer, Alex
- Subjects
Statistics - Applications - Abstract
This paper proposes Bayesian Adaptive Trials (BAT) as both an efficient method to conduct trials and a unifying framework for evaluation social policy interventions, addressing limitations inherent in traditional methods such as Randomized Controlled Trials (RCT). Recognizing the crucial need for evidence-based approaches in public policy, the proposal aims to lower barriers to the adoption of evidence-based methods and align evaluation processes more closely with the dynamic nature of policy cycles. BATs, grounded in decision theory, offer a dynamic, ``learning as we go'' approach, enabling the integration of diverse information types and facilitating a continuous, iterative process of policy evaluation. BATs' adaptive nature is particularly advantageous in policy settings, allowing for more timely and context-sensitive decisions. Moreover, BATs' ability to value potential future information sources positions it as an optimal strategy for sequential data acquisition during policy implementation. While acknowledging the assumptions and models intrinsic to BATs, such as prior distributions and likelihood functions, the paper argues that these are advantageous for decision-makers in social policy, effectively merging the best features of various methodologies.
- Published
- 2024
41. On the classical Lagrange and Markov spectra: new results on the local dimension and the geometry of the difference set
- Author
-
Erazo, Harold, Jeffreys, Luke, and Moreira, Carlos Gustavo
- Subjects
Mathematics - Number Theory ,Mathematics - Dynamical Systems ,11J06, 28A78 (Primary) 11A55, 37B10 (Secondary) - Abstract
Let $L$ and $M$ denote the classical Lagrange and Markov spectra, respectively. It is known that $L\subset M$ and that $M\setminus L\neq\varnothing$. Inspired by three questions asked by the third author in previous work investigating the fractal geometric properties of the Lagrange and Markov spectra, we investigate the function $d_{loc}(t)$ that gives the local Hausdorff dimension at a point $t$ of $L'$. Specifically, we construct several intervals (having non-trivial intersection with $L'$) on which $d_{loc}$ is non-decreasing. We also prove that the respective intersections of $M'$ and $M''$ with these intervals coincide. Furthermore, we completely characterize the local dimension of both spectra when restricted to those intervals. Finally, we demonstrate the largest known elements of the difference set $M\setminus L$ and describe two new maximal gaps of $M$ nearby., Comment: 51 pages, 3 figures
- Published
- 2024
42. Oblique spin injection to graphene via geometry controlled magnetic nanowires
- Author
-
Toscano-Figueroa, Jesus C., Burrow, Daniel, Guarochico-Moreira, Victor H., Xie, Chengkun, Thomson, Thomas, Grigorieva, Irina V., and Vera-Marun, Ivan J.
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
We exploit the geometry of magnetic nanowires, which define 1D contacts to an encapsulated graphene channel, to introduce an out-of-plane component in the polarisation of spin carriers. By design, the magnetic nanowires traverse the angled sides of the 2D material heterostructure. Consequently, the easy axis of the nanowires is inclined, and so the local magnetisation is oblique at the injection point. As a result, when performing non-local spin valve measurements we simultaneously observe both switching and spin precession phenomena, implying the spin population possesses both in-plane and out-of-plane polarisation components. By comparing the relative magnitudes of these components, we quantify the angle of the total spin polarisation vector. The extracted angle is consistent with the angle of the nanowire at the graphene interface, evidencing that the effect is a consequence of the device geometry. This simple method of spin-based vector magnetometry provides an alternative technique to define the spin polarisation in 2D spintronic devices.
- Published
- 2024
- Full Text
- View/download PDF
43. Rotation Averaging: A Primal-Dual Method and Closed-Forms in Cycle Graphs
- Author
-
Moreira, Gabriel, Marques, Manuel, and Costeira, João Paulo
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Robotics - Abstract
A cornerstone of geometric reconstruction, rotation averaging seeks the set of absolute rotations that optimally explains a set of measured relative orientations between them. In addition to being an integral part of bundle adjustment and structure-from-motion, the problem of synchronizing rotations also finds applications in visual simultaneous localization and mapping, where it is used as an initialization for iterative solvers, and camera network calibration. Nevertheless, this optimization problem is both non-convex and high-dimensional. In this paper, we address it from a maximum likelihood estimation standpoint and make a twofold contribution. Firstly, we set forth a novel primal-dual method, motivated by the widely accepted spectral initialization. Further, we characterize stationary points of rotation averaging in cycle graphs topologies and contextualize this result within spectral graph theory. We benchmark the proposed method in multiple settings and certify our solution via duality theory, achieving a significant gain in precision and performance., Comment: arXiv admin note: text overlap with arXiv:2109.08046
- Published
- 2024
44. Refraction FWI of a circular shot OBN acquisition in the Brazilian pre-salt region
- Author
-
da Silva, Sérgio Luiz E. F., Costa, Felipe T., Karsou, Ammir, de Souza, Adriano, Capuzzo, Felipe, Moreira, Roger M., Lopez, Jorge, and Cetale, Marco
- Subjects
Physics - Geophysics - Abstract
We develop a workflow based on full-waveform inversion (FWI) to estimate P-wave velocities in a deepwater Brazilian pre-salt field using the recently introduced circular shot ocean bottom node (OBN) acquisition geometry. Such a geometry comprises a source vessel sailing in large radius concentric circular trajectories and seismic signals are recorded by OBN arrays. The circular shot OBN survey provides mostly refracted waves separately from reflected waves, so the FWI process is mainly driven by diving waves. We introduce a new FWI workflow to analyze non-preprocessed OBN refraction data, which includes automated steps such as data selection solving an Eikonal equation, estimation of a source signature that accounts for ghost and bubble effects, and gradient preconditioning using a non-stationary filter and seismic illumination. We consider two objective functions based on the $L^1$ and $L^2$ norms. The FWI results demonstrated that using our proposed workflow with the $L^1$ norm objective function and the circular OBN survey can lead to an improvement in pre-salt velocity models. Furthermore, using these improved models we construct reverse-time migration (RTM) images of the conventional OBN dataset, showing significant improvements in the salt stratification, the base of salt, and the lateral resolution of the pre-salt area. The Brazilian pre-salt case study demonstrated that the circular shot OBN acquisition maximizes the illumination of deep reservoirs through the ultra-long offset and full-azimuth coverage that prioritizes the recording of diving waves., Comment: 25 pages, 18 figures. Submitted as a Journal Paper to IEEE Transactions on Geoscience and Remote Sensing
- Published
- 2024
- Full Text
- View/download PDF
45. Entanglement signature in quantum work statistics in the slow-driving regime
- Author
-
Li, Jian, Mitchison, Mark T., and Moreira, Saulo V.
- Subjects
Quantum Physics ,Condensed Matter - Statistical Mechanics - Abstract
In slowly driven classical systems, work is a stochastic quantity and its probability distribution is known to satisfy the work fluctuation-dissipation relation, which states that the mean and variance of the dissipated work are linearly related. Recently, it was shown that generation of quantum coherence in the instantaneous energy eigenbasis leads to a correction to this linear relation in the slow-driving regime. Here, we go even further by investigating nonclassical features of work fluctuations in setups with more than one system. To do this, we first generalize slow control protocols to encompass multipartite systems, allowing for the generation of quantum correlations during the driving process. Then, focussing on two-qubit systems, we show that entanglement generation leads to a positive contribution to the dissipated work, which is distinct from the quantum correction due to local coherence generation known from previous work. Our results show that entanglement generated during slow control protocols, e.g. as an unavoidable consequence of qubit crosstalk, comes at the cost of increased dissipation.
- Published
- 2024
46. Effects of external field and potential on non-relativistic quantum particles in disclinations background
- Author
-
Ahmed, Faizuddin and Moreira, Allan R. P.
- Subjects
Quantum Physics ,General Relativity and Quantum Cosmology - Abstract
In this work, we investigate the behavior of non-relativistic quantum particles immersed in a cosmic string space-time background. Our study involves the examination of these particles as they interact with a range of influences, including potential, magnetic, and quantum flux fields. We employ analytical methods to solve the associated wave equation, leading to the derivation of eigenvalue solutions for this quantum system. Subsequently, we leverage these eigenvalue solutions to scrutinize several potential models. For each model, we present and engage in a thorough discussion of the corresponding eigenvalue solutions. In an extension of our investigation, we explore the thermodynamic and magnetic properties of the quantum system when it is exposed to non-zero temperature conditions, denoted by $T \neq 0$. Our analysis encompasses the calculation of essential parameters such as the partition function for the system and other pertinent functions. Following these calculations, we meticulously examine and interpret the outcomes, shedding light on the system's behavior and characteristics in the presence of temperature variations. Furthermore, we calculate entropic information to investigate the location of particles in the system., Comment: 20 pages, 6 figures, 3 tables, accepted in IJGMMP (https://doi.org/10.1142/S0219887824502128)
- Published
- 2024
- Full Text
- View/download PDF
47. Learning Visual-Semantic Subspace Representations for Propositional Reasoning
- Author
-
Moreira, Gabriel, Hauptmann, Alexander, Marques, Manuel, and Costeira, João Paulo
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Learning representations that capture rich semantic relationships and accommodate propositional calculus poses a significant challenge. Existing approaches are either contrastive, lacking theoretical guarantees, or fall short in effectively representing the partial orders inherent to rich visual-semantic hierarchies. In this paper, we propose a novel approach for learning visual representations that not only conform to a specified semantic structure but also facilitate probabilistic propositional reasoning. Our approach is based on a new nuclear norm-based loss. We show that its minimum encodes the spectral geometry of the semantics in a subspace lattice, where logical propositions can be represented by projection operators.
- Published
- 2024
48. Most likely configurations for fermion localization in a Braneworld-$f(Q,B_Q)$
- Author
-
Moreira, A. R. P., Dong, Shi-Hai, and Rodrigues, M. E.
- Subjects
General Relativity and Quantum Cosmology - Abstract
This study delves deeply into braneworld scenarios within modified gravity models, investigating their impact on particle localization and the structure of branes. Through a comprehensive blend of numerical analyses and theoretical inquiries, we unravel a nuanced correlation between deviations from standard General Relativity (GR) and the emergence of split branes. By employing probabilistic measurements, we pinpoint stable configurations that align with brane division intervals, thus challenging prevailing assumptions regarding the gravitational framework of our universe. Furthermore, our investigation extends to the localization of fermions within the brane, exposing intricate dynamics shaped by scalar field characteristics and modifications to gravitational models. By harnessing quantum information measurements, notably Shannon entropy, we discern heightened probabilities of fermion localization within the brane as gravitational models diverge from standard paradigms. This underscores the limitations of General Relativity in comprehensively describing the complexities inherent in our universe. Lastly, our exploration of massive fermions unveils their potential to breach the confines of the brane, hinting at promising avenues for future experimental endeavors aimed at probing the nature of extra dimensions and gravitational interactions. This suggests exciting prospects for advancing our understanding of fundamental physics beyond conventional boundaries.
- Published
- 2024
49. Channeling Skyrmions: suppressing the skyrmion Hall effect in ferrimagnetic nanostripes
- Author
-
Silva, R. C., Silva, R. L., Moreira, J. C., Moura-Melo, W. A., and Pereira, A. R.
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics ,Physics - Applied Physics - Abstract
The Skyrmion Hall Effect (SkHE) observed in ferromagnetic (FM) and ferrimagnetic (FI) skyrmions traveling due to a spin-polarized current can be a problematic issue when it comes to technological applications. By investigating the properties of FI skyrmions in racetracks through computational simulations, we have described the nature of their movement based on the relative values of the exchange, Dzyaloshinskii-Moriya, and anisotropy coupling constants. Beyond that, using a design strategy, a magnetic channel-like nano-device is proposed in which a spin-polarized current protocol is created to successfully control the channel on which the skyrmion will travel without the adverse SkHE. Additionally, a simple adjustment in the current strength can modify the skyrmion position sideways between different parallel channels in the nanostripe., Comment: 10 pages, 9 figures
- Published
- 2024
- Full Text
- View/download PDF
50. Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias
- Author
-
Chen, Shan, Gallifant, Jack, Gao, Mingye, Moreira, Pedro, Munch, Nikolaj, Muthukkumar, Ajay, Rajan, Arvind, Kolluri, Jaya, Fiske, Amelia, Hastings, Janna, Aerts, Hugo, Anthony, Brian, Celi, Leo Anthony, La Cava, William G., and Bitterman, Danielle S.
- Subjects
Computer Science - Computation and Language - Abstract
Large language models (LLMs) are increasingly essential in processing natural languages, yet their application is frequently compromised by biases and inaccuracies originating in their training data. In this study, we introduce Cross-Care, the first benchmark framework dedicated to assessing biases and real world knowledge in LLMs, specifically focusing on the representation of disease prevalence across diverse demographic groups. We systematically evaluate how demographic biases embedded in pre-training corpora like $ThePile$ influence the outputs of LLMs. We expose and quantify discrepancies by juxtaposing these biases against actual disease prevalences in various U.S. demographic groups. Our results highlight substantial misalignment between LLM representation of disease prevalence and real disease prevalence rates across demographic subgroups, indicating a pronounced risk of bias propagation and a lack of real-world grounding for medical applications of LLMs. Furthermore, we observe that various alignment methods minimally resolve inconsistencies in the models' representation of disease prevalence across different languages. For further exploration and analysis, we make all data and a data visualization tool available at: www.crosscare.net., Comment: Submitted for review, data visualization tool available at: www.crosscare.net
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.