Author: "A. Román" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"A. Román"' showing total 160,025 results

Start Over Author "A. Román"

160,025 results on '"A. Román"'

201. Schr\'odinger Bridge for Generative Speech Enhancement

Author: Jukić, Ante, Korostik, Roman, Balam, Jagadeesh, and Ginsburg, Boris
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: This paper proposes a generative speech enhancement model based on Schr\"odinger bridge (SB). The proposed model is employing a tractable SB to formulate a data-to-data process between the clean speech distribution and the observed noisy speech distribution. The model is trained with a data prediction loss, aiming to recover the complex-valued clean speech coefficients, and an auxiliary time-domain loss is used to improve training of the model. The effectiveness of the proposed SB-based model is evaluated in two different speech enhancement tasks: speech denoising and speech dereverberation. The experimental results demonstrate that the proposed SB-based outperforms diffusion-based models in terms of speech quality metrics and ASR performance, e.g., resulting in relative word error rate reduction of 20% for denoising and 6% for dereverberation compared to the best baseline model. The proposed model also demonstrates improved efficiency, achieving better quality than the baselines for the same number of sampling steps and with a reduced computational cost.
Published: 2024

202. Benchmarking of quantum fidelity kernels for Gaussian process regression

Author: Guo, Xuyang, Dai, Jun, and Krems, Roman V.
Subjects: Quantum Physics, Physics - Chemical Physics, Physics - Computational Physics
Abstract: Quantum computing algorithms have been shown to produce performant quantum kernels for machine-learning classification problems. Here, we examine the performance of quantum kernels for regression problems of practical interest. For an unbiased benchmarking of quantum kernels, it is necessary to construct the most optimal functional form of the classical kernels and the most optimal quantum kernels for each given data set. We develop an algorithm that uses an analog of the Bayesian information criterion to optimize the sequence of quantum gates used to estimate quantum kernels for Gaussian process models. The algorithm increases the complexity of the quantum circuits incrementally, while improving the performance of the resulting kernels, and is shown to yield much higher model accuracy with fewer quantum gates than a fixed quantum circuit ansatz. We demonstrate that quantum kernels thus obtained can be used to build accurate models of global potential energy surfaces (PES) for polyatomic molecules. The average interpolation error of the six-dimensional PES obtained with a random distribution of 2000 energy points is 16 cm$^{-1}$ for H$_3$O$^+$, 15 cm$^{-1}$ for H$_2$CO and 88 cm$^{-1}$ for HNO$_2$. We show that a compositional optimization of classical kernels for Gaussian process regression converges to the same errors. This indicates that quantum kernels can achieve the same, though not better, expressivity as classical kernels for regression problems.
Published: 2024
Full Text: View/download PDF

203. Lepton-Flavor-Violating ALP Signals with TeV-Scale Muon Beams

Author: Batell, Brian, Davoudiasl, Hooman, Marcarelli, Roman, Neil, Ethan T., and Trojanowski, Sebastian
Subjects: High Energy Physics - Phenomenology, High Energy Physics - Experiment
Abstract: We explore the feasibility of using TeV-energy muons to probe lepton-flavor-violating (LFV) processes mediated by an axion-like particle (ALP) $a$ with mass $\mathcal{O}(10~\textrm{GeV})$. We focus on $\mu\tau$ LFV interactions and assume that the ALP is coupled to a dark state $\chi$, which can be either less or more massive than $a$. Such a setup is demonstrated to be consistent with $\chi$ being a candidate for dark matter, in the experimentally relevant regime of parameters. We consider the currently operating NA64-$\mu$ experiment and proposed FASER$\nu$2 detector as both the target and the detector for the process $\mu A \to \tau A\, a$, where $A$ is the target nucleus. We also show that a possible future active muon fixed-target experiment operating at a 3 TeV muon collider or in its preparatory phase can provide an impressive reach for the LFV process considered, with future FASER$\nu$2 data providing a pilot study towards that goal. The implications of the muon anomalous magnetic moment $(g-2)_\mu$ measurements for the underlying model, in case of a positive signal, are also examined, and a sample UV completion is outlined., Comment: 16 pages, 7 figures, matches published version
Published: 2024
Full Text: View/download PDF

204. High strength self-healable supercapacitor based on supramolecular polymer hydrogel with upper critical solubility temperature

Author: Elashnikov, Roman, Khrystonko, Olena, Jilková, Tereza, Rimpelová, Silvie, Kolská, Zdenka, Švorčík, Václav, and Lyutakov, Oleksiy
Subjects: Physics - Applied Physics, Condensed Matter - Materials Science
Abstract: Here, we report poly(N-acryloylglycinamide-co-vinyltriazole) p(NAGA-co-VTZ) supramolecular polymer hydrogel doped with activated polypyrrole nanotubes (acPPyNTs) as a high-strength self-healable material for supercapacitors. First, the p(NAGA-co-VTZ) hydrogel films were synthesized by photopolymerization of N-acryloylglycinamide and 1-vinyl-1,2,4-triazole without any cross-linkers. Scanning electron microscopy and mechanical tests showed that initial monomer concentration strongly affects both hydrogel microstructure and resulted mechanical properties. The hydrogels demonstrated self-healing ability through hydrogen bonding at the temperatures above upper critical solubility temperature, excellent mechanical properties (0.9 MPa), large stretchability (1300 %) and cut resistance. Next, as active material for electrochemical double layer capacitors (EDLC) carbonized and ethanol/KOH activated polypyrrole nanotubes (acPPyNTs) were prepared. Symmetric self-healable supercapacitor employing p(NAGA-co-VTZ) hydrogel, acPPyNTs and aqueous 3M KCl solution was assembled. Cyclic voltammetry, galvanostatic charge-discharge measurements showed that the prepared device gave a specific capacitance of 282.62 F g-1 at 0.2 A g-1 and high areal capacitance of 316.86 mF cm-2 at scan rate of 10 mV s-1. The supercapacitor operates over wide voltage window (0-1.2 V) and provides excellent cyclic performance with capacitance retention of 97 % after 10 000 cycles and 94 % after self-healing. Overall, the prepared self-healable supercapacitor appears to have considerable potential as high-performance energy storage device., Comment: 17 pages, 5 figures, 1 table
Published: 2024

205. Derandomized Truncated D-vine Copula Knockoffs with e-values to control the false discovery rate

Author: Vásquez, Alejandro Román, Urbina, José Ulises Márquez, Farías, Graciela González, and Escarela, Gabriel
Subjects: Statistics - Methodology, 62-08, 62Gxx, 62H05, 62E17, G.3
Abstract: The Model-X knockoffs is a practical methodology for variable selection, which stands out from other selection strategies since it allows for the control of the false discovery rate (FDR), relying on finite-sample guarantees. In this article, we propose a Truncated D-vine Copula Knockoffs (TDCK) algorithm for sampling approximate knockoffs from complex multivariate distributions. Our algorithm enhances and improves features of previous attempts to sample knockoffs under the multivariate setting, with the three main contributions being: 1) the truncation of the D-vine copula, which reduces the dependence between the original variables and their corresponding knockoffs, improving the statistical power; 2) the employment of a straightforward non-parametric formulation for marginal transformations, eliminating the need for a specific parametric family or a kernel density estimator; 3) the use of the "rvinecopulib'' R package offers better flexibility than the existing fitting vine copula knockoff methods. To eliminate the randomness in distinct realizations resulting in different sets of selected variables, we wrap the TDCK method with an existing derandomizing procedure for knockoffs, leading to a Derandomized Truncated D-vine Copula Knockoffs with e-values (DTDCKe) procedure. We demonstrate the robustness of the DTDCKe procedure under various scenarios with extensive simulation studies. We further illustrate its efficacy using a gene expression dataset, showing it achieves a more reliable gene selection than other competing methods, when the findings are compared with those of a meta-analysis. The results indicate that our Truncated D-vine copula approach is robust and has superior power, representing an appealing approach for variable selection in different multivariate applications, particularly in gene expression analysis., Comment: 31 pages, 9 figures, 1 table
Published: 2024

206. Phi-3 Safety Post-Training: Aligning Language Models with a 'Break-Fix' Cycle

Author: Haider, Emman, Perez-Becker, Daniel, Portet, Thomas, Madan, Piyush, Garg, Amit, Ashfaq, Atabak, Majercak, David, Wen, Wen, Kim, Dongwoo, Yang, Ziyi, Zhang, Jianwen, Sharma, Hiteshi, Bullwinkel, Blake, Pouliot, Martin, Minnich, Amanda, Chawla, Shiven, Herrera, Solianna, Warreth, Shahed, Engler, Maggie, Lopez, Gary, Chikanov, Nina, Dheekonda, Raja Sekhar Rao, Jagdagdorj, Bolor-Erdene, Lutz, Roman, Lundeen, Richard, Westerhoff, Tori, Bryan, Pete, Seifert, Christian, Kumar, Ram Shankar Siva, Berkley, Andrew, and Kessler, Alex
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Recent innovations in language model training have demonstrated that it is possible to create highly performant models that are small enough to run on a smartphone. As these models are deployed in an increasing number of domains, it is critical to ensure that they are aligned with human preferences and safety considerations. In this report, we present our methodology for safety aligning the Phi-3 series of language models. We utilized a "break-fix" cycle, performing multiple rounds of dataset curation, safety post-training, benchmarking, red teaming, and vulnerability identification to cover a variety of harm areas in both single and multi-turn scenarios. Our results indicate that this approach iteratively improved the performance of the Phi-3 models across a wide range of responsible AI benchmarks. Finally, we include additional red teaming strategies and evaluations that were used to test the safety behavior of Phi-3.5-mini and Phi-3.5-MoE, which were optimized for multilingual capabilities.
Published: 2024

207. GRBAlpha and VZLUSAT-2: GRB observations with CubeSats after 3 years of operations

Author: Münz, Filip, Řípa, Jakub, Pál, András, Dafčíková, Marianna, Werner, Norbert, Ohno, Masanori, Meszáros, László, Dániel, Vladimír, Hanák, Peter, Hudec, Ján, Frajt, Marcel, Kapuš, Jakub, Svoboda, Petr, Dudáš, Juraj, Kasal, Miroslav, Vítek, Tomáš, Kolář, Martin, Szakszonová, Lea, Lipovský, Pavol, Ďuríšková, Michaela, Veřtát, Ivo, Sabol, Martin, Junas, Milan, Maroš, Roman, Kosík, Pavel, Frei, Zsolt, Takahashi, Hiromitsu, Fukazawa, Yasushi, Galgóczi, Gábor, Csák, Balázs, László, Robert, Mizuno, Tsunefumi, Husáriková, Nikola, and Nakazawa, Kazuhiro
Subjects: Astrophysics - High Energy Astrophysical Phenomena
Abstract: GRBAlpha is a 1U CubeSat launched in March 2021 to a sun-synchronous LEO at an altitude of 550 km to perform an in-orbit demonstration of a novel gamma-ray burst detector developed for CubeSats. VZLUSAT-2 followed ten months later in a similar orbit carrying as a secondary payload a pair of identical detectors as used on the first mission. These instruments detecting gamma-rays in the range of 30-900 keV consist of a 56 cm2 5 mm thin CsI(Tl) scintillator read-out by a row of multi-pixel photon counters (MPPC or SiPM). The scientific motivation is to detect gamma-ray bursts and other HE transient events and serve as a pathfinder for a larger constellation of nanosatellites that could localize these events via triangulation. At the beginning of July 2024, GRBAlpha detected 140 such transients, while VZLUSAT-2 had 83 positive detections, confirmed by larger GRB missions. Almost a hundred of them are identified as gamma-ray bursts, including extremely bright GRB 221009A and GRB 230307A, detected by both satellites. We were able to characterize the degradation of SiPMs in polar orbit and optimize the duty cycle of the detector system also by using SatNOGS radio network for downlink., Comment: Presented at SPIE.Astronomical Telescopes and Instrumentations,2024
Published: 2024
Full Text: View/download PDF

208. Policies Grow on Trees: Model Checking Families of MDPs

Author: Andriushchenko, Roman, Češka, Milan, Junges, Sebastian, and Macák, Filip
Subjects: Computer Science - Logic in Computer Science
Abstract: Markov decision processes (MDPs) provide a fundamental model for sequential decision making under process uncertainty. A classical synthesis task is to compute for a given MDP a winning policy that achieves a desired specification. However, at design time, one typically needs to consider a family of MDPs modelling various system variations. For a given family, we study synthesising (1) the subset of MDPs where a winning policy exists and (2) a preferably small number of winning policies that together cover this subset. We introduce policy trees that concisely capture the synthesis result. The key ingredient for synthesising policy trees is a recursive application of a game-based abstraction. We combine this abstraction with an efficient refinement procedure and a post-processing step. An extensive empirical evaluation demonstrates superior scalability of our approach compared to naive baselines. For one of the benchmarks, we find 246 winning policies covering 94 million MDPs. Our algorithm requires less than 30 minutes, whereas the naive baseline only covers 3.7% of MDPs in 24 hours., Comment: to be published at ATVA 2024
Published: 2024

209. Polylogarithmic functions with prescribed branching locus and linear relations between them

Author: Lee, Roman N.
Subjects: High Energy Physics - Theory, High Energy Physics - Phenomenology
Abstract: We consider the problem of finding the set of classical polylogarithmic functions $\text{Li}_n$ with branching locus determined by the solution of $p_1\cdot p_2\cdot \ldots \cdot p_n=0$, where $p_1,\ldots, p_n$ are irreducible polynomials of several variables. We present an algorithm of constructing a complete set of possible arguments of $\text{Li}_n$ functions. The corresponding Mathematica code is included as ancillary file. Using this algorithm and the symbol map, we provide some examples of polylogarithmic identities., Comment: 7 pages
Published: 2024

210. Distributions and correlation properties of offshore wind speeds and wind speed increments

Author: Sim, So-Kumneth, Maass, Philipp, and Roman, H. Eduardo
Subjects: Physics - Atmospheric and Oceanic Physics
Abstract: We determine distributions and correlation properties of offshore wind speeds and wind speed increments by analyzing wind data sampled with a resolution of one second for 20 months at different heights above sea level in the North Sea. Distributions of horizontal wind speeds can be fitted to Weibull distributions with shape and scale parameters varying weakly with the vertical height separation. Kullback-Leibler divergences between distributions at different heights change with the squared logarithm of the height ratio. Cross-correlations between time derivatives of wind speeds are long-term anticorrelated, and the even parts of their correlation functions satisfy sum rules. Distributions of horizontal wind speed increments change from a tent-like shape to a Gaussian with rising increment lag. A surprising peak occurs in the left tail of the increment distributions for lags in a range $10-200\,{\rm km}$ after applying the Taylor's hypothesis locally to transform time lags into distances. The peak is decisive in order to obtain an expected and observed linear scaling of third-order structure functions with distance. This suggests that it is an intrinsic feature of atmospheric turbulence., Comment: 20 pages, 14 figures
Published: 2024
Full Text: View/download PDF

211. The role of precursor coverage in the synthesis and substrate transfer of graphene nanoribbons

Author: Darawish, Rimah, Braun, Oliver, Muellen, Klaus, Calame, Michel, Ruffieux, Pascal, Fasel, Roman, and Barin, Gabriela Borin
Subjects: Condensed Matter - Materials Science
Abstract: Graphene nanoribbons (GNRs) with atomically precise widths and edge topologies have well-defined band gaps that depend on ribbon dimensions, making them ideal for room-temperature switching applications like field-effect transistors (FETs). For efficient device integration, it is crucial to optimize growth conditions to maximize GNR length and, consequently, device yield. Here, we investigate the growth and alignment of 9-atom-wide armchair graphene nanoribbons (9-AGNRs) on a vicinal gold substrate, Au(788), with varying molecular precursor doses (PD) and, therefore, different resulting GNR coverages. Our investigation reveals that GNR growth location on the Au(788) substrate is coverage-dependent. Furthermore, scanning tunneling microscopy shows a strong correlation between the GNR length evolution and both the PD and the GNR growth location on the substrate. Employing Raman spectroscopy, we analyze samples with eight different PDs on Au(788). We find that GNR alignment improves with length, achieving near-perfect alignment with an average GNR length of ~40 nm for GNRs growing solely at Au(788) step edges. To fully exploit GNR properties in device architectures, GNRs need to be transferred from the gold to semiconducting or insulating substrates. Upon substrate transfer, samples with higher PD present systematically better alignment preservation and less surface disorder, which we attribute to reduced GNR mobility during the transfer process. PD also affects the substrate transfer success rate, with higher success rates observed for samples with higher GNR coverages (77%) compared to those with lower GNR coverages (53%). Our findings characterize the important relationship between precursor dose, GNR length, alignment quality, and surface disorder during GNR growth and upon substrate transfer, offering crucial insights for the further development of GNR-based nanoelectronic devices.
Published: 2024

212. Conformational tuning of magnetic interactions in coupled nanographenes

Author: Catarina, Gonçalo, Turco, Elia, Krane, Nils, Bommert, Max, Ortega-Guerrero, Andres, Gröning, Oliver, Ruffieux, Pascal, Fasel, Roman, and Pignedoli, Carlo A.
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: Phenalenyl (C$_{13}$H$_9$) is an open-shell spin-$1/2$ nanographene. Using scanning tunneling microscopy (STM) inelastic electron tunneling spectroscopy (IETS), covalently-bonded phenalenyl dimers have been shown to feature conductance steps associated with singlet-triplet excitations of a spin-$1/2$ dimer with antiferromagnetic exchange. Here, we address the possibility of tuning the magnitude of the exchange interactions by varying the dihedral angle between the two molecules within a dimer. Theoretical methods, ranging from density functional theory calculations to many-body model Hamiltonians solved within different levels of approximation, are used to explain STM-IETS measurements of twisted phenalenyl dimers on a h-BN/Rh(111) surface. By means of first-principles calculations, we also propose strategies to induce sizable twist angles in surface-adsorbed phenalenyl dimers via functional groups, including a photoswitchable scheme. This work paves the way toward tuning magnetic couplings in carbon-based spin chains and two-dimensional lattices.
Published: 2024
Full Text: View/download PDF

213. HuBar: A Visual Analytics Tool to Explore Human Behaviour based on fNIRS in AR guidance systems

Author: Castelo, Sonia, Rulff, Joao, Solunke, Parikshit, McGowan, Erin, Wu, Guande, Roman, Iran, Lopez, Roque, Steers, Bea, Sun, Qi, Bello, Juan, Feest, Bradley, Middleton, Michael, Mckendrick, Ryan, and Silva, Claudio
Subjects: Computer Science - Human-Computer Interaction
Abstract: The concept of an intelligent augmented reality (AR) assistant has significant, wide-ranging applications, with potential uses in medicine, military, and mechanics domains. Such an assistant must be able to perceive the environment and actions, reason about the environment state in relation to a given task, and seamlessly interact with the task performer. These interactions typically involve an AR headset equipped with sensors which capture video, audio, and haptic feedback. Previous works have sought to facilitate the development of intelligent AR assistants by visualizing these sensor data streams in conjunction with the assistant's perception and reasoning model outputs. However, existing visual analytics systems do not focus on user modeling or include biometric data, and are only capable of visualizing a single task session for a single performer at a time. Moreover, they typically assume a task involves linear progression from one step to the next. We propose a visual analytics system that allows users to compare performance during multiple task sessions, focusing on non-linear tasks where different step sequences can lead to success. In particular, we design visualizations for understanding user behavior through functional near-infrared spectroscopy (fNIRS) data as a proxy for perception, attention, and memory as well as corresponding motion data (acceleration, angular velocity, and gaze). We distill these insights into embedding representations that allow users to easily select groups of sessions with similar behaviors. We provide two case studies that demonstrate how to use these visualizations to gain insights about task performance using data collected during helicopter copilot training tasks. Finally, we evaluate our approach by conducting an in-depth examination of a think-aloud experiment with five domain experts., Comment: 11 pages, 6 figures. This is the author's version of the article that has been accepted for publication in IEEE Transactions on Visualization and Computer Graphics (TVCG)
Published: 2024

214. On a possible $^{3}_{\phi}$H hypernucleus with HAL QCD interaction

Author: Filikhin, Igor, Kezerashvili, Roman Ya., and Vlahovic, Branislav
Subjects: Nuclear Theory, High Energy Physics - Phenomenology
Abstract: Within the framework of the Faddeev formalism in configuration space, we investigate bound states in the $\phi NN$ system with total isospin $T=0$ and $T=1$. The recently proposed lattice HAL QCD $\phi N$ potential in the $^{4}S_{3/2}$ channel does not support either $\phi N$ or $\phi NN$ bound states. The HAL QCD $\phi N$ potential in the $^{2}S_{1/2}$ channel suggests the bound states for $\phi N$ and $\phi NN (S=0)$ systems. However, the binding energies are highly sensitive to variations of the enhancement factor $\beta$, and the $\phi NN$ system is extremely strongly bound in the state $S=0$. Considering a spin-averaged potential %$(\frac{1}{3}V_{\phi N}^{1/2}+\frac{2}{3}V_{\phi N}^{3/2})$ for the state $S=1$ yields a bound state for $^3_\phi$H $(S=1)$ hypernucleus with the binding energy (BE) 14.9 MeV when $\beta = 6.9$. The evaluation of the BE for the $S=1$, $T=1$ three-body state results in 5.47 MeV. %Also, We evaluated the BE for the $S=1$, $T=1$ three-body state as 5.47 MeV. Additionally, calculations using our approach confirm the bound states for the $\phi NN$ ($S=2,T=0$ and $S=1, T=1$) system previously predicted with the Yukawa-type potential motivated by the QCD van der Waals attractive force, mediated by multi-gluon exchanges., Comment: 6 Pages, 1 figure
Published: 2024
Full Text: View/download PDF

215. An experimental evaluation of Siamese Neural Networks for robot localization using omnidirectional imaging in indoor environments

Author: Cabrera, J. J., Román, V., Gil, A., Reinoso, O., and Payá, L.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: The objective of this paper is to address the localization problem using omnidirectional images captured by a catadioptric vision system mounted on the robot. For this purpose, we explore the potential of Siamese Neural Networks for modeling indoor environments using panoramic images as the unique source of information. Siamese Neural Networks are characterized by their ability to generate a similarity function between two input data, in this case, between two panoramic images. In this study, Siamese Neural Networks composed of two Convolutional Neural Networks (CNNs) are used. The output of each CNN is a descriptor which is used to characterize each image. The dissimilarity of the images is computed by measuring the distance between these descriptors. This fact makes Siamese Neural Networks particularly suitable to perform image retrieval tasks. First, we evaluate an initial task strongly related to localization that consists in detecting whether two images have been captured in the same or in different rooms. Next, we assess Siamese Neural Networks in the context of a global localization problem. The results outperform previous techniques for solving the localization task using the COLD-Freiburg dataset, in a variety of lighting conditions, specially when using images captured in cloudy and night conditions., Comment: Published: 08 July 2024 Paper link: https://link.springer.com/content/pdf/10.1007/s10462-024-10840-0.pdf
Published: 2024
Full Text: View/download PDF

216. Surprising symmetry properties and exact solutions of Kolmogorov backward equations with power diffusivity

Author: Koval, Serhii D., Cardoso-Bihlo, Elsa Dos Santos, and Popovych, Roman O.
Subjects: Mathematical Physics, Mathematics - Analysis of PDEs, 35B06, 35A30, 35C05, 35K10, 35K70
Abstract: Using the original advanced version of the direct method, we efficiently compute the equivalence groupoids and equivalence groups of two peculiar classes of Kolmogorov backward equations with power diffusivity and solve the problems of their complete group classifications. The results on the equivalence groups are double-checked with the algebraic method. Within these classes, the remarkable Fokker-Planck and the fine Kolmogorov backward equations are distinguished by their exceptional symmetry properties. We extend the known results on these two equations to their counterparts with respect to a distinguished discrete equivalence transformation. Additionally, we carry out Lie reductions of the equations under consideration up to the point equivalence, exhaustively study their hidden Lie symmetries and generate wider families of their new exact solutions via acting by their recursion operators on constructed Lie-invariant solutions. This analysis reveals eight powers of the space variable with exponents -1, 0, 1, 2, 3, 4, 5 and 6 as values of the diffusion coefficient that are prominent due to symmetry properties of the corresponding equations., Comment: 38 pages
Published: 2024

217. NinjaLLM: Fast, Scalable and Cost-effective RAG using Amazon SageMaker and AWS Trainium and Inferentia2

Author: Xue, Tengfei, Li, Xuefeng, Smirnov, Roman, Azim, Tahir, Sadrieh, Arash, and Pahlavan, Babak
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, I.2.7
Abstract: Retrieval-augmented generation (RAG) techniques are widely used today to retrieve and present information in a conversational format. This paper presents a set of enhancements to traditional RAG techniques, focusing on large language models (LLMs) fine-tuned and hosted on AWS Trainium and Inferentia2 AI chips via SageMaker. These chips are characterized by their elasticity, affordability, and efficient performance for AI compute tasks. Besides enabling deployment on these chips, this work aims to improve tool usage, add citation capabilities, and mitigate the risks of hallucinations and unsafe responses due to context bias. We benchmark our RAG system's performance on the Natural Questions and HotPotQA datasets, achieving an accuracy of 62% and 59% respectively, exceeding other models such as DBRX and Mixtral Instruct.
Published: 2024

218. Discovery of a Hypervelocity L Subdwarf at the Star/Brown Dwarf Mass Limit

Author: Burgasser, Adam J., Gerasimov, Roman, Kremer, Kyle, Brooks, Hunter, Alvarado III, Efrain, Schneider, Adam C., Meisner, Aaron M., Theissen, Christopher A., Softich, Emma, Karpoor, Preethi, Bickle, Thomas P., Kabatnik, Martin, Rothermich, Austin, Caselden, Dan, Kirkpatrick, J. Davy, Faherty, Jacqueline K., Casewell, Sarah L., Kuchner, Marc J., Worlds, the Backyard, and Collaboration, Planet 9
Subjects: Astrophysics - Solar and Stellar Astrophysics, Astrophysics - Astrophysics of Galaxies
Abstract: We report the discovery of a high velocity, very low-mass star or brown dwarf whose kinematics suggest it is unbound to the Milky Way. CWISE J124909.08+362116.0 was identified by citizen scientists in the Backyard Worlds: Planet 9 program as a high proper motion ($\mu$ $=$ 0''9/yr) faint red source. Moderate resolution spectroscopy with Keck/NIRES reveals it to be a metal-poor early L subdwarf with a large radial velocity ($-$103$\pm$10 km/s), and its estimated distance of 125$\pm$8 pc yields a speed of 456$\pm$27 km/s in the Galactic rest frame, near the local escape velocity for the Milky Way. We explore several potential scenarios for the origin of this source, including ejection from the Galactic center $\gtrsim$3 Gyr in the past, survival as the mass donor companion to an exploded white dwarf. acceleration through a three-body interaction with a black hole binary in a globular cluster, and accretion from a Milky Way satellite system. CWISE J1249+3621 is the first hypervelocity very low mass star or brown dwarf to be found, and the nearest of all such systems. It may represent a broader population of very high velocity, low-mass objects that have undergone extreme accelerations., Comment: 17 pages, 3 figures, accepted for publication in Astrophysical Journal Letters
Published: 2024

219. Verificarlo CI: continuous integration for numerical optimization and debugging

Author: Delval, Aurélien, Coppens, François, Petit, Eric, Iakymchuk, Roman, and Castro, Pablo de Oliveira
Subjects: Computer Science - Software Engineering
Abstract: Floating-point accuracy is an important concern when developing numerical simulations or other compute-intensive codes. Tracking the introduction of numerical regression is often delayed until it provokes unexpected bug for the end-user. In this paper, we introduce Verificarlo CI, a continuous integration workflow for the numerical optimization and debugging of a code over the course of its development. We demonstrate applicability of Verificarlo CI on two test-case applications.
Published: 2024

220. Scaling Exponents Across Parameterizations and Optimizers

Author: Everett, Katie, Xiao, Lechao, Wortsman, Mitchell, Alemi, Alexander A., Novak, Roman, Liu, Peter J., Gur, Izzeddin, Sohl-Dickstein, Jascha, Kaelbling, Leslie Pack, Lee, Jaehoon, and Pennington, Jeffrey
Subjects: Computer Science - Machine Learning
Abstract: Robust and effective scaling of models from small to large width typically requires the precise adjustment of many algorithmic and architectural details, such as parameterization and optimizer choices. In this work, we propose a new perspective on parameterization by investigating a key assumption in prior work about the alignment between parameters and data and derive new theoretical results under weaker assumptions and a broader set of optimizers. Our extensive empirical investigation includes tens of thousands of models trained with all combinations of three optimizers, four parameterizations, several alignment assumptions, more than a dozen learning rates, and fourteen model sizes up to 26.8B parameters. We find that the best learning rate scaling prescription would often have been excluded by the assumptions in prior work. Our results show that all parameterizations, not just maximal update parameterization (muP), can achieve hyperparameter transfer; moreover, our novel per-layer learning rate prescription for standard parameterization outperforms muP. Finally, we demonstrate that an overlooked aspect of parameterization, the epsilon parameter in Adam, must be scaled correctly to avoid gradient underflow and propose Adam-atan2, a new numerically stable, scale-invariant version of Adam that eliminates the epsilon hyperparameter entirely., Comment: 63 pages, International Conference on Machine Learning 2024
Published: 2024

221. The ALMA-ALPAKA survey II. Evolution of turbulence in galaxy disks across cosmic time: difference between cold and warm gas

Author: Rizzo, F., Bacchini, C., Kohandel, M., Di Mascolo, L., Fraternali, F., Roman-Oliveira, F., Zanella, A., Popping, G., Valentino, F., Magdis, G., and Whitaker, K.
Subjects: Astrophysics - Astrophysics of Galaxies, Astrophysics - Cosmology and Nongalactic Astrophysics
Abstract: The gas in the interstellar medium (ISM) of galaxies is supersonically turbulent. Measurements of turbulence typically rely on cold gas emission lines for low-z galaxies and warm ionized gas observations for z>0 galaxies. Studies of warm gas kinematics at z>0 conclude that the turbulence strongly evolves as a function of redshift, due to the increasing impact of gas accretion and mergers in the early Universe. However, recent findings suggest potential biases in turbulence measurements derived from ionized gas at high-z, impacting our understanding of turbulence origin, ISM physics and disk formation. We investigate the evolution of turbulence using velocity dispersion ($\sigma$) measurements from cold gas tracers (i.e., CO, [CI], [CII]) derived from a sample of 57 galaxy disks spanning the redshift range z=0-5. This sample consists of main-sequence and starburst galaxies with stellar masses $\gtrsim 10^{10} M_{\odot}$. The comparison with current H$\alpha$ kinematic observations and existing models demonstrates that the velocity dispersion inferred from cold gas tracers differ by a factor of $\approx 3$ from those obtained using emission lines tracing warm gas. We show that stellar feedback is the main driver of turbulence measured from cold gas tracers. This is fundamentally different from the conclusions of studies based on warm gas, which had to consider additional turbulence drivers to explain the high values of $\sigma$. We present a model predicting the redshift evolution of turbulence in galaxy disks, attributing the increase of $\sigma$ with redshift to the higher energy injected by supernovae due to the elevated star-formation rate in high-z galaxies. This supernova-driven model suggests that turbulence is lower in galaxies with lower stellar mass compared to those with higher stellar mass. Additionally, it forecasts the evolution of $\sigma$ in Milky-Way like progenitors., Comment: Accepted for publication in A&A. The abstract has been modified to comply with arXiv's character limit
Published: 2024
Full Text: View/download PDF

222. HyperKAN: Kolmogorov-Arnold Networks make Hyperspectral Image Classificators Smarter

Author: Lobanov, Valeriy, Firsov, Nikita, Myasnikov, Evgeny, Khabibullin, Roman, and Nikonorov, Artem
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In traditional neural network architectures, a multilayer perceptron (MLP) is typically employed as a classification block following the feature extraction stage. However, the Kolmogorov-Arnold Network (KAN) presents a promising alternative to MLP, offering the potential to enhance prediction accuracy. In this paper, we propose the replacement of linear and convolutional layers of traditional networks with KAN-based counterparts. These modifications allowed us to significantly increase the per-pixel classification accuracy for hyperspectral remote-sensing images. We modified seven different neural network architectures for hyperspectral image classification and observed a substantial improvement in the classification accuracy across all the networks. The architectures considered in the paper include baseline MLP, state-of-the-art 1D (1DCNN) and 3D convolutional (two different 3DCNN, NM3DCNN), and transformer (SSFTT) architectures, as well as newly proposed M1DCNN. The greatest effect was achieved for convolutional networks working exclusively on spectral data, and the best classification quality was achieved using a KAN-based transformer architecture. All the experiments were conducted using seven openly available hyperspectral datasets. Our code is available at https://github.com/f-neumann77/HyperKAN.
Published: 2024

223. Ultra-low-energy defibrillation through adjoint optimization

Author: Garzon, Alejandro and Grigoriev, Roman O.
Subjects: Quantitative Biology - Tissues and Organs
Abstract: This study investigates ultra-low-energy defibrillation protocols using a simple two-dimensional model of cardiac tissue. We find that, rather counter-intuitively, a single, properly timed, biphasic pulse can be more effective in defibrillating the tissue than low energy antitachycardia pacing (LEAP) which employs a sequence of such pulses, succeeding where the latter approach fails. Furthermore, we show that, with the help of adjoint optimization, it is possible to reduce the energy required for defibrillation even further, making it three orders of magnitude lower than that required by LEAP. Finally, we establish that this dramatic reduction is achieved through exploiting the sensitivity of the dynamics in vulnerable windows to promote annihilation of pairs of nearby phase singularities., Comment: Submitted to Chaos
Published: 2024
Full Text: View/download PDF

224. On non-uniqueness of phase retrieval in multidimensions

Author: Novikov, Roman and Xu, Tianli
Subjects: Mathematical Physics, 42A38, 35R30
Abstract: We give a large class of examples of non-uniqueness for the phase retrieval problem in multidimensions. Our examples include the case of functions with strongly disconnected compact support., Comment: We substantially revised the first version
Published: 2024

225. Idiographic Personality Gaussian Process for Psychological Assessment

Author: Chen, Yehu, Xi, Muchen, Montgomery, Jacob, Jackson, Joshua, and Garnett, Roman
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We develop a novel measurement framework based on a Gaussian process coregionalization model to address a long-lasting debate in psychometrics: whether psychological features like personality share a common structure across the population, vary uniquely for individuals, or some combination. We propose the idiographic personality Gaussian process (IPGP) framework, an intermediate model that accommodates both shared trait structure across a population and "idiographic" deviations for individuals. IPGP leverages the Gaussian process coregionalization model to handle the grouped nature of battery responses, but adjusted to non-Gaussian ordinal data. We further exploit stochastic variational inference for efficient latent factor estimation required for idiographic modeling at scale. Using synthetic and real data, we show that IPGP improves both prediction of actual responses and estimation of individualized factor structures relative to existing benchmarks. In a third study, we show that IPGP also identifies unique clusters of personality taxonomies in real-world data, displaying great potential in advancing individualized approaches to psychological diagnosis and treatment., Comment: 9 pages, 4 figures
Published: 2024

226. Systematic Evaluation of Online Speaker Diarization Systems Regarding their Latency

Author: Aperdannier, Roman, Schacht, Sigurd, and Piazza, Alexander
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: In this paper, different online speaker diarization systems are evaluated on the same hardware with the same test data with regard to their latency. The latency is the time span from audio input to the output of the corresponding speaker label. As part of the evaluation, various model combinations within the DIART framework, a diarization system based on the online clustering algorithm UIS-RNN-SML, and the end-to-end online diarization system FS-EEND are compared. The lowest latency is achieved for the DIART-pipeline with the embedding model pyannote/embedding and the segmentation model pyannote/segmentation. The FS-EEND system shows a similarly good latency. In general there is currently no published research that compares several online diarization systems in terms of their latency. This makes this work even more relevant., Comment: 6 pages
Published: 2024

227. Entity-Level Sentiment: More than the Sum of Its Parts

Author: Rønningstad, Egil, Klinger, Roman, Øvrelid, Lilja, and Velldal, Erik
Subjects: Computer Science - Computation and Language
Abstract: In sentiment analysis of longer texts, there may be a variety of topics discussed, of entities mentioned, and of sentiments expressed regarding each entity. We find a lack of studies exploring how such texts express their sentiment towards each entity of interest, and how these sentiments can be modelled. In order to better understand how sentiment regarding persons and organizations (each entity in our scope) is expressed in longer texts, we have collected a dataset of expert annotations where the overall sentiment regarding each entity is identified, together with the sentence-level sentiment for these entities separately. We show that the reader's perceived sentiment regarding an entity often differs from an arithmetic aggregation of sentiments at the sentence level. Only 70\% of the positive and 55\% of the negative entities receive a correct overall sentiment label when we aggregate the (human-annotated) sentiment labels for the sentences where the entity is mentioned. Our dataset reveals the complexity of entity-specific sentiment in longer texts, and allows for more precise modelling and evaluation of such sentiment expressions., Comment: 14th Workshop on Computational Approaches to Subjectivity, Sentiment & Social Media Analysis (WASSA 2024)
Published: 2024

228. HHH Whitepaper

Author: Brigljevic, Vuko, Ferencek, Dinko, Landsberg, Greg, Robens, Tania, Stamenkovic, Marko, Susa, Tatjana, Abouabid, Hamza, Arhrib, Abdesslam, Arnold, Hannah, Azevedo, Duarte, Diaz, Daniel, Duarte, Javier, Pree, Tristan du, Falaki, Jaouad El, Ferreira, Pedro. M., Fuks, Benjamin, Ganguly, Sanmay, Kolosova, Marina, Konigsberg, Jacobo, Liu, Bingxuan, Moser, Brian, Muehlleitner, Margarete, Papaefstathiou, Andreas, Pasechnik, Roman, Santos, Rui, Sheldon, Brian, Soyez, Gregory, Stylianou, Panagiotis, Tetlalmatzi-Xolocotzi, Gilberto, Weiglein, Georg, Zanderighi, Giulia, and Zhang, Rui
Subjects: High Energy Physics - Phenomenology
Abstract: We here report on the progress of the HHH Workshop, that took place in Dubrovnik in July 2023. After the discovery of a particle that complies with the properties of the Higgs boson of the Standard Model, all SM parameters are in principle determined. However, in order to verify or falsify the model, the full form of the potential has to be determined. This includes the measurement of the triple and quartic scalar couplings. We here report on ongoing progress of measurements for multi scalar final states, with an emphasis on three SM-like scalar bosons at 125 GeV, but also mentioning other options. We discuss both experimental progress and challenges as well as theoretical studies and models that can enhance such rates with respect to the SM predictions., Comment: 117 pages, 56 figures; Whitepaper resulting from HHH Workshop in Dubrovnik 2023, https://indico.cern.ch/event/1232581/; v2: small typos corrected
Published: 2024

229. Disentangling heterogeneity and disorder during ultrafast surface melting of orbital order

Author: Monti, Maurizio, Siddiqui, Khalid M., Perez-Salinas, Daniel, Agarwal, Naman, Bremholm, Martin, Li, Xiang, Prabhakaran, Dharmalingam, Liu, Xin, Babich, Danylo, Sander, Mathias, Deng, Yunpei, Lemke, Henrik T., Mankowsky, Roman, Liu, Xuerong, and Wall, Simon E.
Subjects: Condensed Matter - Strongly Correlated Electrons, Condensed Matter - Materials Science
Abstract: Understanding how light modifies long-range order is key to improve our ability to control material functionality on an ultrafast timescale. Transient spatial heterogeneity has been proposed in many materials, but isolating the dynamics of different regions experimentally has been challenging. Here we address this issue and measure the dynamics of orbital order melting in the layered manganite, La0.5Sr1.5MnO4, and isolate the surface dynamics from the bulk for the first time. Bulk measurements show orbital order is rapidly suppressed, but the correlation length surprisingly increases. However, the surface dynamics, show a stronger suppression and a significant decrease in correlation length. By isolating the surface changes, we find that light preferentially melts a less ordered surface and the loss of long-range order is likely driven by the formation of local and disordered polarons. Melting the disordered surface effectively increases the average correlation of the bulk probed volume, resolving the contradictory response. These results show that surface scattering methods are necessary to understand both surface and bulk dynamics in heterogeneous materials., Comment: 22 pages, 8 figures
Published: 2024

230. Meta 3D Gen

Author: Bensadoun, Raphael, Monnier, Tom, Kleiman, Yanir, Kokkinos, Filippos, Siddiqui, Yawar, Kariya, Mahendra, Harosh, Omri, Shapovalov, Roman, Graham, Benjamin, Garreau, Emilien, Karnewar, Animesh, Cao, Ang, Azuri, Idan, Makarov, Iurii, Le, Eric-Tuan, Toisoul, Antoine, Novotny, David, Gafni, Oran, Neverova, Natalia, and Vedaldi, Andrea
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Graphics, Computer Science - Machine Learning
Abstract: We introduce Meta 3D Gen (3DGen), a new state-of-the-art, fast pipeline for text-to-3D asset generation. 3DGen offers 3D asset creation with high prompt fidelity and high-quality 3D shapes and textures in under a minute. It supports physically-based rendering (PBR), necessary for 3D asset relighting in real-world applications. Additionally, 3DGen supports generative retexturing of previously generated (or artist-created) 3D shapes using additional textual inputs provided by the user. 3DGen integrates key technical components, Meta 3D AssetGen and Meta 3D TextureGen, that we developed for text-to-3D and text-to-texture generation, respectively. By combining their strengths, 3DGen represents 3D objects simultaneously in three ways: in view space, in volumetric space, and in UV (or texture) space. The integration of these two techniques achieves a win rate of 68% with respect to the single-stage model. We compare 3DGen to numerous industry baselines, and show that it outperforms them in terms of prompt fidelity and visual quality for complex textual prompts, while being significantly faster.
Published: 2024

231. Generalized Ridge Regression: Biased Estimation for Multiple Linear Regression Models

Author: Gómez, Román Salmerón, García, Catalina García, and Reina, Guillermo Hortal
Subjects: Statistics - Methodology, 62J05
Abstract: When the regressors of a econometric linear model are nonorthogonal, it is well known that their estimation by ordinary least squares can present various problems that discourage the use of this model. The ridge regression is the most commonly used alternative; however, its generalized version has hardly been analyzed. The present work addresses the estimation of this generalized version, as well as the calculation of its mean squared error, goodness of fit and bootstrap inference., Comment: 23 pages, 5 tables, 7 figures, working paper
Published: 2024

232. Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry, Texture, and PBR Materials

Author: Siddiqui, Yawar, Monnier, Tom, Kokkinos, Filippos, Kariya, Mahendra, Kleiman, Yanir, Garreau, Emilien, Gafni, Oran, Neverova, Natalia, Vedaldi, Andrea, Shapovalov, Roman, and Novotny, David
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Graphics
Abstract: We present Meta 3D AssetGen (AssetGen), a significant advancement in text-to-3D generation which produces faithful, high-quality meshes with texture and material control. Compared to works that bake shading in the 3D object's appearance, AssetGen outputs physically-based rendering (PBR) materials, supporting realistic relighting. AssetGen generates first several views of the object with factored shaded and albedo appearance channels, and then reconstructs colours, metalness and roughness in 3D, using a deferred shading loss for efficient supervision. It also uses a sign-distance function to represent 3D shape more reliably and introduces a corresponding loss for direct shape supervision. This is implemented using fused kernels for high memory efficiency. After mesh extraction, a texture refinement transformer operating in UV space significantly improves sharpness and details. AssetGen achieves 17% improvement in Chamfer Distance and 40% in LPIPS over the best concurrent work for few-view reconstruction, and a human preference of 72% over the best industry competitors of comparable speed, including those that support PBR. Project page with generated assets: https://assetgen.github.io, Comment: Project Page: https://assetgen.github.io
Published: 2024

233. Enlarging of the sample to address multicollinearity

Author: Gómez, Román Salmerón, García, Catalina García, and Sánchez, Ainara Rodríguez
Subjects: Statistics - Applications, 62J05
Abstract: The paper analyzes how the enlarging of the sample affects to the mitigation of collinearity concluding that it may mitigate the consequences of collinearity related to statistical analysis but not necessarily the numerical instability. The problem that is addressed is of importance in the teaching of social sciences since it discusses one of the solutions proposed almost unanimously to solve the problem of multicollinearity. For a better understanding and illustration of the contribution of this paper, two empirical examples are presented and not highly technical developments are used., Comment: 11 pages, 2 tables, working paper
Published: 2024

234. Vortex confinement through an unquantized magnetic flux

Author: Kim, Geunyong, Yun, Jinyoung, Yang, Jinho, Yang, Ilkyu, Wulferding, Dirk, Movshovich, Roman, Cho, Gil Young, Kim, Ki-Seok, Hahn, Garam, and Kim, Jeehoon
Subjects: Condensed Matter - Superconductivity
Abstract: Geometrically confined superconductors often experience a breakdown in the quantization of magnetic flux owing to the incomplete screening of the supercurrent against the field penetration. In this study, we report that the confinement of a magnetic field occurs regardless of the dimensionality of the system, extending even to 1D linear potential systems. By utilizing a vector-field magnetic force microscope, we successfully create a vortex-antivortex pair connected by a 1D unquantized magnetic flux in ultra-thin superconducting films. Through an investigation of the manipulation and thermal behavior of the vortex pair, we uncover a long-range interaction mediated by the unquantized magnetic flux. These findings suggest a universal phenomenon of unquantized magnetic flux formation, independent of the geometry of the system. Our results present an experimental route for probing the impact of confinement on superconducting properties and order parameters in unconventional superconductors characterized by extremely low dimensionality.
Published: 2024

235. Early stages of gap opening by planets in protoplanetary discs

Author: Cordwell, Amelia J. and Rafikov, Roman R.
Subjects: Astrophysics - Earth and Planetary Astrophysics
Abstract: Annular substructures in protoplanetary discs, ubiquitous in sub-mm observations, can be caused by gravitational coupling between a disc and its embedded planets. Planetary density waves inject angular momentum into the disc leading to gap opening only after travelling some distance and steepening into shocks (in the absence of linear damping); no angular momentum is deposited in the planetary coorbital region, where the wave has not shocked yet. Despite that, simulations show mass evacuation from the coorbital region even in inviscid discs, leading to smooth, double-trough gap profiles. Here we consider the early, time-dependent stages of planetary gap opening in inviscid discs. We find that an often-overlooked contribution to the angular momentum balance caused by the time-variability of the specific angular momentum of the disc fluid (caused, in turn, by the time-variability of the radial pressure support) plays a key role in gap opening. Focusing on the regime of shallow gaps with depths of $\lesssim 20\%$, we demonstrate analytically that early gap opening is a self-similar process, with the amplitude of the planet-driven perturbation growing linearly in time and the radial gap profile that can be computed semi-analytically. We show that mass indeed gets evacuated from the coorbital region even in inviscid discs. This evolution pattern holds even in viscous discs over a limited period of time. These results are found to be in excellent agreement with 2D numerical simulations. Our simple gap evolution solutions can be used in studies of dust dynamics near planets and for interpreting protoplanetary disc observations., Comment: 20 pages, 16 figures, Accepted by MNRAS, updated version with additional discussion of viscous steady state and corrected discussion of Muto et al. (2010)
Published: 2024

236. AtLAST Science Overview Report

Author: Booth, Mark, Klaassen, Pamela, Cicone, Claudia, Mroczkowski, Tony, Cordiner, Martin A., Di Mascolo, Luca, Johnstone, Doug, van Kampen, Eelco, Lee, Minju M., Liu, Daizhong, Orlowski-Scherer, John, Saintonge, Amélie, Smith, Matthew W. L., Thelen, Alexander, Wedemeyer, Sven, Akiyama, Kazunori, Andreon, Stefano, Arzoumanian, Doris, Bakx, Tom J. L. C., Bot, Caroline, Bower, Geoffrey, Brajša, Roman, Chen, Chian-Chou, da Cunha, Elisabete, Eden, David, Ettori, Stefano, Gaches, Brandt, Hatziminaoglou, Evanthia, Luppe, Patricia, Magnelli, Benjamin, Marshall, Jonathan P., Montenegro-Montes, Francisco Miguel, Niemack, Michael, Nixon, Conor, de Pater, Imke, Perrott, Yvette, Raimundo, Sandra I., Redaelli, Elena, Richards, Anita, Rybak, Matus, Šarčević, Nikolina, Semenov, Dmitry, Spezzano, Silvia, Srinivasan, Sundar, Stanke, Thomas, Andreani, Paola, Beltrán, Maria T., Butler, Bryan J., Cantalupo, Sebastiano, Dagostino, Miguel Chavez, Duarte-Cabral, Ana, Emonts, Bjorn, Fletcher, Leigh, Gary, Dale E., Gunar, Stanislav, Hacar, Alvaro, Hagedorn, Bendix, Kaminski, Tomek, Kirton, Fiona, de Kleer, Katherine, Kontar, Eduard, Kuan, Yi-Jehng, Lightfoot, John, Lopez-Rodriguez, Enrique, Lundgren, Andreas, Milam, Stefanie N., Mohan, Atul, Moreno, Raphael, Motorina, Galina G., Moullet, Arielle, Pattle, Kate, Pellizzoni, Alberto, Peretto, Nicolas, Ramasawmy, Joanna, Ricci, Claudio, Rigby, Andrew J., Sánchez-Monge, Álvaro, Saberi, Maryam, Shimojo, Masumi, Simionescu, Aurora, Thompson, Mark, Traficante, Alessio, Vignali, Cristian, and White, Stephen M.
Subjects: Astrophysics - Instrumentation and Methods for Astrophysics, Astrophysics - Cosmology and Nongalactic Astrophysics, Astrophysics - Earth and Planetary Astrophysics, Astrophysics - Astrophysics of Galaxies, Astrophysics - Solar and Stellar Astrophysics
Abstract: Submillimeter and millimeter wavelengths provide a unique view of the Universe, from the gas and dust that fills and surrounds galaxies to the chromosphere of our own Sun. Current single-dish facilities have presented a tantalising view of the brightest (sub-)mm sources, and interferometers have provided the exquisite resolution necessary to analyse the details in small fields, but there are still many open questions that cannot be answered with current facilities. In this report we summarise the science that is guiding the design of the Atacama Large Aperture Submillimeter Telescope (AtLAST). We demonstrate how tranformational advances in topics including star formation in high redshift galaxies, the diffuse circumgalactic medium, Galactic ecology, cometary compositions and solar flares motivate the need for a 50m, single-dish telescope with a 1-2 degree field of view and a new generation of highly multiplexed continuum and spectral cameras. AtLAST will have the resolution to drastically lower the confusion limit compared to current single-dish facilities, whilst also being able to rapidly map large areas of the sky and detect extended, diffuse structures. Its high sensitivity and large field of view will open up the field of submillimeter transient science by increasing the probability of serendipitous detections. Finally, the science cases listed here motivate the need for a highly flexible operations model capable of short observations of individual targets, large surveys, monitoring programmes, target of opportunity observations and coordinated observations with other observatories. AtLAST aims to be a sustainable, upgradeable, multipurpose facility that will deliver orders of magnitude increases in sensitivity and mapping speeds over current and planned submillimeter observatories., Comment: 47 pages, 12 figures. For further details on AtLAST see https://atlast.uio.no
Published: 2024

237. Towards shutdownable agents via stochastic choice

Author: Thornley, Elliott, Roman, Alexander, Ziakas, Christos, Ho, Leyton, and Thomson, Louis
Subjects: Computer Science - Artificial Intelligence
Abstract: Some worry that advanced artificial agents may resist being shut down. The Incomplete Preferences Proposal (IPP) is an idea for ensuring that doesn't happen. A key part of the IPP is using a novel 'Discounted REward for Same-Length Trajectories (DREST)' reward function to train agents to (1) pursue goals effectively conditional on each trajectory-length (be 'USEFUL'), and (2) choose stochastically between different trajectory-lengths (be 'NEUTRAL' about trajectory-lengths). In this paper, we propose evaluation metrics for USEFULNESS and NEUTRALITY. We use a DREST reward function to train simple agents to navigate gridworlds, and we find that these agents learn to be USEFUL and NEUTRAL. Our results thus suggest that DREST reward functions could also train advanced agents to be USEFUL and NEUTRAL, and thereby make these advanced agents useful and shutdownable.
Published: 2024

238. Magnetic Excitations in Ferromagnetically Coupled Spin-1 Nanographenes

Author: Turco, Elia, Wu, Fupeng, Catarina, Gonçalo, Krane, Nils, Ma, Ji, Fasel, Roman, Feng, Xinliang, and Ruffieux, Pascal
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: In the quest for high-spin building blocks to form covalently bonded 1D or 2D materials with controlled magnetic interactions, $\pi$-electron magnetism provides an ideal framework to engineer large ferromagnetic interactions between nanographenes. As a first step in this direction, we investigate the spin properties of ferromagnetically coupled triangulenes, triangular nanographenes with spin $S = 1$. Combining in-solution synthesis of rationally designed molecular precursors and on-surface synthesis, we achieve covalently bonded $S = 2$ triangulene dimers and $S = 3$ trimers on Au(111). Starting from the triangulene dimer, we thoroughly characterize its low-energy magnetic excitations using inelastic electron tunneling spectroscopy (IETS). IETS reveals conductance steps identified as a quintet to triplet excitation, and a zero-bias peak stemming from higher-order spin-spin scattering of the 5-fold degenerate ferromagnetic ground state. The Heisenberg picture captures the relevant parameters of inter-triangulene ferromagnetic exchange, and its successful extension to the larger $S = 3$ system confirms the model's accuracy. We expect that the addition of ferromagnetically coupled building blocks to the toolbox of magnetic nanographenes opens new opportunities to design carbon materials with complex magnetic ground states., Comment: 38 pages, 5 Figures
Published: 2024

239. On gaps in the spectra of quasiperiodic Schr\'odinger operators with discontinuous monotone potentials

Author: Kachkovskiy, Ilya, Parnovski, Leonid, and Shterenberg, Roman
Subjects: Mathematics - Spectral Theory, Mathematical Physics
Abstract: We show that, for one-dimensional discrete Schr\"odinger operators, stability of Anderson localization under a class of rank one perturbations implies absence of intervals in spectra. The argument is based on well-known result of Gordon and del Rio--Makarov--Simon, combined with a way to consider perturbations whose ranges are not necessarily cyclic. The main application of the results is showing that a class of quasiperiodic operators with sawtooth-like potentials, for which such a version of stable localization is known, has Cantor spectra. We also obtain several results on gap filling under rank one perturbations for some general (not necessarily monotone) classes of quasiperiodic operators with discontinuous potentials., Comment: 22 pages
Published: 2024

240. CMOS-fabricated ultraviolet light modulators using low-loss alumina piezo-optomechanical photonic circuits

Author: Shugayev, Roman, Dominguez, Daniel, Leenheer, Andrew, Little, Bethany, Chow, Matthew N. H., Karl, Nicholas, Koppa, Matt, Gehl, Michael, Jau, Yuan-Yu, and Eichenfield, Matt
Subjects: Physics - Optics, Physics - Applied Physics, Quantum Physics
Abstract: We demonstrate a CMOS-foundry-fabricated piezo-optomechanical photonic integrated circuit platform for ultraviolet and blue wavelengths, using alumina waveguides that are strongly mechanically coupled to monolithically integrated aluminum nitride piezoelectric actuators. Low waveguide losses are measured down to at least 320 nm, where we achieve 1.6 dB/cm. This allows us to demonstrate broadband amplitude modulators based on piezoelectrically actuated MEMS cantilever phase-shifters down to 320 nm, with a high extinction ratio of 30 dB. We further demonstrate the versatility of the platform by designing and demonstrating a modulator that can work with high extinction and low loss at 320 nm and 420 nm, simultaneously, demonstrating control of multiple, disparate wavelengths in one device. We also demonstrate narrow-band resonant racetrack modulators with quality factors of 4.7E5 and a tuning rate of 27.5 MHz/V. These results should open doors for a range of novel applications in UV photonics, quantum science, sensing and spectroscopy.
Published: 2024

241. String Diagrams for Physical Duoidal Categories

Author: Román, Mario
Subjects: Mathematics - Category Theory, 18M50
Abstract: We introduce string diagrams for physical duoidal categories (normal $\otimes$-symmetric duoidal categories): they consist of string diagrams with wires forming a zigzag-free partial order and order-preserving nodes whose inputs and outputs form intervals., Comment: 26 pages, 11 figures. The author thanks Nayan Rajesh for pointing out a mistake on a previous version of Definition 8.4
Published: 2024

242. KAGNNs: Kolmogorov-Arnold Networks meet Graph Learning

Author: Bresson, Roman, Nikolentzos, Giannis, Panagopoulos, George, Chatzianastasis, Michail, Pang, Jun, and Vazirgiannis, Michalis
Subjects: Computer Science - Machine Learning
Abstract: In recent years, Graph Neural Networks (GNNs) have become the de facto tool for learning node and graph representations. Most GNNs typically consist of a sequence of neighborhood aggregation (a.k.a., message passing) layers. Within each of these layers, the representation of each node is updated from an aggregation and transformation of its neighbours representations at the previous layer. The upper bound for the expressive power of message passing GNNs was reached through the use of MLPs as a transformation, due to their universal approximation capabilities. However, MLPs suffer from well-known limitations, which recently motivated the introduction of Kolmogorov-Arnold Networks (KANs). KANs rely on the Kolmogorov-Arnold representation theorem, rendering them a promising alternative to MLPs. In this work, we compare the performance of KANs against that of MLPs in graph learning tasks. We perform extensive experiments on node classification, graph classification and graph regression datasets. Our preliminary results indicate that while KANs are on-par with MLPs in classification tasks, they seem to have a clear advantage in the graph regression tasks. Code is available at https: //github.com/RomanBresson/KAGNN.
Published: 2024

243. Evaluating Quality of Answers for Retrieval-Augmented Generation: A Strong LLM Is All You Need

Author: Wang, Yang, Hernandez, Alberto Garcia, Kyslyi, Roman, and Kersting, Nicholas
Subjects: Computer Science - Computation and Language
Abstract: We present a comprehensive study of answer quality evaluation in Retrieval-Augmented Generation (RAG) applications using vRAG-Eval, a novel grading system that is designed to assess correctness, completeness, and honesty. We further map the grading of quality aspects aforementioned into a binary score, indicating an accept or reject decision, mirroring the intuitive "thumbs-up" or "thumbs-down" gesture commonly used in chat applications. This approach suits factual business contexts where a clear decision opinion is essential. Our assessment applies vRAG-Eval to two Large Language Models (LLMs), evaluating the quality of answers generated by a vanilla RAG application. We compare these evaluations with human expert judgments and find a substantial alignment between GPT-4's assessments and those of human experts, reaching 83% agreement on accept or reject decisions. This study highlights the potential of LLMs as reliable evaluators in closed-domain, closed-ended settings, particularly when human evaluations require significant resources., Comment: 13 pages, 8 figures, 12 tables
Published: 2024

244. Highly Constrained Coded Aperture Imaging Systems Design Via a Knowledge Distillation Approach

Author: Suarez-Rodriguez, Leon, Jacome, Roman, and Arguello, Henry
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Computational optical imaging (COI) systems have enabled the acquisition of high-dimensional signals through optical coding elements (OCEs). OCEs encode the high-dimensional signal in one or more snapshots, which are subsequently decoded using computational algorithms. Currently, COI systems are optimized through an end-to-end (E2E) approach, where the OCEs are modeled as a layer of a neural network and the remaining layers perform a specific imaging task. However, the performance of COI systems optimized through E2E is limited by the physical constraints imposed by these systems. This paper proposes a knowledge distillation (KD) framework for the design of highly physically constrained COI systems. This approach employs the KD methodology, which consists of a teacher-student relationship, where a high-performance, unconstrained COI system (the teacher), guides the optimization of a physically constrained system (the student) characterized by a limited number of snapshots. We validate the proposed approach, using a binary coded apertures single pixel camera for monochromatic and multispectral image reconstruction. Simulation results demonstrate the superiority of the KD scheme over traditional E2E optimization for the designing of highly physically constrained COI systems., Comment: 7 pages, 3 figures. Accepted at ICIP 2024
Published: 2024

245. Domain Adaptation of Echocardiography Segmentation Via Reinforcement Learning

Author: Judge, Arnaud, Judge, Thierry, Duchateau, Nicolas, Sandler, Roman A., Sokol, Joseph Z., Bernard, Olivier, and Jodoin, Pierre-Marc
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Performance of deep learning segmentation models is significantly challenged in its transferability across different medical imaging domains, particularly when aiming to adapt these models to a target domain with insufficient annotated data for effective fine-tuning. While existing domain adaptation (DA) methods propose strategies to alleviate this problem, these methods do not explicitly incorporate human-verified segmentation priors, compromising the potential of a model to produce anatomically plausible segmentations. We introduce RL4Seg, an innovative reinforcement learning framework that reduces the need to otherwise incorporate large expertly annotated datasets in the target domain, and eliminates the need for lengthy manual human review. Using a target dataset of 10,000 unannotated 2D echocardiographic images, RL4Seg not only outperforms existing state-of-the-art DA methods in accuracy but also achieves 99% anatomical validity on a subset of 220 expert-validated subjects from the target domain. Furthermore, our framework's reward network offers uncertainty estimates comparable with dedicated state-of-the-art uncertainty methods, demonstrating the utility and effectiveness of RL4Seg in overcoming domain adaptation challenges in medical image segmentation., Comment: 9 pages
Published: 2024

246. Mask-Guided Attention U-Net for Enhanced Neonatal Brain Extraction and Image Preprocessing

Author: Jafrasteh, Bahram, Lubian-Lopez, Simon Pedro, Trimarco, Emiliano, Ruiz, Macarena Roman, Barrios, Carmen Rodriguez, Almagro, Yolanda Marin, and Benavente-Fernandez, Isabel
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Statistics - Computation
Abstract: In this study, we introduce MGA-Net, a novel mask-guided attention neural network, which extends the U-net model for precision neonatal brain imaging. MGA-Net is designed to extract the brain from other structures and reconstruct high-quality brain images. The network employs a common encoder and two decoders: one for brain mask extraction and the other for brain region reconstruction. A key feature of MGA-Net is its high-level mask-guided attention module, which leverages features from the brain mask decoder to enhance image reconstruction. To enable the same encoder and decoder to process both MRI and ultrasound (US) images, MGA-Net integrates sinusoidal positional encoding. This encoding assigns distinct positional values to MRI and US images, allowing the model to effectively learn from both modalities. Consequently, features learned from a single modality can aid in learning a modality with less available data, such as US. We extensively validated the proposed MGA-Net on diverse datasets from varied clinical settings and neonatal age groups. The metrics used for assessment included the DICE similarity coefficient, recall, and accuracy for image segmentation; structural similarity for image reconstruction; and root mean squared error for total brain volume estimation from 3D ultrasound images. Our results demonstrate that MGA-Net significantly outperforms traditional methods, offering superior performance in brain extraction and segmentation while achieving high precision in image reconstruction and volumetric analysis. Thus, MGA-Net represents a robust and effective preprocessing tool for MRI and 3D ultrasound images, marking a significant advance in neuroimaging that enhances both research and clinical diagnostics in the neonatal period and beyond.
Published: 2024

247. Robust NLoS Localization in 5G mmWave Networks: Data-based Methods and Performance

Author: Klus, Roman, Talvitie, Jukka, Vinogradova, Julia, Fodor, Gabor, Torsner, Johan, and Valkama, Mikko
Subjects: Electrical Engineering and Systems Science - Signal Processing
Abstract: Ensuring smooth mobility management while employing directional beamformed transmissions in 5G millimeter-wave networks calls for robust and accurate user equipment (UE) localization and tracking. In this article, we develop neural network-based positioning models with time- and frequency-domain channel state information (CSI) data in harsh non-line-of-sight (NLoS) conditions. We propose a novel frequency-domain feature extraction, which combines relative phase differences and received powers across resource blocks, and offers robust performance and reliability. Additionally, we exploit the multipath components and propose an aggregate time-domain feature combining time-of-flight, angle-of-arrival and received path-wise powers. Importantly, the temporal correlations are also harnessed in the form of sequence processing neural networks, which prove to be of particular benefit for vehicular UEs. Realistic numerical evaluations in large-scale line-of-sight (LoS)-obstructed urban environment with moving vehicles are provided, building on full ray-tracing based propagation modeling. The results show the robustness of the proposed CSI features in terms of positioning accuracy, and that the proposed models reliably localize UEs even in the absence of a LoS path, clearly outperforming the state-of-the-art with similar or even reduced processing complexity. The proposed sequence-based neural network model is capable of tracking the UE position, speed and heading simultaneously despite the strong uncertainties in the CSI measurements. Finally, it is shown that differences between the training and online inference environments can be efficiently addressed and alleviated through transfer learning., Comment: 16 pages, 13 figures, manuscript currently in review with IEEE
Published: 2024

248. The Infinite-Dimensional Quantum Entropy: the Unified Entropy Case

Author: Gielerak, Roman, Wiśniewska, Joanna, and Sawerwain, Marek
Subjects: Quantum Physics, Computer Science - Information Theory
Abstract: By a use of the Fredholm determinant theory, the unified quantum entropy notion has been extended to a case of infinite-dimensional systems. Some of the known (in the finite-dimensional case) basic properties of the introduced unified entropies have been extended to the case study. Certain numerical approaches for computing the proposed finite and infinite-dimensional entropies are being outlined as well., Comment: 12 pages, 2 figures
Published: 2024

249. Bisimulation for Impure Simplicial Complexes

Author: Bílková, Marta, van Ditmarsch, Hans, Kuznets, Roman, and Randrianomentsoa, Rojo
Subjects: Computer Science - Logic in Computer Science, Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: As an alternative to Kripke models, simplicial complexes are a versatile semantic primitive on which to interpret epistemic logic. Given a set of vertices, a simplicial complex is a downward closed set of subsets, called simplexes, of the vertex set. A maximal simplex is called a facet. Impure simplicial complexes represent that some agents (processes) are dead. It is known that impure simplicial complexes categorically correspond to so-called partial epistemic (Kripke) models. In this contribution, we define a notion of bisimulation to compare impure simplicial complexes and show that it has the Hennessy-Milner property. These results are for a logical language including atoms that express whether agents are alive or dead. Without these atoms no reasonable standard notion of bisimulation exists, as we amply justify by counterexamples, because such a restricted language is insufficiently expressive., Comment: Proceedings of Advances in Modal Logic 2024
Published: 2024

250. The VISTA Variables in the V\'ia L\'actea eXtended (VVVX) ESO public survey: Completion of the observations and legacy

Author: Saito, R. K., Hempel, M., Alonso-García, J., Lucas, P. W., Minniti, D., Alonso, S., Baravalle, L., Borissova, J., Caceres, C., Chené, A. N., Cross, N. J. G., Duplancic, F., Garro, E. R., Gómez, M., Ivanov, V. D., Kurtev, R., Luna, A., Majaess, D., Navarro, M. G., Pullen, J. B., Rejkuba, M., Sanders, J. L., Smith, L. C., Albino, P. H. C., Alonso, M. V., Amôres, E. B., Angeloni, R., Arias, J. I., Arnaboldi, M., Barbuy, B., Bayo, A., Beamin, J. C., Bedin, L. R., Bellini, A., Benjamin, R. A., Bica, E., Bonatto, C. J., Botan, E., Braga, V. F., Brown, D. A., Cabral, J. B., Camargo, D., Garatti, A. Caratti o, Carballo-Bello, J. A., Catelan, M., Chavero, C., Chijani, M. A., Clariá, J. J., Coldwell, G. V., Peña, C. Contreras, Ramos, R. Contreras, Corral-Santana, J. M., Cortés, C. C., Cortés-Contreras, M., Cruz, P., Daza-Perilla, I. V., Debattista, V. P., Dias, B., Donoso, L., D'Souza, R., Emerson, J. P., Federle, S., Fermiano, V., Fernandez, J., Fernández-Trincado, J. G., Ferreira, T., Lopes, C. E. Ferreira, Firpo, V., Flores-Quintana, C., Fraga, L., Froebrich, D., Galdeano, D., Gavignaud, I., Geisler, D., Gerhard, O. E., Gieren, W., Gonzalez, O. A., Gramajo, L. V., Gran, F., Granitto, P. M., Griggio, M., Guo, Z., Gurovich, S., Hilker, M., Jones, H. R. A., Kammers, R., Kuhn, M. A., Kumar, M. S . N., Kundu, R., Lares, M., Libralato, M., Lima, E., Maccarone, T. J., Cortés, P. Marchant, Martin, E. L., Masetti, N., Matsunaga, N., Mauro, F., McDonald, I., Mejías, A., Mesa, V., Milla-Castro, F. P., Minniti, J. H., Bidin, C. Moni, Montenegro, K., Morris, C., Motta, V., Navarete, F., Molina, C. Navarro, Nikzat, F., Castellón, J. L. Nilo, Obasi, C., Ortigoza-Urdaneta, M., Palma, T., Parisi, C., Ramírez, K. Pena, Pereyra, L., Perez, N., Petralia, I., Pichel, A., Pignata, G., Alegría, S. Ramírez, Rojas, A. F., Rojas, D., Roman-Lopes, A., Rovero, A. C., Saroon, S., Schmidt, E. O., Schröder, A. C., Schultheis, M., Sgró, M. A., Solano, E., Soto, M., Stecklum, B., Steeghs, D., Tamura, M., Tissera, P., Valcarce, A. A. R., Valotto, C. A., Vasquez, S., Villalon, C., Villanova, S., Cádiz, F. Vivanco, Bacigalupo, R. Zelada, Zijlstra, A., and Zoccali, M.
Subjects: Astrophysics - Astrophysics of Galaxies, Astrophysics - Solar and Stellar Astrophysics
Abstract: The ESO public survey VISTA Variables in the V\'ia L\'actea (VVV) surveyed the inner Galactic bulge and the adjacent southern Galactic disk from $2009-2015$. Upon its conclusion, the complementary VVV eXtended (VVVX) survey has expanded both the temporal as well as spatial coverage of the original VVV area, widening it from $562$ to $1700$ sq. deg., as well as providing additional epochs in $JHK_{\rm s}$ filters from $2016-2023$. With the completion of VVVX observations during the first semester of 2023, we present here the observing strategy, a description of data quality and access, and the legacy of VVVX. VVVX took $\sim 2000$ hours, covering about 4% of the sky in the bulge and southern disk. VVVX covered most of the gaps left between the VVV and the VISTA Hemisphere Survey (VHS) areas and extended the VVV time baseline in the obscured regions affected by high extinction and hence hidden from optical observations. VVVX provides a deep $JHK_{\rm s}$ catalogue of $\gtrsim 1.5\times10^9$ point sources, as well as a $K_{\rm s}$ band catalogue of $\sim 10^7$ variable sources. Within the existing VVV area, we produced a $5D$ map of the surveyed region by combining positions, distances, and proper motions of well-understood distance indicators such as red clump stars, RR Lyrae, and Cepheid variables. In March 2023 we successfully finished the VVVX survey observations that started in 2016, an accomplishment for ESO Paranal Observatory upon 4200 hours of observations for VVV+VVVX. The VVV+VVVX catalogues complement those from the Gaia mission at low Galactic latitudes and provide spectroscopic targets for the forthcoming ESO high-multiplex spectrographs MOONS and 4MOST., Comment: 17 pages, 11 figures (+ appendix). Accepted for publication in Astronomy and Astrophysics in section 14: Catalogs and data
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Region

Database

Publisher

160,025 results on '"A. Román"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources