2,611,918 results
Search Results
2. Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability
- Author
-
Lin, Zicheng, Liang, Tian, Xu, Jiahao, Wang, Xing, Luo, Ruilin, Shi, Chufan, Li, Siheng, Yang, Yujiu, and Tu, Zhaopeng
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Large Language Models (LLMs) have exhibited remarkable performance on reasoning tasks. They utilize autoregressive token generation to construct reasoning trajectories, enabling the development of a coherent chain of thought. In this work, we explore the impact of individual tokens on the final outcomes of reasoning tasks. We identify the existence of ``critical tokens'' that lead to incorrect reasoning trajectories in LLMs. Specifically, we find that LLMs tend to produce positive outcomes when forced to decode other tokens instead of critical tokens. Motivated by this observation, we propose a novel approach - cDPO - designed to automatically recognize and conduct token-level rewards for the critical tokens during the alignment process. Specifically, we develop a contrastive estimation approach to automatically identify critical tokens. It is achieved by comparing the generation likelihood of positive and negative models. To achieve this, we separately fine-tune the positive and negative models on various reasoning trajectories, consequently, they are capable of identifying identify critical tokens within incorrect trajectories that contribute to erroneous outcomes. Moreover, to further align the model with the critical token information during the alignment process, we extend the conventional DPO algorithms to token-level DPO and utilize the differential likelihood from the aforementioned positive and negative model as important weight for token-level DPO learning.Experimental results on GSM8K and MATH500 benchmarks with two-widely used models Llama-3 (8B and 70B) and deepseek-math (7B) demonstrate the effectiveness of the propsoed approach cDPO., Comment: Work in progress
- Published
- 2024
3. T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs
- Author
-
Yin, Shukang, Fu, Chaoyou, Zhao, Sirui, Shen, Yunhang, Ge, Chunjiang, Yang, Yan, Long, Zuwei, Dai, Yuhan, Xu, Tong, Sun, Xing, He, Ran, Shan, Caifeng, and Chen, Enhong
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
The success of Multimodal Large Language Models (MLLMs) in the image domain has garnered wide attention from the research community. Drawing on previous successful experiences, researchers have recently explored extending the success to the video understanding realms. Apart from training from scratch, an efficient way is to utilize the pre-trained image-LLMs, leading to two mainstream approaches, i.e. zero-shot inference and further fine-tuning with video data. In this work, our study of these approaches harvests an effective data augmentation method. We first make a deeper inspection of the zero-shot inference way and identify two limitations, i.e. limited generalization and lack of temporal understanding capabilities. Thus, we further investigate the fine-tuning approach and find a low learning efficiency when simply using all the video data samples, which can be attributed to a lack of instruction diversity. Aiming at this issue, we develop a method called T2Vid to synthesize video-like samples to enrich the instruction diversity in the training corpus. Integrating these data enables a simple and efficient training scheme, which achieves performance comparable to or even superior to using full video datasets by training with just 15% the sample size. Meanwhile, we find that the proposed scheme can boost the performance of long video understanding without training with long video samples. We hope our study will spark more thinking about using MLLMs for video understanding and curation of high-quality data. The code is released at https://github.com/xjtupanda/T2Vid., Comment: 13 pages, 9 figures, 5 tables. Project page: https://github.com/xjtupanda/T2Vid
- Published
- 2024
4. AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos
- Author
-
He, Yuze, Zhao, Wang, Liu, Shaohui, Hu, Yubin, Bai, Yushi, Wen, Yu-Hui, and Liu, Yong-Jin
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
We introduce AlphaTablets, a novel and generic representation of 3D planes that features continuous 3D surface and precise boundary delineation. By representing 3D planes as rectangles with alpha channels, AlphaTablets combine the advantages of current 2D and 3D plane representations, enabling accurate, consistent and flexible modeling of 3D planes. We derive differentiable rasterization on top of AlphaTablets to efficiently render 3D planes into images, and propose a novel bottom-up pipeline for 3D planar reconstruction from monocular videos. Starting with 2D superpixels and geometric cues from pre-trained models, we initialize 3D planes as AlphaTablets and optimize them via differentiable rendering. An effective merging scheme is introduced to facilitate the growth and refinement of AlphaTablets. Through iterative optimization and merging, we reconstruct complete and accurate 3D planes with solid surfaces and clear boundaries. Extensive experiments on the ScanNet dataset demonstrate state-of-the-art performance in 3D planar reconstruction, underscoring the great potential of AlphaTablets as a generic 3D plane representation for various applications. Project page is available at: https://hyzcluster.github.io/alphatablets, Comment: NeurIPS 2024
- Published
- 2024
5. Efficient short-wave infrared upconversion by self-sensitized holmium-doped nanoparticles
- Author
-
Arul, Rakesh, Jiang, Zhao, Li, Xinjuan, Bell, Fiona M., Tew, Alasdair, Ducati, Caterina, Rao, Akshay, and Yu, Zhongzheng
- Subjects
Physics - Optics ,Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Materials Science - Abstract
Photon upconversion, combining several low-energy photons to generate one high-energy photon is of wide interest for biomedical, catalytic and photonic applications. Lanthanide-doped nanoparticles (LnNP) are a unique type of upconversion nanoconverter, which can realize ultralarge anti-Stokes shift (>1000 nm) and high photostability, without photo-bleaching and photo-blinking. The excitation wavelength of LnNPs has been limited to the second near-infrared window (1000-1700 nm), mainly sensitized by erbium ions with absorption centered around 1.5 $\mu$m. Here, we demonstrate novel self-sensitized holmium (Ho)-doped nanoconverters to further expand the sensitization range to the short-wave infrared at 2 $\mu$m and achieve efficient upconversion to 640 nm. We show that this upconversion is a 4-photon conversion process with an underlying energy transfer upconversion mechanism. Via careful control of dopant concentration and shelling we achieve a relative upconversion-to-downconversion efficiency up to 15.2%, more than half the theoretical maximum. The placement of the Ho doped LnNPs into a plasmonic nanocavity device enables large gains in emission intensity (up to 32-fold), due to the dramatic shortening of the emission lifetime of Ho from 29 $\mu$s to <1 ns, indicating a high Purcell-enhancement factor of 3x10$^4$. These results open new possibilities at the frontier of short-wave infrared upconversion and the nanoplasmonic enhancement of LnNP emission, with potential applications in detection, theranostics, photonics and optoelectronics.
- Published
- 2024
6. Scaling Laws Governing the Collapse of a Bose-Einstein Condensate
- Author
-
Morris, Sebastian J., Ho, Christopher J., Fischer, Simon M., Etrych, Jiří, Martirosyan, Gevorg, Hadzibabic, Zoran, and Eigen, Christoph
- Subjects
Condensed Matter - Quantum Gases ,Mathematical Physics ,Nonlinear Sciences - Pattern Formation and Solitons ,Physics - Plasma Physics ,Quantum Physics - Abstract
We study the collapse of an attractive Bose-Einstein condensate, where an unstable system evolves towards a singularity, by numerically solving the underlying cubic-quintic nonlinear Schr\"odinger equation. We find good agreement between our simulations and the atom-loss measurements with a $^{39}$K condensate. Our simulations reveal an interplay of weak collapse and the propensity of the system to form a hotspot, and we uncover new scaling laws that govern this behavior. We also identify promising signatures of the theoretically predicted, but so far experimentally elusive, elastic three-body interactions., Comment: Main text (6 pages, 4 figures), Supplemental Material (2 pages, 4 figures)
- Published
- 2024
7. Operator Valued Flow Equation Approach to the Bosonic Lattice Polaron: Dispersion Renormalization Beyond the Fr\'ohlich Paradigm
- Author
-
Christ, Jan-Philipp, Bermes, Pit, and Grusdt, Fabian
- Subjects
Condensed Matter - Quantum Gases ,Quantum Physics - Abstract
We consider the ground state properties of a lattice Bose polaron, a quasiparticle arising from the interaction between an impurity confined to an optical lattice and a surrounding homogeneous Bose-Einstein condensate hosting phononic modes. We present an extension of Wegner's and Wilson's flow equation approach, the operator valued flow equation approach, which allows us to calculate the renormalized dispersion of the polaron and assess the role of two-phonon scattering processes on the dispersion. The results obtained in this way are compared to a variational mean-field approach. We find that in certain impurity phonon interaction regimes the shape of the dispersion is significantly altered by the inclusion of two-phonon scattering events as opposed to only single-phonon scattering events. Moreover, our results predict that a polaronic bound state may emerge, which is not present in Fr\"ohlich-type models that only consider single-phonon scattering events., Comment: 12 pages, 10 figures
- Published
- 2024
8. DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation
- Author
-
Shen, Zhiqiang, Sherif, Ammar, Yin, Zeyuan, and Shao, Shitong
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Recent advances in dataset distillation have led to solutions in two main directions. The conventional batch-to-batch matching mechanism is ideal for small-scale datasets and includes bi-level optimization methods on models and syntheses, such as FRePo, RCIG, and RaT-BPTT, as well as other methods like distribution matching, gradient matching, and weight trajectory matching. Conversely, batch-to-global matching typifies decoupled methods, which are particularly advantageous for large-scale datasets. This approach has garnered substantial interest within the community, as seen in SRe$^2$L, G-VBSM, WMDD, and CDA. A primary challenge with the second approach is the lack of diversity among syntheses within each class since samples are optimized independently and the same global supervision signals are reused across different synthetic images. In this study, we propose a new Diversity-driven EarlyLate Training (DELT) scheme to enhance the diversity of images in batch-to-global matching with less computation. Our approach is conceptually simple yet effective, it partitions predefined IPC samples into smaller subtasks and employs local optimizations to distill each subset into distributions from distinct phases, reducing the uniformity induced by the unified optimization process. These distilled images from the subtasks demonstrate effective generalization when applied to the entire task. We conduct extensive experiments on CIFAR, Tiny-ImageNet, ImageNet-1K, and its sub-datasets. Our approach outperforms the previous state-of-the-art by 2$\sim$5% on average across different datasets and IPCs (images per class), increasing diversity per class by more than 5% while reducing synthesis time by up to 39.3% for enhancing the training efficiency. Code is available at: https://github.com/VILA-Lab/DELT.
- Published
- 2024
9. The hot circumgalactic medium in the eROSITA All-Sky Survey III. Star-forming and quiescent galaxies
- Author
-
Zhang, Yi, Comparat, Johan, Ponti, Gabriele, Merloni, Andrea, Nandra, Kirpal, Haberl, Frank, Truong, Nhut, Pillepich, Annalisa, Popesso, Paola, Locatelli, Nicola, Zhang, Xiaoyuan, Sanders, Jeremy, Zheng, Xueying, Liu, Ang, Liu, Teng, Predehl, Peter, Salvato, Mara, Bruggen, Marcus, Shreeram, Soumya, and Yeung, Michael C. H.
- Subjects
Astrophysics - Astrophysics of Galaxies ,High Energy Physics - Phenomenology - Abstract
The circumgalactic medium (CGM), as the gas repository for star formation, might contain the answer to the mysterious galaxy quenching and bimodal galaxy population origin. We measured the X-ray emission of the hot CGM around star-forming and quiescent galaxies. We detect extended X-ray emission from the hot CGM around star-forming galaxies with $\log(M_*/M_\odot)>11.0$ and quiescent galaxies with $\log(M_*/M_\odot)>10.5$, extending out to $R_{\rm 500c}$. $L_{\rm X, CGM}$ of star-forming galaxies with median stellar masses $\log(M_{\rm *,med}/M_\odot) = 10.7, 11.1, 11.3$ are approximately $0.8\,, 2.3\,, 4.0 \times 10^{40}\,\rm erg/s$, while for quiescent galaxies with $\log(M_{\rm *,med}/M_\odot) = 10.8, 11.1, 11.4$, they are $1.1\,, 6.2\,, 30 \times 10^{40}\,\rm erg/s$. Notably, quiescent galaxies with $\log(M_{\rm *,med}/M_\odot) > 11.0$ exhibit brighter hot CGM than their star-forming counterparts. In halo mass bins, we detect similar X-ray emission around star-forming and quiescent galaxies with $\log(M_{\rm 200m}/M_\odot) > 12.5$, suggesting that galaxies in the same mass dark matter halos host equally bright hot CGM. We emphasize the observed $L_{\rm X, CGM} - M_{\rm 500c}$ relations of star-forming and quiescent galaxies are sensitive to the stellar-to-halo mass relation (SHMR). A comparison with cosmological hydrodynamical simulations (EAGLE, TNG100, and SIMBA) reveals varying degrees of agreement, contingent on the simulation and the specific stellar or halo mass ranges considered. Either selected in stellar mass or halo mass, the star-forming galaxies do not host brighter stacked X-ray emission from the hot CGM than their quiescent counterparts at the same mass range. The result provides useful constraints on the extent of feedback's impacts as a mechanism for quenching star formation as implemented in current cosmological simulations., Comment: 16 pages, 12 figures, accepted for publication in A&A
- Published
- 2024
10. Free-form Generation Enhances Challenging Clothed Human Modeling
- Author
-
Ye, Hang, Ma, Xiaoxuan, Ci, Hai, Zhu, Wentao, and Wang, Yizhou
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Graphics ,Computer Science - Machine Learning - Abstract
Achieving realistic animated human avatars requires accurate modeling of pose-dependent clothing deformations. Existing learning-based methods heavily rely on the Linear Blend Skinning (LBS) of minimally-clothed human models like SMPL to model deformation. However, these methods struggle to handle loose clothing, such as long dresses, where the canonicalization process becomes ill-defined when the clothing is far from the body, leading to disjointed and fragmented results. To overcome this limitation, we propose a novel hybrid framework to model challenging clothed humans. Our core idea is to use dedicated strategies to model different regions, depending on whether they are close to or distant from the body. Specifically, we segment the human body into three categories: unclothed, deformed, and generated. We simply replicate unclothed regions that require no deformation. For deformed regions close to the body, we leverage LBS to handle the deformation. As for the generated regions, which correspond to loose clothing areas, we introduce a novel free-form, part-aware generator to model them, as they are less affected by movements. This free-form generation paradigm brings enhanced flexibility and expressiveness to our hybrid framework, enabling it to capture the intricate geometric details of challenging loose clothing, such as skirts and dresses. Experimental results on the benchmark dataset featuring loose clothing demonstrate that our method achieves state-of-the-art performance with superior visual fidelity and realism, particularly in the most challenging cases., Comment: 23 pages, 25 figures
- Published
- 2024
11. Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark
- Author
-
Heyward, Joseph, Carreira, João, Damen, Dima, Zisserman, Andrew, and Pătrăucean, Viorica
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Following the successful 2023 edition, we organised the Second Perception Test challenge as a half-day workshop alongside the IEEE/CVF European Conference on Computer Vision (ECCV) 2024, with the goal of benchmarking state-of-the-art video models and measuring the progress since last year using the Perception Test benchmark. This year, the challenge had seven tracks (up from six last year) and covered low-level and high-level tasks, with language and non-language interfaces, across video, audio, and text modalities; the additional track covered hour-long video understanding and introduced a novel video QA benchmark 1h-walk VQA. Overall, the tasks in the different tracks were: object tracking, point tracking, temporal action localisation, temporal sound localisation, multiple-choice video question-answering, grounded video question-answering, and hour-long video question-answering. We summarise in this report the challenge tasks and results, and introduce in detail the novel hour-long video QA benchmark 1h-walk VQA., Comment: arXiv admin note: substantial text overlap with arXiv:2312.13090
- Published
- 2024
12. Direct local parametrization of nuclear state densities using the back-shifted Bethe formula
- Author
-
Özen, C. and Alhassid, Y.
- Subjects
Nuclear Theory - Abstract
Level densities are often parametrized using the back-shifted Bethe formula (BBF) for nuclei that possess experimental data for s-wave neutron resonance average spacings and a complete discrete level sequence at low excitation energies. However, these parametrizations require the additional modeling of the dependence of the spin-cutoff parameter on excitation energy. Here we avoid the need to model the spin distribution of level densities by using the experimental data to parametrize directly the state densities, for which the BBF does not depend on the spin-cutoff parameter. This approach allows for a local parameterization of state densities that is independent of the spin-cutoff parameter. We provide these parameters in a tabulated form for applications in nuclear reaction calculations and for testing microscopic approaches to state densities., Comment: 12 pages, 3 figures, 1 table
- Published
- 2024
13. VLSBench: Unveiling Visual Leakage in Multimodal Safety
- Author
-
Hu, Xuhao, Liu, Dongrui, Li, Hao, Huang, Xuanjing, and Shao, Jing
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Safety concerns of Multimodal large language models (MLLMs) have gradually become an important problem in various applications. Surprisingly, previous works indicate a counter-intuitive phenomenon that using textual unlearning to align MLLMs achieves comparable safety performances with MLLMs trained with image-text pairs. To explain such a counter-intuitive phenomenon, we discover a visual safety information leakage (VSIL) problem in existing multimodal safety benchmarks, i.e., the potentially risky and sensitive content in the image has been revealed in the textual query. In this way, MLLMs can easily refuse these sensitive text-image queries according to textual queries. However, image-text pairs without VSIL are common in real-world scenarios and are overlooked by existing multimodal safety benchmarks. To this end, we construct multimodal visual leakless safety benchmark (VLSBench) preventing visual safety leakage from image to textual query with 2.4k image-text pairs. Experimental results indicate that VLSBench poses a significant challenge to both open-source and close-source MLLMs, including LLaVA, Qwen2-VL, Llama3.2-Vision, and GPT-4o. This study demonstrates that textual alignment is enough for multimodal safety scenarios with VSIL, while multimodal alignment is a more promising solution for multimodal safety scenarios without VSIL. Please see our code and data at: http://hxhcreate.github.io/VLSBench
- Published
- 2024
14. Probing quantum critical phase from neural network wavefunction
- Author
-
Chen, Haoxiang, Ren, Weiluo, Li, Xiang, and Chen, Ji
- Subjects
Condensed Matter - Strongly Correlated Electrons ,Condensed Matter - Disordered Systems and Neural Networks ,Physics - Computational Physics - Abstract
One-dimensional (1D) systems and models provide a versatile platform for emergent phenomena induced by strong electron correlation. In this work, we extend the newly developed real space neural network quantum Monte Carlo methods to study the quantum phase transition of electronic and magnetic properties. Hydrogen chains of different interatomic distances are explored systematically with both open and periodic boundary conditions, and fully correlated ground state many-body wavefunction is achieved via unsupervised training of neural networks. We demonstrate for the first time that neural networks are capable of capturing the quantum critical behavior of Tomonaga- Luttinger liquid (TLL), which is known to dominate 1D quantum systems. Moreover, we reveal the breakdown of TLL phase and the emergence of a Fermi liquid behavior, evidenced by abrupt changes in the spin structure and the momentum distribution. Such behavior is absent in commonly studied 1D lattice models and is likely due to the involvement of high-energy orbitals of hydrogen atoms. Our work highlights the powerfulness of neural networks for representing complex quantum phases.
- Published
- 2024
15. Multi-Epoch Observations of the Nearby Spiral Galaxy NGC 3938 with the Chandra X-ray Observatory
- Author
-
Raut, Siddhi, Schlegel, Eric M., Pannuti, Thomas G., Jones, Brannon W., and Matallana, Jacobo
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We present an analysis of two epochs of ACIS observations of the SA(s)c spiral galaxy NGC 3938 with the Chandra X-ray Observatory. The total exposure time of the observations was 95 ksec with a limiting unabsorbed luminosity of approximately 10^{38}$ ergs/sec assuming a distance of 22 Mpc. A total of 47 discrete merged sources from both epochs were detected at the 3sigma level or greater with the D25 radius. We demonstrate that at the time of the Chandra observations, the nucleus was not detected. We connect the detected sources to counterparts in other wavebands to the degree possible. Based on the two epochs, we identify three variable sources and an additional two that may have varied between the two observations. We do not formally detect any of the five historical supernovae that have occurred in NGC 3938. The luminosity function of NGC 3938 is compared to a recent compilation of 38 galaxies and we identify a potentially significant problem with the `known' distance to NGC 3938. Star formation rate and metallicity values are also computed; the star formation rate is highly dependent upon the adopted distance. The metallicity appears to lie in the range of 8.2-9.2, consistent with values from other work. We include in an appendix a short discussion of the sources that lie in Chandra's field-of-view but lie outside of NGC 3938., Comment: accepted by AJ November 2024
- Published
- 2024
16. It's Quick to be Square: Fast Quadratisation for Quantum Toolchains
- Author
-
Schmidbauer, Lukas, Lobe, Elisabeth, Schaefer, Ina, and Mauerer, Wolfgang
- Subjects
Quantum Physics - Abstract
Many of the envisioned use-cases for quantum computers involve optimisation processes. While there are many algorithmic primitives to perform the required calculations, all eventually lead to quantum gates operating on quantum bits, with an order as determined by the structure of the objective function and the properties of target hardware. When the structure of the problem representation is not aligned with structure and boundary conditions of the executing hardware, various overheads to degrade the computation may arise, possibly negating any possible quantum advantage. Therefore, automatic transformations of problem representations play an important role in quantum computing when descriptions (semi-)targeted at humans must be cast into forms that can be executed on quantum computers. Mathematically equivalent formulations are known to result in substantially different non-functional properties depending on hardware, algorithm and detail properties of the problem. Given the current state of noisy-intermediate scale quantum hardware (NISQ), these effects are considerably more pronounced than in classical computing. Likewise, efficiency of the transformation itself is relevant because possible quantum advantage may easily be eradicated by the overhead of transforming between representations. In this paper we consider a specific class of higher-level representations (polynomial unbiased binary optimisation problems), and devise novel automatic transformation mechanisms into widely used quadratic unconstrained binary optimisation problems that substantially improve efficiency and versatility over the state of the art. We also identify what influence factors of lower-level details can be abstracted away in the transformation process, and which details must be made available to higher-level abstractions.
- Published
- 2024
17. Wonderful Compactification of a Cartan Subalgebra of a Semisimple Lie Algebra
- Author
-
Evens, Sam and Li, Yu
- Subjects
Mathematics - Representation Theory ,Mathematics - Algebraic Geometry ,Mathematics - Combinatorics ,Mathematics - Symplectic Geometry - Abstract
Let $\mathfrak h$ be a Cartan subalgebra of a complex semisimple Lie algebra $\mathfrak g.$ We define a compactification $\bar{\mathfrak h}$ of $\mathfrak h$, which is analogous to the closure $\bar H$ of the corresponding maximal torus $H$ in the adjoint group of $\mathfrak g$ in its wonderful compactification, which was introduced and studied by De Concini and Procesi \cite{DCP}. We determine the irreducible components of the boundary $\bar{\mathfrak h} - \mathfrak h$ of $\mathfrak h$ in terms of certain maximal root subsystems described by Borel-de Siebenthal theory. We prove that $\bar{\mathfrak h} - \mathfrak h$ is equidimensional, and we prove that $\bar{\mathfrak h}$ is a normal variety. As a consequence, we find an affine paving of $\bar{\mathfrak h}$, and when $\mathfrak g$ is classical, we determine the number of strata in each dimension in terms of Stirling numbers and their variants, thereby computing the Betti numbers of $\bar{\mathfrak h}.$ In the general case, we relate the order relation on strata given by closures of strata to the poset of hyperplane arrangements studied by Orlik and Solomon, and determine the cup product in the cohomology of $\bar{\mathfrak h}.$ Our work has similarities to results of Braden et al \cite{BHMPW} on matroid Schubert varieties, but the connection to root systems facilitates greater precision in some of our results.
- Published
- 2024
18. Linearization (in)stabilities and crossed products
- Author
-
De Vuyst, Julian, Eccles, Stefan, Hoehn, Philipp A., and Kirklin, Josh
- Subjects
High Energy Physics - Theory ,General Relativity and Quantum Cosmology ,Quantum Physics - Abstract
Modular crossed product algebras have recently assumed an important role in perturbative quantum gravity as they lead to an intrinsic regularization of entanglement entropies by introducing quantum reference frames (QRFs) in place of explicit regulators. This is achieved by imposing certain boost constraints on gravitons, QRFs and other fields. Here, we revisit the question of how these constraints should be understood through the lens of perturbation theory and particularly the study of linearization (in)stabilities, exploring when linearized solutions can be integrated to exact ones. Our aim is to provide some clarity about the status of justification, under various conditions, for imposing such constraints on the linearized theory in the $G_N\to0$ limit as they turn out to be of second-order. While for spatially compact spacetimes there is an essentially unambiguous justification, in the presence of boundaries or the absence of isometries this depends on whether one is also interested in second-order observables. Linearization (in)stabilities occur in any gauge-covariant field theory with non-linear equations and to address this in a unified framework, we translate the subject from the usual canonical formulation into a systematic covariant phase space language. This overcomes theory-specific arguments, exhibiting the universal structure behind (in)stabilities, and permits us to cover arbitrary generally covariant theories. We comment on the relation to modular flow and illustrate our findings in several gravity and gauge theory examples., Comment: 42 + 16 pages, 5 figures, comments welcome
- Published
- 2024
19. Transfer Learning for High-dimensional Quantile Regression with Distribution Shift
- Author
-
Bai, Ruiqi, Zhang, Yijiao, Yang, Hanbo, and Zhu, Zhongyi
- Subjects
Statistics - Methodology ,Mathematics - Statistics Theory ,Statistics - Machine Learning - Abstract
Information from related source studies can often enhance the findings of a target study. However, the distribution shift between target and source studies can severely impact the efficiency of knowledge transfer. In the high-dimensional regression setting, existing transfer approaches mainly focus on the parameter shift. In this paper, we focus on the high-dimensional quantile regression with knowledge transfer under three types of distribution shift: parameter shift, covariate shift, and residual shift. We propose a novel transferable set and a new transfer framework to address the above three discrepancies. Non-asymptotic estimation error bounds and source detection consistency are established to validate the availability and superiority of our method in the presence of distribution shift. Additionally, an orthogonal debiased approach is proposed for statistical inference with knowledge transfer, leading to sharper asymptotic results. Extensive simulation results as well as real data applications further demonstrate the effectiveness of our proposed procedure., Comment: 53 pages
- Published
- 2024
20. Silicon Isotopic Composition of Mainstream Presolar SiC Grains Revisited: The Impact of Nuclear Reaction Rate Uncertainties
- Author
-
Fok, Hung Kwan, Pignatari, Marco, Côté, Benoît, and Trappitsch, Reto
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - Earth and Planetary Astrophysics ,Astrophysics - Astrophysics of Galaxies - Abstract
Presolar grains are stardust particles that condensed in the ejecta or in the outflows of dying stars and can today be extracted from meteorites. They recorded the nucleosynthetic fingerprint of their parent stars and thus serve as valuable probes of these astrophysical sites. The most common types of presolar silicon carbide grains (called mainstream SiC grains) condensed in the outflows of asymptotic giant branch stars. Their measured silicon isotopic abundances are not significantly influenced by nucleosynthesis within the parent star, but rather represents the pristine stellar composition. Silicon isotopes can thus be used as a proxy for galactic chemical evolution. However, the measured correlation of $^{29}$Si/$^{28}$Si versus $^{30}$Si/$^{28}$Si does not agree with any current chemical evolution model. Here, we use a Monte Carlo model to vary nuclear reaction rates within their theoretical or experimental uncertainties and process them through stellar nucleosynthesis and galactic chemical evolution models to study the variation of silicon isotope abundances based on these nuclear reaction rate uncertainties. We find that these uncertainties can indeed be responsible for the discrepancy between measurements and models and that the slope of the silicon isotope correlation line measured in mainstream SiC grains agrees with chemical evolution models within the nuclear reaction rate uncertainties. Our result highlights the importance of future precision reaction rate measurements for resolving the apparent data-model discrepancy.
- Published
- 2024
- Full Text
- View/download PDF
21. Enhancement of the superconducting transition temperature due to multiband effect in the topological nodal-line semimetal Pb$_{1-x}$Sn$_{x}$TaSe$_{2}$
- Author
-
Kumarasinghe, K., Rahman, A., Tomlinson, M., and Nakajima, Y.
- Subjects
Condensed Matter - Superconductivity - Abstract
We report a systematic study of the normal-state and superconducting properties of single crystal Pb$_{1-x}$Sn$_{x}$TaSe$_{2}$ $(0\leq x \leq 0.23)$. Sn doping enhances the superconducting temperature $T_{c}$ up to 5.1 K, while also significantly increasing impurity scattering in the crystals. For $x=0$, the specific heat jump at $T_{c}$ exceeds the Bardeen-Cooper-Schrieffer (BCS) weak-coupling value of 1.43, indicating the realization of strong-coupling superconductivity in PbTaSe$_{2}$. In contrast, substituting Pb with Sn lowers the specific heat jump at $T_{c}$ below the BSC value of 1.43, which cannot be explained by a single-gap model. Rather, the observed specific heat of Sn-doped PbTaSe$_{2}$ is reproduced by a two-gap model. Our observations suggest that additional Fermi pockets appear due to a reduction of the spin-orbit gap with Sn doping, and the multiband effect arising from these emergent Fermi pockets enhances the effective electron-phonon coupling strength, leading to the increase in $T_{c}$., Comment: 6 pages, 4 figures
- Published
- 2024
22. On Domain-Specific Post-Training for Multimodal Large Language Models
- Author
-
Cheng, Daixuan, Huang, Shaohan, Zhu, Ziyu, Zhang, Xintong, Zhao, Wayne Xin, Luan, Zhongzhi, Dai, Bo, and Zhang, Zhenliang
- Subjects
Computer Science - Computation and Language ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Recent years have witnessed the rapid development of general multimodal large language models (MLLMs). However, adapting general MLLMs to specific domains, such as scientific fields and industrial applications, remains less explored. This paper systematically investigates domain adaptation of MLLMs through post-training, focusing on data synthesis, training pipelines, and task evaluation. (1) Data Synthesis: Using open-source models, we develop a visual instruction synthesizer that effectively generates diverse visual instruction tasks from domain-specific image-caption pairs. Our synthetic tasks surpass those generated by manual rules, GPT-4, and GPT-4V in enhancing the domain-specific performance of MLLMs. (2) Training Pipeline: While the two-stage training--initially on image-caption pairs followed by visual instruction tasks--is commonly adopted for developing general MLLMs, we apply a single-stage training pipeline to enhance task diversity for domain-specific post-training. (3) Task Evaluation: We conduct experiments in two domains, biomedicine and food, by post-training MLLMs of different sources and scales (e.g., Qwen2-VL-2B, LLaVA-v1.6-8B, Llama-3.2-11B), and then evaluating MLLM performance on various domain-specific tasks. To support further research in MLLM domain adaptation, we will open-source our implementations.
- Published
- 2024
23. Cyclotomic synthetic spectra
- Author
-
Antieau, Benjamin and Riggenbach, Noah
- Subjects
Mathematics - K-Theory and Homology ,Mathematics - Algebraic Geometry ,Mathematics - Algebraic Topology - Abstract
We define an $\infty$-category $\mathrm{CycSyn}$ of $p$-typical cyclotomic synthetic spectra and prove that the motivic filtration on $\mathrm{THH}(R;\mathbf{Z}_p)$, defined by Bhatt, Morrow, and Scholze when $R$ is quasisyntomic and by Hahn, Raksit, and Wilson in the chromatically quasisyntomic case, naturally admits the structure of a $p$-typical cyclotomic synthetic spectrum. As a consequence, we obtain new bounds on the syntomic cohomology of connective chromatically quasisyntomic $\mathbf{E}_\infty$-ring spectra., Comment: 54 pages, comments welcome!
- Published
- 2024
24. Altermagnetic multiferroics and altermagnetoelectric effect
- Author
-
Šmejkal, Libor
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
Magnetoelectric multiferroics are highly sought after for applications in low-power electronics and for advancing fundamental research, including axion insulators and dark matter detection. However, achieving a combination of ferroic spin and electric orders, along with their controllable switching, remains a significant challenge in conventional ferromagnets and antiferromagnets. Here, we present first-principles evidence that time-reversal symmetry-breaking altermagnetic spin polarization with relatively high critical temperatures can emerge in ferroelectrics BaCuF$_4$ (T$_N$ $\sim$ 275K) and Ca$_3$Mn$_2$O$_7$ (T$_N$ $\sim$ 110K). Furthermore, we classify all possible altermagnetic polar spin groups, revealing altermagnetism in a collinear phase of BiFeO$_3$. We also propose an altermagnetoelectric effect, a nonrelativistic cross-coupling between altermagnetic spin polarization and ferroelectric polarization, mediated by a rotation of nonmagnetic polyhedra in the lattice structure. Our findings suggest an alternative pathway towards high-temperature magnetoelectric multiferroicity and the electric field control of altermagnetic order parameters., Comment: 6 pages, 4 figures, 1 table
- Published
- 2024
25. Spatio-Temporal Energy Cascade in Three-Dimensional Magnetohydrodynamic Turbulence
- Author
-
Arrò, Giuseppe, Li, Hui, and Matthaeus, William H.
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Physics - Plasma Physics ,Physics - Space Physics - Abstract
We present a new scale decomposition method to investigate turbulence in wavenumber-frequency space. Using 3D magnetohydrodynamic turbulence simulations, we show that magnetic fluctuations with time scales longer than the nonlinear time exhibit an inverse cascade toward even smaller frequencies. Low frequency magnetic fluctuations support turbulence, acting as an energy reservoir that is converted into plasma kinetic energy, the latter cascading toward large wavenumbers and frequencies, where it is dissipated. Our results shed new light on the spatio-temporal properties of turbulence, potentially explaining the origin and role of low frequency turbulent fluctuations in the solar wind.
- Published
- 2024
26. Sparse Pseudospectral Shattering
- Author
-
Shah, Rikhav, Srivastava, Nikhil, and Zeng, Edward
- Subjects
Mathematics - Probability ,Mathematics - Numerical Analysis ,60B20, 65F22, 65F50, 68Q87 - Abstract
The eigenvalues and eigenvectors of nonnormal matrices can be unstable under perturbations of their entries. This renders an obstacle to the analysis of numerical algorithms for non-Hermitian eigenvalue problems. A recent technique to handle this issue is pseudospectral shattering [BGVKS23], showing that adding a random perturbation to any matrix has a regularizing effect on the stability of the eigenvalues and eigenvectors. Prior work has analyzed the regularizing effect of dense Gaussian perturbations, where independent noise is added to every entry of a given matrix [BVKS20, BGVKS23, BKMS21, JSS21]. We show that the same effect can be achieved by adding a sparse random perturbation. In particular, we show that given any $n\times n$ matrix $M$ of polynomially bounded norm: (a) perturbing $O(n\log^2(n))$ random entries of $M$ by adding i.i.d. complex Gaussians yields $\log\kappa_V(A)=O(\text{poly}\log(n))$ and $\log (1/\eta(A))=O(\text{poly}\log(n))$ with high probability; (b) perturbing $O(n^{1+\alpha})$ random entries of $M$ for any constant $\alpha>0$ yields $\log\kappa_V(A)=O_\alpha(\log(n))$ and $\log(1/\eta(A))=O_\alpha(\log(n))$ with high probability. Here, $\kappa_V(A)$ denotes the condition number of the eigenvectors of the perturbed matrix $A$ and $\eta(A)$ denotes its minimum eigenvalue gap. A key mechanism of the proof is to reduce the study of $\kappa_V(A)$ to control of the pseudospectral area and minimum eigenvalue gap of $A$, which are further reduced to estimates on the least two singular values of shifts of $A$. We obtain the required least singular value estimates via a streamlining of an argument of Tao and Vu [TV07] specialized to the case of sparse complex Gaussian perturbations.
- Published
- 2024
27. Efficiency Enhancement of c-Si/TiO$_2$ Heterojunction Thin Film Solar Cell Using Hybrid Metal-Dielectric Nanostructures
- Author
-
Sarkar, Soikot and Choudhury, Sajid Muhaimin
- Subjects
Physics - Optics ,Physics - Applied Physics - Abstract
The hybrid metal-dielectric nanostructures (HMDN) are promising candidates to address the ohmic loss by conventional nanostructures in photovoltaic applications by strong confinement and high scattering directivity. In this study, we present a c-Si/TiO$_2$ heterojunction thin film solar cell (TFSC) where a pair of triangular HMDN comprised of Ag and AZO was utilized to enhance the longer wavelength light absorption. The presence of the TiO$_2$ inverted pyramid layer, in combination with the ITO and SiO$_2$-based pyramid layers at the front, enhanced the shorter wavelength light absorption by increasing the optical path and facilitating the coupling of incoming light in photonic mode. Consequently, the average absorption by 1000 nm thick photoactive layer reached 83.32 % for AM 1.5G within the wavelength range of 300 - 1100 nm which was investigated by employing the finite-difference time-domain (FDTD) method. The electric field profile and current density profile demonstrated the respective contributions of each layer in the absorption of light at shorter and longer wavelengths. The structure exhibited a short circuit current density ($J_{sc}$) of 37.96 mA/cm$^2$ and a power conversion efficiency ($PCE$) of 17.42 %. The efficiency of our proposed structure experienced a maximum relative change of 0.34 % when a polarized light was exposed with an angle of 0$^\circ$ to 90$^\circ$. The incorporation of self-heating in non-isothermal conditions reduced $PCE$ by $13.77 \%$. In addition, the comparative analysis to assess the impact of HMDN on our structure revealed a $4.54 \%$ increase in $PCE$ of the structure with metallic nanostructures, paving the way for the utilization of HMDN to enhance the performance of TFSC., Comment: 46 page 10 figures
- Published
- 2024
28. Scalable Out-of-distribution Robustness in the Presence of Unobserved Confounders
- Author
-
Prashant, Parjanya, Khatami, Seyedeh Baharan, Ribeiro, Bruno, and Salimi, Babak
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
We consider the task of out-of-distribution (OOD) generalization, where the distribution shift is due to an unobserved confounder ($Z$) affecting both the covariates ($X$) and the labels ($Y$). In this setting, traditional assumptions of covariate and label shift are unsuitable due to the confounding, which introduces heterogeneity in the predictor, i.e., $\hat{Y} = f_Z(X)$. OOD generalization differs from traditional domain adaptation by not assuming access to the covariate distribution ($X^\text{te}$) of the test samples during training. These conditions create a challenging scenario for OOD robustness: (a) $Z^\text{tr}$ is an unobserved confounder during training, (b) $P^\text{te}{Z} \neq P^\text{tr}{Z}$, (c) $X^\text{te}$ is unavailable during training, and (d) the posterior predictive distribution depends on $P^\text{te}(Z)$, i.e., $\hat{Y} = E_{P^\text{te}(Z)}[f_Z(X)]$. In general, accurate predictions are unattainable in this scenario, and existing literature has proposed complex predictors based on identifiability assumptions that require multiple additional variables. Our work investigates a set of identifiability assumptions that tremendously simplify the predictor, whose resulting elegant simplicity outperforms existing approaches., Comment: 24 pages, 3 figures
- Published
- 2024
29. New bulk cone singularities in Vaidya-like spacetimes from large $c$ conformal blocks
- Author
-
Leung, Henry
- Subjects
High Energy Physics - Theory ,General Relativity and Quantum Cosmology - Abstract
Bulk cone singularities are singularities in boundary two-point functions at points separated by a null geodesic in the bulk, but not in the boundary. In this work, we describe a new type of bulk cone singularities in a family of Vaidya-like spacetimes that are labeled by the radius $r_+$ of the resulting black hole. We find a sharp transition in the causal structure within this family of spacetimes at $r_+=l$, the AdS length. In particular, there are bulk cone singularities that do not exist in the $r_+>l$ case, but appear for $r_+
- Published
- 2024
30. Dynamic EEG-fMRI mapping: Revealing the relationship between brain connectivity and cognitive state
- Author
-
Liu, Guiran and Zhu, Binrong
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
This study investigated the dynamic connectivity patterns between EEG and fMRI modalities, contributing to our understanding of brain network interactions. By employing a comprehensive approach that integrated static and dynamic analyses of EEG-fMRI data, we were able to uncover distinct connectivity states and characterize their temporal fluctuations. The results revealed modular organization within the intrinsic connectivity networks (ICNs) of the brain, highlighting the significant roles of sensory systems and the default mode network. The use of a sliding window technique allowed us to assess how functional connectivity varies over time, further elucidating the transient nature of brain connectivity. Additionally, our findings align with previous literature, reinforcing the notion that cognitive states can be effectively identified through short-duration data, specifically within the 30-60 second timeframe. The established relationships between connectivity strength and cognitive processes, particularly during different visual states, underscore the relevance of our approach for future research into brain dynamics. Overall, this study not only enhances our understanding of the interplay between EEG and fMRI signals but also paves the way for further exploration into the neural correlates of cognitive functions and their implications in clinical settings. Future research should focus on refining these methodologies and exploring their applications in various cognitive and clinical contexts., Comment: 15 pages, Subjects: Machine Learning (cs.LG); Human-Computer Interaction (cs.HC); Signal Processing (eess.SP)
- Published
- 2024
31. The scalar angular Teukolsky equation and its solution for the Taub-NUT spacetime
- Author
-
Willenborg, Felix, Philipp, Dennis, and Lämmerzahl, Claus
- Subjects
General Relativity and Quantum Cosmology - Abstract
The Taub-NUT spacetime offers many curious insights into the solutions of Einstein's electrovacuum equation. In the Bonnor interpretation, this spacetime possesses so-called Misner strings, which induce phenomena strikingly analogous to Dirac strings in the context of magnetic monopoles. The study of scattering in the latter case leads to a quantization of the product of electric charge and magnetic moment, sometimes called the Dirac condition. To enable a thorough discussion of scattering on the Taub-NUT spacetime, linear perturbations are considered in the Newman-Penrose formalism and separated into angular and radial equations. The angular Teukolsky equation is discussed in detail, and eigenvalues are derived to subsequently solve the differential equation in terms of solutions to the confluent Heun equation. In the Bonnor interpretation of the Taub-NUT spacetime, there is no analog property to the Dirac condition. The choice of spacetime parameters remains unconstrained. However, for a particular parameter choice, one can rederive the well-known "Misner" condition, in which a product of frequency and NUT charge is of integer value, as well as another product additionally including the Manko-Ruiz parameter. The results of this work will allow us to solve analytically for wave-optical scattering in order to, e.g., examine the wave-optical image of Taub-NUT black holes., Comment: 18 pages, 3 figures
- Published
- 2024
32. SIMS: Simulating Human-Scene Interactions with Real World Script Planning
- Author
-
Wang, Wenjia, Pan, Liang, Dou, Zhiyang, Liao, Zhouyingcheng, Lou, Yuke, Yang, Lei, Wang, Jingbo, and Komura, Taku
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Graphics - Abstract
Simulating long-term human-scene interaction is a challenging yet fascinating task. Previous works have not effectively addressed the generation of long-term human scene interactions with detailed narratives for physics-based animation. This paper introduces a novel framework for the planning and controlling of long-horizon physical plausible human-scene interaction. On the one hand, films and shows with stylish human locomotions or interactions with scenes are abundantly available on the internet, providing a rich source of data for script planning. On the other hand, Large Language Models (LLMs) can understand and generate logical storylines. This motivates us to marry the two by using an LLM-based pipeline to extract scripts from videos, and then employ LLMs to imitate and create new scripts, capturing complex, time-series human behaviors and interactions with environments. By leveraging this, we utilize a dual-aware policy that achieves both language comprehension and scene understanding to guide character motions within contextual and spatial constraints. To facilitate training and evaluation, we contribute a comprehensive planning dataset containing diverse motion sequences extracted from real-world videos and expand them with large language models. We also collect and re-annotate motion clips from existing kinematic datasets to enable our policy learn diverse skills. Extensive experiments demonstrate the effectiveness of our framework in versatile task execution and its generalization ability to various scenarios, showing remarkably enhanced performance compared with existing methods. Our code and data will be publicly available soon.
- Published
- 2024
33. Geometry of fibers of the multiplication map of deep linear neural networks
- Author
-
Lehalleur, SImon Pepin and Rimányi, Richárd
- Subjects
Mathematics - Algebraic Geometry ,Mathematics - Representation Theory ,Statistics - Machine Learning ,16G20 05E14 62F15 62R01 - Abstract
We study the geometry of the algebraic set of tuples of composable matrices which multiply to a fixed matrix, using tools from the theory of quiver representations. In particular, we determine its codimension $C$ and the number $\theta$ of its top-dimensional irreducible components. Our solution is presented in three forms: a Poincar\'e series in equivariant cohomology, a quadratic integer program, and an explicit formula. In the course of the proof, we establish a surprising property: $C$ and $\theta$ are invariant under arbitrary permutations of the dimension vector. We also show that the real log-canonical threshold of the function taking a tuple to the square Frobenius norm of its product is $C/2$. These results are motivated by the study of deep linear neural networks in machine learning and Bayesian statistics (singular learning theory) and show that deep linear networks are in a certain sense ``mildly singular"., Comment: 28 pages, 2 figures. Comments welcome!
- Published
- 2024
34. Handling irresolvable conflicts in the Semantic Web: an RDF-based conflict-tolerant version of the Deontic Traditional Scheme
- Author
-
Robaldo, Livio and Pozzato, Gianluca
- Subjects
Computer Science - Artificial Intelligence - Abstract
This paper presents a new ontology that implements the well-known Deontic Traditional Scheme in RDFs and SPARQL, fit to handle irresolvable conflicts, i.e., situations in which two or more statements prescribe conflicting obligations, prohibitions, or permissions, with none of them being "stronger" than the other one(s). In our view, this paper marks a significant advancement in standard theoretical research in formal Deontic Logic. Most contemporary approaches in this field are confined to the propositional level, mainly focus on the notion of obligation, and lack implementations. The proposed framework is encoded in RDF, which is not only a first-order language but also the most widely used knowledge representation language, as it forms the foundation of the Semantic Web. Moreover, the proposed computational ontology formalizes all deontic modalities defined in the Deontic Traditional Scheme, without specifically focusing on obligations, and offers constructs to model and reason with various types of irresolvable conflicts, violations, and the interaction between deontic modalities and contextual constraints in a given state of affairs. To the best of our knowledge, no existing approach in the literature addresses all these aspects within a unified integrated framework. All examples presented and discussed in this paper, together with Java code and clear instructions to re-execute them locally, are available at https://github.com/liviorobaldo/conflict-tolerantDeonticTraditionalScheme
- Published
- 2024
35. Traction force microscopy for linear and nonlinear elastic materials as a parameter identification inverse problem
- Author
-
Sarnighausen, Gesa, Nguyen, Tram Thi Ngoc, Hohage, Thorsten, Sinha, Mangalika, Koester, Sarah, Betz, Timo, Schwarz, Ulrich Sebastian, and Wald, Anne
- Subjects
Mathematics - Numerical Analysis ,92-08, 35Q92, 35R30 - Abstract
Traction force microscopy is a method widely used in biophysics and cell biology to determine forces that biological cells apply to their environment. In the experiment, the cells adhere to a soft elastic substrate, which is then deformed in response to cellular traction forces. The inverse problem consists in computing the traction stress applied by the cell from microscopy measurements of the substrate deformations. In this work, we consider a linear model, in which 3D forces are applied at a 2D interface, called 2.5D traction force microscopy, and a nonlinear pure 2D model, from which we directly obtain a linear pure 2D model. All models lead to a linear resp. nonlinear parameter identification problem for a boundary value problem of elasticity. We analyze the respective forward operators and conclude with some numerical experiments for simulated and experimental data., Comment: 28 pages, 9 figures
- Published
- 2024
36. A gravitational wave detectable candidate Type Ia supernova progenitor
- Author
-
Chickles, Emma T., Burdge, Kevin B., Chakraborty, Joheen, Dhillon, Vik S., Draghis, Paul, Hughes, Scott A., Munday, James, Rappaport, Saul A., Tonry, John, Bauer, Evan, Brown, Alex, Castro, Noel, Chakrabarty, Deepto, Dyer, Martin, El-Badry, Kareem, Frebel, Anna, Furesz, Gabor, Garbutt, James, Green, Matthew J., Householder, Aaron, Jarvis, Daniel, Kara, Erin, Kennedy, Mark R., Kerry, Paul, Littlefair, Stuart P, McCormac, James, Mo, Geoffrey, Ng, Mason, Parsons, Steven, Pelisoli, Ingrid, Pike, Eleanor, Prince, Thomas A., Ricker, George R., van Roestel, Jan, Sahman, David, Shen, Ken J., Simcoe, Robert A., Vanderburg, Andrew, and Wong, Tin Long Sunny
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - High Energy Astrophysical Phenomena - Abstract
Type Ia supernovae, critical for studying cosmic expansion, arise from thermonuclear explosions of white dwarfs, but their precise progenitor pathways remain unclear. Growing evidence supports the ``double-degenerate'' scenario, where two white dwarfs interact. The absence of other companion types capable of explaining the observed Ia rate, along with observations of hyper-velocity white dwarfs interpreted as surviving companions of such systems provide compelling evidence in favor of this scenario. Upcoming millihertz gravitational wave observatories like the Laser Interferometer Space Antenna (LISA) are expected to detect thousands of double-degenerate systems, though the most compact known candidate Ia progenitors produce only marginally detectable gravitational wave signals. Here, we report observations of ATLAS J1138-5139, a binary white dwarf system with an orbital period of 28 minutes. Our analysis reveals a 1 solar mass carbon-oxygen white dwarf accreting from a helium-core white dwarf. Given its mass, the accreting carbon-oxygen white dwarf is poised to trigger a typical-luminosity Type Ia supernova within a few million years, or to evolve into a stably mass-transferring AM CVn system. ATLAS J1138-5139 provides a rare opportunity to calibrate binary evolution models by directly comparing observed orbital parameters and mass transfer rates closer to merger than any previously identified candidate Type Ia progenitor. Its compact orbit ensures detectability by LISA, demonstrating the potential of millihertz gravitational wave observatories to reveal a population of Type Ia progenitors on a Galactic scale, paving the way for multi-messenger studies offering insights into the origins of these cosmologically significant explosions., Comment: 40 pages, 7 figures, 2 tables
- Published
- 2024
37. Sparse Partitions of Graphs with Bounded Clique Number
- Author
-
Girão, António and Insley, Toby
- Subjects
Mathematics - Combinatorics - Abstract
We prove that for each integer $r\geq 2$, there exists a constant $C_r>0$ with the following property: for any $0<\varepsilon \leq 1/2$ and any graph $G$ with clique number at most $r,$ there is a partition of $V(G)$ into at most $(1/\varepsilon)^{C_r}$ sets $S_1, \dots, S_t,$ such that $G[S_i]$ has maximum degree at most $\varepsilon |S_i|$ for each $1 \leq i \leq t.$ This answers a question of Fox, Nguyen, Scott and Seymour, who proved a similar result for graphs with no induced $P_4.$, Comment: 8 pp
- Published
- 2024
38. Learning Feedback Mechanisms for Measurement-Based Variational Quantum State Preparation
- Author
-
Puente, Daniel Alcalde and Rizzi, Matteo
- Subjects
Quantum Physics - Abstract
This work introduces a self-learning protocol that incorporates measurement and feedback into variational quantum circuits for efficient quantum state preparation. By combining projective measurements with conditional feedback, the protocol learns state preparation strategies that extend beyond unitary-only methods, leveraging measurement-based shortcuts to reduce circuit depth. Using the spin-1 Affleck-Kennedy-Lieb-Tasaki state as a benchmark, the protocol learns high-fidelity state preparation by overcoming a family of measurement induced local minima through adjustments of parameter update frequencies and ancilla regularization. Despite these efforts, optimization remains challenging due to the highly non-convex landscapes inherent to variational circuits. The approach is extended to larger systems using translationally invariant ans\"atze and recurrent neural networks for feedback, demonstrating scalability. Additionally, the successful preparation of a specific AKLT state with desired edge modes highlights the potential to discover new state preparation protocols where none currently exist. These results indicate that integrating measurement and feedback into variational quantum algorithms provides a promising framework for quantum state preparation.
- Published
- 2024
39. Theory of the photonic Joule effect in superconducting circuits
- Author
-
Cailleaux, Samuel, Ficheux, Quentin, Roch, Nicolas, and Basko, Denis M.
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Superconductivity ,Quantum Physics - Abstract
When a small system is coupled to a bath, it is generally assumed that the state of the bath remains unaffected by the system due to the bath's large number of degrees of freedom. Here we show theoretically that this assumption can be easily violated for photonic baths typically used in experiments involving superconducting circuits. We analyze the dynamics of a voltage-biased Josephson junction coupled to a photonic bath, represented as a long Josephson junction chain. Our findings show that the system can reach a non-equilibrium steady state where the photonic degrees of freedom become significantly overheated, leading to a qualitative change in the current-voltage $I-V$ curve. This phenomenon is analogous to the Joule effect observed in electrical conductors, where flowing current can substantially heat up electrons. Recognizing this effect is crucial for the many applications of high-impedance environments in quantum technologies.
- Published
- 2024
40. Quantifying the synthetic and real domain gap in aerial scene understanding
- Author
-
Marcu, Alina
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Quantifying the gap between synthetic and real-world imagery is essential for improving both transformer-based models - that rely on large volumes of data - and datasets, especially in underexplored domains like aerial scene understanding where the potential impact is significant. This paper introduces a novel methodology for scene complexity assessment using Multi-Model Consensus Metric (MMCM) and depth-based structural metrics, enabling a robust evaluation of perceptual and structural disparities between domains. Our experimental analysis, utilizing real-world (Dronescapes) and synthetic (Skyscenes) datasets, demonstrates that real-world scenes generally exhibit higher consensus among state-of-the-art vision transformers, while synthetic scenes show greater variability and challenge model adaptability. The results underline the inherent complexities and domain gaps, emphasizing the need for enhanced simulation fidelity and model generalization. This work provides critical insights into the interplay between domain characteristics and model performance, offering a pathway for improved domain adaptation strategies in aerial scene understanding., Comment: 17 pages (including references), 5 figures, 2 tables. Accepted for publication in the "Scientific Bulletin", Series C, Electrical Engineering and Computer Science, ISSN 2286-3540
- Published
- 2024
41. Interacting Dark Sector (ETHOS $n=0$): Cosmological Constraints from SPT Cluster Abundance with DES and HST Weak Lensing Data
- Author
-
Mazoun, Asmaa, Bocquet, Sebastian, Mohr, Joseph J., Garny, Mathias, Rubira, Henrique, Klein, Matthias, Bleem, Lindsey, Grandis, Sebastian, and Schrabback, Tim
- Subjects
Astrophysics - Cosmology and Nongalactic Astrophysics - Abstract
We use galaxy cluster abundance measurements from the South Pole Telescope (SPT) enhanced by Multi-Component Matched Filter (MCMF) confirmation and complemented with mass information obtained using weak-lensing data from Dark Energy Survey Year~3 (DES Y3) and targeted Hubble Space Telescope (HST) observations for probing deviations from the cold dark matter paradigm. Concretely, we consider a class of dark sector models featuring interactions between dark matter (DM) and a dark radiation (DR) component within the framework of the Effective Theory of Structure Formation (ETHOS). We focus on scenarios that lead to power suppression over a wide range of scales, and thus can be tested with data sensitive to large scales, as realized for example for DM$-$DR interactions following from an unbroken non-Abelian $SU(N)$ gauge theory (interaction rate with power-law index $n=0$ within the ETHOS parameterization). Cluster abundance measurements are mostly sensitive to the amount of DR interacting with DM, parameterized by the ratio of DR temperature to the cosmic microwave background (CMB) temperature, $\xi_{\rm DR}=T_{\rm DR}/T_{\rm CMB}$. We find an upper limit $\xi_{\rm DR}<17\%$ at $95\%$ credibility. When the cluster data are combined with Planck 2018 CMB data along with baryon acoustic oscillation (BAO) measurements we find $\xi_{\rm DR}<10\%$, corresponding to a limit on the abundance of interacting DR that is around three times tighter than that from CMB+BAO data alone. We also discuss the complementarity of weak lensing informed cluster abundance studies with probes sensitive to smaller scales, explore the impact on our analysis of massive neutrinos, and comment on a slight preference for the presence of a non-zero interacting DR abundance, which enables a physical solution to the $S_8$ tension., Comment: 18 pages, 7 figures
- Published
- 2024
42. Coulomb Gauges and Regularity for Stationary Weak Yang$-$Mills Connections in Supercritical Dimension
- Author
-
Caniato, Riccardo and Rivière, Tristan
- Subjects
Mathematics - Differential Geometry ,Mathematics - Analysis of PDEs ,58E15 (primary), 49Q15, 49Q20, 53C65, 81T13 (secondary) - Abstract
We prove that stationary Yang$-$Mills fields in dimensions 5 belonging to the variational class of weak connections are smooth away from a closed singular set $S$ of vanishing 1-dimensional Hausdorff measure. Our proof is based on an $\varepsilon$-regularity theorem, which generalizes to this class of weak connections the existing previous $\varepsilon$-regularity results by G. Tian for smooth connections, by Y. Meyer and the second author for Sobolev and approximable connections, and by T. Tao and G. Tian for admissible connections (which are weak limits of smooth Yang$-$Mills fields). On the path towards establishing $\varepsilon$-regularity, a pivotal step is the construction of controlled Coulomb gauges for general weak connections under small Morrey norm assumptions., Comment: 56 pages. Any comments are welcome
- Published
- 2024
43. Classical and Quantum Algorithms for the Deterministic L-system Inductive Inference Problem
- Author
-
Lotfi, Ali, McQuillan, Ian, and Rayan, Steven
- Subjects
Quantum Physics ,Computer Science - Computation and Language ,Computer Science - Data Structures and Algorithms ,Computer Science - Formal Languages and Automata Theory ,Computer Science - Machine Learning - Abstract
L-systems can be made to model and create simulations of many biological processes, such as plant development. Finding an L-system for a given process is typically solved by hand, by experts, in a hugely time-consuming process. It would be significant if this could be done automatically from data, such as from sequences of images. In this paper, we are interested in inferring a particular type of L-system, deterministic context-free L-system (D0L-system) from a sequence of strings. We introduce the characteristic graph of a sequence of strings, which we then utilize to translate our problem (inferring D0L-system) in polynomial time into the maximum independent set problem (MIS) and the SAT problem. After that, we offer a classical exact algorithm and an approximate quantum algorithm for the problem., Comment: 16 pages, 1 figure
- Published
- 2024
44. Normed modules, integral sequences, and integrals with variable upper limits
- Author
-
Liu, Miantao, Liu, Yu-Zhe, and Liu, Shengda
- Subjects
Mathematics - Category Theory ,Mathematics - Classical Analysis and ODEs ,16D10, 16G10, 46H25 - Abstract
This paper introduces a novel framework for categorizing the Lebesgue integral with variable upper limits, utilizing normed modules over finite-dimensional $k$-algebras $\mathit{\Lambda}$ and the category $\mathscr{A}^p$ of $\mathit{\Lambda}$. We redefine the integration process by incorporating integral partial ordered set, which provide a categorification of integral with variable upper limits. Moreover, we provide two applications for this categorification in Chapters 5 and 6., Comment: 32 pages
- Published
- 2024
45. Gravitational form factors of the deuteron
- Author
-
Panteleeva, J. Yu., Epelbaum, E., Gasparyan, A. M., and Gegelia, J.
- Subjects
Nuclear Theory ,High Energy Physics - Phenomenology - Abstract
The gravitational form factors of the deuteron are calculated in the framework of non-relativistic chiral effective field theory. Non-relativistic reduction of the matrix element of the energy-momentum tensor operator for spin-one systems is worked out, and the gravitational form factors of the deuteron are extracted from the three-point function of the energy-momentum tensor using the LSZ reduction formula. The obtained form factors are compared to results of model calculations available in the literature.
- Published
- 2024
46. Another look at inference after prediction
- Author
-
Gronsbell, Jessica, Gao, Jianhui, Shi, Yaqi, McCaw, Zachary R., and Cheng, David
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning - Abstract
Prediction-based (PB) inference is increasingly used in applications where the outcome of interest is difficult to obtain, but its predictors are readily available. Unlike traditional inference, PB inference performs statistical inference using a partially observed outcome and a set of covariates by leveraging a prediction of the outcome generated from a machine learning (ML) model. Motwani and Witten (2023) recently revisited two innovative PB inference approaches for ordinary least squares. They found that the method proposed by Wang et al. (2020) yields a consistent estimator for the association of interest when the ML model perfectly captures the underlying regression function. Conversely, the prediction-powered inference (PPI) method proposed by Angelopoulos et al. (2023) yields valid inference regardless of the model's accuracy. In this paper, we study the statistical efficiency of the PPI estimator. Our analysis reveals that a more efficient estimator, proposed 25 years ago by Chen and Chen (2000), can be obtained by simply adding a weight to the PPI estimator. We also contextualize PB inference with methods from the economics and statistics literature dating back to the 1960s. Our extensive theoretical and numerical analyses indicate that the Chen and Chen (CC) estimator offers a balance between robustness to ML model specification and statistical efficiency, making it the preferred choice for use in practice.
- Published
- 2024
47. Memory Efficient GPU-based Label Propagation Algorithm (LPA) for Community Detection on Large Graphs
- Author
-
Sahu, Subhajit
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Social and Information Networks ,G.2.2 ,I.5.3 - Abstract
Community detection involves grouping nodes in a graph with dense connections within groups, than between them. We previously proposed efficient multicore (GVE-LPA) and GPU-based ($\nu$-LPA) implementations of Label Propagation Algorithm (LPA) for community detection. However, these methods incur high memory overhead due to their per-thread/per-vertex hashtables. This makes it challenging to process large graphs on shared memory systems. In this report, we introduce memory-efficient GPU-based LPA implementations, using weighted Boyer-Moore (BM) and Misra-Gries (MG) sketches. Our new implementation, $\nu$MG8-LPA, using an 8-slot MG sketch, reduces memory usage by 98x and 44x compared to GVE-LPA and $\nu$-LPA, respectively. It is also 2.4x faster than GVE-LPA and only 1.1x slower than $\nu$-LPA, with minimal quality loss (4.7%/2.9% drop compared to GVE-LPA/$\nu$-LPA)., Comment: 18 pages, 7 figures, 1 table
- Published
- 2024
48. Universal non-Hermitian transport in disordered systems
- Author
-
Li, Bo, Chen, Chuan, and Wang, Zhong
- Subjects
Quantum Physics ,Condensed Matter - Disordered Systems and Neural Networks ,Condensed Matter - Statistical Mechanics ,Physics - Optics - Abstract
In disordered Hermitian systems, localization of energy eigenstates prohibits wave propagation. In non-Hermitian systems, however, wave propagation is possible even when the eigenstates of Hamiltonian are exponentially localized by disorders. We find in this regime that non-Hermitian wave propagation exhibits novel universal scaling behaviors without Hermitian counterpart. Furthermore, our theory demonstrates how the tail of imaginary-part density of states dictates wave propagation in the long-time limit. Specifically, for the three typical classes, namely the Gaussian, the uniform, and the linear imaginary-part density of states, we obtain logarithmically suppressed sub-ballistic transport, and two types of subdiffusion with exponents that depend only on spatial dimensions, respectively. Our work highlights the fundamental differences between Hermitian and non-Hermitian Anderson localization, and uncovers unique universality in non-Hermitian wave propagation., Comment: 5+10 pages,3+2 figures
- Published
- 2024
49. Choice and independence of premise rules in intuitionistic set theory
- Author
-
Frittaion, Emanuele, Nemoto, Takako, and Rathjen, Michael
- Subjects
Mathematics - Logic - Abstract
Choice and independence of premise principles play an important role in characterizing Kreisel's modified realizability and G\"odel's Dialectica interpretation. In this paper we show that a great many intuitionistic set theories are closed under the corresponding rules for finite types over $\mathbb{N}$. It is also shown that the existence property (or existential definability property) holds for statements of the form $\exists y^{\sigma}\, \varphi(y)$, where the variable $y$ ranges over objects of finite type $\sigma$. This applies in particular to ${\sf CZF}$ (Constructive Zermelo-Fraenkel set theory) and ${\sf IZF}$ (Intuitionistic Zermelo-Fraenkel set theory), two systems known not to have the general existence property. On the technical side, the paper uses a method that amalgamates generic realizability for set theory with truth, whereby the underlying partial combinatory algebra is required to contain all objects of finite type.
- Published
- 2024
50. Noncommutative Model Selection for Data Clustering and Dimension Reduction Using Relative von Neumann Entropy
- Author
-
Guzmán-Tristán, Araceli and Rieser, Antonio
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning ,Statistics - Other Statistics - Abstract
We propose a pair of completely data-driven algorithms for unsupervised classification and dimension reduction, and we empirically study their performance on a number of data sets, both simulated data in three-dimensions and images from the COIL-20 data set. The algorithms take as input a set of points sampled from a uniform distribution supported on a metric space, the latter embedded in an ambient metric space, and they output a clustering or reduction of dimension of the data. They work by constructing a natural family of graphs from the data and selecting the graph which maximizes the relative von Neumann entropy of certain normalized heat operators constructed from the graphs. Once the appropriate graph is selected, the eigenvectors of the graph Laplacian may be used to reduce the dimension of the data, and clusters in the data may be identified with the kernel of the associated graph Laplacian. Notably, these algorithms do not require information about the size of a neighborhood or the desired number of clusters as input, in contrast to popular algorithms such as $k$-means, and even more modern spectral methods such as Laplacian eigenmaps, among others. In our computational experiments, our clustering algorithm outperforms $k$-means clustering on data sets with non-trivial geometry and topology, in particular data whose clusters are not concentrated around a specific point, and our dimension reduction algorithm is shown to work well in several simple examples., Comment: 20 pages
- Published
- 2024
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.