Author: "A. A. Long" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"A. A. Long"' showing total 1,106,377 results

Start Over Author "A. A. Long"

1,106,377 results on '"A. A. Long"'

1. T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts

Author: Huang, Ziwei, He, Wanggui, Long, Quanyu, Wang, Yandi, Li, Haoyuan, Yu, Zhelun, Shu, Fangxun, Chan, Long, Jiang, Hao, Gan, Leilei, and Wu, Fei
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Evaluating the quality of synthesized images remains a significant challenge in the development of text-to-image (T2I) generation. Most existing studies in this area primarily focus on evaluating text-image alignment, image quality, and object composition capabilities, with comparatively fewer studies addressing the evaluation of the factuality of T2I models, particularly when the concepts involved are knowledge-intensive. To mitigate this gap, we present T2I-FactualBench in this work - the largest benchmark to date in terms of the number of concepts and prompts specifically designed to evaluate the factuality of knowledge-intensive concept generation. T2I-FactualBench consists of a three-tiered knowledge-intensive text-to-image generation framework, ranging from the basic memorization of individual knowledge concepts to the more complex composition of multiple knowledge concepts. We further introduce a multi-round visual question answering (VQA) based evaluation framework to assess the factuality of three-tiered knowledge-intensive text-to-image generation tasks. Experiments on T2I-FactualBench indicate that current state-of-the-art (SOTA) T2I models still leave significant room for improvement.
Published: 2024

2. DarkSHINE Baseline Design Report: Physics Prospects and Detector Technologies

Author: Chen, Jing, Chen, Ji-Yuan, Chen, Jun-Feng, Chen, Xiang, Fu, Chang-Bo, Guo, Jun, Guo, Yi-Han, Khaw, Kim Siang, Li, Jia-Lin, Li, Liang, Li, Shu, Lin, Yu-ming, Liu, Dan-Ning, Liu, Kang, Liu, Kun, Liu, Qi-Bin, Liu, Zhi, Lu, Ze-Jia, Lv, Meng, Song, Si-Yuan, Sun, Tong, Tang, Jian-Nan, Wan, Wei-Shi, Wang, Dong, Wang, Xiao-Long, Wang, Yu-Feng, Wang, Zhen, Wang, Zi-Rui, Wu, Wei-Hao, Yang, Hai-Jun, Yang, Lin, Yang, Yong, Yu, Dian, Yuan, Rui, Zhang, Jun-Hua, Zhang, Yu-Lei, Zhang, Yun-Long, Zhao, Zhi-Yu, Zhou, Bai-Hong, Zhu, Chun-Xiang, Zhu, Xu-Liang, and Zhu, Yi-Fan
Subjects: Physics - Instrumentation and Detectors, High Energy Physics - Experiment
Abstract: DarkSHINE is a newly proposed fixed-target experiment initiative to search for the invisible decay of Dark Photon via missing energy/momentum signatures, based on the high repetition rate electron beam to be deployed/delivered by the Shanghai High repetition rate XFEL and Extreme light facility (SHINE). This report elaborates the baseline design of DarkSHINE experiment by introducing the physics goals, experimental setups, details of each sub-detector system technical designs, signal and backgground modelings, expected search sensitivities and future prospects, which mark an important step towards the further prototyping and technical demonstrations.
Published: 2024

3. STEM Pushout and Redirection of HMoob American College Students at a Predominantly White Institution. WCER Working Paper No. 2024-4

Author: University of Wisconsin-Madison, Wisconsin Center for Education Research (WCER), Bailey B. Smolarek, Matthew Wolfgram, Chundou Her, Lena Lee, Stacey J. Lee, Geboli Long, Payeng Moua, Kong Pheng Pha, Ariana Thao, Mai See Thao, Mai Neng Vang, Susan Vang, Chee Meng Xiong, Choua Xiong, Edward Xiong, Odyssey Xiong, Pa Kou Xiong, Ying Yang Youa Xiong, Kayeng Yang, Lisa Yang, Mai Chong Yang, Scy Yang, and Steven Yang
Abstract: Asian Americans as a group are overrepresented among STEM college graduates and have the highest average college enrollment rate of any racial or ethnic category. Thus, Asian Americans are typically excluded from educational interventions directed at improving STEM education for Students of Color because they are not considered to be underrepresented minorities. However, statistics obscure the individual needs of the more than 20 ethnic subgroups that fall under the umbrella term Asian Americans. Using a participatory action research approach, this paper documents the institutional and sociocultural factors that push out HMoob (or Hmong) American college students from STEM programs at one large, predominantly White university; and the coordinate processes of gatekeeping and transactional advising that either redirect those students toward non-STEM programs or force them out of the university completely.
Published: 2024

4. Timing and spectral studies of the Be/X-ray binary EXO 2030+375 using Insight-HXMT observations

Author: Du, Yu-Jia, Ducci, Lorenzo, Ji, Long, Bu, Qing-Cui, Kong, Ling-Da, Wang, Peng-Ju, Tuo, Youli, and Santangelo, Andrea
Subjects: Astrophysics - High Energy Astrophysical Phenomena
Abstract: We report the X-ray spectral and timing analysis of the high mass X-ray binary EXO 2030+375 during the 2021 type-II outburst based on the Insight-HXMT observations. Pulsations can be detected in the energy band of 1-150 keV. The pulse profile shows energy and luminosity dependence and variability. We observed transitions in the pulse profile shape during the rising and the decaying phase of the outburst. The pulse fraction exhibits an anti-correlation with luminosity and a non-monotonic energy dependence, with a possible dip near 30 keV during the outburst peak. The hardness-intensity diagrams (7-10 keV/4-7 keV) suggest state transitions during the early and late phases of the outburst. These transitions are consistent with the luminosity at which the pulse profile shape changes occur, revealing the source reaching the critical luminosity and transitioning between super-critical and sub-critical accretion regimes. We performed the average and phase-resolved spectral analysis, where the flux-resolved average spectra show a stable spectral evolution with luminosity. The phase-resolved spectral analysis reveals that the dependence of spectral parameters on the pulse phase varies with different luminosities.
Published: 2025

5. ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark

Author: Dang, Ronghao, Yuan, Yuqian, Zhang, Wenqi, Xin, Yifei, Zhang, Boqiang, Li, Long, Wang, Liuyi, Zeng, Qinyang, Li, Xin, and Bing, Lidong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Computer Science - Robotics
Abstract: The enhancement of generalization in robots by large vision-language models (LVLMs) is increasingly evident. Therefore, the embodied cognitive abilities of LVLMs based on egocentric videos are of great interest. However, current datasets for embodied video question answering lack comprehensive and systematic evaluation frameworks. Critical embodied cognitive issues, such as robotic self-cognition, dynamic scene perception, and hallucination, are rarely addressed. To tackle these challenges, we propose ECBench, a high-quality benchmark designed to systematically evaluate the embodied cognitive abilities of LVLMs. ECBench features a diverse range of scene video sources, open and varied question formats, and 30 dimensions of embodied cognition. To ensure quality, balance, and high visual dependence, ECBench uses class-independent meticulous human annotation and multi-round question screening strategies. Additionally, we introduce ECEval, a comprehensive evaluation system that ensures the fairness and rationality of the indicators. Utilizing ECBench, we conduct extensive evaluations of proprietary, open-source, and task-specific LVLMs. ECBench is pivotal in advancing the embodied cognitive capabilities of LVLMs, laying a solid foundation for developing reliable core models for embodied agents. All data and code are available at https://github.com/Rh-Dang/ECBench.
Published: 2025

6. EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model

Author: He, Yi, Dang, Shengqi, Ling, Long, Qian, Ziqing, Zhao, Nanxuan, and Cao, Nan
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent research shows that emotions can enhance users' cognition and influence information communication. While research on visual emotion analysis is extensive, limited work has been done on helping users generate emotionally rich image content. Existing work on emotional image generation relies on discrete emotion categories, making it challenging to capture complex and subtle emotional nuances accurately. Additionally, these methods struggle to control the specific content of generated images based on text prompts. In this work, we introduce the new task of continuous emotional image content generation (C-EICG) and present EmotiCrafter, an emotional image generation model that generates images based on text prompts and Valence-Arousal values. Specifically, we propose a novel emotion-embedding mapping network that embeds Valence-Arousal values into textual features, enabling the capture of specific emotions in alignment with intended input prompts. Additionally, we introduce a loss function to enhance emotion expression. The experimental results show that our method effectively generates images representing specific emotions with the desired content and outperforms existing techniques., Comment: 11 pages, 8 figures
Published: 2025

7. Improving Zero-Shot Object-Level Change Detection by Incorporating Visual Correspondence

Author: Nguyen, Hung Huy, Rahmanzadehgervi, Pooyan, Mail, Long, and Nguyen, Anh Totti
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Detecting object-level changes between two images across possibly different views is a core task in many applications that involve visual inspection or camera surveillance. Existing change-detection approaches suffer from three major limitations: (1) lack of evaluation on image pairs that contain no changes, leading to unreported false positive rates; (2) lack of correspondences (\ie, localizing the regions before and after a change); and (3) poor zero-shot generalization across different domains. To address these issues, we introduce a novel method that leverages change correspondences (a) during training to improve change detection accuracy, and (b) at test time, to minimize false positives. That is, we harness the supervision labels of where an object is added or removed to supervise change detectors, improving their accuracy over previous work by a large margin. Our work is also the first to predict correspondences between pairs of detected changes using estimated homography and the Hungarian algorithm. Our model demonstrates superior performance over existing methods, achieving state-of-the-art results in change detection and change correspondence accuracy across both in-distribution and zero-shot benchmarks.
Published: 2025

8. Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces

Author: Mahapatra, Aniruddha, Mai, Long, Zhang, Yitian, Bourgin, David, and Liu, Feng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Video tokenizers are essential for latent video diffusion models, converting raw video data into spatiotemporally compressed latent spaces for efficient training. However, extending state-of-the-art video tokenizers to achieve a temporal compression ratio beyond 4x without increasing channel capacity poses significant challenges. In this work, we propose an alternative approach to enhance temporal compression. We find that the reconstruction quality of temporally subsampled videos from a low-compression encoder surpasses that of high-compression encoders applied to original videos. This indicates that high-compression models can leverage representations from lower-compression models. Building on this insight, we develop a bootstrapped high-temporal-compression model that progressively trains high-compression blocks atop well-trained lower-compression models. Our method includes a cross-level feature-mixing module to retain information from the pretrained low-compression model and guide higher-compression blocks to capture the remaining details from the full video sequence. Evaluation of video benchmarks shows that our method significantly improves reconstruction quality while increasing temporal compression compared to direct extensions of existing video tokenizers. Furthermore, the resulting compact latent space effectively trains a video diffusion model for high-quality video generation with a reduced token budget., Comment: Project website: https://progressive-video-tokenizer.github.io/Pro-MAG/
Published: 2025

9. Virtual-Work Based Shape-Force Sensing for Continuum Instruments with Tension-Feedback Actuation

Author: Zhang, Guoqing, Chen, Zihan, and Wang, Long
Subjects: Computer Science - Robotics
Abstract: Continuum instruments are integral to robot-assisted minimally invasive surgery (MIS), with tendon-driven mechanisms being the most common. Real-time tension feedback is crucial for precise articulation but remains a challenge in compact actuation unit designs. Additionally, accurate shape and external force sensing of continuum instruments are essential for advanced control and manipulation. This paper presents a compact and modular actuation unit that integrates a torque cell directly into the pulley module to provide real-time tension feedback. Building on this unit, we propose a novel shape-force sensing framework that incorporates polynomial curvature kinematics to accurately model non-constant curvature. The framework combines pose sensor measurements at the instrument tip and actuation tension feedback at the developed actuation unit. Experimental results demonstrate the improved performance of the proposed shape-force sensing framework in terms of shape reconstruction accuracy and force estimation reliability compared to conventional constant-curvature methods.
Published: 2025

10. A Thermodynamic Theory of Proximity Ferroelectricity

Author: Eliseev, Eugene A., Morozovska, Anna N., Maria, Jon-Paul, Chen, Long-Qing, and Gopalan, Venkatraman
Subjects: Condensed Matter - Materials Science, Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: Proximity ferroelectricity has recently been reported as a new design paradigm for inducing ferroelectricity, where a non-ferroelectric polar material becomes a ferroelectric by interfacing with a thin ferroelectric layer. Strongly polar materials, such as AlN and ZnO, which were previously unswitchable with an external field below their dielectric breakdown fields, can now be switched with practical coercive fields when they are in intimate proximity to a switchable ferroelectric. Here, we develop a general Landau-Ginzburg theory of proximity ferroelectricity in multilayers of non-ferroelectrics and ferroelectrics to analyze their switchability and coercive fields. The theory predicts regimes of both "proximity switching" where the multilayers collectively switch, as well as "proximity suppression" where they collectively do not switch. The mechanism of the proximity ferroelectricity is an internal electric field determined by the polarization of the layers and their relative thickness in a self-consistent manner that renormalizes the double-well ferroelectric potential to lower the steepness of the switching barrier. Further reduction in the coercive field emerges from charged defects in the bulk that act as nucleation centers. The application of the theory to proximity ferroelectricity in Alx-1ScxN/AlN and Zn1-xMgxO/ZnO bilayers is demonstrated. The theory further predicts that multilayers of dielectric/ferroelectric and paraelectric/ferroelectric layers can potentially result in induced ferroelectricity in the dielectric or paraelectric layers, resulting in the entire stack being switched, an exciting avenue for new discoveries. This thawing of "frozen ferroelectrics", paraelectrics and potentially dielectrics, promises a large class of new ferroelectrics with exciting prospects for previously unrealizable domain-patterned optoelectronic and memory technologies., Comment: 42 pages including 7 figures and 4 Appendices. To be submitted to Physical Review
Published: 2025

11. Real-Time Textless Dialogue Generation

Author: Mai, Long and Carson-Berndsen, Julie
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Recent advancements in large language models (LLMs) have led to significant progress in text-based dialogue systems. These systems can now generate high-quality responses that are accurate and coherent across a wide range of topics and tasks. However, spoken dialogue systems still lag behind in terms of naturalness. They tend to produce robotic interactions, with issues such as slow response times, overly generic or cautious replies, and a lack of natural rhythm and fluid turn-taking. This shortcoming is largely due to the over-reliance on the traditional cascaded design, which involve separate, sequential components, as well as the use of text as an intermediate representation. This paper propose a real-time, textless spoken dialogue generation model (RTTL-DG) that aims to overcome these challenges. Our system enables fluid turn-taking and generates responses with minimal delay by processing streaming spoken conversation directly. Additionally, our model incorporates backchannels, filters, laughter, and other paralinguistic signals, which are often absent in cascaded dialogue systems, to create more natural and human-like interactions. The implementations and generated samples are available in our repository: https://github.com/mailong25/rts2s-dg
Published: 2025

12. GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting

Author: Bond, Andrew, Wang, Jui-Hsien, Mai, Long, Erdem, Erkut, and Erdem, Aykut
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Efficient neural representations for dynamic video scenes are critical for applications ranging from video compression to interactive simulations. Yet, existing methods often face challenges related to high memory usage, lengthy training times, and temporal consistency. To address these issues, we introduce a novel neural video representation that combines 3D Gaussian splatting with continuous camera motion modeling. By leveraging Neural ODEs, our approach learns smooth camera trajectories while maintaining an explicit 3D scene representation through Gaussians. Additionally, we introduce a spatiotemporal hierarchical learning strategy, progressively refining spatial and temporal features to enhance reconstruction quality and accelerate convergence. This memory-efficient approach achieves high-quality rendering at impressive speeds. Experimental results show that our hierarchical learning, combined with robust camera motion modeling, captures complex dynamic scenes with strong temporal consistency, achieving state-of-the-art performance across diverse video datasets in both high- and low-motion scenarios., Comment: 10 pages, 10 figures
Published: 2025

13. A Plug-and-Play Bregman ADMM Module for Inferring Event Branches in Temporal Point Processes

Author: Wang, Qingmei, Wu, Yuxin, Long, Yujie, Huang, Jing, Ran, Fengyuan, Su, Bing, and Xu, Hongteng
Subjects: Computer Science - Machine Learning, 60G55, 62M10
Abstract: An event sequence generated by a temporal point process is often associated with a hidden and structured event branching process that captures the triggering relations between its historical and current events. In this study, we design a new plug-and-play module based on the Bregman ADMM (BADMM) algorithm, which infers event branches associated with event sequences in the maximum likelihood estimation framework of temporal point processes (TPPs). Specifically, we formulate the inference of event branches as an optimization problem for the event transition matrix under sparse and low-rank constraints, which is embedded in existing TPP models or their learning paradigms. We can implement this optimization problem based on subspace clustering and sparse group-lasso, respectively, and solve it using the Bregman ADMM algorithm, whose unrolling leads to the proposed BADMM module. When learning a classic TPP (e.g., Hawkes process) by the expectation-maximization algorithm, the BADMM module helps derive structured responsibility matrices in the E-step. Similarly, the BADMM module helps derive low-rank and sparse attention maps for the neural TPPs with self-attention layers. The structured responsibility matrices and attention maps, which work as learned event transition matrices, indicate event branches, e.g., inferring isolated events and those key events triggering many subsequent events. Experiments on both synthetic and real-world data show that plugging our BADMM module into existing TPP models and learning paradigms can improve model performance and provide us with interpretable structured event branches. The code is available at \url{https://github.com/qingmeiwangdaily/BADMM_TPP}., Comment: Accepted at AAAI 2025
Published: 2025

14. LAMOST Reveals Long-lived Protoplanetary Disks

Author: Wang, Xiao-Long, Fang, Min, Liu, Yao, Zhang, Miao-Miao, and Cui, Wen-Yuan
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: While both observations and theories demonstrate that protoplanetary disks are not expected to live much longer than $\sim$10 Myr, several examples of prolonged disks have been observed in the past. In this work, we perform a systematic search for aged YSOs still surrounded by protoplanetary disks in the M star catalog from the LAMOST archive. We identify 14 sources older than 10 Myr, still surrounded by protoplanetary disks and with ongoing accretion activities, significantly improving the census of the category known as the Peter Pan disks. The stellar parameters, variability and accretion properties of these objects, as well as their spatial distribution, are investigated. Nearly all of these objects are distributed far away from nearby associations and star forming regions, but show evidence of being members of open clusters. Investigating the correlation between mass accretion rates and stellar masses, we find these long-lived disks accrete at systematically lower levels, compared to their younger counterparts with similar stellar masses. Studying the evolution of mass accretion rates with stellar ages, we find these aged disks follow similar trend as young ones., Comment: 24 pages, 12 figures, 2 tables, Accepted for publication in AJ
Published: 2025

15. The Cosmic Evolution Early Release Science Survey (CEERS)

Author: Finkelstein, Steven L., Bagley, Micaela B., Haro, Pablo Arrabal, Dickinson, Mark, Ferguson, Henry C., Kartaltepe, Jeyhan S., Kocevski, Dale D., Koekemoer, Anton M., Lotz, Jennifer M., Papovich, Casey, Perez-Gonzalez, Pablo G., Pirzkal, Nor, Somerville, Rachel S., Trump, Jonathan R., Yang, Guang, Yung, L. Y. Aaron, Fontana, Adriano, Grazian, Andrea, Grogin, Norman A., Kewley, Lisa J., Kirkpatrick, Allison, Larson, Rebecca L., Pentericci, Laura, Ravindranath, Swara, Wilkins, Stephen M., Almaini, Omar, Amorin, Ricardo O., Barro, Guillermo, Bhatawdekar, Rachana, Bisigello, Laura, Brooks, Madisyn, Buitrago, Fernando, Calabro, Antonello, Castellano, Marco, Cheng, Yingjie, Cleri, Nikko J., Cole, Justin W., Cooper, M. C., Cooper, Olivia R., Costantin, Luca, Cox, Isa G., Croton, Darren, Daddi, Emanuele, Davis, Kelcey, Dekel, Avishai, Elbaz, David, Fernandez, Vital, Fujimoto, Seiji, Gandolfi, Giovanni, Gardner, Jonathan P., Gawiser, Eric, Giavalisco, Mauro, Gomez-Guijarro, Carlos, Guo, Yuchen, Gupta, Ansh R., Hathi, Nimish P., Harish, Santosh, Henry, Aurelien, Hirschmann, Michaela, Hu, Weida, Hutchison, Taylor A., Iyer, Kartheik G., Jaskot, Anne E., Jha, Saurabh W., Jung, Intae, Kokorev, Vasily, Kurczynski, Peter, Leung, Gene C. K., Llerena, Mario, Long, Arianna S., Lucas, Ray A., Lu, Shiying, McGrath, Elizabeth J., McIntosh, Daniel H., Merlin, Emiliano, Morales, Alexa M., Napolitano, Lorenzo, Pacucci, Fabio, Pandya, Viraj, Rafelski, Marc, Rodighiero, Giulia, Rose, Caitlin, Santini, Paola, Seille, Lise-Marie, Simons, Raymond C., Shen, Lu, Straughn, Amber N., Tacchella, Sandro, Vanderhoof, Brittany N., Vega-Ferrero, Jesus, Weiner, Benjamin J., Willmer, Christopher N. A., Zhu, Peixin, Bell, Eric F., Wuyts, Stijn, Holwerda, Benne W., Wang, Xin, Wang, Weichen, and Zavala, Jorge A.
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We present the Cosmic Evolution Early Release Science (CEERS) Survey, a 77.2 hour Director's Discretionary Early Release Science Program. CEERS demonstrates, tests, and validates efficient extragalactic surveys using coordinated, overlapping parallel observations with the JWST instrument suite, including NIRCam and MIRI imaging, NIRSpec low (R~100) and medium (R~1000) resolution spectroscopy, and NIRCam slitless grism (R~1500) spectroscopy. CEERS targets the Hubble Space Telescope-observed region of the Extended Groth Strip (EGS) field, supported by a rich set of multiwavelength data. CEERS facilitated immediate community science in both of the extragalactic core JWST science drivers ``First Light" and ``Galaxy Assembly," including: 1) The discovery and characterization of large samples of galaxies at z >~ 10 from ~90 arcmin^2 of NIRCam imaging, constraining their abundance and physical nature; 2) Deep spectra of >1000 galaxies, including dozens of galaxies at 63; and 4) Characterizing galaxy mid-IR emission with MIRI to study dust-obscured star-formation and supermassive black hole growth at z~1-3. As a legacy product for the community, the CEERS team has provided several data releases, accompanied by detailed notes on the data reduction procedures and notebooks to aid in reproducibility. In addition to an overview of the survey and quality of the data, we provide science highlights from the first two years with CEERS data., Comment: 38 pages, 13 figures, 6 tables
Published: 2025

16. Multi-armed Bandit and Backbone boost Lin-Kernighan-Helsgaun Algorithm for the Traveling Salesman Problems

Author: Wang, Long, Zheng, Jiongzhi, Xiong, Zhengda, and He, Kun
Subjects: Computer Science - Data Structures and Algorithms, Computer Science - Artificial Intelligence
Abstract: The Lin-Kernighan-Helsguan (LKH) heuristic is a classic local search algorithm for the Traveling Salesman Problem (TSP). LKH introduces an $\alpha$-value to replace the traditional distance metric for evaluating the edge quality, which leads to a significant improvement. However, we observe that the $\alpha$-value does not make full use of the historical information during the search, and single guiding information often makes LKH hard to escape from some local optima. To address the above issues, we propose a novel way to extract backbone information during the TSP local search process, which is dynamic and can be updated once a local optimal solution is found. We further propose to combine backbone information, $\alpha$-value, and distance to evaluate the edge quality so as to guide the search. Moreover, we abstract their different combinations to arms in a multi-armed bandit (MAB) and use an MAB model to help the algorithm select an appropriate evaluation metric dynamically. Both the backbone information and MAB can provide diverse guiding information and learn from the search history to suggest the best metric. We apply our methods to LKH and LKH-3, which is an extension version of LKH that can be used to solve about 40 variant problems of TSP and Vehicle Routing Problem (VRP). Extensive experiments show the excellent performance and generalization capability of our proposed method, significantly improving LKH for TSP and LKH-3 for two representative TSP and VRP variants, the Colored TSP (CTSP) and Capacitated VRP with Time Windows (CVRPTW).
Published: 2025

17. Critical properties in the non-Hermitian Aubry-Andre-Stark model

Author: Dong, Ji-Long, Liang, En-Wen, Liu, Shi-Yang, Zhang, Guo-Qing, Tang, Ling-Zhi, and Zhang, Dan-Wei
Subjects: Quantum Physics, Condensed Matter - Disordered Systems and Neural Networks, Condensed Matter - Mesoscale and Nanoscale Physics, Condensed Matter - Statistical Mechanics
Abstract: We explore the critical properties of the localization transition in the non-Hermitian Aubry-Andre-Stark (AAS) model with quasiperiodic and Stark potentials, where the non-Hermiticity comes from the nonreciprocal hopping. The localization length, the inverse participation ratio and the energy gap are adopted as the characteristic quantities. We perform the scaling analysis to derive the scaling functions of the three quantities with critical exponents in several critical regions, with respect to the quasiperiodic and Stark potentials and the nonreciprocal strength. We numerically verify the finite-size scaling forms and extract the critical exponents in different situations. Two groups of new critical exponents for the non-Hermitian AAS model and its pure Stark limit are obtained, which are distinct to those for the non-Hermitian Aubry-Andre model and their Hermitian counterparts. Our results indicate that the Hermitian and non-Hermitian AAS, Aubry-Andre, and Stark models belong to different universality classes. We demonstrate that these critical exponents are independent of the nonreciprocal strength, and remain the same in different critical regions and boundary conditions. Furthermore, we establish a hybrid scaling function with a hybrid exponent in the overlap region between the critical regions for the non-Hermitian AAS and Stark models., Comment: 10 pages, 8 figures
Published: 2025

18. Rethinking domain generalization in medical image segmentation: One image as one domain

Author: Hong, Jin, Liu, Bo, and Long, Guoli
Subjects: Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Domain shifts in medical image segmentation, particularly when data comes from different centers, pose significant challenges. Intra-center variability, such as differences in scanner models or imaging protocols, can cause domain shifts as large as, or even larger than, those between centers. To address this, we propose the "one image as one domain" (OIOD) hypothesis, which treats each image as a unique domain, enabling flexible and robust domain generalization. Based on this hypothesis, we develop a unified disentanglement-based domain generalization (UniDDG) framework, which simultaneously handles both multi-source and single-source domain generalization without requiring explicit domain labels. This approach simplifies training with a fixed architecture, independent of the number of source domains, reducing complexity and enhancing scalability. We decouple each input image into content representation and style code, then exchange and combine these within the batch for segmentation, reconstruction, and further disentanglement. By maintaining distinct style codes for each image, our model ensures thorough decoupling of content representations and style codes, improving domain invariance of the content representations. Additionally, we enhance generalization with expansion mask attention (EMA) for boundary preservation and style augmentation (SA) to simulate diverse image styles, improving robustness to domain shifts. Extensive experiments show that our method achieves Dice scores of 84.43% and 88.91% for multi-source to single-center and single-center generalization in optic disc and optic cup segmentation, respectively, and 86.96% and 88.56% for prostate segmentation, outperforming current state-of-the-art domain generalization methods, offering superior performance and adaptability across clinical settings.
Published: 2025

19. Integrated Offline and Online Learning to Solve a Large Class of Scheduling Problems

Author: Liu, Anbang, Chen, Zhi-Long, Jiang, Jinyang, and Chen, Xi
Subjects: Mathematics - Optimization and Control, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: In this paper, we develop a unified machine learning (ML) approach to predict high-quality solutions for single-machine scheduling problems with a non-decreasing min-sum objective function with or without release times. Our ML approach is novel in three major aspects. First, our approach is developed for the entire class of the aforementioned problems. To achieve this, we exploit the fact that the entire class of the problems considered can be formulated as a time-indexed formulation in a unified manner. We develop a deep neural network (DNN) which uses the cost parameters in the time-indexed formulation as the inputs to effectively predict a continuous solution to this formulation, based on which a feasible discrete solution is easily constructed. The second novel aspect of our approach lies in how the DNN model is trained. In view of the NP-hard nature of the problems, labels (i.e., optimal solutions) are hard to generate for training. To overcome this difficulty, we generate and utilize a set of special instances, for which optimal solutions can be found with little computational effort, to train the ML model offline. The third novel idea we employ in our approach is that we develop an online single-instance learning approach to fine tune the parameters in the DNN for a given online instance, with the goal of generating an improved solution for the given instance. To this end, we develop a feasibility surrogate that approximates the objective value of a given instance as a continuous function of the outputs of the DNN, which then enables us to derive gradients and update the learnable parameters in the DNN. Numerical results show that our approach can efficiently generate high-quality solutions for a variety of single-machine scheduling min-sum problems with up to 1000 jobs.
Published: 2025

20. Gearing of nitrate ions in ammonium nitrate

Author: Du, Na, Wang, Xintian, Zhu, Yu Ying, Long, Chanreingam, Ren, Peng, and Yen, Fei
Subjects: Physics - Chemical Physics, Condensed Matter - Other Condensed Matter
Abstract: Reorienting polyatomic ions such as NH4+ and NO3- exhibit weak magnetic fields because the ions at the extremities trace out current loops; if the periodic reorientations become long-range ordered (i.e. gearing of neighboring NO3-), then the magnetic susceptibility should exhibit a unique signature along the different crystallographic axes. For the case of ammonium nitrate NH4NO3, we report the presence of two successive sharp steps in the molar magnetic susceptibility along the a- and b-axes upon crossing its order-disorder phase transition (from phase IV to phase II). We suggest the first step pertains to the NO3- planes shifting away from facing only along the b-axis and onto the a-axis by 45{\deg}. The second step is attributed to the disordering (ungearing) of the NH4+ and NO3-. In contrast, only one step was observed in the magnetic susceptibility along the c-axis and its large magnitude suggest the NO3- remain weakly correlated even in phase I at 400 K. We also find evidence that the NH4+ become magnetically ordered (geared) along the c-axis only until phase V. The approach employed in this work can be extended to experimentally study the lattice dynamics of other solids possessing planar ions such as amphidynamic crystals., Comment: 13 pages (single column), 4 figures
Published: 2025

21. Hermitian and Non-Hermitian Topological Transitions Characterized by Manifold Distance

Author: Fang, ZhaoXiang, Gong, Ming, Guo, Guang-Can, Fu, Yongxu, and Xiong, Long
Subjects: Condensed Matter - Strongly Correlated Electrons, Quantum Physics
Abstract: Topological phases are generally characterized by topological invariants denoted by integer numbers. However, different topological systems often require different topological invariants to measure, and theses definition usually fail at critical points. Therefore, it's challenging to predict what would occur during the transformation between two different topological phases. To address these issues, we propose a general definition based on fidelity and trace distance from quantum information theory: manifold distance (MD). This definition does not rely on the berry connection but rather on the information of the two manifolds - their ground state wave functions. Thus, it can measure different topological systems (including traditional band topology models, non-Hermitian systems, and gapless systems, etc.) and exhibit some universal laws during the transformation between two topological phases. Our research demonstrates for different topological manifolds, the change rate (first-order derivative) or susceptibility (second-order derivative) of MD exhibit various divergent behaviors near the critical points. Compared to the strange correlator, which could be used as a diagnosis for short-range entangled states in 1D and 2D, MD is more universal and could be applied to non-Hermitian systems and long-range entangled states. For subsequent studies, we expect the method to be generalized to real-space or non-lattice models, in order to facilitate the study of a wider range of physical platforms such as open systems and many-body localization., Comment: arXiv admin note: substantial text overlap with arXiv:2405.03323
Published: 2025

22. Proteomic Learning of Gamma-Aminobutyric Acid (GABA) Receptor-Mediated Anesthesia

Author: Jiang, Jian, Chen, Long, Zhu, Yueying, Shi, Yazhou, Qiu, Huahai, Zhang, Bengong, Zhou, Tianshou, and Wei, Guo-Wei
Subjects: Quantitative Biology - Biomolecules, Computer Science - Machine Learning
Abstract: Anesthetics are crucial in surgical procedures and therapeutic interventions, but they come with side effects and varying levels of effectiveness, calling for novel anesthetic agents that offer more precise and controllable effects. Targeting Gamma-aminobutyric acid (GABA) receptors, the primary inhibitory receptors in the central nervous system, could enhance their inhibitory action, potentially reducing side effects while improving the potency of anesthetics. In this study, we introduce a proteomic learning of GABA receptor-mediated anesthesia based on 24 GABA receptor subtypes by considering over 4000 proteins in protein-protein interaction (PPI) networks and over 1.5 millions known binding compounds. We develop a corresponding drug-target interaction network to identify potential lead compounds for novel anesthetic design. To ensure robust proteomic learning predictions, we curated a dataset comprising 136 targets from a pool of 980 targets within the PPI networks. We employed three machine learning algorithms, integrating advanced natural language processing (NLP) models such as pretrained transformer and autoencoder embeddings. Through a comprehensive screening process, we evaluated the side effects and repurposing potential of over 180,000 drug candidates targeting the GABRA5 receptor. Additionally, we assessed the ADMET (absorption, distribution, metabolism, excretion, and toxicity) properties of these candidates to identify those with near-optimal characteristics. This approach also involved optimizing the structures of existing anesthetics. Our work presents an innovative strategy for the development of new anesthetic drugs, optimization of anesthetic use, and deeper understanding of potential anesthesia-related side effects.
Published: 2025

23. Ultra-fast, high-power MUTC Photodiodes with bandwidth-efficiency product over 130 GHz * 100%

Author: Li, Linze, Long, Tianyu, Yang, Xiongwei, Zhang, Zhouze, Wang, Luyu, Wang, Jingyi, Wang, Mingxu, Lu, Juanjuan, Yu, Jianjun, and Chen, Baile
Subjects: Physics - Applied Physics, Physics - Optics
Abstract: The accelerating demand for wireless communication necessitates wideband, energy-efficient photonic sub-terahertz (sub-THz) sources to enable ultra-fast data transfer. However, as critical components for THz photonic mixing, photodiodes (PDs) face a fundamental trade-off between quantum efficiency and bandwidth, presenting a major obstacle to achieving high-speed performance with high optoelectronic conversion efficiency. Here, we overcome this challenge by demonstrating an InP-based, waveguide-integrated modified uni-traveling carrier photodiode (MUTC-PD) with a terahertz bandwidth exceeding 200 GHz and a bandwidth-efficiency product (BEP) surpassing 130 GHz * 100%. Through the integration of a spot-size converter (SSC) to enhance external responsivity, alongside optimized electric field distribution, balanced carrier transport, and minimized parasitic capacitance, the device achieves a 3-dB bandwidth of 206 GHz and an external responsivity of 0.8 A/W, setting a new benchmark for BEP. Packaged with WR-5.1 waveguide output, it delivers radio-frequency (RF) power exceeding -5 dBm across the 127-185 GHz frequency range. As a proof of concept, we achieved a wireless transmission of 54 meters with a single-line rate of up to 120 Gbps, leveraging photonics-aided technology without requiring a low-noise amplifier (LNA). This work establishes a pathway to significantly enhance optical power budgets and reduce energy consumption, presenting a transformative step toward high-bandwidth, high-efficiency sub-THz communication systems and next-generation wireless networks.
Published: 2025

24. A multi-wavelength investigation of spiral structures in $z > 1$ galaxies with JWST

Author: Kalita, Boris S., Yu, Si-Yue, Silverman, John D., Daddi, Emanuele, Ho, Luis C., Faisst, Andreas L., Dessauges-Zavadsky, Miroslava, Puglisi, Annagrazia, Birrer, Simon, Kashino, Daichi, Ding, Xuheng, Kartaltepe, Jeyhan S., Kakkad, Darshan, Valentino, Francesco, Ilbert, Olivier, Magdis, Georgios, Long, Arianna S., Jin, Shuowen, Koekemoer, Anton M., and Massey, Richard
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: Recent JWST observations have revealed the prevalence of spiral structures at $z > 1$. Unlike in the local Universe, the origin and the consequence of spirals at this epoch remain unexplored. We use public JWST/NIRCam data from the COSMOS-Web survey to map spiral structures in eight massive ($> 10^{10.5}\,\rm M_{\odot}$) star-forming galaxies at $z_{\rm spec} \sim 1.5$. We present a method for systematically quantifying spiral arms at $z>1$, enabling direct measurements of flux distributions. Using rest-frame near-IR images, we construct morphological models accurately tracing spiral arms. We detect offsets ($\sim 0.2 - 0.8\,\rm kpc$) between the rest-frame optical and near-IR flux distributions across most arms. Drawing parallels to the local Universe, we conclude that these offsets reflect the presence of density waves. For nine out of eighteen arms, the offsets indicate spiral shocks triggered by density waves. Five arms have offsets in the opposite direction and are likely associated with tidal interactions. For the remaining cases with no detected offsets, we suggest that stochastic 'clumpy' star formation is the primary driver of their formation. In conclusion, we find a multi-faceted nature of spiral arms at $z > 1$, similar to that in the local Universe., Comment: Accepted for publication in ApJ Letters; 12 pages, 6 figures
Published: 2025

25. Footprint in fitting $B\to D$ vector form factor and determination for $D$-meson leading-twist LCDA

Author: Wu, Sheng-Bo, Tian, Hai-Jiang, Yang, Yin-Long, Cheng, Wei, Fu, Hai-Bing, and Zhong, Tao
Subjects: High Energy Physics - Phenomenology
Abstract: In this paper, we fit the $B\to D$ vector transition form factor (TFF) by using the data measured by BABAR and Belle Collaborations within Monte Carlo (MC) method. Meanwhile, the $B\to D$ TFF is also calculated by using the QCD light-cone sum rules approach (LCSRs) within right-handed chiral current correlation function. In which, the $D$-meson leading-twist light-cone distribution amplitude (LCDA) serves as crucial input parameter is reconstructed with light-cone harmonic oscillator model where its longitudinal behavior primarily determined by the model-free parameter $B_{2;D}$. After matching the TFF with two scenarios from MC and LCSRs, we have $B_{2;D}=0.17$. Then, we present the curve of $D$-meson leading-twist LCDA in comparison with other theoretical approaches. Subsequently, the $B\to D$ TFF $f_{+}^{BD}(q^2)$ at the large recoil region is $f_{+}^{BD}(0)=0.625^{+0.087}_{-0.113}$, which is compared in detail with theoretical estimates and experimental measurements. Furthermore, we calculate the decay width and branching ratio of the Cabibbo-favored semileptonic decays $B\to D\ell \bar{\nu}_{\ell}$, which lead to the results $\mathcal{B}(B^0\to D^-\ell ^+\nu _{\ell}) =(1.96_{-0.55}^{+0.51})\times 10^{-2}$ and $\mathcal{B}(B^+\to \bar{D}^0\ell ^+\nu _{\ell}) =(2.12_{-0.59}^{+0.55})\times 10^{-2}$. Finally, we predict the CKM matrix element with two scenarios $|V_{cb}|_{\rm SR}=42.97_{-2.57}^{+2.42}\times 10^{-3}$ and $|V_{cb} |_{\rm MC}=42.82_{-1.29}^{+1.07}\times 10^{-3}$ from $B^0\to D^-\ell^+\nu_{\ell}$, $|V_{cb}|_{\rm SR}=41.93_{-1.05}^{+1.03}\times 10^{-3}$ and $|V_{cb} |_{\rm MC}=41.82_{-0.25}^{+0.23}\times 10^{-3}$ from $B^+\to \bar{D}^0\ell^+\nu_{\ell}$ which are in good agreement with theoretical and experimental predictions., Comment: 12 pages, 7 figures, comments welcome
Published: 2025

26. Hybridizable Symmetric Stress Elements on the Barycentric Refinement in Arbitrary Dimensions

Author: Chen, Long and Huang, Xuehai
Subjects: Mathematics - Numerical Analysis, 65N30, 65N12, 65N15, 15A72
Abstract: Hybridizable $H(\textrm{div})$-conforming finite elements for symmetric tensors on simplices with barycentric refinement are developed in this work for arbitrary dimensions and any polynomial order. By employing barycentric refinement and an intrinsic tangential-normal ($t$-$n$) decomposition, novel basis functions are constructed to redistribute degrees of freedom while preserving $H(\textrm{div})$-conformity and symmetry, and ensuring inf-sup stability. These hybridizable elements enhance computational flexibility and efficiency, with applications to mixed finite element methods for linear elasticity., Comment: 25 pages, 2 figures
Published: 2025

27. The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit

Author: Zhou, Huixue, Gu, Hengrui, Liu, Xi, Zhou, Kaixiong, Liang, Mingfu, Xiao, Yongkang, Govindan, Srinivas, Chawla, Piyush, Yang, Jiyan, Meng, Xiangfei, Li, Huayu, Zhang, Buyun, Luo, Liang, Chen, Wen-Yen, Han, Yiping, Long, Bo, Zhang, Rui, and Chen, Tianlong
Subjects: Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: The deployment of Large Language Models (LLMs) in recommender systems for predicting Click-Through Rates (CTR) necessitates a delicate balance between computational efficiency and predictive accuracy. This paper presents an optimization framework that combines Retrieval-Augmented Generation (RAG) with an innovative multi-head early exit architecture to concurrently enhance both aspects. By integrating Graph Convolutional Networks (GCNs) as efficient retrieval mechanisms, we are able to significantly reduce data retrieval times while maintaining high model performance. The early exit strategy employed allows for dynamic termination of model inference, utilizing real-time predictive confidence assessments across multiple heads. This not only quickens the responsiveness of LLMs but also upholds or improves their accuracy, making it ideal for real-time application scenarios. Our experiments demonstrate how this architecture effectively decreases computation time without sacrificing the accuracy needed for reliable recommendation delivery, establishing a new standard for efficient, real-time LLM deployment in commercial systems.
Published: 2025

28. Enhanced Phonon-Phonon Interactions and Weakened Electron-Phonon Coupling in Charge-Density-Wave Topological Semimetal EuAl4 with a Possible Intermediate Electronic State

Author: Cao, Shize, Jin, Feng, Long, Yun-Ze, Luo, Jianlin, Zhang, Qingming, and Chen, Zhi-Guo
Subjects: Condensed Matter - Materials Science, Condensed Matter - Superconductivity
Abstract: The origin of charge density wave (CDW) is a long-term open issue. Furthermore, the evolution of phonon-phonon interactions (PPI) across CDW transitions has rarely been investigated. Besides, whether electron-phonon coupling (EPC) would be weakened or enhanced after CDW transitions is still under debate. Additionally, CDW provides a fertile ground for uncovering intriguing intermediate electronic states. Here, we report a Raman spectroscopy study of the PPI and EPC in topological semimetal EuAl4 exhibiting a CDW phase below temperature Tc ~ 145 K. The free-charge-carrier-density (nc) and temperature dependences of the Fano asymmetric factors (1/|q|) of the two phonon modes A1g and B1g indicates that below Tc, the EPC becomes weakened probably due to the reduction of the nc. Interestingly, in the temperature range from 50 to 145 K, the steep growth of the 1/|q| leading to the significant deviation from the linear dependence on the nc, together with the shoulder-like features in the temperature evolutions of the 1/|q| and the nc around 50 K, implies the possible existence of an intermediate electronic state with the EPC distinctly larger than the CDW ground state in EuAl4. Furthermore, below Tc, the faster decrease in the full width at half maxima of the B1g phonon mode representing the collective vibrations of the CDW-modulated Al1 atoms suggests that a remarkable growth of the PPI for the B1g phonon mode after the CDW phase transition, which is in contrast to the weakening of the EPC and thus may mainly arise from the strengthening of lattice anharmonicity in EuAl4. Our results not only highlight the significance of the enhanced PPI and the weakened EPC in completely understanding the formation of the CDW phase but also initiate the exploration of novel intermediate electronic states in EuAl4.
Published: 2025

29. VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Author: Fu, Chaoyou, Lin, Haojia, Wang, Xiong, Zhang, Yi-Fan, Shen, Yunhang, Liu, Xiaoyu, Li, Yangze, Long, Zuwei, Gao, Heting, Li, Ke, Zheng, Xiawu, Ji, Rongrong, Sun, Xing, Shan, Caifeng, and He, Ran
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Recent Multimodal Large Language Models (MLLMs) have typically focused on integrating visual and textual modalities, with less emphasis placed on the role of speech in enhancing interaction. However, speech plays a crucial role in multimodal dialogue systems, and implementing high-performance in both vision and speech tasks remains a significant challenge due to the fundamental modality differences. In this paper, we propose a carefully designed multi-stage training methodology that progressively trains LLM to understand both visual and speech information, ultimately enabling fluent vision and speech interaction. Our approach not only preserves strong vision-language capacity, but also enables efficient speech-to-speech dialogue capabilities without separate ASR and TTS modules, significantly accelerating multimodal end-to-end response speed. By comparing our method against state-of-the-art counterparts across benchmarks for image, video, and speech tasks, we demonstrate that our model is equipped with both strong visual and speech capabilities, making near real-time vision and speech interaction., Comment: https://github.com/VITA-MLLM/VITA
Published: 2025

30. Knudsen boundary layer equations with incoming boundary condition: full range of cutoff collision kernels and Mach numbers of the far field

Author: Jiang, Ning, Luo, Yi-Long, Wu, Yulong, and Yang, Tong
Subjects: Mathematics - Analysis of PDEs, 35Q20, 76P05, 35F30, 35B45, 35A01, 35A02
Abstract: This paper establishes tahe existence and uniqueness of the nonlinear Knudsen layer equation with incoming boundary conditions. It is well-known that the solvability conditions of the problem vary with the Mach number of the far Maxwellian $\mathcal{M}^\infty$. We consider full ranges of cutoff collision kernels (i.e., $- 3 < \gamma \leq 1$) and all the Mach numbers of the far field in the $L^\infty_{x,v}$ framework. Additionally, the solution exhibits exponential decay $\exp \{- c x^\frac{2}{3 - \gamma} - c |v|^2 \}$ for some $c > 0$. To address the general angular cutoff collision kernel, we introduce a $(x,v)$-mixed weight $\sigma$. The proof is essentially bsed on adding an artificial damping term., Comment: arXiv admin note: substantial text overlap with arXiv:2407.02852
Published: 2025

31. Deep Reinforcement Learning for Job Scheduling and Resource Management in Cloud Computing: An Algorithm-Level Review

Author: Gu, Yan, Liu, Zhaoze, Dai, Shuhong, Liu, Cong, Wang, Ying, Wang, Shen, Theodoropoulos, Georgios, and Cheng, Long
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Artificial Intelligence
Abstract: Cloud computing has revolutionized the provisioning of computing resources, offering scalable, flexible, and on-demand services to meet the diverse requirements of modern applications. At the heart of efficient cloud operations are job scheduling and resource management, which are critical for optimizing system performance and ensuring timely and cost-effective service delivery. However, the dynamic and heterogeneous nature of cloud environments presents significant challenges for these tasks, as workloads and resource availability can fluctuate unpredictably. Traditional approaches, including heuristic and meta-heuristic algorithms, often struggle to adapt to these real-time changes due to their reliance on static models or predefined rules. Deep Reinforcement Learning (DRL) has emerged as a promising solution to these challenges by enabling systems to learn and adapt policies based on continuous observations of the environment, facilitating intelligent and responsive decision-making. This survey provides a comprehensive review of DRL-based algorithms for job scheduling and resource management in cloud computing, analyzing their methodologies, performance metrics, and practical applications. We also highlight emerging trends and future research directions, offering valuable insights into leveraging DRL to advance both job scheduling and resource management in cloud computing.
Published: 2025

32. Host-guided data placement: whose job is it anyway?

Author: Purandare, Devashish R., Alvaro, Peter, Wildani, Avani, Long, Darrell D. E., and Miller, Ethan L.
Subjects: Computer Science - Operating Systems, Computer Science - Emerging Technologies
Abstract: The increasing demand for SSDs coupled with scaling difficulties have left manufacturers scrambling for newer SSD interfaces which promise better performance and durability. While these interfaces reduce the rigidity of traditional abstractions, they require application or system-level changes that can impact the stability, security, and portability of systems. To make matters worse, such changes are rendered futile with introduction of next-generation interfaces. Further, there is little guidance on data placement and hardware specifics are often abstracted from the application layer. It is no surprise therefore that such interfaces have seen limited adoption, leaving behind a graveyard of experimental interfaces ranging from open-channel SSDs to zoned namespaces. In this paper, we show how shim layers can to shield systems from changing hardware interfaces while benefiting from them. We present Reshim, an all-userspace shim layer that performs affinity and lifetime based data placement with no change to the operating system or the application. We demonstrate Reshim's ease of adoption with host-device coordination for three widely-used data-intensive systems: RocksDB, MongoDB, and CacheLib. With Reshim, these systems see 2-6 times highe write throughput, up to 6 times lower latency, and reduced write amplification compared to filesystems like F2FS. Reshim performs on par with application-specific backends like ZenFS while offering more generality, lower latency, and richer data placement. With Reshim we demonstrate the value of isolating the complexity of the placement logic, allowing easy deployment of dynamic placement rules across several applications and storage interfaces., Comment: 14 pages, 10 figures, 3 tables
Published: 2025

33. Realistic overground gait transitions are not sharp but involve gradually changing walk-run mixtures as per energy optimality

Author: Baker, Nicholas S., Long, Leroy, and Srinivasan, Manoj
Subjects: Quantitative Biology - Neurons and Cognition
Abstract: Humans use two qualitatively different gaits for locomotion, namely, walking and running -- usually using walking at lower speeds and running at higher speeds. Researchers have examined when humans switch between walking and running on treadmills and have noted hystereses in these gait transition speeds. Here, we consider an ecologically realistic overground locomotion task, one requiring traveling a given long distance (800 meters or 2400 meters) in a prescribed time duration. Unlike on a treadmill, this task allows the human to change speed or gait during the trial to reach the destination on time: this task is akin to traveling to an appointment at a particular time from your office to another office, arriving neither early or late. We find that gait transition is not sharp, but instead involves a 'gait transition regime' in which humans use a mixture of walking and running, using mostly walking atlower speeds and mostly running higher speeds -- supporting earlier results over short distances (120 m). The presence of this gradually changing walk-run mixture is predicted by energy optimality. We hypothesize that this energy optimal behavior in this realistic overground conditions accounts for the hysteretic behavior in treadmill experiments, apparently switching earlier than predicted by energy optimality.
Published: 2024

34. VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Author: Yuan, Yuqian, Zhang, Hang, Li, Wentong, Cheng, Zesen, Zhang, Boqiang, Li, Long, Li, Xin, Zhao, Deli, Zhang, Wenqiao, Zhuang, Yueting, Zhu, Jianke, and Bing, Lidong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Video Large Language Models (Video LLMs) have recently exhibited remarkable capabilities in general video understanding. However, they mainly focus on holistic comprehension and struggle with capturing fine-grained spatial and temporal details. Besides, the lack of high-quality object-level video instruction data and a comprehensive benchmark further hinders their advancements. To tackle these challenges, we introduce the VideoRefer Suite to empower Video LLM for finer-level spatial-temporal video understanding, i.e., enabling perception and reasoning on any objects throughout the video. Specially, we thoroughly develop VideoRefer Suite across three essential aspects: dataset, model, and benchmark. Firstly, we introduce a multi-agent data engine to meticulously curate a large-scale, high-quality object-level video instruction dataset, termed VideoRefer-700K. Next, we present the VideoRefer model, which equips a versatile spatial-temporal object encoder to capture precise regional and sequential representations. Finally, we meticulously create a VideoRefer-Bench to comprehensively assess the spatial-temporal understanding capability of a Video LLM, evaluating it across various aspects. Extensive experiments and analyses demonstrate that our VideoRefer model not only achieves promising performance on video referring benchmarks but also facilitates general video understanding capabilities., Comment: 17 pages, 14 figures, technical report
Published: 2024

35. Quasinormal Ringing of de Sitter Braneworlds

Author: Jia, Hai-Long, Guo, Wen-Di, Liu, Yu-Xiao, and Tan, Qin
Subjects: General Relativity and Quantum Cosmology
Abstract: Compared with the Poincar\'e braneworld, the de Sitter (dS) braneworld aligns more closely with the present universe characterized by a small but finite cosmological constant. To explore the quasinormal ringing properties within the dS brane scenario, we investigate the gravitational perturbations in both thin and thick dS brane configurations. Analysis of the perturbation equations reveals that the effective potential along the extra dimension exhibits the shape of P\"oschl-Teller potential, asymptotically approaching a constant value (mass gap) at infinity. And analytical calculations further indicate that the gravitational perturbations, apart from the zero mode, possess a series of discrete, purely imaginary quasinormal modes in the late stages. This result implies that these perturbations decay without oscillation over time. The analytical findings also demonstrate that the brane structure primarily determines the distribution of the quasinormal spectrum while preserving the purely imaginary nature of the quasinormal frequencies. Subsequently, we further simulate the gravitational wave signal by numerically evolving the perturbation equations, which yield late-stage results consistent with the analytical predictions. Interestingly, these quasinormal modes carry information about the cosmological constant on the brane, which provides a potential new pathway for the study of cosmology in the dS brane scenario.
Published: 2024

36. VoxVietnam: a Large-Scale Multi-Genre Dataset for Vietnamese Speaker Recognition

Author: Vu, Hoang Long, Dat, Phuong Tuan, Nhi, Pham Thao, Hao, Nguyen Song, and Trang, Nguyen Thi Thu
Subjects: Computer Science - Sound, Computer Science - Computation and Language, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Recent research in speaker recognition aims to address vulnerabilities due to variations between enrolment and test utterances, particularly in the multi-genre phenomenon where the utterances are in different speech genres. Previous resources for Vietnamese speaker recognition are either limited in size or do not focus on genre diversity, leaving studies in multi-genre effects unexplored. This paper introduces VoxVietnam, the first multi-genre dataset for Vietnamese speaker recognition with over 187,000 utterances from 1,406 speakers and an automated pipeline to construct a dataset on a large scale from public sources. Our experiments show the challenges posed by the multi-genre phenomenon to models trained on a single-genre dataset, and demonstrate a significant increase in performance upon incorporating the VoxVietnam into the training process. Our experiments are conducted to study the challenges of the multi-genre phenomenon in speaker recognition and the performance gain when the proposed dataset is used for multi-genre training., Comment: Accepted to 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025)
Published: 2024

37. Retrieval-Augmented Generation with Graphs (GraphRAG)

Author: Han, Haoyu, Wang, Yu, Shomer, Harry, Guo, Kai, Ding, Jiayuan, Lei, Yongjia, Halappanavar, Mahantesh, Rossi, Ryan A., Mukherjee, Subhabrata, Tang, Xianfeng, He, Qi, Hua, Zhigang, Long, Bo, Zhao, Tong, Shah, Neil, Javari, Amin, Xia, Yinglong, and Tang, Jiliang
Subjects: Computer Science - Information Retrieval, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Retrieval-augmented generation (RAG) is a powerful technique that enhances downstream task execution by retrieving additional information, such as knowledge, skills, and tools from external sources. Graph, by its intrinsic "nodes connected by edges" nature, encodes massive heterogeneous and relational information, making it a golden resource for RAG in tremendous real-world applications. As a result, we have recently witnessed increasing attention on equipping RAG with Graph, i.e., GraphRAG. However, unlike conventional RAG, where the retriever, generator, and external data sources can be uniformly designed in the neural-embedding space, the uniqueness of graph-structured data, such as diverse-formatted and domain-specific relational knowledge, poses unique and significant challenges when designing GraphRAG for different domains. Given the broad applicability, the associated design challenges, and the recent surge in GraphRAG, a systematic and up-to-date survey of its key concepts and techniques is urgently desired. Following this motivation, we present a comprehensive and up-to-date survey on GraphRAG. Our survey first proposes a holistic GraphRAG framework by defining its key components, including query processor, retriever, organizer, generator, and data source. Furthermore, recognizing that graphs in different domains exhibit distinct relational patterns and require dedicated designs, we review GraphRAG techniques uniquely tailored to each domain. Finally, we discuss research challenges and brainstorm directions to inspire cross-disciplinary opportunities. Our survey repository is publicly maintained at https://github.com/Graph-RAG/GraphRAG/.
Published: 2024

38. Systematic study of large-momentum distribution in nuclei with the operator product expansion

Author: Yu, Jiexin and Long, Bingwei
Subjects: Nuclear Theory
Abstract: The operator product expansion (OPE) is applied in conjunction with Pionless effective field theory to study the short-rang structure of nuclei. By matching the OPE with the selected nuclear potentials for nucleon-nucleon scattering states, we obtain the Wilson coefficients. The nucleon momentum distribution in the deuteron is then used to test the OPE against the predictions of these nuclear potentials. In order to achieve a systematic separation of short-range and long-range interactions, we discuss how the OPE approximation can be improved by including higher-order EFT potentials and higher-dimension local operators., Comment: 13 pages, 7 figures
Published: 2024

39. Quantitative Phase Retrieval and Characterization of Magnetic Nanostructures via Lorentz (Scanning) Transmission Electron Microscopy

Author: Mendoza, Kayna L., Ni, Haoyang, Varnavides, Georgios, Chi, Miaofang, Ophus, Colin, Petford-Long, Amanda, and Phatak, Charudatta
Subjects: Condensed Matter - Materials Science
Abstract: Magnetic materials phase reconstruction from Lorentz transmission electron microscopy (LTEM) measurements has traditionally been achieved using longstanding methods such as off-axis holography (OAH) and the transport-of-intensity equation (TIE). Amidst the increase in access to processing power and the development of advanced algorithms, phase retrieval of nanoscale magnetic materials with higher fidelity and resolution, potentially down to the few nanometer limit, becomes possible. Specifically, reverse-mode automatic differentiation (RMAD) and the extended electron ptychography iterative engine (ePIE) are two methods that have been utilized for high confidence phase reconstructions using LTEM through-focal series imaging and Lorentz scanning TEM (Ltz-4D-STEM), respectively. This work evaluates phase retrieval using TIE, RMAD, and ePIE in simulations consisting of an array of Permalloy (Ni80Fe20) nanoscale islands. Extending beyond simulations, we demonstrate total phase reconstructions of a NiFe nanowire using OAH and RMAD in LTEM and ePIE in Ltz-4D-STEM experiments and determine the magnetization saturation through corroborations with micromagnetic simulations. Finally, we show how the total phase shift gradient can be utilized to observe and characterize the proximity effects emanating from neighboring magnetic island interactions and an isolated NiFe nanowire., Comment: 14 pages, 5 figures, 3 supplementary figures, and government copyright
Published: 2024

40. Observation of metastability in open quantum dynamics of a solid-state system

Author: Zhang, Jun-Xiang, Jin, Yuan-De, Qiu, Chu-Dan, Ma, Wen-Long, and Liu, Gang-Qin
Subjects: Quantum Physics
Abstract: Metastability is a ubiquitous phenomenon in non-equilibrium physics and classical stochastic dynamics.It arises when the system dynamics settles in long-lived states before eventually decaying to true equilibria. Remarkably, it has been predicted that quantum metastability can also occur in continuous-time and discrete-time open quantum dynamics. However, the direct experimental observation of metastability in open quantum systems has remained elusive. Here, we experimentally observe metastability in the discrete-time evolution of a single nuclear spin in diamond, realized by sequential Ramsey interferometry measurements of a nearby nitrogen-vacancy electron spin. We demonstrate that the metastable polarization of the nuclear spin emerges at around 60,000-250,000 sequential measurements, enabling high-fidelity single-shot readout of the nuclear spin under a small magnetic field of 108.4 gauss. An ultra-long spin relaxation time of more than 10 s has been observed at room temperature. By further increasing the measurement number, the nuclear spin eventually relaxes into the maximally mixed state. Our results represent a concrete step towards uncovering non-equilibrium physics in open quantum dynamics, which is practically relevant for the utilization of metastable information in various quantum information processing tasks, such as accurate quantum operations, quantum channel discrimination and quantum error correction.
Published: 2024

41. Quantum flux operators in the fermionic theory and their supersymmetric extension

Author: Guo, Si-Mao, Liu, Wen-Bin, and Long, Jiang
Subjects: High Energy Physics - Theory, General Relativity and Quantum Cosmology
Abstract: We construct quantum flux operators with respect to the Poincar\'e symmetry in the massless Dirac theory at future null infinity. An anomalous helicity flux operator emerges from the commutator of the superrotation generators. The helicity flux operator corresponds to the local chiral symmetry which is the analog of superduality in the gauge theories. We also find its relation to the non-closure of the Lie transport of the spinor field around a loop. We discuss various algebras formed by these operators and constrain the test functions by the requirement of eliminating the non-local terms and satisfying the Jacobi identities. Furthermore, we explore their $\mathcal{N}=1$ supersymmetric extension in the Wess-Zumino model. There are four kinds of quantum flux operators, which correspond to the supertranslation, superrotation, superduality and supersymmetry, respectively. Interestingly, besides the expected supertranslation generator, a helicity flux operator will also emerge in the commutator between the superflux operators. We check that our flux algebra can give rise to the super-BMS and super-Poincar\'e algebras with appropriate choice of parameters. In the latter reduction, we find the helicity flux reduces to behaving like a $R$ symmetry generator in the commutator with the superflux. For completion, we derive the $R$ flux which also includes a charge flux for complex scalar besides the helicity flux for spinor field., Comment: 48 pages, 2 figures
Published: 2024

42. Online Adaptive Platoon Control for Connected and Automated Vehicles via Physics Enhanced Residual Learning

Author: Zhang, Peng, Huang, Heye, Zhou, Hang, Shi, Haotian, Long, Keke, and Li, Xiaopeng
Subjects: Computer Science - Robotics, Electrical Engineering and Systems Science - Systems and Control
Abstract: This paper introduces a physics enhanced residual learning (PERL) framework for connected and automated vehicle (CAV) platoon control, addressing the dynamics and unpredictability inherent to platoon systems. The framework first develops a physics-based controller to model vehicle dynamics, using driving speed as input to optimize safety and efficiency. Then the residual controller, based on neural network (NN) learning, enriches the prior knowledge of the physical model and corrects residuals caused by vehicle dynamics. By integrating the physical model with data-driven online learning, the PERL framework retains the interpretability and transparency of physics-based models and enhances the adaptability and precision of data-driven learning, achieving significant improvements in computational efficiency and control accuracy in dynamic scenarios. Simulation and robot car platform tests demonstrate that PERL significantly outperforms pure physical and learning models, reducing average cumulative absolute position and speed errors by up to 58.5% and 40.1% (physical model) and 58.4% and 47.7% (NN model). The reduced-scale robot car platform tests further validate the adaptive PERL framework's superior accuracy and rapid convergence under dynamic disturbances, reducing position and speed cumulative errors by 72.73% and 99.05% (physical model) and 64.71% and 72.58% (NN model). PERL enhances platoon control performance through online parameter updates when external disturbances are detected. Results demonstrate the advanced framework's exceptional accuracy and rapid convergence capabilities, proving its effectiveness in maintaining platoon stability under diverse conditions., Comment: 25 pages, 12 figures
Published: 2024

43. Natural Language Fine-Tuning

Author: Liu, Jia, Wang, Yue, Lin, Zhiqi, Chen, Min, Hao, Yixue, and Hu, Long
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large language model fine-tuning techniques typically depend on extensive labeled data, external guidance, and feedback, such as human alignment, scalar rewards, and demonstration. However, in practical application, the scarcity of specific knowledge poses unprecedented challenges to existing fine-tuning techniques. In this paper, focusing on fine-tuning tasks in specific domains with limited data, we introduce Natural Language Fine-Tuning (NLFT), which utilizes natural language for fine-tuning for the first time. By leveraging the strong language comprehension capability of the target LM, NLFT attaches the guidance of natural language to the token-level outputs. Then, saliency tokens are identified with calculated probabilities. Since linguistic information is effectively utilized in NLFT, our proposed method significantly reduces training costs. It markedly enhances training efficiency, comprehensively outperforming reinforcement fine-tuning algorithms in accuracy, time-saving, and resource conservation. Additionally, on the macro level, NLFT can be viewed as a token-level fine-grained optimization of SFT, thereby efficiently replacing the SFT process without the need for warm-up (as opposed to ReFT requiring multiple rounds of warm-up with SFT). Compared to SFT, NLFT does not increase the algorithmic complexity, maintaining O(n). Extensive experiments on the GSM8K dataset demonstrate that NLFT, with only 50 data instances, achieves an accuracy increase that exceeds SFT by 219%. Compared to ReFT, the time complexity and space complexity of NLFT are reduced by 78.27% and 92.24%, respectively. The superior technique of NLFT is paving the way for the deployment of various innovative LLM fine-tuning applications when resources are limited at network edges. Our code has been released at https://github.com/Julia-LiuJ/NLFT.
Published: 2024

44. Narrowband parallel coherent LiDAR with frequency interleaving

Author: Wang, Long, Hu, Liang, Jiao, Wenhai, Shang, Yaxin, Chen, Jianping, and Wu, Guiling
Subjects: Physics - Optics
Abstract: The high demand for 3D imaging in intelligent robotics is motivating the advances of coherent LiDARs towards high performances with low complexity/cost. However, the current coherent LiDARs suffer from the tight coupling between the high ranging-imaging performance and the high complexity/cost. Herein, we propose a narrowband parallel coherent LiDAR with frequency-interleaving architecture. The LiDAR architecture utilizes narrowband signals for ranging, and interleaves multi-channel sparse and narrowband signals in frequency domain at the receiving end to significantly reduce the required bandwidth and the number of detection branches, facilitating massive parallelization with low system complexity/cost. In experiments, a ranging precision of 0.49 mm that approaches the shot noise limit, and a power sensitivity of -95 dBm (~9 photons) are achieved. Parallel 3D imaging with an equivalent imaging rate of 10 Mpixel/s and a 2 cm ranging precision is also demonstrated using only two 150 MHz receiving branches. With these desirable properties, this new LiDAR opens an avenue for the LiDAR ecosystem.
Published: 2024

45. Linear Shrinkage Convexification of Penalized Linear Regression With Missing Data

Author: Park, Seongoh, Lee, Seongjin, Yen, Nguyen Thi Hai, Long, Nguyen Phuoc, and Lim, Johan
Subjects: Statistics - Methodology
Abstract: One of the common challenges faced by researchers in recent data analysis is missing values. In the context of penalized linear regression, which has been extensively explored over several decades, missing values introduce bias and yield a non-positive definite covariance matrix of the covariates, rendering the least square loss function non-convex. In this paper, we propose a novel procedure called the linear shrinkage positive definite (LPD) modification to address this issue. The LPD modification aims to modify the covariance matrix of the covariates in order to ensure consistency and positive definiteness. Employing the new covariance estimator, we are able to transform the penalized regression problem into a convex one, thereby facilitating the identification of sparse solutions. Notably, the LPD modification is computationally efficient and can be expressed analytically. In the presence of missing values, we establish the selection consistency and prove the convergence rate of the $\ell_1$-penalized regression estimator with LPD, showing an $\ell_2$-error convergence rate of square-root of $\log p$ over $n$ by a factor of $(s_0)^{3/2}$ ($s_0$: the number of non-zero coefficients). To further evaluate the effectiveness of our approach, we analyze real data from the Genomics of Drug Sensitivity in Cancer (GDSC) dataset. This dataset provides incomplete measurements of drug sensitivities of cell lines and their protein expressions. We conduct a series of penalized linear regression models with each sensitivity value serving as a response variable and protein expressions as explanatory variables.
Published: 2024

46. Flavor Physics at CEPC: a General Perspective

Author: Ai, Xiaocong, Altmannshofer, Wolfgang, Athron, Peter, Bai, Xiaozhi, Calibbi, Lorenzo, Cao, Lu, Che, Yuzhi, Chen, Chunhui, Chen, Ji-Yuan, Chen, Long, Chen, Mingshui, Chen, Shanzhen, Chen, Xuan, Cheng, Shan, Chiang, Cheng-Wei, Crivellin, Andreas, Cui, Hanhua, Deschamps, Olivier, Descotes-Genon, Sébastien, Du, Xiaokang, Fang, Shuangshi, Gao, Yu, Geng, Li-Sheng, Goldenzweig, Pablo, Gu, Jiayin, Guo, Feng-Kun, Guo, Yuchen, Guo, Zhi-Hui, Han, Tao, He, Hong-Jian, He, Jibo, He, Miao, Huang, Yanping, Isidori, Gino, Ji, Quan, Jiang, Jianfeng, Jiang, Xu-Hui, Kamenik, Jernej F., Kwok, Tsz Hong, Li, Gang, Li, Geng, Li, Haibo, Li, Haitao, Li, Hengne, Li, Honglei, Li, Liang, Li, Lingfeng, Li, Qiang, Li, Shu, Li, Xiaomei, Li, Xin-Qiang, Li, Yiming, Li, Yubo, Li, Yuji, Li, Zhao, Liang, Hao, Liang, Zhijun, Liao, Libo, Ligeti, Zoltan, Liu, Jia, Liu, Jianbei, Liu, Tao, Liu, Yi, Liu, Yong, Liu, Zhen, Lou, Xinchou, Lu, Peng-Cheng, Lusiani, Alberto, Ma, Hong-Hao, Ma, Kai, Mao, Yaxian, Marzocca, David, Niu, Juan-Juan, Prell, Soeren, Qi, Huirong, Qian, Sen, Qian, Wenbin, Qian, Zhuoni, Qin, Qin, Rock, Ariel, Rosner, Jonathan L., Ruan, Manqi, Shao, Dingyu, Shen, Chengping, Shen, Xiaoyan, Shi, Haoyu, Shi, Liaoshan, Si, Zong-Guo, Sierra, Cristian, Song, Huayang, Su, Shufang, Su, Wei, Tammaro, Michele, Wang, En, Wang, Fei, Wang, Hengyu, Wang, Jian, Wang, Jianchun, Wang, Kun, Wang, Lian-Tao, Wang, Wei, Wang, Xiaolong, Wang, Xiaoping, Wang, Yadi, Wang, Yifang, Wang, Yuexin, Wu, Xing-Gang, Wu, Yongcheng, Xiao, Rui-Qing, Xie, Ke-Pan, Xie, Yuehong, Xu, Zijun, Yang, Haijun, Yang, Hongtao, Yang, Lin, Yang, Shuo, Yin, Zhongbao, Yu, Fusheng, Yuan, Changzheng, Yuan, Xing-Bo, Yuan, Xuhao, Yue, Chongxing, Zhan, Xi-Jie, Zhang, Kaili, Zhang, Liming, Zhang, Xiaoming, Zhang, Yang, Zhang, Yanxi, Zhang, Yongchao, Zhang, Yu, Zhang, Zhen-Hua, Zhang, Zhong, Zhao, Mingrui, Zhao, Qiang, Zheng, Xu-Chang, Zheng, Yangheng, Zhou, Chen, Zhu, Pengxuan, Zhu, Yongfeng, Zuo, Xunwu, and Zupan, Jure
Subjects: High Energy Physics - Experiment, High Energy Physics - Phenomenology
Abstract: We discuss the landscape of flavor physics at the Circular Electron-Positron Collider (CEPC), based on the nominal luminosity outlined in its Technical Design Report. The CEPC is designed to operate in multiple modes to address a variety of tasks. At the $Z$ pole, the expected production of 4 Tera $Z$ bosons will provide unique and highly precise measurements of $Z$ boson couplings, while the substantial number of boosted heavy-flavored quarks and leptons produced in clean $Z$ decays will facilitate investigations into their flavor physics with unprecedented precision. We investigate the prospects of measuring various physics benchmarks and discuss their implications for particle theories and phenomenological models. Our studies indicate that, with its highlighted advantages and anticipated excellent detector performance, the CEPC can explore beauty and $\tau$ physics in ways that are superior to or complementary with the Belle II and Large-Hadron-Collider-beauty experiments, potentially enabling the detection of new physics at energy scales of 10 TeV and above. This potential also extends to the observation of yet-to-be-discovered rare and exotic processes, as well as testing fundamental principles such as lepton flavor universality, lepton and baryon number conservation, etc., making the CEPC a vibrant platform for flavor physics research. The $WW$ threshold scan, Higgs-factory operation and top-pair productions of the CEPC further enhance its merits in this regard, especially for measuring the Cabibbo-Kobayashi-Maskawa matrix elements, and Flavor-Changing-Neutral-Current physics of Higgs boson and top quarks. We outline the requirements for detector performance and considerations for future development to achieve the anticipated scientific goals.
Published: 2024

47. A General Framework of Brain Region Detection And Genetic Variants Selection in Imaging Genetics

Author: Su, Siqiang, Li, Zhenghao, Feng, Long, and Li, Ting
Subjects: Statistics - Applications
Abstract: Imaging genetics is a growing field that employs structural or functional neuroimaging techniques to study individuals with genetic risk variants potentially linked to specific illnesses. This area presents considerable challenges to statisticians due to the heterogeneous information and different data forms it involves. In addition, both imaging and genetic data are typically high-dimensional, creating a "big data squared" problem. Moreover, brain imaging data contains extensive spatial information. Simply vectorizing tensor images and treating voxels as independent features can lead to computational issues and disregard spatial structure. This paper presents a novel statistical method for imaging genetics modeling while addressing all these challenges. We explore a Canonical Correlation Analysis based linear model for the joint modeling of brain imaging, genetic information, and clinical phenotype, enabling the simultaneous detection of significant brain regions and selection of important genetic variants associated with the phenotype outcome. Scalable algorithms are developed to tackle the "big data squared" issue. We apply the proposed method to explore the reaction speed, an indicator of cognitive functions, and its associations with brain MRI and genetic factors using the UK Biobank database. Our study reveals a notable connection between the caudate nucleus region of brain and specific significant SNPs, along with their respective regulated genes, and the reaction speed.
Published: 2024

48. DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT

Author: Hu, Xiaotao, Yin, Wei, Jia, Mingkai, Deng, Junyuan, Guo, Xiaoyang, Zhang, Qian, Long, Xiaoxiao, and Tan, Ping
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recent successes in autoregressive (AR) generation models, such as the GPT series in natural language processing, have motivated efforts to replicate this success in visual tasks. Some works attempt to extend this approach to autonomous driving by building video-based world models capable of generating realistic future video sequences and predicting ego states. However, prior works tend to produce unsatisfactory results, as the classic GPT framework is designed to handle 1D contextual information, such as text, and lacks the inherent ability to model the spatial and temporal dynamics essential for video generation. In this paper, we present DrivingWorld, a GPT-style world model for autonomous driving, featuring several spatial-temporal fusion mechanisms. This design enables effective modeling of both spatial and temporal dynamics, facilitating high-fidelity, long-duration video generation. Specifically, we propose a next-state prediction strategy to model temporal coherence between consecutive frames and apply a next-token prediction strategy to capture spatial information within each frame. To further enhance generalization ability, we propose a novel masking strategy and reweighting strategy for token prediction to mitigate long-term drifting issues and enable precise control. Our work demonstrates the ability to produce high-fidelity and consistent video clips of over 40 seconds in duration, which is over 2 times longer than state-of-the-art driving world models. Experiments show that, in contrast to prior works, our method achieves superior visual quality and significantly more accurate controllable future video generation. Our code is available at https://github.com/YvanYin/DrivingWorld.
Published: 2024

49. Online distributed algorithms for mixed equilibrium problems in dynamic environments

Author: Xu, Hang, Lu, Kaihong, Wang, Yu-Long, and Zhu, Qixin
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: In this paper, the mixed equilibrium problem with coupled inequality constraints in dynamic environments is solved by employing a multi-agent system, where each agent only has access to its own bifunction, its own constraint function, and can only communicate with its immediate neighbors via a time-varying digraph. At each time, the goal of agents is to cooperatively find a point in the constraint set such that the sum of local bifunctions with a free variable is non-negative. Different from existing works, here the bifunctions and the constraint functions are time-varying and only available to agents after decisions are made. To tackle this problem, first, an online distributed algorithm involving accurate gradient information is proposed based on mirror descent algorithms and primal-dual strategies. Of particular interest is that dynamic regrets, whose offline benchmarks are to find the solution at each time, are employed to measure the performance of the algorithm. Under mild assumptions on the graph and the bifunctions, we prove that if the deviation in the solution sequence grows within a certain rate, then both the dynamic regret and the violation of coupled inequality constraints increase sublinearly. Second, considering the case where each agent only has access to a noisy estimate on the accurate gradient, we propose an online distributed algorithm involving the stochastic gradients. The result shows that under the same conditions as in the first case, if the noise distribution satisfies the sub-Gaussian condition, then dynamic regrets, as well as constraint violations, increase sublinearly with high probability. Finally, several simulation examples are presented to corroborate the validity of our results.
Published: 2024

50. Study of rare top quark decays into a jet plus a charged pseudo-scalar meson

Author: Lu, Long-Shun, Li, Lei-Yi, and Lü, Cai-Dian
Subjects: High Energy Physics - Phenomenology
Abstract: The semi-inclusive decay processes of a top quark into a charged pseudo-scalar meson and a jet are studied within the framework of QCD factorization. The leading power of the decay matrix elements can be factorized into heavy-to-light quark transition current and a hadron matrix element up to next-to-leading order QCD corrections. We calculate one-loop virtual corrections together with real gluon emission corrections at the {\alpha}s order. The numerical results of the branching ratios are presented for the sum of two-body and three-body decays. We also study the energy cut-off dependence of the gluon jet. These processes are hopeful to be detected in the near future experiments, which can serve as probes for new physics.
Published: 2024

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Region

Database

Publisher

1,106,377 results on '"A. A. Long"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources