211,829 results on '"Wang, Wei"'
Search Results
2. Association between single nucleotide polymorphisms in TYW5 locus and beef amino acids content in shuxuan cattle
- Author
-
Jia, Xianbo, Chen, Shiyi, Wang, Jie, Fu, Maozhong, Yi, Jun, Fang, Donghui, Wang, Wei, and Lai, Songjia
- Published
- 2022
- Full Text
- View/download PDF
3. Body Size Traits and Association with the Genetic Polymorphism of Melatonin Receptor 1A (MTNR1A) Gene in Shuxuan Cattle
- Author
-
Wang, Wei, Jia, Xian-bo, Gan, Jia, Fang, Dong-hui, Shi, Yi, He, Fang, and Yi, Jun
- Published
- 2021
- Full Text
- View/download PDF
4. Inferring from Logits: Exploring Best Practices for Decoding-Free Generative Candidate Selection
- Author
-
Ma, Mingyu Derek, Ding, Yanna, Huang, Zijie, Gao, Jianxi, Sun, Yizhou, and Wang, Wei
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Generative Language Models rely on autoregressive decoding to produce the output sequence token by token. Many tasks such as preference optimization, require the model to produce task-level output consisting of multiple tokens directly by selecting candidates from a pool as predictions. Determining a task-level prediction from candidates using the ordinary token-level decoding mechanism is constrained by time-consuming decoding and interrupted gradients by discrete token selection. Existing works have been using decoding-free candidate selection methods to obtain candidate probability from initial output logits over vocabulary. Though these estimation methods are widely used, they are not systematically evaluated, especially on end tasks. We introduce an evaluation of a comprehensive collection of decoding-free candidate selection approaches on a comprehensive set of tasks, including five multiple-choice QA tasks with a small candidate pool and four clinical decision tasks with a massive amount of candidates, some with 10k+ options. We evaluate the estimation methods paired with a wide spectrum of foundation LMs covering different architectures, sizes and training paradigms. The results and insights from our analysis inform the future model design.
- Published
- 2025
5. Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction
- Author
-
Ma, Mingyu Derek, Wang, Xiaoxuan, Xiao, Yijia, Cuturrufo, Anthony, Nori, Vijay S, Halperin, Eran, and Wang, Wei
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Clinical diagnosis prediction models, when provided with a patient's medical history, aim to detect potential diseases early, facilitating timely intervention and improving prognostic outcomes. However, the inherent scarcity of patient data and large disease candidate space often pose challenges in developing satisfactory models for this intricate task. The exploration of leveraging Large Language Models (LLMs) for encapsulating clinical decision processes has been limited. We introduce MERA, a clinical diagnosis prediction model that bridges pertaining natural language knowledge with medical practice. We apply hierarchical contrastive learning on a disease candidate ranking list to alleviate the large decision space issue. With concept memorization through fine-tuning, we bridge the natural language clinical knowledge with medical codes. Experimental results on MIMIC-III and IV datasets show that MERA achieves the state-of-the-art diagnosis prediction performance and dramatically elevates the diagnosis prediction capabilities of generative LMs., Comment: To appear at AAAI 2025
- Published
- 2025
6. Tumor Detection, Segmentation and Classification Challenge on Automated 3D Breast Ultrasound: The TDSC-ABUS Challenge
- Author
-
Luo, Gongning, Xu, Mingwang, Chen, Hongyu, Liang, Xinjie, Tao, Xing, Ni, Dong, Jeong, Hyunsu, Kim, Chulhong, Stock, Raphael, Baumgartner, Michael, Kirchhoff, Yannick, Rokuss, Maximilian, Maier-Hein, Klaus, Yang, Zhikai, Fan, Tianyu, Boutry, Nicolas, Tereshchenko, Dmitry, Moine, Arthur, Charmetant, Maximilien, Sauer, Jan, Du, Hao, Bai, Xiang-Hui, Raikar, Vipul Pai, Montoya-del-Angel, Ricardo, Marti, Robert, Luna, Miguel, Lee, Dongmin, Qayyum, Abdul, Mazher, Moona, Guo, Qihui, Wang, Changyan, Awasthi, Navchetan, Zhao, Qiaochu, Wang, Wei, Wang, Kuanquan, Wang, Qiucheng, and Dong, Suyu
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Breast cancer is one of the most common causes of death among women worldwide. Early detection helps in reducing the number of deaths. Automated 3D Breast Ultrasound (ABUS) is a newer approach for breast screening, which has many advantages over handheld mammography such as safety, speed, and higher detection rate of breast cancer. Tumor detection, segmentation, and classification are key components in the analysis of medical images, especially challenging in the context of 3D ABUS due to the significant variability in tumor size and shape, unclear tumor boundaries, and a low signal-to-noise ratio. The lack of publicly accessible, well-labeled ABUS datasets further hinders the advancement of systems for breast tumor analysis. Addressing this gap, we have organized the inaugural Tumor Detection, Segmentation, and Classification Challenge on Automated 3D Breast Ultrasound 2023 (TDSC-ABUS2023). This initiative aims to spearhead research in this field and create a definitive benchmark for tasks associated with 3D ABUS image analysis. In this paper, we summarize the top-performing algorithms from the challenge and provide critical analysis for ABUS image examination. We offer the TDSC-ABUS challenge as an open-access platform at https://tdsc-abus2023.grand-challenge.org/ to benchmark and inspire future developments in algorithmic research.
- Published
- 2025
7. Observations of the X-ray Millihertz Quasiperiodic Oscillations in Hercules X-1
- Author
-
Yang, Wen and Wang, Wei
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Solar and Stellar Astrophysics - Abstract
With a systematic timing investigation of the persistent X-ray binary pulsar Her X-1 based on a large number of Insight-HXMT observations between 2017 to 2019, we confirm the presence of X-ray millihertz quasi-periodic oscillations (mHz QPOs) at $\sim 0.01$ Hz. By applying wavelet analysis in our data analysis procedures, we firstly identified $\sim 0.005-0.009$ Hz QPOs coexisting with the $\sim 0.01$ Hz QPOs. Wavelet analysis suggests that these QPO features show transient behaviors, frequencies of mHz QPOs evolved in short time scales. There exists a positive relation between QPO centroid frequency (from $\sim 0.005-0.009$ Hz) and the X-ray luminosity, while the 10 mHz QPO frequencies keep nearly constant for different luminosities, which suggests different physical mechanisms for two types of mHz QPOs. The 10 mHz QPOs in both X-ray and UV bands would have the same origin related to the beat frequency where the Alfv$\acute{e}$n radius is close to the corotation radius, and the 5 mHz QPOs may originate from magnetic disk precession., Comment: 9 pages, 5 figures, 2 tables, accept for the publication in ApJ
- Published
- 2025
8. Fusion of Millimeter-wave Radar and Pulse Oximeter Data for Low-burden Diagnosis of Obstructive Sleep Apnea-Hypopnea Syndrome
- Author
-
Wang, Wei, Chen, Zhaoxi, Zhang, Wenyu, Wang, Zetao, Zhao, Xiang, Li, Chenyang, Guan, Jian, Yin, Shankai, and Li, Gang
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
Objective: The aim of the study is to develop a novel method for improved diagnosis of obstructive sleep apnea-hypopnea syndrome (OSAHS) in clinical or home settings, with the focus on achieving diagnostic performance comparable to the gold-standard polysomnography (PSG) with significantly reduced monitoring burden. Methods: We propose a method using millimeter-wave radar and pulse oximeter for OSAHS diagnosis (ROSA). It contains a sleep apnea-hypopnea events (SAE) detection network, which directly predicts the temporal localization of SAE, and a sleep staging network, which predicts the sleep stages throughout the night, based on radar signals. It also fuses oxygen saturation (SpO2) information from the pulse oximeter to adjust the score of SAE detected by radar. Results: Experimental results on a real-world dataset (>800 hours of overnight recordings, 100 subjects) demonstrated high agreement (ICC=0.9870) on apnea-hypopnea index (AHI) between ROSA and PSG. ROSA also exhibited excellent diagnostic performance, exceeding 90% in accuracy across AHI diagnostic thresholds of 5, 15 and 30 events/h. Conclusion: ROSA improves diagnostic accuracy by fusing millimeter-wave radar and pulse oximeter data. It provides a reliable and low-burden solution for OSAHS diagnosis. Significance: ROSA addresses the limitations of high complexity and monitoring burden associated with traditional PSG. The high accuracy and low burden of ROSA show its potential to improve the accessibility of OSAHS diagnosis among population.
- Published
- 2025
9. Estimating the Black Hole Spin for the X-Ray Binary MAXI J1727-203 Based on Insight-HXMT
- Author
-
Zhu, Haifan and Wang, Wei
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Solar and Stellar Astrophysics ,High Energy Physics - Phenomenology - Abstract
We constrain the spin of the black hole (BH) candidate MAXI J1727-203 using Insight-HXMT data. Due to limited HXMT observations covering only part of the outburst, NICER data were used to analyze the full outburst's state transitions, we identified two of three HXMT observations in the high soft state and applied the continuum-fitting method to measure the spin. Based on previous estimates and continuum spectral fittings, we explored the parameter space and found that the best-fitting values were $(D, i, M) \approx (6\ \text{kpc}, 30^\circ, 12 M_{\odot})$. We also tested the variation of these parameters using Monte Carlo simulations, sampling over 3000 sets within the parameter ranges: $5.9 \text{kpc}< D<7 \text{kpc}$, $24^\circ
- Published
- 2025
10. Top Ten Challenges Towards Agentic Neural Graph Databases
- Author
-
Bai, Jiaxin, Wang, Zihao, Zhou, Yukun, Yin, Hang, Fei, Weizhi, Hu, Qi, Deng, Zheye, Cheng, Jiayang, Zheng, Tianshi, Tsang, Hong Ting, Gao, Yisen, Xie, Zhongwei, Li, Yufei, Fan, Lixin, Yuan, Binhang, Wang, Wei, Chen, Lei, Zhou, Xiaofang, and Song, Yangqiu
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Databases ,Computer Science - Machine Learning - Abstract
Graph databases (GDBs) like Neo4j and TigerGraph excel at handling interconnected data but lack advanced inference capabilities. Neural Graph Databases (NGDBs) address this by integrating Graph Neural Networks (GNNs) for predictive analysis and reasoning over incomplete or noisy data. However, NGDBs rely on predefined queries and lack autonomy and adaptability. This paper introduces Agentic Neural Graph Databases (Agentic NGDBs), which extend NGDBs with three core functionalities: autonomous query construction, neural query execution, and continuous learning. We identify ten key challenges in realizing Agentic NGDBs: semantic unit representation, abductive reasoning, scalable query execution, and integration with foundation models like large language models (LLMs). By addressing these challenges, Agentic NGDBs can enable intelligent, self-improving systems for modern data-driven applications, paving the way for adaptable and autonomous data management solutions., Comment: 12 Pages
- Published
- 2025
11. A Multi-annotated and Multi-modal Dataset for Wide-angle Video Quality Assessment
- Author
-
Hu, Bo, Wang, Wei, Li, Chunyi, He, Lihuo, Li, Leida, and Gao, Xinbo
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Electrical Engineering and Systems Science - Image and Video Processing - Abstract
Wide-angle video is favored for its wide viewing angle and ability to capture a large area of scenery, making it an ideal choice for sports and adventure recording. However, wide-angle video is prone to deformation, exposure and other distortions, resulting in poor video quality and affecting the perception and experience, which may seriously hinder its application in fields such as competitive sports. Up to now, few explorations focus on the quality assessment issue of wide-angle video. This deficiency primarily stems from the absence of a specialized dataset for wide-angle videos. To bridge this gap, we construct the first Multi-annotated and multi-modal Wide-angle Video quality assessment (MWV) dataset. Then, the performances of state-of-the-art video quality methods on the MWV dataset are investigated by inter-dataset testing and intra-dataset testing. Experimental results show that these methods impose significant limitations on their applicability.
- Published
- 2025
12. A White Dwarf Binary Candidate Discovered by LAMOST Using Dynamical Method
- Author
-
Zhu, Haifan, Wang, Wei, Lib, Xue, Li, Jia-jia, and Tian, Pengfu
- Subjects
Astrophysics - Solar and Stellar Astrophysics - Abstract
We present the discovery of a binary system containing a white dwarf candidate using data from the LAMOST. Our analysis of the radial velocity data allowed us to determine an orbital period of approximately 0.953 days and a mass function of 0.129 $M_\odot$. Through spectral energy distribution (SED) fitting, we obtained the stellar parameters of the visible star. By combining these results with the mass function, we established a relationship between the mass of the invisible star and the system's inclination angle, along with the Roche lobe radius. We find that the mass of the invisible star is below the Chandrasekhar limit when the inclination angle exceeds $35^\circ$. Given that systems with large variations in radial velocity typically have high inclination angles, we classify the invisible star as a white dwarf candidate. The Roche lobe radius exceeds the physical radius of the visible star, indicating that no mass transfer occurs, which results in a weak ellipsoidal modulation effect. Additionally, we obtained light curves from the TESS, ASAS-SN, and CRTS surveys. The light curves also exhibit a periodicity of approximately 0.95 days, with ellipsoidal modulation only in the 2019 TESS observations. Coupled with the strong $\rm H_{\alpha}$ emission line observed in the LAMOST MRS spectrum, we infer that the surface of the visible star contains significant hot spots. This obscures the system's inherently weak ellipsoidal modulation, resulting in a manifestation of rotational variables. Furthermore, an analysis of the dynamical characteristics of this system indicates that it has a high inclination angle ($>60$ degrees) and its orbital properties are consistent with those of typical thin disk stars, supporting the hypothesis that the invisible object is a white dwarf., Comment: 13pages, 10figures. Accepted by the Journal of High Energy Astrophysics
- Published
- 2025
13. MedFILIP: Medical Fine-grained Language-Image Pre-training
- Author
-
Liang, Xinjie, Li, Xiangyu, Li, Fanding, Jiang, Jie, Dong, Qing, Wang, Wei, Wang, Kuanquan, Dong, Suyu, Luo, Gongning, and Li, Shuo
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Medical vision-language pretraining (VLP) that leverages naturally-paired medical image-report data is crucial for medical image analysis. However, existing methods struggle to accurately characterize associations between images and diseases, leading to inaccurate or incomplete diagnostic results. In this work, we propose MedFILIP, a fine-grained VLP model, introduces medical image-specific knowledge through contrastive learning, specifically: 1) An information extractor based on a large language model is proposed to decouple comprehensive disease details from reports, which excels in extracting disease deals through flexible prompt engineering, thereby effectively reducing text complexity while retaining rich information at a tiny cost. 2) A knowledge injector is proposed to construct relationships between categories and visual attributes, which help the model to make judgments based on image features, and fosters knowledge extrapolation to unfamiliar disease categories. 3) A semantic similarity matrix based on fine-grained annotations is proposed, providing smoother, information-richer labels, thus allowing fine-grained image-text alignment. 4) We validate MedFILIP on numerous datasets, e.g., RSNA-Pneumonia, NIH ChestX-ray14, VinBigData, and COVID-19. For single-label, multi-label, and fine-grained classification, our model achieves state-of-the-art performance, the classification accuracy has increased by a maximum of 6.69\%. The code is available in https://github.com/PerceptionComputingLab/MedFILIP., Comment: 10 pages, 5 figures, IEEE Journal of Biomedical and Health Informatics 2025
- Published
- 2025
14. OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training
- Author
-
Yu, Yijiong, Dai, Ziyun, Wang, Zekun, Wang, Wei, Chen, Ran, and Pei, Ji
- Subjects
Computer Science - Computation and Language - Abstract
Large language models (LLMs) have demonstrated remarkable capabilities, but their success heavily relies on the quality of pretraining corpora. For Chinese LLMs, the scarcity of high-quality Chinese datasets presents a significant challenge, often limiting their performance. To address this issue, we propose the OpenCSG Chinese Corpus, a series of high-quality datasets specifically designed for LLM pretraining, post-training, and fine-tuning. This corpus includes Fineweb-edu-chinese, Fineweb-edu-chinese-v2, Cosmopedia-chinese, and Smoltalk-chinese, each with distinct characteristics: Fineweb-edu datasets focus on filtered, high-quality content derived from diverse Chinese web sources; Cosmopedia-chinese provides synthetic, textbook-style data for knowledge-intensive training; and Smoltalk-chinese emphasizes stylistic and diverse chat-format data. The OpenCSG Chinese Corpus is characterized by its high-quality text, diverse coverage across domains, and scalable, reproducible data curation processes. Additionally, we conducted extensive experimental analyses, including evaluations on smaller parameter models, which demonstrated significant performance improvements in tasks such as C-Eval, showcasing the effectiveness of the corpus for training Chinese LLMs., Comment: The datasets are available on https://huggingface.co/collections/opencsg/chinese-fineweb-66cfed105f502ece8f29643e ; The code is on https://github.com/yuyijiong/fineweb-edu-chinese
- Published
- 2025
15. Science objectives of the Einstein Probe mission
- Author
-
Yuan, Weimin, Dai, Lixin, Feng, Hua, Jin, Chichuan, Jonker, Peter, Kuulkers, Erik, Liu, Yuan, Nandra, Kirpal, O'Brien, Paul, Piro, Luigi, Rau, Arne, Rea, Nanda, Sanders, Jeremy, Tao, Lian, Wang, Junfeng, Wu, Xuefeng, Zhang, Bing, Zhang, Shuangnan, Ai, Shunke, Buchner, Johannes, Bulbul, Esra, Chen, Hechao, Chen, Minghua, Chen, Yong, Chen, Yu-Peng, Coleiro, Alexis, Zelati, Francesco Coti, Dai, Zigao, Fan, Xilong, Fan, Zhou, Friedrich, Susanne, Gao, He, Ge, Chong, Ge, Mingyu, Geng, Jinjun, Ghirlanda, Giancarlo, Gianfagna, Giulia, Gou, Lijun, Guillot, Sébastien, Hou, Xian, Hu, Jingwei, Huang, Yongfeng, Ji, Long, Jia, Shumei, Komossa, S., Kong, Albert K. H., Lan, Lin, Li, An, Li, Ang, Li, Chengkui, Li, Dongyue, Li, Jian, Li, Zhaosheng, Ling, Zhixing, Liu, Ang, Liu, Jinzhong, Liu, Liangduan, Liu, Zhu, Luo, Jiawei, Ma, Ruican, Maggi, Pierre, Maitra, Chandreyee, Marino, Alessio, Ng, Stephen Chi-Yung, Pan, Haiwu, Rukdee, Surangkhana, Soria, Roberto, Sun, Hui, Tam, Pak-Hin Thomas, Thakur, Aishwarya Linesh, Tian, Hui, Troja, Eleonora, Wang, Wei, Wang, Xiangyu, Wang, Yanan, Wei, Junjie, Wen, Sixiang, Wu, Jianfeng, Wu, Ting, Xiao, Di, Xu, Dong, Xu, Renxin, Xu, Yanjun, Xu, Yu, Yang, Haonan, You, Bei, Yu, Heng, Yu, Yunwei, Zhang, Binbin, Zhang, Chen, Zhang, Guobao, Zhang, Liang, Zhang, Wenda, Zhang, Yu, Zhou, Ping, and Zou, Zecheng
- Subjects
Astrophysics - High Energy Astrophysical Phenomena - Abstract
The Einstein Probe (EP) is an interdisciplinary mission of time-domain and X-ray astronomy. Equipped with a wide-field lobster-eye X-ray focusing imager, EP will discover cosmic X-ray transients and monitor the X-ray variability of known sources in 0.5-4 keV, at a combination of detecting sensitivity and cadence that is not accessible to the previous and current wide-field monitoring missions. EP can perform quick characterisation of transients or outbursts with a Wolter-I X-ray telescope onboard. In this paper, the science objectives of the Einstein Probe mission are presented. EP is expected to enlarge the sample of previously known or predicted but rare types of transients with a wide range of timescales. Among them, fast extragalactic transients will be surveyed systematically in soft X-rays, which include {\gamma}-ray bursts and their variants, supernova shock breakouts, and the predicted X-ray transients associated with binary neutron star mergers. EP will detect X-ray tidal disruption events and outbursts from active galactic nuclei, possibly at an early phase of the flares for some. EP will monitor the variability and outbursts of X-rays from white dwarfs, neutron stars and black holes in our and neighbouring galaxies at flux levels fainter than those detectable by the current instruments, and is expected to discover new objects. A large sample of stellar X-ray flares will also be detected and characterised. In the era of multi-messenger astronomy, EP has the potential of detecting the possible X-ray counterparts of gravitational wave events, neutrino sources, and ultra-high energy {\gamma}-ray and cosmic ray sources. EP is expected to help advance the studies of extreme objects/phenomena and their underlying physical processes revealed in the dynamic X-ray universe, as well as studies in other areas of X-ray astronomy., Comment: 67 pages, 24 figures, accepted for publication in SCIENCE CHINA Physics, Mechanics & Astronomy
- Published
- 2025
- Full Text
- View/download PDF
16. Detecting Linear Breit-Wheeler Signals with a Laser-Foil Setup
- Author
-
Song, Huai-Hang, Wang, Wei-Min, Li, Yu-Tong, He, Yutong, Tamburini, Matteo, Keitel, Christoph H., and Sheng, Zheng-Ming
- Subjects
Physics - Plasma Physics - Abstract
As a fundamental QED process, linear Breit-Wheeler (LBW) pair production predicted 90 years ago has not yet been demonstrated in experiments with real photons. Here, we propose an experimentally advantageous scheme to detect the LBW signal by irradiating a foil target with a single 10 PW-level laser. Our integrated QED particle-in-cell simulations demonstrate that the LBW signal can be explicitly distinguished from the Bethe-Heitler (BH) signal by comparing positron energy spectra behind the target at varying target thicknesses. The LBW positrons are created at the front of the target and subsequently experience both laser vacuum acceleration and sheath field acceleration to gain high energies, while BH positrons, originating within the target bulk, are only subjected to sheath field acceleration. As a result, the invariance of the high-energy tail of positron spectra with respect to the target thickness serves as a distinct signature of the LBW process. Notably, this scheme remains viable even when the BH yield dominates over the LBW yield., Comment: 6 pages, 4 figures
- Published
- 2025
17. Target Detection in ISAC Systems with Active RISs: A Multi-Perspective Observation Approach
- Author
-
Zhang, Shoushuo, Liu, Rang, Li, Ming, Wang, Wei, and Liu, Qian
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
Integrated sensing and communication (ISAC) has emerged as a transformative technology for 6G networks, enabling the seamless integration of communication and sensing functionalities. Reconfigurable intelligent surfaces (RIS), with their capability to adaptively reconfigure the radio environment, have shown significant potential in enhancing communication quality and enabling advanced cooperative sensing. This paper investigates a multi-RIS-assisted ISAC system and introduces a novel multi-perspective observation framework that leverages the diversity of multiple observation paths, each exhibiting distinct spatial, delay, and Doppler characteristics for both target and clutter. The proposed framework integrates symbol-level precoding (SLP) and space-time adaptive processing (STAP) to fully exploit the benefits of multi-perspective observations, enabling superior target-clutter separation and significantly improving detection accuracy. The objective is to jointly design the transmit waveform, reflection coefficients of multiple active RISs, and spatial-temporal receive filters to maximize the radar output signal-to-clutter-plus-noise ratio (SCNR) for target detection, while ensuring the quality-of-service (QoS) requirements of communication users. To address the resulting non-convex optimization problem, an effective iterative algorithm is developed, combining fractional programming (FP), majorization-minimization (MM), and the alternating direction method of multipliers (ADMM). Extensive simulation results validate the effectiveness of the proposed multi-perspective observation strategy, demonstrating its advantages in improving target detection performance in challenging environments., Comment: Submitted to TCCN
- Published
- 2025
18. On the Computational Capability of Graph Neural Networks: A Circuit Complexity Bound Perspective
- Author
-
Li, Xiaoyu, Liang, Yingyu, Shi, Zhenmei, Song, Zhao, Wang, Wei, and Zhang, Jiahao
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computational Complexity - Abstract
Graph Neural Networks (GNNs) have become the standard approach for learning and reasoning over relational data, leveraging the message-passing mechanism that iteratively propagates node embeddings through graph structures. While GNNs have achieved significant empirical success, their theoretical limitations remain an active area of research. Existing studies primarily focus on characterizing GNN expressiveness through Weisfeiler-Lehman (WL) graph isomorphism tests. In this paper, we take a fundamentally different approach by exploring the computational limitations of GNNs through the lens of circuit complexity. Specifically, we analyze the circuit complexity of common GNN architectures and prove that under constraints of constant-depth layers, linear or sublinear embedding sizes, and polynomial precision, GNNs cannot solve key problems such as graph connectivity and graph isomorphism unless $\mathsf{TC}^0 = \mathsf{NC}^1$. These results reveal the intrinsic expressivity limitations of GNNs behind their empirical success and introduce a novel framework for analyzing GNN expressiveness that can be extended to a broader range of GNN models and graph decision problems.
- Published
- 2025
19. Uncovering Non-native Speakers' Experiences in Global Software Development Teams -- A Bourdieusian Perspective
- Author
-
Wang, Yi, Yue, Yang, Wang, Wei, and Zhang, Gaowei
- Subjects
Computer Science - Software Engineering - Abstract
Globally distributed software development has been a mainstream paradigm in developing modern software systems. We have witnessed a fast-growing population of software developers from areas where English is not a native language in the last several decades. Given that English is still the de facto working language in most global software engineering teams, we need to gain more knowledge about the experiences of developers who are non-native English speakers. We conducted an empirical study to fill this research gap. In this study, we interviewed 27 Chinese developers in commercial software development and open source global software development teams and applied Bourdieu's capital-field-habitus framework in an abductive data analysis process. Our study reveals four types of capital (language, social, symbolic, and economic) involved in their experiences and examines the interrelations among them. We found that non-native speakers' insufficient language capital played an essential role in prohibiting them from accessing and accumulating other capital, thus reproducing the sustained and systematic disadvantaged positions of non-native English speakers in GSD teams. We further discussed the theoretical and practical implications of the study.
- Published
- 2025
- Full Text
- View/download PDF
20. Optimizing Distributed Deployment of Mixture-of-Experts Model Inference in Serverless Computing
- Author
-
Liu, Mengfan, Wang, Wei, and Wu, Chuan
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Machine Learning - Abstract
With the advancement of serverless computing, running machine learning (ML) inference services over a serverless platform has been advocated, given its labor-free scalability and cost effectiveness. Mixture-of-Experts (MoE) models have been a dominant type of model architectures to enable large models nowadays, with parallel expert networks. Serving large MoE models on serverless computing is potentially beneficial, but has been underexplored due to substantial challenges in handling the skewed expert popularity and scatter-gather communication bottleneck in MoE model execution, for cost-efficient serverless MoE deployment and performance guarantee. We study optimized MoE model deployment and distributed inference serving on a serverless platform, that effectively predict expert selection, pipeline communication with model execution, and minimize the overall billed cost of serving MoE models. Especially, we propose a Bayesian optimization framework with multi-dimensional epsilon-greedy search to learn expert selections and optimal MoE deployment achieving optimal billed cost, including: 1) a Bayesian decision-making method for predicting expert popularity; 2) flexibly pipelined scatter-gather communication; and 3) an optimal model deployment algorithm for distributed MoE serving. Extensive experiments on AWS Lambda show that our designs reduce the billed cost of all MoE layers by at least 75.67% compared to CPU clusters while maintaining satisfactory inference throughput. As compared to LambdaML in serverless computing, our designs achieves 43.41% lower cost with a throughput decrease of at most 18.76%.
- Published
- 2025
21. LM-Net: A Light-weight and Multi-scale Network for Medical Image Segmentation
- Author
-
Lu, Zhenkun, She, Chaoyin, Wang, Wei, and Huang, Qinghua
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Current medical image segmentation approaches have limitations in deeply exploring multi-scale information and effectively combining local detail textures with global contextual semantic information. This results in over-segmentation, under-segmentation, and blurred segmentation boundaries. To tackle these challenges, we explore multi-scale feature representations from different perspectives, proposing a novel, lightweight, and multi-scale architecture (LM-Net) that integrates advantages of both Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to enhance segmentation accuracy. LM-Net employs a lightweight multi-branch module to capture multi-scale features at the same level. Furthermore, we introduce two modules to concurrently capture local detail textures and global semantics with multi-scale features at different levels: the Local Feature Transformer (LFT) and Global Feature Transformer (GFT). The LFT integrates local window self-attention to capture local detail textures, while the GFT leverages global self-attention to capture global contextual semantics. By combining these modules, our model achieves complementarity between local and global representations, alleviating the problem of blurred segmentation boundaries in medical image segmentation. To evaluate the feasibility of LM-Net, extensive experiments have been conducted on three publicly available datasets with different modalities. Our proposed model achieves state-of-the-art results, surpassing previous methods, while only requiring 4.66G FLOPs and 5.4M parameters. These state-of-the-art results on three datasets with different modalities demonstrate the effectiveness and adaptability of our proposed LM-Net for various medical image segmentation tasks.
- Published
- 2025
22. Splicer$^{+}$: Secure Hub Placement and Deadlock-Free Routing for Payment Channel Network Scalability
- Author
-
Yang, Lingxiao, Dong, Xuewen, Wang, Wei, Gao, Sheng, Qu, Qiang, Tian, Wensheng, and Shen, Yulong
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
Payment channel hub (PCH) is a promising approach for payment channel networks (PCNs) to improve efficiency by deploying robust hubs to steadily process off-chain transactions. However, existing PCHs, often preplaced without considering payment request distribution across PCNs, can lead to load imbalance. PCNs' reliance on source routing, which makes decisions based solely on individual sender requests, can degrade performance by overlooking other requests, thus further impairing scalability. In this paper, we introduce Splicer$^{+}$, a highly scalable multi-PCH solution based on the trusted execution environment (TEE). We study tradeoffs in communication overhead between participants, transform the original NP-hard PCH placement problem by mixed-integer linear programming, and propose optimal/approximate solutions with load balancing for different PCN scales using supermodular techniques. Considering global PCN states and local directly connected sender requests, we design a deadlock-free routing protocol for PCHs. It dynamically adjusts the payment processing rate across multiple channels and, combined with TEE, ensures high-performance routing with confidential computation. We provide a formal security proof for the Splicer$^{+}$ protocol in the UC-framework. Extensive evaluations demonstrate the effectiveness of Splicer$^{+}$, with transaction success ratio ($\uparrow$51.1%), throughput ($\uparrow$181.5%), and latency outperforming state-of-the-art PCNs., Comment: Extended version of ICDCS 2023 (arXiv:2305.19182)
- Published
- 2025
23. AgentRefine: Enhancing Agent Generalization through Refinement Tuning
- Author
-
Fu, Dayuan, He, Keqing, Wang, Yejie, Hong, Wentao, Gongque, Zhuoma, Zeng, Weihao, Wang, Wei, Wang, Jingang, Cai, Xunliang, and Xu, Weiran
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Robotics - Abstract
Large Language Model (LLM) based agents have proved their ability to perform complex tasks like humans. However, there is still a large gap between open-sourced LLMs and commercial models like the GPT series. In this paper, we focus on improving the agent generalization capabilities of LLMs via instruction tuning. We first observe that the existing agent training corpus exhibits satisfactory results on held-in evaluation sets but fails to generalize to held-out sets. These agent-tuning works face severe formatting errors and are frequently stuck in the same mistake for a long while. We analyze that the poor generalization ability comes from overfitting to several manual agent environments and a lack of adaptation to new situations. They struggle with the wrong action steps and can not learn from the experience but just memorize existing observation-action relations. Inspired by the insight, we propose a novel AgentRefine framework for agent-tuning. The core idea is to enable the model to learn to correct its mistakes via observation in the trajectory. Specifically, we propose an agent synthesis framework to encompass a diverse array of environments and tasks and prompt a strong LLM to refine its error action according to the environment feedback. AgentRefine significantly outperforms state-of-the-art agent-tuning work in terms of generalization ability on diverse agent tasks. It also has better robustness facing perturbation and can generate diversified thought in inference. Our findings establish the correlation between agent generalization and self-refinement and provide a new paradigm for future research.
- Published
- 2025
24. M2I2: Learning Efficient Multi-Agent Communication via Masked State Modeling and Intention Inference
- Author
-
Sun, Chuxiong, He, Peng, Ji, Qirui, Zang, Zehua, Li, Jiangmeng, Wang, Rui, and Wang, Wei
- Subjects
Computer Science - Multiagent Systems ,Computer Science - Artificial Intelligence - Abstract
Communication is essential in coordinating the behaviors of multiple agents. However, existing methods primarily emphasize content, timing, and partners for information sharing, often neglecting the critical aspect of integrating shared information. This gap can significantly impact agents' ability to understand and respond to complex, uncertain interactions, thus affecting overall communication efficiency. To address this issue, we introduce M2I2, a novel framework designed to enhance the agents' capabilities to assimilate and utilize received information effectively. M2I2 equips agents with advanced capabilities for masked state modeling and joint-action prediction, enriching their perception of environmental uncertainties and facilitating the anticipation of teammates' intentions. This approach ensures that agents are furnished with both comprehensive and relevant information, bolstering more informed and synergistic behaviors. Moreover, we propose a Dimensional Rational Network, innovatively trained via a meta-learning paradigm, to identify the importance of dimensional pieces of information, evaluating their contributions to decision-making and auxiliary tasks. Then, we implement an importance-based heuristic for selective information masking and sharing. This strategy optimizes the efficiency of masked state modeling and the rationale behind information sharing. We evaluate M2I2 across diverse multi-agent tasks, the results demonstrate its superior performance, efficiency, and generalization capabilities, over existing state-of-the-art methods in various complex scenarios.
- Published
- 2024
25. Toward Scene Graph and Layout Guided Complex 3D Scene Generation
- Author
-
Huang, Yu-Hsiang, Wang, Wei, Huang, Sheng-Yu, and Wang, Yu-Chiang Frank
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Recent advancements in object-centric text-to-3D generation have shown impressive results. However, generating complex 3D scenes remains an open challenge due to the intricate relations between objects. Moreover, existing methods are largely based on score distillation sampling (SDS), which constrains the ability to manipulate multiobjects with specific interactions. Addressing these critical yet underexplored issues, we present a novel framework of Scene Graph and Layout Guided 3D Scene Generation (GraLa3D). Given a text prompt describing a complex 3D scene, GraLa3D utilizes LLM to model the scene using a scene graph representation with layout bounding box information. GraLa3D uniquely constructs the scene graph with single-object nodes and composite super-nodes. In addition to constraining 3D generation within the desirable layout, a major contribution lies in the modeling of interactions between objects in a super-node, while alleviating appearance leakage across objects within such nodes. Our experiments confirm that GraLa3D overcomes the above limitations and generates complex 3D scenes closely aligned with text prompts., Comment: 13 pages, 12 figures
- Published
- 2024
26. TradingAgents: Multi-Agents LLM Financial Trading Framework
- Author
-
Xiao, Yijia, Sun, Edward, Luo, Di, and Wang, Wei
- Subjects
Quantitative Finance - Trading and Market Microstructure ,Computer Science - Artificial Intelligence ,Computer Science - Computational Engineering, Finance, and Science ,Computer Science - Machine Learning - Abstract
Significant progress has been made in automated problem-solving using societies of agents powered by large language models (LLMs). In finance, efforts have largely focused on single-agent systems handling specific tasks or multi-agent frameworks independently gathering data. However, multi-agent systems' potential to replicate real-world trading firms' collaborative dynamics remains underexplored. TradingAgents proposes a novel stock trading framework inspired by trading firms, featuring LLM-powered agents in specialized roles such as fundamental analysts, sentiment analysts, technical analysts, and traders with varied risk profiles. The framework includes Bull and Bear researcher agents assessing market conditions, a risk management team monitoring exposure, and traders synthesizing insights from debates and historical data to make informed decisions. By simulating a dynamic, collaborative trading environment, this framework aims to improve trading performance. Detailed architecture and extensive experiments reveal its superiority over baseline models, with notable improvements in cumulative returns, Sharpe ratio, and maximum drawdown, highlighting the potential of multi-agent LLM frameworks in financial trading. More details on TradingAgents are available at https://TradingAgents-AI.github.io., Comment: Multi-Agent AI in the Real World @ AAAI 2025
- Published
- 2024
27. Graph isomorphism and multivariate graph spectrum
- Author
-
Wang, Wei and Zhao, Da
- Subjects
Mathematics - Combinatorics ,05C50, 05C60 - Abstract
We provide a criterion to distinguish two graphs which are indistinguishable by $2$-dimensional Weisfeiler-Lehman algorithm for almost all graphs. Haemers conjectured that almost all graphs are identified by their spectrum. Our approach suggests that almost all graphs are identified by their generalized block Laplacian spectrum., Comment: 21 pages, 1 figure
- Published
- 2024
28. Quark Transverse Spin-Momentum Correlation of the Pion from Lattice QCD: The Boer-Mulders Function
- Author
-
Walter, Lisa, Hua, Jun, Lahrtz, Sebastian, Ma, Lingquan, Schäfer, Andreas, Shu, Hai-Tao, Su, Yushan, Sun, Peng, Wang, Wei, Xiong, Xiaonu, Yang, Yi-Bo, Zhang, Jian-Hui, and Zhang, Qi-An
- Subjects
High Energy Physics - Lattice - Abstract
We present the first lattice QCD calculation of the quark transverse spin-momentum correlation, i.e., the T-odd Boer-Mulders function, of the pion, using large-momentum effective theory (LaMET). The calculation is done at three lattice spacings $a=(0.098, 0.085, 0.064)$ fm and pion masses $\sim350$ MeV, with pion momenta up to $1.8$ GeV. The matrix elements are renormalized in a state-of-the-art scheme and extrapolated to the continuum and infinite momentum limit. We have implemented the perturbative matching up to the next-to-next-to-leading order and carried out a renormalization-group resummation. Our results provide valuable input for phenomenological analyses of the Boer-Mulders single-spin asymmetry., Comment: 16 pages, 15 figures
- Published
- 2024
29. Search for Solar Boosted Dark Matter Particles at the PandaX-4T Experiment
- Author
-
Shen, Guofang, Bo, Zihao, Chen, Wei, Chen, Xun, Chen, Yunhua, Cheng, Zhaokan, Cui, Xiangyi, Fan, Yingjie, Fang, Deqing, Gao, Zhixing, Geng, Lisheng, Giboni, Karl, Guo, Xunan, Guo, Xuyuan, Guo, Zichao, Han, Chencheng, Han, Ke, He, Changda, He, Jinrong, Huang, Di, Huang, Houqi, Huang, Junting, Hou, Ruquan, Hou, Yu, Ji, Xiangdong, Ji, Xiangpan, Ju, Yonglin, Li, Chenxiang, Li, Jiafu, Li, Mingchuan, Li, Shuaijie, Li, Tao, Li, Zhiyuan, Lin, Qing, Liu, Jianglai, Lu, Congcong, Lu, Xiaoying, Luo, Lingyin, Luo, Yunyang, Ma, Wenbo, Ma, Yugang, Mao, Yajun, Meng, Yue, Ning, Xuyang, Pang, Binyu, Qi, Ningchun, Qian, Zhicheng, Ren, Xiangxiang, Shan, Dong, Shang, Xiaofeng, Shao, Xiyuan, Shen, Manbin, Sun, Wenliang, Tao, Yi, Wang, Anqing, Wang, Guanbo, Wang, Hao, Wang, Jiamin, Wang, Lei, Wang, Meng, Wang, Qiuhong, Wang, Shaobo, Wang, Siguang, Wang, Wei, Wang, Xiuli, Wang, Xu, Wang, Zhou, Wei, Yuehuan, Wu, Weihao, Wu, Yuan, Xiao, Mengjiao, Xiao, Xiang, Xiong, Kaizhi, Xu, Yifan, Yao, Shunyu, Yan, Binbin, Yan, Xiyu, Yang, Yong, Ye, Peihua, Yu, Chunxu, Yuan, Ying, Yuan, Zhe, Yun, Youhui, Zeng, Xinning, Zhang, Minzhen, Zhang, Peng, Zhang, Shibo, Zhang, Shu, Zhang, Tao, Zhang, Wei, Zhang, Yang, Zhang, Yingxin, Zhang, Yuanyuan, Zhao, Li, Zhou, Jifang, Zhou, Jiaxu, Zhou, Jiayi, Zhou, Ning, Zhou, Xiaopeng, Zhou, Yubo, Zhou, Zhizhen, An, Haipeng, and Nie, Haoming
- Subjects
High Energy Physics - Experiment ,High Energy Physics - Phenomenology - Abstract
We present a novel constraint on light dark matter utilizing $1.54$ tonne$\cdot$year of data acquired from the PandaX-4T dual-phase xenon time projection chamber. This constraint is derived through detecting electronic recoil signals resulting from the interaction with solar-enhanced dark matter flux. Low-mass dark matter particles, lighter than a few MeV/$c^2$, can scatter with the thermal electrons in the Sun. Consequently, with higher kinetic energy, the boosted dark matter component becomes detectable via contact scattering with xenon electrons, resulting in a few keV energy deposition that exceeds the threshold of PandaX-4T. We calculate the expected recoil energy in PandaX-4T considering the Sun's acceleration and the detection capabilities of the xenon detector. The first experimental search results using the xenon detector yield the most stringent cross-section of $3.51 \times 10^{-39}~\mathrm{cm}^2$ at $0.08~\mathrm{MeV}$/$c^2$ for a solar boosted dark matter mass ranging from $0.02$ to $10~ \mathrm{MeV}$/$c^2$, achieving a 23 fold improvement compared with earlier experimental studies.
- Published
- 2024
30. A Fully Hardware Implemented Accelerator Design in ReRAM Analog Computing without ADCs
- Author
-
Dang, Peng, Li, Huawei, and Wang, Wei
- Subjects
Computer Science - Hardware Architecture ,Computer Science - Artificial Intelligence - Abstract
Emerging ReRAM-based accelerators process neural networks via analog Computing-in-Memory (CiM) for ultra-high energy efficiency. However, significant overhead in peripheral circuits and complex nonlinear activation modes constrain system energy efficiency improvements. This work explores the hardware implementation of the Sigmoid and SoftMax activation functions of neural networks with stochastically binarized neurons by utilizing sampled noise signals from ReRAM devices to achieve a stochastic effect. We propose a complete ReRAM-based Analog Computing Accelerator (RACA) that accelerates neural network computation by leveraging stochastically binarized neurons in combination with ReRAM crossbars. The novel circuit design removes significant sources of energy/area efficiency degradation, i.e., the Digital-to-Analog and Analog-to-Digital Converters (DACs and ADCs) as well as the components to explicitly calculate the activation functions. Experimental results show that our proposed design outperforms traditional architectures across all overall performance metrics without compromising inference accuracy.
- Published
- 2024
31. Flavor Physics at CEPC: a General Perspective
- Author
-
Ai, Xiaocong, Altmannshofer, Wolfgang, Athron, Peter, Bai, Xiaozhi, Calibbi, Lorenzo, Cao, Lu, Che, Yuzhi, Chen, Chunhui, Chen, Ji-Yuan, Chen, Long, Chen, Mingshui, Chen, Shanzhen, Chen, Xuan, Cheng, Shan, Chiang, Cheng-Wei, Crivellin, Andreas, Cui, Hanhua, Deschamps, Olivier, Descotes-Genon, Sébastien, Du, Xiaokang, Fang, Shuangshi, Gao, Yu, Geng, Li-Sheng, Goldenzweig, Pablo, Gu, Jiayin, Guo, Feng-Kun, Guo, Yuchen, Guo, Zhi-Hui, Han, Tao, He, Hong-Jian, He, Jibo, He, Miao, Huang, Yanping, Isidori, Gino, Ji, Quan, Jiang, Jianfeng, Jiang, Xu-Hui, Kamenik, Jernej F., Kwok, Tsz Hong, Li, Gang, Li, Geng, Li, Haibo, Li, Haitao, Li, Hengne, Li, Honglei, Li, Liang, Li, Lingfeng, Li, Qiang, Li, Shu, Li, Xiaomei, Li, Xin-Qiang, Li, Yiming, Li, Yubo, Li, Yuji, Li, Zhao, Liang, Hao, Liang, Zhijun, Liao, Libo, Ligeti, Zoltan, Liu, Jia, Liu, Jianbei, Liu, Tao, Liu, Yi, Liu, Yong, Liu, Zhen, Lou, Xinchou, Lu, Peng-Cheng, Lusiani, Alberto, Ma, Hong-Hao, Ma, Kai, Mao, Yaxian, Marzocca, David, Niu, Juan-Juan, Prell, Soeren, Qi, Huirong, Qian, Sen, Qian, Wenbin, Qian, Zhuoni, Qin, Qin, Rock, Ariel, Rosner, Jonathan L., Ruan, Manqi, Shao, Dingyu, Shen, Chengping, Shen, Xiaoyan, Shi, Haoyu, Shi, Liaoshan, Si, Zong-Guo, Sierra, Cristian, Song, Huayang, Su, Shufang, Su, Wei, Tammaro, Michele, Wang, En, Wang, Fei, Wang, Hengyu, Wang, Jian, Wang, Jianchun, Wang, Kun, Wang, Lian-Tao, Wang, Wei, Wang, Xiaolong, Wang, Xiaoping, Wang, Yadi, Wang, Yifang, Wang, Yuexin, Wu, Xing-Gang, Wu, Yongcheng, Xiao, Rui-Qing, Xie, Ke-Pan, Xie, Yuehong, Xu, Zijun, Yang, Haijun, Yang, Hongtao, Yang, Lin, Yang, Shuo, Yin, Zhongbao, Yu, Fusheng, Yuan, Changzheng, Yuan, Xing-Bo, Yuan, Xuhao, Yue, Chongxing, Zhan, Xi-Jie, Zhang, Kaili, Zhang, Liming, Zhang, Xiaoming, Zhang, Yang, Zhang, Yanxi, Zhang, Yongchao, Zhang, Yu, Zhang, Zhen-Hua, Zhang, Zhong, Zhao, Mingrui, Zhao, Qiang, Zheng, Xu-Chang, Zheng, Yangheng, Zhou, Chen, Zhu, Pengxuan, Zhu, Yongfeng, Zuo, Xunwu, and Zupan, Jure
- Subjects
High Energy Physics - Experiment ,High Energy Physics - Phenomenology - Abstract
We discuss the landscape of flavor physics at the Circular Electron-Positron Collider (CEPC), based on the nominal luminosity outlined in its Technical Design Report. The CEPC is designed to operate in multiple modes to address a variety of tasks. At the $Z$ pole, the expected production of 4 Tera $Z$ bosons will provide unique and highly precise measurements of $Z$ boson couplings, while the substantial number of boosted heavy-flavored quarks and leptons produced in clean $Z$ decays will facilitate investigations into their flavor physics with unprecedented precision. We investigate the prospects of measuring various physics benchmarks and discuss their implications for particle theories and phenomenological models. Our studies indicate that, with its highlighted advantages and anticipated excellent detector performance, the CEPC can explore beauty and $\tau$ physics in ways that are superior to or complementary with the Belle II and Large-Hadron-Collider-beauty experiments, potentially enabling the detection of new physics at energy scales of 10 TeV and above. This potential also extends to the observation of yet-to-be-discovered rare and exotic processes, as well as testing fundamental principles such as lepton flavor universality, lepton and baryon number conservation, etc., making the CEPC a vibrant platform for flavor physics research. The $WW$ threshold scan, Higgs-factory operation and top-pair productions of the CEPC further enhance its merits in this regard, especially for measuring the Cabibbo-Kobayashi-Maskawa matrix elements, and Flavor-Changing-Neutral-Current physics of Higgs boson and top quarks. We outline the requirements for detector performance and considerations for future development to achieve the anticipated scientific goals.
- Published
- 2024
32. MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
- Author
-
Qi, Jingyuan, Jia, Zian, Liu, Minqian, Zhan, Wangzhi, Zhang, Junkai, Wen, Xiaofei, Gan, Jingru, Chen, Jianpeng, Liu, Qin, Ma, Mingyu Derek, Li, Bangzheng, Wang, Haohui, Kulkarni, Adithya, Chen, Muhao, Zhou, Dawei, Li, Ling, Wang, Wei, and Huang, Lifu
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Human-Computer Interaction - Abstract
The discovery of novel mechanical metamaterials, whose properties are dominated by their engineered structures rather than chemical composition, is a knowledge-intensive and resource-demanding process. To accelerate the design of novel metamaterials, we present MetaScientist, a human-in-the-loop system that integrates advanced AI capabilities with expert oversight with two primary phases: (1) hypothesis generation, where the system performs complex reasoning to generate novel and scientifically sound hypotheses, supported with domain-specific foundation models and inductive biases retrieved from existing literature; (2) 3D structure synthesis, where a 3D structure is synthesized with a novel 3D diffusion model based on the textual hypothesis and refined it with a LLM-based refinement model to achieve better structure properties. At each phase, domain experts iteratively validate the system outputs, and provide feedback and supplementary materials to ensure the alignment of the outputs with scientific principles and human preferences. Through extensive evaluation from human scientists, MetaScientist is able to deliver novel and valid mechanical metamaterial designs that have the potential to be highly impactful in the metamaterial field.
- Published
- 2024
33. Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation
- Author
-
Kang, Xiaoqiang, Wang, Zimu, Jin, Xiaobo, Wang, Wei, Huang, Kaizhu, and Wang, Qiufeng
- Subjects
Computer Science - Computation and Language - Abstract
Solving tabular math word problems (TMWPs) has become a critical role in evaluating the mathematical reasoning ability of large language models (LLMs), where large-scale TMWP samples are commonly required for LLM fine-tuning. Since the collection of high-quality TMWP datasets is costly and time-consuming, recent research has concentrated on automatic TMWP generation. However, current generated samples usually suffer from issues of either correctness or diversity. In this paper, we propose a Template-driven LLM-paraphrased (TeLL) framework for generating high-quality TMWP samples with diverse backgrounds and accurate tables, questions, answers, and solutions. To this end, we first extract templates from existing real samples to generate initial problems, ensuring correctness. Then, we adopt an LLM to extend templates and paraphrase problems, obtaining diverse TMWP samples. Furthermore, we find the reasoning annotation is important for solving TMWPs. Therefore, we propose to enrich each solution with illustrative reasoning steps. Through the proposed framework, we construct a high-quality dataset TabMWP-TeLL by adhering to the question types in the TabMWP dataset, and we conduct extensive experiments on a variety of LLMs to demonstrate the effectiveness of TabMWP-TeLL in improving TMWP solving performance. The code and data of this paper are available at: https://github.com/Jason8Kang/TELL., Comment: Accepted at AAAI 2025, extended version with appendix
- Published
- 2024
34. Paradoxical non-Gaussian behavior in fractional Laplace motion with drift
- Author
-
Wang, Wei, Liang, Yingjie, Chechkin, Aleksei V., and Metzler, Ralf
- Subjects
Condensed Matter - Statistical Mechanics ,Physics - Biological Physics ,Quantitative Biology - Quantitative Methods - Abstract
We study fractional Laplace motion (FLM) obtained from subordination of fractional Brownian motion to a gamma process, in the presence of an external drift that acts on the composite process or of an internal drift acting solely on the parental process. We derive the statistical properties of this FLM process and find that the external drift does not influence the mean-squared displacement (MSD), whereas the internal drift leads to normal diffusion, dominating at long times in the subdiffusive Hurst exponent regime. We also investigate the intricate properties of the probability density function (PDF), demonstrating that it possesses a central Gaussian region, whose expansion in time is influenced by FBM's Hurst exponent. Outside of this region the PDF follows a non-Gaussian pattern. The kurtosis of this FLM process converges toward the Gaussian limit at long times insensitive to the extreme non-Gaussian tails. Additionally, in the presence of the external drift, the PDF remains symmetric and centered at $x=vt$. In contrast, for the internal drift this symmetry is broken. The results of our computer simulations are fully consistent with the theoretical predictions. The FLM model is suitable for describing stochastic processes with a non-Gaussian PDF and long-ranged correlations of the motion., Comment: 15 pages, 12 figures
- Published
- 2024
35. Promptable Representation Distribution Learning and Data Augmentation for Gigapixel Histopathology WSI Analysis
- Author
-
Tang, Kunming, Jiang, Zhiguo, Shi, Jun, Wang, Wei, Wu, Haibo, and Zheng, Yushan
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Gigapixel image analysis, particularly for whole slide images (WSIs), often relies on multiple instance learning (MIL). Under the paradigm of MIL, patch image representations are extracted and then fixed during the training of the MIL classifiers for efficiency consideration. However, the invariance of representations makes it difficult to perform data augmentation for WSI-level model training, which significantly limits the performance of the downstream WSI analysis. The current data augmentation methods for gigapixel images either introduce additional computational costs or result in a loss of semantic information, which is hard to meet the requirements for efficiency and stability needed for WSI model training. In this paper, we propose a Promptable Representation Distribution Learning framework (PRDL) for both patch-level representation learning and WSI-level data augmentation. Meanwhile, we explore the use of prompts to guide data augmentation in feature space, which achieves promptable data augmentation for training robust WSI-level models. The experimental results have demonstrated that the proposed method stably outperforms state-of-the-art methods., Comment: Accepted by AAAI2025
- Published
- 2024
36. Searching for Neutrinoless Double-Beta Decay of $^{136}$Xe with PandaX-4T
- Author
-
PandaX Collaboration, Zhang, Shu, Bo, Zihao, Chen, Wei, Chen, Xun, Chen, Yunhua, Cheng, Zhaokan, Cui, Xiangyi, Fan, Yingjie, Fang, Deqing, Gao, Zhixing, Geng, Lisheng, Giboni, Karl, Guo, Xunan, Guo, Xuyuan, Guo, Zichao, Han, Chencheng, Han, Ke, He, Changda, He, Jinrong, Huang, Di, Huang, Houqi, Huang, Junting, Hou, Ruquan, Hou, Yu, Ji, Xiangdong, Ji, Xiangpan, Ju, Yonglin, Li, Chenxiang, Li, Jiafu, Li, Mingchuan, Li, Shuaijie, Li, Tao, Li, Zhiyuan, Lin, Qing, Liu, Jianglai, Lu, Congcong, Lu, Xiaoying, Luo, Lingyin, Luo, Yunyang, Ma, Wenbo, Ma, Yugang, Mao, Yajun, Meng, Yue, Ning, Xuyang, Pang, Binyu, Qi, Ningchun, Qian, Zhicheng, Ren, Xiangxiang, Shan, Dong, Shang, Xiaofeng, Shao, Xiyuan, Shen, Guofang, Shen, Manbin, Sun, Wenliang, Tao, Yi, Wang, Anqing, Wang, Guanbo, Wang, Hao, Wang, Jiamin, Wang, Lei, Wang, Meng, Wang, Qiuhong, Wang, Shaobo, Wang, Siguang, Wang, Wei, Wang, Xiuli, Wang, Xu, Wang, Zhou, Wei, Yuehuan, Wu, Weihao, Wu, Yuan, Xiao, Mengjiao, Xiao, Xiang, Xiong, Kaizhi, Xu, Yifan, Yao, Shunyu, Yan, Binbin, Yan, Xiyu, Yang, Yong, Ye, Peihua, Yu, Chunxu, Yuan, Ying, Yuan, Zhe, Yun, Youhui, Zeng, Xinning, Zhang, Minzhen, Zhang, Peng, Zhang, Shibo, Zhang, Tao, Zhang, Wei, Zhang, Yang, Zhang, Yingxin, Zhang, Yuanyuan, Zhao, Li, Zhou, Jifang, Zhou, Jiaxu, Zhou, Jiayi, Zhou, Ning, Zhou, Xiaopeng, Zhou, Yubo, and Zhou, Zhizhen
- Subjects
Nuclear Experiment ,High Energy Physics - Experiment - Abstract
We report the search for neutrinoless double-beta decay of $^{136}$Xe from the PandaX-4T experiment with a 3.7-tonne natural xenon target. The data reconstruction and the background modeling are optimized in the MeV energy region. A blind analysis is performed with data from the commissioning run and the first science run. No significant excess of signal over the background is observed. A lower limit on the half-life of $^{136}$Xe neutrinoless double-beta decay is established to be $2.1 \times 10^{24}$~yr at the 90\% confidence level, with a $^{136}$Xe exposure of 44.6~kg$\cdot$year. Our result represents the most stringent constraint from a natural xenon detector to date., Comment: 9 pages, 4 figures, 2 tables
- Published
- 2024
37. Spectrum and Lifshitz tails for the Anderson model on the Sierpinski gasket graph
- Author
-
Shou, Laura, Wang, Wei, and Zhang, Shiwen
- Subjects
Mathematical Physics - Abstract
In this work, we study the Anderson model on the Sierpinski gasket graph. We first identify the almost sure spectrum of the Anderson model when the support of the random potential has no gaps. We then prove the existence of the integrated density states of the Anderson model and show that it has Lifshitz tails with Lifshitz exponent determined by the ratio of the volume growth rate and the random walk dimension of the Sierpinski gasket graph.
- Published
- 2024
38. ChatDiT: A Training-Free Baseline for Task-Agnostic Free-Form Chatting with Diffusion Transformers
- Author
-
Huang, Lianghua, Wang, Wei, Wu, Zhi-Fan, Shi, Yupeng, Liang, Chen, Shen, Tong, Zhang, Han, Dou, Huanzhang, Liu, Yu, and Zhou, Jingren
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Recent research arXiv:2410.15027 arXiv:2410.23775 has highlighted the inherent in-context generation capabilities of pretrained diffusion transformers (DiTs), enabling them to seamlessly adapt to diverse visual tasks with minimal or no architectural modifications. These capabilities are unlocked by concatenating self-attention tokens across multiple input and target images, combined with grouped and masked generation pipelines. Building upon this foundation, we present ChatDiT, a zero-shot, general-purpose, and interactive visual generation framework that leverages pretrained diffusion transformers in their original form, requiring no additional tuning, adapters, or modifications. Users can interact with ChatDiT to create interleaved text-image articles, multi-page picture books, edit images, design IP derivatives, or develop character design settings, all through free-form natural language across one or more conversational rounds. At its core, ChatDiT employs a multi-agent system comprising three key components: an Instruction-Parsing agent that interprets user-uploaded images and instructions, a Strategy-Planning agent that devises single-step or multi-step generation actions, and an Execution agent that performs these actions using an in-context toolkit of diffusion transformers. We thoroughly evaluate ChatDiT on IDEA-Bench arXiv:2412.11767, comprising 100 real-world design tasks and 275 cases with diverse instructions and varying numbers of input and target images. Despite its simplicity and training-free approach, ChatDiT surpasses all competitors, including those specifically designed and trained on extensive multi-task datasets. We further identify key limitations of pretrained DiTs in zero-shot adapting to tasks. We release all code, agents, results, and intermediate outputs to facilitate further research at https://github.com/ali-vilab/ChatDiT, Comment: Tech report. Project page: https://ali-vilab.github.io/ChatDiT-Page/
- Published
- 2024
39. Unsupervised Region-Based Image Editing of Denoising Diffusion Models
- Author
-
Li, Zixiang, Song, Yue, Tao, Renshuai, Jia, Xiaohong, Zhao, Yao, and Wang, Wei
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Although diffusion models have achieved remarkable success in the field of image generation, their latent space remains under-explored. Current methods for identifying semantics within latent space often rely on external supervision, such as textual information and segmentation masks. In this paper, we propose a method to identify semantic attributes in the latent space of pre-trained diffusion models without any further training. By projecting the Jacobian of the targeted semantic region into a low-dimensional subspace which is orthogonal to the non-masked regions, our approach facilitates precise semantic discovery and control over local masked areas, eliminating the need for annotations. We conducted extensive experiments across multiple datasets and various architectures of diffusion models, achieving state-of-the-art performance. In particular, for some specific face attributes, the performance of our proposed method even surpasses that of supervised approaches, demonstrating its superior ability in editing local image properties.
- Published
- 2024
40. The Hartogs-Bochner extension for monogenic functions of several vector variables and the Dirac complex
- Author
-
Shi, Yun and Wang, Wei
- Subjects
Mathematics - Complex Variables ,Mathematics - Analysis of PDEs - Abstract
Holomorphic functions in several complex variables are generalized to regular functions in several quaternionic variables, and further to monogenic functions of several vector variables, which are annihilated by several Dirac operators on $k$ copies of the Euclidean space $\mathbb R^n$. As the Dolbeault complex in complex analysis, the Dirac complex resolving several Dirac operators plays the fundamental role to investigate monogenic functions. Although the spaces in the Dirac complex are complicated irreducible modules of ${\rm GL}(k),$ we give a simple characterization of the first four spaces, which allows us to write down first three operators in the Dirac complex explicitly and to show this part to be an elliptic complex. Then the PDE method can be applied to obtain solutions to the non-homogeneous several Dirac equations under the compatibility condition, which implies the Hartogs' phenomenon for monogenic functions. Moreover, we find the boundary version of several Dirac operators and introduce the notion of a tangentially monogenic function, corresponding to tangential Cauchy-Riemann operator and CR functions in several complex variables, and establish the Hartogs-Bochner extension for tangentially monogenic functions on the boundary of a domain., Comment: 29 pages
- Published
- 2024
41. Echo: Simulating Distributed Training At Scale
- Author
-
Feng, Yicheng, Chen, Yuetao, Chen, Kaiwen, Li, Jingzong, Wu, Tianyuan, Cheng, Peng, Wu, Chuan, Wang, Wei, Ho, Tsung-Yi, and Xu, Hong
- Subjects
Computer Science - Machine Learning ,Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
Simulation offers unique values for both enumeration and extrapolation purposes, and is becoming increasingly important for managing the massive machine learning (ML) clusters and large-scale distributed training jobs. In this paper, we build Echo to tackle three key challenges in large-scale training simulation: (1) tracing the runtime training workloads at each device in an ex-situ fashion so we can use a single device to obtain the actual execution graphs of 1K-GPU training, (2) accurately estimating the collective communication without high overheads of discrete-event based network simulation, and (3) accounting for the interference-induced computation slowdown from overlapping communication and computation kernels on the same device. Echo delivers on average 8% error in training step -- roughly 3x lower than state-of-the-art simulators -- for GPT-175B on a 96-GPU H800 cluster with 3D parallelism on Megatron-LM under 2 minutes.
- Published
- 2024
42. IDEA-Bench: How Far are Generative Models from Professional Designing?
- Author
-
Liang, Chen, Huang, Lianghua, Fang, Jingwu, Dou, Huanzhang, Wang, Wei, Wu, Zhi-Fan, Shi, Yupeng, Zhang, Junge, Zhao, Xin, and Liu, Yu
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Real-world design tasks - such as picture book creation, film storyboard development using character sets, photo retouching, visual effects, and font transfer - are highly diverse and complex, requiring deep interpretation and extraction of various elements from instructions, descriptions, and reference images. The resulting images often implicitly capture key features from references or user inputs, making it challenging to develop models that can effectively address such varied tasks. While existing visual generative models can produce high-quality images based on prompts, they face significant limitations in professional design scenarios that involve varied forms and multiple inputs and outputs, even when enhanced with adapters like ControlNets and LoRAs. To address this, we introduce IDEA-Bench, a comprehensive benchmark encompassing 100 real-world design tasks, including rendering, visual effects, storyboarding, picture books, fonts, style-based, and identity-preserving generation, with 275 test cases to thoroughly evaluate a model's general-purpose generation capabilities. Notably, even the best-performing model only achieves 22.48 on IDEA-Bench, while the best general-purpose model only achieves 6.81. We provide a detailed analysis of these results, highlighting the inherent challenges and providing actionable directions for improvement. Additionally, we provide a subset of 18 representative tasks equipped with multimodal large language model (MLLM)-based auto-evaluation techniques to facilitate rapid model development and comparison. We releases the benchmark data, evaluation toolkits, and an online leaderboard at https://github.com/ali-vilab/IDEA-Bench, aiming to drive the advancement of generative models toward more versatile and applicable intelligent design systems.
- Published
- 2024
43. Fractional Langevin equation far from equilibrium: Riemann-Liouville fractional Brownian motion, spurious nonergodicity and aging
- Author
-
Wei, Qing, Wang, Wei, Tang, Yifa, Metzler, Ralf, and Chechkin, Aleksei
- Subjects
Condensed Matter - Statistical Mechanics - Abstract
We consider the fractional Langevin equation far from equilibrium (FLEFE) to describe stochastic dynamics which do not obey the fluctuation-dissipation theorem, unlike the conventional fractional Langevin equation (FLE). The solution of this equation is Riemann-Liouville fractional Brownian motion (RL-FBM), also known in the literature as FBM II. Spurious nonergodicity, stationarity, and aging properties of the solution are explored for all admissible values $\alpha>1/2$ of the order $\alpha$ of the time-fractional Caputo derivative in the FLEFE. The increments of the process are asymptotically stationary. However when $1/2<\alpha<3/2$, the time-averaged mean-squared displacement (TAMSD) does not converge to the mean-squared displacement (MSD). Instead, it converges to the mean-squared increment (MSI) or structure function, leading to the phenomenon of spurious nonergodicity. When $\alpha\ge 3/2$, the increments of FLEFE motion are nonergodic, however the higher order increments are asymptotically ergodic. We also discuss the aging effect in the FLEFE by investigating the influence of an aging time $t_a$ on the mean-squared displacement, time-averaged mean-squared displacement and autocovariance function of the increments. We find that under strong aging conditions the process becomes ergodic, and the increments become stationary in the domain $1/2<\alpha<3/2$., Comment: 25 pages, 5 figures
- Published
- 2024
44. Rethinking Software Misconfigurations in the Real World: An Empirical Study and Literature Analysis
- Author
-
Liu, Yuhao, Zhou, Yingnan, Zhang, Hanfeng, Chang, Zhiwei, Xu, Sihan, Jia, Yan, Wang, Wei, and Liu, Zheli
- Subjects
Computer Science - Software Engineering ,D.2 - Abstract
Software misconfiguration has consistently been a major reason for software failures. Over the past twenty decades, much work has been done to detect and diagnose software misconfigurations. However, there is still a gap between real-world misconfigurations and the literature. It is desirable to investigate whether existing taxonomy and tools are applicable for real-world misconfigurations in modern software. In this paper, we conduct an empirical study on 823 real-world misconfiguration issues, based on which we propose a novel classification of the root causes of software misconfigurations, i.e., constraint violation, resource unavailability, component-dependency error, and misunderstanding of configuration effects. Then, we systematically review the literature on misconfiguration troubleshooting, and study the trends of research and the practicality of the tools and datasets in this field. We find that the research targets have changed from fundamental software to advanced applications (e.g., cloud service). In the meanwhile, the research on non-crash misconfigurations such as performance degradation and security risks also has a significant growth. Despite the progress, a majority of studies lack reproducibility due to the unavailable tools and evaluation datasets. In total, only six tools and two datasets are publicly available. However, the adaptability of these tools limit their practical use on real-world misconfigurations. We also summarize the important challenges and several suggestions to facilitate the research on software misconfiguration., Comment: 15 pages,6 figures, 7 tables
- Published
- 2024
45. APAR: Modeling Irregular Target Functions in Tabular Regression via Arithmetic-Aware Pre-Training and Adaptive-Regularized Fine-Tuning
- Author
-
Wu, Hong-Wei, Wang, Wei-Yao, Wang, Kuang-Da, and Peng, Wen-Chih
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Tabular data are fundamental in common machine learning applications, ranging from finance to genomics and healthcare. This paper focuses on tabular regression tasks, a field where deep learning (DL) methods are not consistently superior to machine learning (ML) models due to the challenges posed by irregular target functions inherent in tabular data, causing sensitive label changes with minor variations from features. To address these issues, we propose a novel Arithmetic-Aware Pre-training and Adaptive-Regularized Fine-tuning framework (APAR), which enables the model to fit irregular target function in tabular data while reducing the negative impact of overfitting. In the pre-training phase, APAR introduces an arithmetic-aware pretext objective to capture intricate sample-wise relationships from the perspective of continuous labels. In the fine-tuning phase, a consistency-based adaptive regularization technique is proposed to self-learn appropriate data augmentation. Extensive experiments across 10 datasets demonstrated that APAR outperforms existing GBDT-, supervised NN-, and pretrain-finetune NN-based methods in RMSE (+9.43% $\sim$ 20.37%), and empirically validated the effects of pre-training tasks, including the study of arithmetic operations. Our code and data are publicly available at https://github.com/johnnyhwu/APAR., Comment: AAAI 2025 Main Track
- Published
- 2024
46. Proposing and solving olympiad geometry with guided tree search
- Author
-
Zhang, Chi, Song, Jiajun, Li, Siyu, Liang, Yitao, Ma, Yuxi, Wang, Wei, Zhu, Yixin, and Zhu, Song-Chun
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Mathematics olympiads are prestigious competitions, with problem proposing and solving highly honored. Building artificial intelligence that proposes and solves olympiads presents an unresolved challenge in automated theorem discovery and proving, especially in geometry for its combination of numerical and spatial elements. We introduce TongGeometry, a Euclidean geometry system supporting tree-search-based guided problem proposing and solving. The efficient geometry system establishes the most extensive repository of geometry theorems to date: within the same computational budget as the existing state-of-the-art, TongGeometry discovers 6.7 billion geometry theorems requiring auxiliary constructions, including 4.1 billion exhibiting geometric symmetry. Among them, 10 theorems were proposed to regional mathematical olympiads with 3 of TongGeometry's proposals selected in real competitions, earning spots in a national team qualifying exam or a top civil olympiad in China and the US. Guided by fine-tuned large language models, TongGeometry solved all International Mathematical Olympiad geometry in IMO-AG-30, outperforming gold medalists for the first time. It also surpasses the existing state-of-the-art across a broader spectrum of olympiad-level problems. The full capabilities of the system can be utilized on a consumer-grade machine, making the model more accessible and fostering widespread democratization of its use. By analogy, unlike existing systems that merely solve problems like students, TongGeometry acts like a geometry coach, discovering, presenting, and proving theorems.
- Published
- 2024
47. Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance
- Author
-
Lyu, Jiahao, Wang, Wei, Yang, Dongbao, Zhong, Jinwen, and Zhou, Yu
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Scene text spotting has attracted the enthusiasm of relative researchers in recent years. Most existing scene text spotters follow the detection-then-recognition paradigm, where the vanilla detection module hardly determines the reading order and leads to failure recognition. After rethinking the auto-regressive scene text recognition method, we find that a well-trained recognizer can implicitly perceive the local semantics of all characters in a complete word or a sentence without a character-level detection module. Local semantic knowledge not only includes text content but also spatial information in the right reading order. Motivated by the above analysis, we propose the Local Semantics Guided scene text Spotter (LSGSpotter), which auto-regressively decodes the position and content of characters guided by the local semantics. Specifically, two effective modules are proposed in LSGSpotter. On the one hand, we design a Start Point Localization Module (SPLM) for locating text start points to determine the right reading order. On the other hand, a Multi-scale Adaptive Attention Module (MAAM) is proposed to adaptively aggregate text features in a local area. In conclusion, LSGSpotter achieves the arbitrary reading order spotting task without the limitation of sophisticated detection, while alleviating the cost of computational resources with the grid sampling strategy. Extensive experiment results show LSGSpotter achieves state-of-the-art performance on the InverseText benchmark. Moreover, our spotter demonstrates superior performance on English benchmarks for arbitrary-shaped text, achieving improvements of 0.7\% and 2.5\% on Total-Text and SCUT-CTW1500, respectively. These results validate our text spotter is effective for scene texts in arbitrary reading order and shape., Comment: Accepted by AAAI2025
- Published
- 2024
48. ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression
- Author
-
Yao, Kai, Tan, Zhaorui, Ye, Tiandi, Li, Lichun, Zhao, Yuan, Liu, Wenyan, Wang, Wei, and Zhu, Jianke
- Subjects
Computer Science - Computation and Language ,Computer Science - Cryptography and Security - Abstract
Offsite-tuning is a privacy-preserving method for tuning large language models (LLMs) by sharing a lossy compressed emulator from the LLM owners with data owners for downstream task tuning. This approach protects the privacy of both the model and data owners. However, current offsite tuning methods often suffer from adaptation degradation, high computational costs, and limited protection strength due to uniformly dropping LLM layers or relying on expensive knowledge distillation. To address these issues, we propose ScaleOT, a novel privacy-utility-scalable offsite-tuning framework that effectively balances privacy and utility. ScaleOT introduces a novel layerwise lossy compression algorithm that uses reinforcement learning to obtain the importance of each layer. It employs lightweight networks, termed harmonizers, to replace the raw LLM layers. By combining important original LLM layers and harmonizers in different ratios, ScaleOT generates emulators tailored for optimal performance with various model scales for enhanced privacy protection. Additionally, we present a rank reduction method to further compress the original LLM layers, significantly enhancing privacy with negligible impact on utility. Comprehensive experiments show that ScaleOT can achieve nearly lossless offsite tuning performance compared with full fine-tuning while obtaining better model privacy., Comment: accepted by AAAI2025
- Published
- 2024
49. ALMA/SCUBA-2 COSMOS Survey: Properties of X-ray- and SED-selected AGNs in Bright Submillimeter Galaxies
- Author
-
Uematsu, Ryosuke, Ueda, Yoshihiro, Alexander, David M., Swinbank, A. M., Smail, Ian, Andonie, Carolina, Chen, Chian-Chou, Dudzeviciute, Ugne, Ikarashi, Soh, Kohno, Kotaro, Matsuda, Yuichi, Puglisi, Annagrazia, Umehata, Hideki, and Wang, Wei-Hao
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We investigate the properties of active galactic nuclei (AGNs) in the brightest submillimeter galaxies (SMGs) in the COSMOS field. We utilize the bright sample of ALMA/SCUBA-2 COSMOS Survey (AS2COSMOS), which consists of 260 SMGs with $S_{\mathrm{870}\, \mu \mathrm{m}}=0.7\text{--}19.2\,\mathrm{mJy}$ at $z=0\text{--}6$. We perform optical to millimeter spectral energy distribution (SED) modeling for the whole sample. We identify 24 AGN-host galaxies from the SEDs. Supplemented by 23 X-ray detected AGNs (X-ray AGNs), we construct an overall sample of 40 AGN-host galaxies. The X-ray luminosity upper bounds indicate that the X-ray undetected SED-identified AGNs are likely to be nearly Compton thick or have unusually suppressed X-ray emission. From visual classification, we identify $25^{+6}_{-5}$\% of the SMGs without AGNs as major merger candidates. This fraction is almost consistent with the general galaxy population at $z\sim2$, suggesting that major mergers are not necessarily required for the enhanced star formation in SMGs. We also identify $47^{+16}_{-15}$\% of the AGN hosts as major merger candidates, which is about twice as high as that in the SMGs without AGNs. This suggests that major mergers play a key role in triggering AGN activity in bright SMGs., Comment: 37 pages, 21 figures, accepted for The Astrophysical Journal
- Published
- 2024
50. DumpyOS: A Data-Adaptive Multi-ary Index for Scalable Data Series Similarity Search
- Author
-
Wang, Zeyu, Wang, Qitong, Wang, Peng, Palpanas, Themis, and Wang, Wei
- Subjects
Computer Science - Databases - Abstract
Data series indexes are necessary for managing and analyzing the increasing amounts of data series collections that are nowadays available. These indexes support both exact and approximate similarity search, with approximate search providing high-quality results within milliseconds, which makes it very attractive for certain modern applications. Reducing the pre-processing (i.e., index building) time and improving the accuracy of search results are two major challenges. DSTree and the iSAX index family are state-of-the-art solutions for this problem. However, DSTree suffers from long index building times, while iSAX suffers from low search accuracy. In this paper, we identify two problems of the iSAX index family that adversely affect the overall performance. First, we observe the presence of a proximity-compactness trade-off related to the index structure design (i.e., the node fanout degree), significantly limiting the efficiency and accuracy of the resulting index. Second, a skewed data distribution will negatively affect the performance of iSAX. To overcome these problems, we propose Dumpy, an index that employs a novel multi-ary data structure with an adaptive node splitting algorithm and an efficient building workflow. Furthermore, we devise Dumpy-Fuzzy as a variant of Dumpy which further improves search accuracy by proper duplication of series. To fully leverage the potential of modern hardware including multicore CPUs and Solid State Drives (SSDs), we parallelize Dumpy to DumpyOS with sophisticated indexing and pruning-based querying algorithms. An optimized approximate search algorithm, DumpyOS-F which prominently improves the search accuracy without violating the index, is also proposed., Comment: arXiv admin note: substantial text overlap with arXiv:2304.08264
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.