205,340 results on '"WANG, WEI"'
Search Results
2. Association between single nucleotide polymorphisms in TYW5 locus and beef amino acids content in shuxuan cattle
- Author
-
Jia, Xianbo, Chen, Shiyi, Wang, Jie, Fu, Maozhong, Yi, Jun, Fang, Donghui, Wang, Wei, and Lai, Songjia
- Published
- 2022
- Full Text
- View/download PDF
3. Body Size Traits and Association with the Genetic Polymorphism of Melatonin Receptor 1A (MTNR1A) Gene in Shuxuan Cattle
- Author
-
Wang, Wei, Jia, Xian-bo, Gan, Jia, Fang, Dong-hui, Shi, Yi, He, Fang, and Yi, Jun
- Published
- 2021
- Full Text
- View/download PDF
4. NCST: Neural-based Color Style Transfer for Video Retouching
- Author
-
Jiang, Xintao, Chen, Yaosen, Zhang, Siqin, Wang, Wei, and Wen, Xuming
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Neural and Evolutionary Computing ,Electrical Engineering and Systems Science - Image and Video Processing - Abstract
Video color style transfer aims to transform the color style of an original video by using a reference style image. Most existing methods employ neural networks, which come with challenges like opaque transfer processes and limited user control over the outcomes. Typically, users cannot fine-tune the resulting images or videos. To tackle this issue, we introduce a method that predicts specific parameters for color style transfer using two images. Initially, we train a neural network to learn the corresponding color adjustment parameters. When applying style transfer to a video, we fine-tune the network with key frames from the video and the chosen style image, generating precise transformation parameters. These are then applied to convert the color style of both images and videos. Our experimental results demonstrate that our algorithm surpasses current methods in color style transfer quality. Moreover, each parameter in our method has a specific, interpretable meaning, enabling users to understand the color style transfer process and allowing them to perform manual fine-tuning if desired., Comment: 10 pages, 8 figures
- Published
- 2024
5. In-Context LoRA for Diffusion Transformers
- Author
-
Huang, Lianghua, Wang, Wei, Wu, Zhi-Fan, Shi, Yupeng, Dou, Huanzhang, Liang, Chen, Feng, Yutong, Liu, Yu, and Zhou, Jingren
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Graphics - Abstract
Recent research arXiv:2410.15027 has explored the use of diffusion transformers (DiTs) for task-agnostic image generation by simply concatenating attention tokens across images. However, despite substantial computational resources, the fidelity of the generated images remains suboptimal. In this study, we reevaluate and streamline this framework by hypothesizing that text-to-image DiTs inherently possess in-context generation capabilities, requiring only minimal tuning to activate them. Through diverse task experiments, we qualitatively demonstrate that existing text-to-image DiTs can effectively perform in-context generation without any tuning. Building on this insight, we propose a remarkably simple pipeline to leverage the in-context abilities of DiTs: (1) concatenate images instead of tokens, (2) perform joint captioning of multiple images, and (3) apply task-specific LoRA tuning using small datasets (e.g., $20\sim 100$ samples) instead of full-parameter tuning with large datasets. We name our models In-Context LoRA (IC-LoRA). This approach requires no modifications to the original DiT models, only changes to the training data. Remarkably, our pipeline generates high-fidelity image sets that better adhere to prompts. While task-specific in terms of tuning data, our framework remains task-agnostic in architecture and pipeline, offering a powerful tool for the community and providing valuable insights for further research on product-level task-agnostic generation systems. We release our code, data, and models at https://github.com/ali-vilab/In-Context-LoRA, Comment: Tech report. Project page: https://ali-vilab.github.io/In-Context-LoRA-Page/
- Published
- 2024
6. Across-Platform Detection of Malicious Cryptocurrency Transactions via Account Interaction Learning
- Author
-
Che, Zheng, Shen, Meng, Tan, Zhehui, Du, Hanbiao, Zhu, Liehuang, Wang, Wei, Chen, Ting, Zhao, Qinglin, and Xie, Yong
- Subjects
Computer Science - Cryptography and Security - Abstract
With the rapid evolution of Web3.0, cryptocurrency has become a cornerstone of decentralized finance. While these digital assets enable efficient and borderless financial transactions, their pseudonymous nature has also attracted malicious activities such as money laundering, fraud, and other financial crimes. Effective detection of malicious transactions is crucial to maintaining the security and integrity of the Web 3.0 ecosystem. Existing malicious transaction detection methods rely on large amounts of labeled data and suffer from low generalization. Label-efficient and generalizable malicious transaction detection remains a challenging task. In this paper, we propose ShadowEyes, a novel malicious transaction detection method. Specifically, we first propose a generalized graph structure named TxGraph as a representation of malicious transaction, which captures the interaction features of each malicious account and its neighbors. Then we carefully design a data augmentation method tailored to simulate the evolution of malicious transactions to generate positive pairs. To alleviate account label scarcity, we further design a graph contrastive mechanism, which enables ShadowEyes to learn discriminative features effectively from unlabeled data, thereby enhancing its detection capabilities in real-world scenarios. We conduct extensive experiments using public datasets to evaluate the performance of ShadowEyes. The results demonstrate that it outperforms state-of-the-art (SOTA) methods in four typical scenarios. Specifically, in the zero-shot learning scenario, it can achieve an F1 score of 76.98% for identifying gambling transactions, surpassing the SOTA method by12.05%. In the scenario of across-platform malicious transaction detection, ShadowEyes maintains an F1 score of around 90%, which is 10% higher than the SOTA method.
- Published
- 2024
7. EVeCA: Efficient and Verifiable On-Chain Data Query Framework Using Challenge-Based Authentication
- Author
-
Shen, Meng, Liu, Yuzhi, Zhao, Qinglin, Wang, Wei, Ou, Wei, Han, Wenbao, and Zhu, Liehuang
- Subjects
Computer Science - Cryptography and Security ,Computer Science - Distributed, Parallel, and Cluster Computing - Abstract
As blockchain applications become increasingly widespread, there is a rising demand for on-chain data queries. However, existing schemes for on-chain data queries face a challenge between verifiability and efficiency. Queries on blockchain databases can compromise the authenticity of the query results, while schemes that utilize on-chain Authenticated Data Structure (ADS) have lower efficiency. To overcome this limitation, we propose an efficient and verifiable on-chain data query framework EVeCA. In our approach, we free the full nodes from the task of ADS maintenance by delegating it to a limited number of nodes, and full nodes verify the correctness of ADS by using challenge-based authentication scheme instead of reconstructing them, which prevents the service providers from maintaining incorrect ADS with overwhelming probability. By carefully designing the ADS verification scheme, EVeCA achieves higher efficiency while remaining resilient against adaptive attacks. Our framework effectively eliminates the need for on-chain ADS maintenance, and allows full nodes to participate in ADS maintenance in a cost-effective way. We demonstrate the effectiveness of the proposed scheme through security analysis and experimental evaluation. Compared to existing schemes, our approach improves ADS maintenance efficiency by about 20*.
- Published
- 2024
8. Are Large-Language Models Graph Algorithmic Reasoners?
- Author
-
Taylor, Alexander K, Cuturrufo, Anthony, Yathish, Vishal, Ma, Mingyu Derek, and Wang, Wei
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
We seek to address a core challenge facing current Large Language Models (LLMs). LLMs have demonstrated superior performance in many tasks, yet continue to struggle with reasoning problems on explicit graphs that require multiple steps. To address this gap, we introduce a novel benchmark designed to evaluate LLM performance on classical algorithmic reasoning tasks on explicit graphs. Our benchmark encompasses five fundamental algorithms: Breadth-First Search (BFS) and Depth-First Search (DFS) for connectivity, Dijkstra's algorithm and Floyd-Warshall algorithm for all nodes shortest path, and Prim's Minimum Spanning Tree (MST-Prim's) algorithm. Through extensive experimentation, we assess the capabilities of state-of-the-art LLMs in executing these algorithms step-by-step and systematically evaluate their performance at each stage. Our findings highlight the persistent challenges LLMs face in this domain and underscore the necessity for advanced prompting techniques and algorithmic instruction to enhance their graph reasoning abilities. This work presents MAGMA, the first comprehensive benchmark focused on LLMs completing classical graph algorithms, and provides a critical step toward understanding and improving their structured problem-solving skills., Comment: 9 pages, 13 Figures
- Published
- 2024
9. Temporal and spectral variations of the X-ray pulsar Cen X-3 observed by NuSTAR
- Author
-
Liu, Qi, Wang, Wei, Santangelo, Andrea, Kong, Lingda, Ji, Long, and Ducci, Lorenzo
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Solar and Stellar Astrophysics - Abstract
We report a time-resolved analysis of the accreting X-ray pulsar Cen X-3 using observations carried out by NuSTAR, which cover approximately two binary orbits in different intensity states. The pulse profile is relatively stable over the orbital phase and shows energy dependence. It has an obvious double-peaked shape in the energy band below 15 keV -- with the second pulse peak decreasing as energy increases -- and is gradually dominated by a single peak in higher energy bands. We find that the pulse profile in the energy band of 3-5 keV at high-intensity states shows a subtle triple-peaked shape, with the main peak divided into two subpeaks. We also find a positive correlation between the pulse fraction and both energy and flux. Our spectral analysis reveals that the spectra can be well described by the continuum of Fermi-Dirac cutoff and NPEX models, and the cyclotron line is detected with the centroid energies varying from 26 keV to 29 keV, along with the iron emission line around 6.4 keV. We investigated the dependence between the cyclotron resonant scattering feature (CRSF) centroid energy and luminosity and discuss the theoretical critical luminosity. Although the variation of $E_{\rm cyc}- L_X$ is not distinct, there is a possibility that the critical luminosity lies within the range of $\sim (0.5-4)\times 10^{37}$ erg s$^{-1}$ in the band of $4-78$ keV. The photon index shows a strong positive correlation with luminosity. Our orbital-phase analysis reveals that the spectral parameters show orbital variability, and the highly variable photoelectric absorption may indicate the existence of clumpy stellar winds., Comment: 10 pages, 10 figures
- Published
- 2024
- Full Text
- View/download PDF
10. Studying the variations of the cyclotron line in Cen X-3 using Insight-HXMT
- Author
-
Liu, Qi, Wang, Wei, Yang, Wen, Chen, Xiao, and Wu, Hanji
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Solar and Stellar Astrophysics - Abstract
We investigate the cyclotron resonant scattering features (CRSFs) of the accreting X-ray pulsar Cen X-3 and significantly detect the 29 keV cyclotron line features in the hard X-ray averaged spectroscopy studies based on the recent Insight-HXMT observations in 2022, when Cen X-3 has X-ray luminosity $L_{\rm X} > \sim 5 \times 10^{37}$ erg\ s$^{-1}$ in the bands of 2 -- 60 keV. We do not find a harmonic line in the average spectra based on different continuum models. We showed that the CRSF energies have no correlation with time or luminosity in the average spectra. In addition, by performing a pulse phase-dependent spectral analysis, we revealed the fundamental line with the centroid energy ranging from 25 to 29 keV with a high significance over the spin phases. The evolution of the cyclotron line centroid energies over pulse phase is similar to the shape of pulse profiles, illustrating a positive correlation between the energy of CRSFs and the pulse phase flux., Comment: 10 pages, 7 figures
- Published
- 2024
- Full Text
- View/download PDF
11. Causality-Respecting Adaptive Refinement for PINNs: Enabling Precise Interface Evolution in Phase Field Modeling
- Author
-
Wang, Wei, Wong, Tang Paai, Ruan, Haihui, and Goswami, Somdatta
- Subjects
Physics - Computational Physics ,Computer Science - Computational Engineering, Finance, and Science - Abstract
Physics-informed neural networks (PINNs) have emerged as a powerful tool for solving physical systems described by partial differential equations (PDEs). However, their accuracy in dynamical systems, particularly those involving sharp moving boundaries with complex initial morphologies, remains a challenge. This study introduces an approach combining residual-based adaptive refinement (RBAR) with causality-informed training to enhance the performance of PINNs in solving spatio-temporal PDEs. Our method employs a three-step iterative process: initial causality-based training, RBAR-guided domain refinement, and subsequent causality training on the refined mesh. Applied to the Allen-Cahn equation, a widely-used model in phase field simulations, our approach demonstrates significant improvements in solution accuracy and computational efficiency over traditional PINNs. Notably, we observe an 'overshoot and relocate' phenomenon in dynamic cases with complex morphologies, showcasing the method's adaptive error correction capabilities. This synergistic interaction between RBAR and causality training enables accurate capture of interface evolution, even in challenging scenarios where traditional PINNs fail. Our framework not only resolves the limitations of uniform refinement strategies but also provides a generalizable methodology for solving a broad range of spatio-temporal PDEs. The simplicity and effectiveness of our RBAR-causality combined PINN offer promising potential for applications across various physical systems characterized by complex, evolving interfaces., Comment: 22 Pages, 7 Figures
- Published
- 2024
12. Diffusion models for lattice gauge field simulations
- Author
-
Zhu, Qianteng, Aarts, Gert, Wang, Wei, Zhou, Kai, and Wang, Lingxiao
- Subjects
High Energy Physics - Lattice - Abstract
We develop diffusion models for lattice gauge theories which build on the concept of stochastic quantization. This framework is applied to $U(1)$ gauge theory in $1+1$ dimensions. We show that a model trained at one small inverse coupling can be effectively transferred to larger inverse coupling without encountering issues related to topological freezing, i.e., the model can generate configurations corresponding to different couplings by introducing the Boltzmann factors as physics conditions, while maintaining the correct physical distributions without any additional training. This demonstrates the potential of physics-conditioned diffusion models for efficient and flexible lattice gauge theory simulations., Comment: 7 pages, 3 figures, accepted at the NeurIPS 2024 workshop "Machine Learning and the Physical Sciences"
- Published
- 2024
13. Calculation of heavy meson light-cone distribution amplitudes from lattice QCD
- Author
-
Han, Xue-Ying, Hua, Jun, Ji, Xiangdong, Lü, Cai-Dian, Schäfer, Andreas, Su, Yushan, Wang, Wei, Xu, Ji, Yang, Yibo, Zhang, Jian-Hui, Zhang, Qi-An, and Zhao, Shuai
- Subjects
High Energy Physics - Lattice ,High Energy Physics - Phenomenology - Abstract
We develop an approach for calculating heavy quark effective theory (HQET) light-cone distribution amplitudes (LCDAs) by employing a sequential effective theory methodology. The theoretical foundation of the framework is established, elucidating how the quasi distribution amplitudes (quasi DAs) with three scales can be utilized to compute HQET LCDAs. We provide theoretical support for this approach by demonstrating the rationale behind devising a hierarchical ordering for the three involved scales, discussing the factorization at each step, clarifying the underlying reason for obtaining HQET LCDAs in the final phase, and addressing potential theoretical challenges. The lattice QCD simulation aspect is explored in detail, and the computations of quasi DAs are presented. We employ three fitting strategies to handle contributions from excited states and extract the bare matrix elements. For renormalization purposes, we apply hybrid renormalization schemes at short and long distance separations. To mitigate long-distance perturbations, we perform an extrapolation in $\lambda= z\cdot P^z$ and assess the stability against various parameters. After two-step matching, our results for HQET LCDAs are found in agreement with existing model parametrizations. The potential phenomenological implications of the results are discussed, shedding light on how these findings could impact our understanding of the strong interaction dynamics and physics beyond the standard model. It should be noted, however, that systematic uncertainties have not been accounted for yet., Comment: 27 pages, 23 figures
- Published
- 2024
14. SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness
- Author
-
Parekh, Tanmay, Kwan, Jeffrey, Yu, Jiarui, Johri, Sparsh, Ahn, Hyosang, Muppalla, Sreya, Chang, Kai-Wei, Wang, Wei, and Peng, Nanyun
- Subjects
Computer Science - Computation and Language ,Computer Science - Social and Information Networks - Abstract
Social media is often the first place where communities discuss the latest societal trends. Prior works have utilized this platform to extract epidemic-related information (e.g. infections, preventive measures) to provide early warnings for epidemic prediction. However, these works only focused on English posts, while epidemics can occur anywhere in the world, and early discussions are often in the local, non-English languages. In this work, we introduce the first multilingual Event Extraction (EE) framework SPEED++ for extracting epidemic event information for a wide range of diseases and languages. To this end, we extend a previous epidemic ontology with 20 argument roles; and curate our multilingual EE dataset SPEED++ comprising 5.1K tweets in four languages for four diseases. Annotating data in every language is infeasible; thus we develop zero-shot cross-lingual cross-disease models (i.e., training only on English COVID data) utilizing multilingual pre-training and show their efficacy in extracting epidemic-related events for 65 diverse languages across different diseases. Experiments demonstrate that our framework can provide epidemic warnings for COVID-19 in its earliest stages in Dec 2019 (3 weeks before global discussions) from Chinese Weibo posts without any training in Chinese. Furthermore, we exploit our framework's argument extraction capabilities to aggregate community epidemic discussions like symptoms and cure measures, aiding misinformation detection and public attention monitoring. Overall, we lay a strong foundation for multilingual epidemic preparedness., Comment: Accepted at EMNLP 2024
- Published
- 2024
15. Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection
- Author
-
He, Jianfei, Wang, Lilin, Wang, Jiaying, Liu, Zhenyu, Na, Hongbin, Wang, Zimu, Wang, Wei, and Chen, Qi
- Subjects
Computer Science - Computation and Language - Abstract
Identifying offensive language is essential for maintaining safety and sustainability in the social media era. Though large language models (LLMs) have demonstrated encouraging potential in social media analytics, they lack thorough evaluation when in offensive language detection, particularly in multilingual environments. We for the first time evaluate multilingual offensive language detection of LLMs in three languages: English, Spanish, and German with three LLMs, GPT-3.5, Flan-T5, and Mistral, in both monolingual and multilingual settings. We further examine the impact of different prompt languages and augmented translation data for the task in non-English contexts. Furthermore, we discuss the impact of the inherent bias in LLMs and the datasets in the mispredictions related to sensitive topics., Comment: Accepted at UIC 2024 proceedings. Accepted version
- Published
- 2024
16. Group Diffusion Transformers are Unsupervised Multitask Learners
- Author
-
Huang, Lianghua, Wang, Wei, Wu, Zhi-Fan, Dou, Huanzhang, Shi, Yupeng, Feng, Yutong, Liang, Chen, Liu, Yu, and Zhou, Jingren
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
While large language models (LLMs) have revolutionized natural language processing with their task-agnostic capabilities, visual generation tasks such as image translation, style transfer, and character customization still rely heavily on supervised, task-specific datasets. In this work, we introduce Group Diffusion Transformers (GDTs), a novel framework that unifies diverse visual generation tasks by redefining them as a group generation problem. In this approach, a set of related images is generated simultaneously, optionally conditioned on a subset of the group. GDTs build upon diffusion transformers with minimal architectural modifications by concatenating self-attention tokens across images. This allows the model to implicitly capture cross-image relationships (e.g., identities, styles, layouts, surroundings, and color schemes) through caption-based correlations. Our design enables scalable, unsupervised, and task-agnostic pretraining using extensive collections of image groups sourced from multimodal internet articles, image galleries, and video frames. We evaluate GDTs on a comprehensive benchmark featuring over 200 instructions across 30 distinct visual generation tasks, including picture book creation, font design, style transfer, sketching, colorization, drawing sequence generation, and character customization. Our models achieve competitive zero-shot performance without any additional fine-tuning or gradient updates. Furthermore, ablation studies confirm the effectiveness of key components such as data scaling, group size, and model design. These results demonstrate the potential of GDTs as scalable, general-purpose visual generation systems.
- Published
- 2024
17. A Hierarchical DRL Approach for Resource Optimization in Multi-RIS Multi-Operator Networks
- Author
-
Zhang, Haocheng, Wang, Wei, Zhou, Hao, Lu, Zhiping, and Li, Ming
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
As reconfigurable intelligent surfaces (RIS) emerge as a pivotal technology in the upcoming sixth-generation (6G) networks, their deployment within practical multiple operator (OP) networks presents significant challenges, including the coordination of RIS configurations among OPs, interference management, and privacy maintenance. A promising strategy is to treat RIS as a public resource managed by an RIS provider (RP), which can enhance resource allocation efficiency by allowing dynamic access for multiple OPs. However, the intricate nature of coordinating management and optimizing RIS configurations significantly complicates the implementation process. In this paper, we propose a hierarchical deep reinforcement learning (HDRL) approach that decomposes the complicated RIS resource optimization problem into several subtasks. Specifically, a top-level RP-agent is responsible for RIS allocation, while low-level OP-agents control their assigned RISs and handle beamforming, RIS phase-shifts, and user association. By utilizing the semi-Markov decision process (SMDP) theory, we establish a sophisticated interaction mechanism between the RP and OPs, and introduce an advanced hierarchical proximal policy optimization (HPPO) algorithm. Furthermore, we propose an improved sequential-HPPO (S-HPPO) algorithm to address the curse of dimensionality encountered with a single RP-agent. Experimental results validate the stability of the HPPO algorithm across various environmental parameters, demonstrating its superiority over other benchmarks for joint resource optimization. Finally, we conduct a detailed comparative analysis between the proposed S-HPPO and HPPO algorithms, showcasing that the S-HPPO algorithm achieves faster convergence and improved performance in large-scale RIS allocation scenarios.
- Published
- 2024
18. Self-Supervised Learning of Disentangled Representations for Multivariate Time-Series
- Author
-
Chang, Ching, Chan, Chiao-Tung, Wang, Wei-Yao, Peng, Wen-Chih, and Chen, Tien-Fu
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Multivariate time-series data in fields like healthcare and industry are informative but challenging due to high dimensionality and lack of labels. Recent self-supervised learning methods excel in learning rich representations without labels but struggle with disentangled embeddings and inductive bias issues like transformation-invariance. To address these challenges, we introduce TimeDRL, a framework for multivariate time-series representation learning with dual-level disentangled embeddings. TimeDRL features: (i) disentangled timestamp-level and instance-level embeddings using a [CLS] token strategy; (ii) timestamp-predictive and instance-contrastive tasks for representation learning; and (iii) avoidance of augmentation methods to eliminate inductive biases. Experiments on forecasting and classification datasets show TimeDRL outperforms existing methods, with further validation in semi-supervised settings with limited labeled data., Comment: This submission has been withdrawn to avoid duplication with a full version of the paper that is already available in another arXiv entry (arXiv:2410.12606). The withdrawn version was a short format prepared for a NeurIPS workshop and is no longer necessary as a separate arXiv submission
- Published
- 2024
19. FALCON: Pinpointing and Mitigating Stragglers for Large-Scale Hybrid-Parallel Training
- Author
-
Wu, Tianyuan, Wang, Wei, Yu, Yinghao, Yang, Siran, Wu, Wenchao, Duan, Qinkai, Yang, Guodong, Wang, Jiamang, Qu, Lin, and Zhang, Liping
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Operating Systems - Abstract
Fail-slows, or stragglers, are common but largely unheeded problems in large-scale hybrid-parallel training that spans thousands of GPU servers and runs for weeks to months. Yet, these problems are not well studied, nor can they be quickly detected and effectively mitigated. In this paper, we first present a characterization study on a shared production cluster with over 10,000 GPUs1. We find that fail-slows are caused by various CPU/GPU computation and cross-node networking issues, lasting from tens of seconds to nearly ten hours, and collectively delaying the average job completion time by 1.34%. The current practice is to manually detect these fail-slows and simply treat them as fail-stops using a checkpoint-and-restart failover approach, which are labor-intensive and time-consuming. In this paper, we propose FALCON, a framework that rapidly identifies fail-slowed GPUs and/or communication links, and effectively tackles them with a novel multi-level mitigation mechanism, all without human intervention. We have applied FALCON to detect human-labeled fail-slows in a production cluster with over 99% accuracy. Cluster deployment further demonstrates that FALCON effectively handles manually injected fail-slows, mitigating the training slowdown by 60.1%., Comment: 17 pages, 20 figures
- Published
- 2024
20. The Generation of All Regular Rational Orthogonal Matrices
- Author
-
Tang, Quanyu, Wang, Wei, and Zhang, Hao
- Subjects
Mathematics - Combinatorics ,05C50 - Abstract
A \emph{rational orthogonal matrix} $Q$ is an orthogonal matrix with rational entries, and $Q$ is called \emph{regular} if each of its row sum equals one, i.e., $Qe = e$ where $e$ is the all-one vector. This paper presents a method for generating all regular rational orthogonal matrices using the classic Cayley transformation. Specifically, we demonstrate that for any regular rational orthogonal matrix $Q$, there exists a permutation matrix $P$ such that $QP$ does not possess an eigenvalue of $-1$. Consequently, $Q$ can be expressed in the form $Q = (I_n + S)^{-1}(I_n - S)P$, where $I_n$ is the identity matrix of order $n$, $S$ is a rational skew-symmetric matrix satisfying $Se = 0$, and $P$ is a permutation matrix. Central to our approach is a pivotal intermediate result, which holds independent interest: given a square matrix $M$, then $MP$ has $-1$ as an eigenvalue for every permutation matrix $P$ if and only if either every row sum of $M$ is $-1$ or every column sum of $M$ is $-1$.
- Published
- 2024
21. Layer-wise Importance Matters: Less Memory for Better Performance in Parameter-efficient Fine-tuning of Large Language Models
- Author
-
Yao, Kai, Gao, Penlei, Li, Lichun, Zhao, Yuan, Wang, Xiaofeng, Wang, Wei, and Zhu, Jianke
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Parameter-Efficient Fine-Tuning (PEFT) methods have gained significant popularity for adapting pre-trained Large Language Models (LLMs) to downstream tasks, primarily due to their potential to significantly reduce memory and computational overheads. However, a common limitation in most PEFT approaches is their application of a uniform architectural design across all layers. This uniformity involves identical trainable modules and ignores the varying importance of each layer, leading to sub-optimal fine-tuning results. To overcome the above limitation and obtain better performance, we develop a novel approach, Importance-aware Sparse Tuning (IST), to fully utilize the inherent sparsity and select the most important subset of full layers with effective layer-wise importance scoring. The proposed IST is a versatile and plug-and-play technique compatible with various PEFT methods that operate on a per-layer basis. By leveraging the estimated importance scores, IST dynamically updates these selected layers in PEFT modules, leading to reduced memory demands. We further provide theoretical proof of convergence and empirical evidence of superior performance to demonstrate the advantages of IST over uniform updating strategies. Extensive experiments on a range of LLMs, PEFTs, and downstream tasks substantiate the effectiveness of our proposed method, showcasing IST's capacity to enhance existing layer-based PEFT methods. Our code is available at https://github.com/Kaiseem/IST., Comment: EMNLP 2024
- Published
- 2024
22. Parameter estimation of structural dynamics with neural operators enabled surrogate modeling
- Author
-
Zhou, Mingyuan, Song, Haoze, Ye, Wenjing, Wang, Wei, and Lai, Zhilu
- Subjects
Computer Science - Computational Engineering, Finance, and Science ,Physics - Data Analysis, Statistics and Probability - Abstract
Parameter estimation generally involves inferring the values of mathematical models derived from first principles or expert knowledge, which is challenging for complex structural systems. In this work, we present a unified deep learning-based framework for parameterization, forward modeling, and inverse modeling of structural dynamics. The parameterization is flexible and can be user-defined, including physical and/or non-physical (customized) parameters. In forward modeling, we train a neural operator for response prediction -- forming a surrogate model, which leverages the defined system parameters and excitation forces as inputs. The inverse modeling focuses on estimating system parameters. In particular, the learned forward surrogate model (which is differentiable) is utilized for preliminary parameter estimation via gradient-based optimization; to further boost the parameter estimation, we introduce a neural refinement method to mitigate ill-posed problems, which often occur in the former. The framework's effectiveness is verified numerically and experimentally, in both interpolation and extrapolation cases, indicating its capability to capture intrinsic dynamics of structural systems from both forward and inverse perspectives. Moreover, the framework's flexibility is expected to support a wide range of applications, including surrogate modeling, structural identification, damage detection, and inverse design of structural systems.
- Published
- 2024
23. Slide-based Graph Collaborative Training for Histopathology Whole Slide Image Analysis
- Author
-
Shi, Jun, Shu, Tong, Jiang, Zhiguo, Wang, Wei, Wu, Haibo, and Zheng, Yushan
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
The development of computational pathology lies in the consensus that pathological characteristics of tumors are significant guidance for cancer diagnostics. Most existing research focuses on the inner-contextual information within each WSI yet ignores the possible inter-correlations between slides. As the development of tumors is a continuous process involving a series of histological, morphological, and genetic changes that accumulate over time, the similarities and differences between WSIs across various stages, grades, locations and patients should potentially contribute to the representation of WSIs and deserve to be taken into account in WSI modeling. To verify the advancement of introducing the slide inter-correlations into the representation learning of WSIs, we proposed a generic WSI analysis pipeline SlideGCD that can be adapted to any existing Multiple Instance Learning (MIL) frameworks and improve their performance. With the new paradigm, the prior knowledge of cancer development can participate in the end-to-end workflow, which concurrently initializes and refines the slide representation, as a guide for message passing in the slide-based graph. Extensive comparisons and experiments are conducted to validate the effectiveness and robustness of the proposed pipeline across 4 different tasks, including cancer subtyping, cancer staging, survival prediction, and gene mutation prediction, with 7 representative SOTA WSI analysis frameworks as backbones.
- Published
- 2024
24. The Stellar Abundances and Galactic Evolution Survey (SAGES) III -- The g/r/i-band Data Release
- Author
-
Li, Chun, Fan, Zhou, Zhao, Gang, Wang, Wei, Zheng, Jie, Tan, Kefeng, Zhao, Jingkun, Huang, Yang, Yuan, Haibo, Xiao, Kai, Chen, Yuqin, Li, Haining, Liu, Yujuan, Song, Nan, Esamdin, Ali, Niu, Hu-Biao, Liu, Jin-Zhong, and Feng, Guo-Jie
- Subjects
Astrophysics - Astrophysics of Galaxies ,Astrophysics - Solar and Stellar Astrophysics - Abstract
The Stellar Abundances and Galactic Evolution Survey (SAGES) is a multi-band survey that covers the northern sky area of ~12000 deg2. Nanshan One-meter Wide-field Telescope (NOWT) of Xinjiang Astronomical Observatory (XAO) carried out observations on g/r/i bands. We present here the survey strategy, data processing, catalog construction, and database schema. The observations of NOWT started in 2016 August and was completed in 2018 January, total 17827 frames were obtained and ~4600 deg2 sky areas were covered. In this paper, we released the catalog of the data in the g/r/i bands observed with NOWT. In total, there are 109,197,578 items of the source records. The catalog is the supplement for the SDSS for the bright end, and the combination of our catalog and these catalogs could be helpful for source selections for other surveys and the Milky Way sciences, e.g., white dwarf candidates and stellar flares., Comment: 12 pages, 8 figures, accepted for publication in RAA
- Published
- 2024
25. Annihilating polynomial, Jordan canonical from, and generalized spectral characterizations of Eulerian graphs
- Author
-
Li, Kunyue, Wang, Wei, and Zhang, Hao
- Subjects
Mathematics - Combinatorics ,05C50 - Abstract
Let $G$ be an Eulerian graph on $n$ vertices with adjacency matrix $A$ and characteristic polynomial $\phi(x)$. We show that when $n$ is even (resp. odd), the square-root of $\phi(x)$ (resp. $x\phi(x)$) is an annihilating polynomial of $A$, over $\mathbb{F}_2$. The result was achieved by applying the Jordan canonical form of $A$ over the algebraic closure $\bar{\mathbb{F}}_2$. Based on this, we show a family of Eulerian graphs are determined by their generalized spectrum among all Eulerian graphs, which significantly simplifies and strengthens the previous result.
- Published
- 2024
26. On Positive and Negative $r$-th Power Energy of Graphs with Edge Addition
- Author
-
Tang, Quanyu, Liu, Yinchen, and Wang, Wei
- Subjects
Mathematics - Combinatorics ,05C50 - Abstract
In this paper, we investigate the positive and negative $r$-th power energy of graphs and their behavior under edge addition. Specifically, we extend the classical notions of positive and negative square energies to the $r$-th power energies, denoted as $s^{+}_r(G)$ and $s^{-}_r(G)$, respectively. We derive bounds for $s^{+}_r(G)$ and $s^{-}_r(G)$ under edge addition, which provide tighter bounds when $r=2$ compared to those of Abiad et al. Moreover, we address the problem of monotonicity of the positive $r$-th power energy under edge addition, providing a family of counterexamples for \( 1 \leq r < 3 \). Finally, three related conjectures are also proposed. One of them is a reformulation of Guo's conjecture, we believe the monotonicity property holds for $r\geq 3$; the other two generalize a conjecture of Elphick et al.
- Published
- 2024
27. A new criterion for oriented graphs to be determined by their generalized skew spectrum
- Author
-
Chao, Yiquan, Wang, Wei, and Zhang, Hao
- Subjects
Mathematics - Combinatorics ,05C50 - Abstract
Spectral characterizations of graphs is an important topic in spectral graph theory which has been studied extensively by researchers in recent years. The study of oriented graphs, however, has received less attention so far. In Qiu et al.~\cite{QWW} (Linear Algebra Appl. 622 (2021) 316-332), the authors gave an arithmetic criterion for an oriented graph to be determined by its \emph{generalized skew spectrum} (DGSS for short). More precisely, let $\Sigma$ be an $n$-vertex oriented graph with skew adjacency matrix $S$ and $W(\Sigma)=[e,Se,\ldots,S^{n-1}e]$ be the \emph{walk-matrix} of $\Sigma$, where $e$ is the all-one vector. A theorem of Qiu et al.~\cite{QWW} shows that a self-converse oriented graph $\Sigma$ is DGSS, provided that the Smith normal form of $W(\Sigma)$ is ${\rm diag}(1,\ldots,1,2,\ldots,2,2d)$, where $d$ is an odd and square-free integer and the number of $1$'s appeared in the diagonal is precisely $\lceil \frac{n}{2}\rceil$. In this paper, we show that the above square-freeness assumptions on $d$ can actually be removed, which significantly improves upon the above theorem. Our new ingredient is a key intermediate result, which is of independent interest: for a self-converse oriented graphs $\Sigma$ and an odd prime $p$, if the rank of $W(\Sigma)$ is $n-1$ over $\mathbb{F}_p$, then the kernel of $W(\Sigma)^{\rm T}$ over $\mathbb{F}_p$ is \emph{anisotropic}, i.e., $v^{\rm T}v\neq 0$ for any $0\ne v\in{{\rm ker}\,W(\Sigma)^{\rm T}}$ over $\mathbb{F}_p$.
- Published
- 2024
28. Bi-temporal Gaussian Feature Dependency Guided Change Detection in Remote Sensing Images
- Author
-
Xiao, Yi, Luo, Bin, Liu, Jun, Su, Xin, and Wang, Wei
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Change Detection (CD) enables the identification of alterations between images of the same area captured at different times. However, existing CD methods still struggle to address pseudo changes resulting from domain information differences in multi-temporal images and instances of detail errors caused by the loss and contamination of detail features during the upsampling process in the network. To address this, we propose a bi-temporal Gaussian distribution feature-dependent network (BGFD). Specifically, we first introduce the Gaussian noise domain disturbance (GNDD) module, which approximates distribution using image statistical features to characterize domain information, samples noise to perturb the network for learning redundant domain information, addressing domain information differences from a more fundamental perspective. Additionally, within the feature dependency facilitation (FDF) module, we integrate a novel mutual information difference loss ($L_{MI}$) and more sophisticated attention mechanisms to enhance the capabilities of the network, ensuring the acquisition of essential domain information. Subsequently, we have designed a novel detail feature compensation (DFC) module, which compensates for detail feature loss and contamination introduced during the upsampling process from the perspectives of enhancing local features and refining global features. The BGFD has effectively reduced pseudo changes and enhanced the detection capability of detail information. It has also achieved state-of-the-art performance on four publicly available datasets - DSIFN-CD, SYSU-CD, LEVIR-CD, and S2Looking, surpassing baseline models by +8.58%, +1.28%, +0.31%, and +3.76% respectively, in terms of the F1-Score metric.
- Published
- 2024
29. Score Neural Operator: A Generative Model for Learning and Generalizing Across Multiple Probability Distributions
- Author
-
Liao, Xinyu, Qin, Aoyang, Seidman, Jacob, Wang, Junqi, Wang, Wei, and Perdikaris, Paris
- Subjects
Computer Science - Machine Learning - Abstract
Most existing generative models are limited to learning a single probability distribution from the training data and cannot generalize to novel distributions for unseen data. An architecture that can generate samples from both trained datasets and unseen probability distributions would mark a significant breakthrough. Recently, score-based generative models have gained considerable attention for their comprehensive mode coverage and high-quality image synthesis, as they effectively learn an operator that maps a probability distribution to its corresponding score function. In this work, we introduce the $\emph{Score Neural Operator}$, which learns the mapping from multiple probability distributions to their score functions within a unified framework. We employ latent space techniques to facilitate the training of score matching, which tends to over-fit in the original image pixel space, thereby enhancing sample generation quality. Our trained Score Neural Operator demonstrates the ability to predict score functions of probability measures beyond the training space and exhibits strong generalization performance in both 2-dimensional Gaussian Mixture Models and 1024-dimensional MNIST double-digit datasets. Importantly, our approach offers significant potential for few-shot learning applications, where a single image from a new distribution can be leveraged to generate multiple distinct images from that distribution.
- Published
- 2024
30. GIVE: Structured Reasoning with Knowledge Graph Inspired Veracity Extrapolation
- Author
-
He, Jiashu, Ma, Mingyu Derek, Fan, Jinxuan, Roth, Dan, Wang, Wei, and Ribeiro, Alejandro
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computation and Language - Abstract
Existing retrieval-based reasoning approaches for large language models (LLMs) heavily rely on the density and quality of the non-parametric knowledge source to provide domain knowledge and explicit reasoning chain. However, inclusive knowledge sources are expensive and sometimes infeasible to build for scientific or corner domains. To tackle the challenges, we introduce Graph Inspired Veracity Extrapolation (GIVE), a novel reasoning framework that integrates the parametric and non-parametric memories to enhance both knowledge retrieval and faithful reasoning processes on very sparse knowledge graphs. By leveraging the external structured knowledge to inspire LLM to model the interconnections among relevant concepts, our method facilitates a more logical and step-wise reasoning approach akin to experts' problem-solving, rather than gold answer retrieval. Specifically, the framework prompts LLMs to decompose the query into crucial concepts and attributes, construct entity groups with relevant entities, and build an augmented reasoning chain by probing potential relationships among node pairs across these entity groups. Our method incorporates both factual and extrapolated linkages to enable comprehensive understanding and response generation. Extensive experiments on reasoning-intense benchmarks on biomedical and commonsense QA demonstrate the effectiveness of our proposed method. Specifically, GIVE enables GPT3.5-turbo to outperform advanced models like GPT4 without any additional training cost, thereby underscoring the efficacy of integrating structured information and internal reasoning ability of LLMs for tackling specialized tasks with limited external resources.
- Published
- 2024
31. Revealing COVID-19's Social Dynamics: Diachronic Semantic Analysis of Vaccine and Symptom Discourse on Twitter
- Author
-
Wang, Zeqiang, Wu, Jiageng, Wang, Yuqi, Wang, Wei, Yang, Jie, Johnson, Jon, Sastry, Nishanth, and De, Suparna
- Subjects
Computer Science - Computation and Language ,Computer Science - Information Retrieval ,Computer Science - Social and Information Networks - Abstract
Social media is recognized as an important source for deriving insights into public opinion dynamics and social impacts due to the vast textual data generated daily and the 'unconstrained' behavior of people interacting on these platforms. However, such analyses prove challenging due to the semantic shift phenomenon, where word meanings evolve over time. This paper proposes an unsupervised dynamic word embedding method to capture longitudinal semantic shifts in social media data without predefined anchor words. The method leverages word co-occurrence statistics and dynamic updating to adapt embeddings over time, addressing the challenges of data sparseness, imbalanced distributions, and synergistic semantic effects. Evaluated on a large COVID-19 Twitter dataset, the method reveals semantic evolution patterns of vaccine- and symptom-related entities across different pandemic stages, and their potential correlations with real-world statistics. Our key contributions include the dynamic embedding technique, empirical analysis of COVID-19 semantic shifts, and discussions on enhancing semantic shift modeling for computational social science research. This study enables capturing longitudinal semantic dynamics on social media to understand public discourse and collective phenomena.
- Published
- 2024
32. Representations of non-finitely graded Heisenberg-Virasoro type Lie algebras
- Author
-
Xia, Chunguang, Ma, Tianyu, Wang, Wei, and Zhang, Mingjing
- Subjects
Mathematics - Representation Theory - Abstract
We construct and study non-finitely graded Lie algebras $\mathcal{HV}(a,b;\epsilon)$ related to Heisenberg-Virasoro type Lie algebras, where $a,b$ are complex numbers, and $\epsilon = \pm 1$. Using combinatorial techniques, we completely classify the free $\mathcal{U}(\mathfrak h)$-modules of rank one over $\mathcal{HV}(a,b;\epsilon)$. It turns out that these modules are more varied and complex than those over non-finitely graded Virasoro algebras, and in particular admit infinitely many free parameters if $b=1$ and $\epsilon=-1$. Meanwhile, we also determine the simplicity and isomorphism classes of these modules., Comment: 19 pages
- Published
- 2024
33. Discovery of Two New Eruptions of the Ultrashort Recurrence Time Nova M31N 2017-01e
- Author
-
Shafter, Allen W., Zhao, Jingyuan, Hornoch, Kamil, Kučáková, Hana, Taguchi, Kenta, Zhang, Jiashuo, You, Jia, Wang, Binyu, Xu, Runwei, Wang, Weiye, Ren, Yuqing, Ding, Lanhe, Yan, Xiaochang, Zhang, Mi, Wang, Wei-Hao, Bond, Howard E., Williams, Robert, and Zeimann, Gregory R.
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - Astrophysics of Galaxies ,Astrophysics - High Energy Astrophysical Phenomena - Abstract
We report the recent discovery of two new eruptions of the recurrent nova M31N 2017-01e in the Andromeda galaxy. The latest eruption, M31N 2024-08c, reached $R=17.8$ on 2024 August 06.85 UT, $\sim2$ months earlier than predicted. In addition to this recent eruption, a search of archival PTF data has revealed a previously unreported eruption on 2014 June 18.46 UT that reached a peak brightness of $R\sim17.9$ approximately a day later. The addition of these two eruption timings has allowed us to update the mean recurrence time of the nova. We find $\langle T_\mathrm{rec} \rangle = 924.0\pm7.0$ days ($2.53\pm0.02$ yr), which is slightly shorter than our previous determination. Thus, M31N 2017-01e remains the nova with the second shortest recurrence time known, with only M31N 2008-12a being shorter. We also present a low-resolution spectrum of the likely quiescent counterpart of the nova, a $\sim20.5$ mag evolved B star displaying an $\sim14.3$ d photometric modulation., Comment: 6 pages; 1 multi-panel figure; 1 table; expanded references; accepted for publication in the Research Notes of the AAS
- Published
- 2024
34. Physics-Informed Regularization for Domain-Agnostic Dynamical System Modeling
- Author
-
Huang, Zijie, Zhao, Wanjia, Gao, Jingdong, Hu, Ziniu, Luo, Xiao, Cao, Yadi, Chen, Yuanzhou, Sun, Yizhou, and Wang, Wei
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Learning complex physical dynamics purely from data is challenging due to the intrinsic properties of systems to be satisfied. Incorporating physics-informed priors, such as in Hamiltonian Neural Networks (HNNs), achieves high-precision modeling for energy-conservative systems. However, real-world systems often deviate from strict energy conservation and follow different physical priors. To address this, we present a framework that achieves high-precision modeling for a wide range of dynamical systems from the numerical aspect, by enforcing Time-Reversal Symmetry (TRS) via a novel regularization term. It helps preserve energies for conservative systems while serving as a strong inductive bias for non-conservative, reversible systems. While TRS is a domain-specific physical prior, we present the first theoretical proof that TRS loss can universally improve modeling accuracy by minimizing higher-order Taylor terms in ODE integration, which is numerically beneficial to various systems regardless of their properties, even for irreversible systems. By integrating the TRS loss within neural ordinary differential equation models, the proposed model TREAT demonstrates superior performance on diverse physical systems. It achieves a significant 11.5% MSE improvement in a challenging chaotic triple-pendulum scenario, underscoring TREAT's broad applicability and effectiveness., Comment: Accepted to The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024)
- Published
- 2024
35. Concurrent-Learning Based Relative Localization in Shape Formation of Robot Swarms
- Author
-
Lü, Jinhu, Ze, Kunrui, Yue, Shuoyu, Liu, Kexin, Wang, Wei, and Sun, Guibin
- Subjects
Computer Science - Robotics ,Computer Science - Multiagent Systems - Abstract
In this paper, we address the shape formation problem for massive robot swarms in environments where external localization systems are unavailable. Achieving this task effectively with solely onboard measurements is still scarcely explored and faces some practical challenges. To solve this challenging problem, we propose the following novel results. Firstly, to estimate the relative positions among neighboring robots, a concurrent-learning based estimator is proposed. It relaxes the persistent excitation condition required in the classical ones such as least-square estimator. Secondly, we introduce a finite-time agreement protocol to determine the shape location. This is achieved by estimating the relative position between each robot and a randomly assigned seed robot. The initial position of the seed one marks the shape location. Thirdly, based on the theoretical results of the relative localization, a novel behavior-based control strategy is devised. This strategy not only enables adaptive shape formation of large group of robots but also enhances the observability of inter-robot relative localization. Numerical simulation results are provided to verify the performance of our proposed strategy compared to the state-of-the-art ones. Additionally, outdoor experiments on real robots further demonstrate the practical effectiveness and robustness of our methods.
- Published
- 2024
36. Document-level Causal Relation Extraction with Knowledge-guided Binary Question Answering
- Author
-
Wang, Zimu, Xia, Lei, Wang, Wei, and Du, Xinya
- Subjects
Computer Science - Computation and Language - Abstract
As an essential task in information extraction (IE), Event-Event Causal Relation Extraction (ECRE) aims to identify and classify the causal relationships between event mentions in natural language texts. However, existing research on ECRE has highlighted two critical challenges, including the lack of document-level modeling and causal hallucinations. In this paper, we propose a Knowledge-guided binary Question Answering (KnowQA) method with event structures for ECRE, consisting of two stages: Event Structure Construction and Binary Question Answering. We conduct extensive experiments under both zero-shot and fine-tuning settings with large language models (LLMs) on the MECI and MAVEN-ERE datasets. Experimental results demonstrate the usefulness of event structures on document-level ECRE and the effectiveness of KnowQA by achieving state-of-the-art on the MECI dataset. We observe not only the effectiveness but also the high generalizability and low inconsistency of our method, particularly when with complete event structures after fine-tuning the models., Comment: Accepted at Findings of EMNLP 2024. Camera-ready version
- Published
- 2024
37. Hyper-multi-step: The Truth Behind Difficult Long-context Tasks
- Author
-
Yu, Yijiong, Xiufa, Ma, Jianwei, Fang, Xu, Zhi, Guangyao, Su, Jiancheng, Wang, Huang, Yongfeng, Qi, Zhixiao, Wang, Wei, Liu, Weifeng, Chen, Ran, and Pei, Ji
- Subjects
Computer Science - Computation and Language - Abstract
Long-context language models (LCLM), characterized by their extensive context window, is becoming increasingly popular. Meanwhile, many long-context benchmarks present challenging tasks that even the most advanced LCLMs struggle to complete. However, the underlying sources of various challenging long-context tasks have seldom been studied. To bridge this gap, we conduct experiments to indicate their difficulty stems primarily from two basic issues: "multi-matching retrieval," which requires the simultaneous retrieval of multiple items, and "logic-based retrieval," which necessitates logical judgment within retrieval criteria. These two problems, while seemingly straightforward, actually exceed the capabilities of LCLMs because they are proven to be hyper-multi-step (demanding numerous steps to solve) in nature. This finding could explain why LLMs struggle with more advanced long-context tasks, providing a more accurate perspective for rethinking solutions for them., Comment: Our code is publicly available at https://github.com/yuyijiong/hard_retrieval_for_llm and the datasets is at https://huggingface.co/datasets/yuyijiong/difficult_retrieval
- Published
- 2024
38. Deep Learning-based Automated Diagnosis of Obstructive Sleep Apnea and Sleep Stage Classification in Children Using Millimeter-wave Radar and Pulse Oximeter
- Author
-
Wang, Wei, Song, Ruobing, Wu, Yunxiao, Zheng, Li, Zhang, Wenyu, Chen, Zhaoxi, Li, Gang, and Xu, Zhifei
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
Study Objectives: To evaluate the agreement between the millimeter-wave radar-based device and polysomnography (PSG) in diagnosis of obstructive sleep apnea (OSA) and classification of sleep stage in children. Methods: 281 children, aged 1 to 18 years, who underwent sleep monitoring between September and November 2023 at the Sleep Center of Beijing Children's Hospital, Capital Medical University, were recruited in the study. All enrolled children underwent sleep monitoring by PSG and the millimeter-wave radar-based device, QSA600, simultaneously. QSA600 recordings were automatically analyzed using a deep learning model meanwhile the PSG data was manually scored. Results: The Obstructive Apnea-Hypopnea Index (OAHI) obtained from QSA600 and PSG demonstrates a high level of agreement with an intraclass correlation coefficient of 0.945 (95% CI: 0.93 to 0.96). Bland-Altman analysis indicates that the mean difference of OAHI between QSA600 and PSG is -0.10 events/h (95% CI: -11.15 to 10.96). The deep learning model evaluated through cross-validation showed good sensitivity (81.8%, 84.3% and 89.7%) and specificity (90.5%, 95.3% and 97.1%) values for diagnosing children with OAHI>1, OAHI>5 and OAHI>10. The area under the receiver operating characteristic curve is 0.923, 0.955 and 0.988, respectively. For sleep stage classification, the model achieved Kappa coefficients of 0.854, 0.781, and 0.734, with corresponding overall accuracies of 95.0%, 84.8%, and 79.7% for Wake-sleep classification, Wake-REM-Light-Deep classification, and Wake-REM-N1-N2 N3 classification, respectively. Conclusions: QSA600 has demonstrated high agreement with PSG in diagnosing OSA and performing sleep staging in children. The device is portable, low-load and suitable for follow up and long-term pediatric sleep assessment.
- Published
- 2024
39. Detection of Sleep Apnea-Hypopnea Events Using Millimeter-wave Radar and Pulse Oximeter
- Author
-
Wang, Wei, Li, Chenyang, Chen, Zhaoxi, Zhang, Wenyu, Wang, Zetao, Guo, Xi, Guan, Jian, and Li, Gang
- Subjects
Electrical Engineering and Systems Science - Signal Processing - Abstract
Obstructive Sleep Apnea-Hypopnea Syndrome (OSAHS) is a sleep-related breathing disorder associated with significant morbidity and mortality worldwide. The gold standard for OSAHS diagnosis, polysomnography (PSG), faces challenges in popularization due to its high cost and complexity. Recently, radar has shown potential in detecting sleep apnea-hypopnea events (SAE) with the advantages of low cost and non-contact monitoring. However, existing studies, especially those using deep learning, employ segment-based classification approach for SAE detection, making the task of event quantity estimation difficult. Additionally, radar-based SAE detection is susceptible to interference from body movements and the environment. Oxygen saturation (SpO2) can offer valuable information about OSAHS, but it also has certain limitations and cannot be used alone for diagnosis. In this study, we propose a method using millimeter-wave radar and pulse oximeter to detect SAE, called ROSA. It fuses information from both sensors, and directly predicts the temporal localization of SAE. Experimental results demonstrate a high degree of consistency (ICC=0.9864) between AHI from ROSA and PSG. This study presents an effective method with low-load device for the diagnosis of OSAHS.
- Published
- 2024
40. Magnon-mediated exciton-exciton interaction in a van der Waals antiferromagnet
- Author
-
Datta, Biswajit, Adak, Pratap Chandra, Yu, Sichao, Dharmapalan, Agneya V., Hall, Siedah J., Vakulenko, Anton, Komissarenko, Filipp, Kurganov, Egor, Quan, Jiamin, Wang, Wei, Mosina, Kseniia, Sofer, Zdeněk, Pashov, Dimitar, van Schilfgaarde, Mark, Acharya, Swagata, Kamra, Akashdeep, Sfeir, Matthew Y., Alù, Andrea, Khanikaev, Alexander B., and Menon, Vinod M.
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
Excitons are fundamental excitations that govern the optical properties of semiconductors. Interacting excitons can lead to various emergent phases of matter and large nonlinear optical responses. In most semiconductors, excitons interact via exchange interaction or phase space filling. Correlated materials that host excitons coupled to other degrees of freedom offer hitherto unexplored pathways for controlling these interactions. Here, we demonstrate magnon-mediated excitonic interactions in CrSBr, an antiferromagnetic semiconductor. This interaction manifests as the dependence of exciton energy on exciton density via a magnonic adjustment of the spin canting angle. Our study demonstrates the emergence of quasiparticle-mediated interactions in correlated quantum materials, leading to large nonlinear optical responses and potential device concepts such as magnon-mediated quantum transducers., Comment: 33 pages, 14 figures
- Published
- 2024
41. Dark Miner: Defend against unsafe generation for text-to-image diffusion models
- Author
-
Meng, Zheling, Peng, Bo, Jin, Xiaochuan, Jiang, Yue, Dong, Jing, Wang, Wei, and Tan, Tieniu
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Text-to-image diffusion models have been demonstrated with unsafe generation due to unfiltered large-scale training data, such as violent, sexual, and shocking images, necessitating the erasure of unsafe concepts. Most existing methods focus on modifying the generation probabilities conditioned on the texts containing unsafe descriptions. However, they fail to guarantee safe generation for unseen texts in the training phase, especially for the prompts from adversarial attacks. In this paper, we re-analyze the erasure task and point out that existing methods cannot guarantee the minimization of the total probabilities of unsafe generation. To tackle this problem, we propose Dark Miner. It entails a recurring three-stage process that comprises mining, verifying, and circumventing. It greedily mines embeddings with maximum generation probabilities of unsafe concepts and reduces unsafe generation more effectively. In the experiments, we evaluate its performance on two inappropriate concepts, two objects, and two styles. Compared with 6 previous state-of-the-art methods, our method achieves better erasure and defense results in most cases, especially under 4 state-of-the-art attacks, while preserving the model's native generation capability. Our code will be available on GitHub.
- Published
- 2024
42. Exploring Nucleon Structure through Sub-threshold $\phi$-Meson Photoproduction at an Electron-Positron Collider
- Author
-
Wang, Wei, Xu, Ji, Yang, Xing-Hua, Zhang, Ya-Teng, and Zhao, Shuai
- Subjects
High Energy Physics - Phenomenology - Abstract
We propose to investigate short-range correlations (SRCs) in nuclei by studying sub-threshold photoproduction of $\phi$ particles in an electron-positron collision experiment. We present a direct experimental signature for SRCs, which is deemed achievable using the Beijing Spectrometer III (BESIII). The cross sections for sub-threshold production, as well as the likelihood of detection by BESIII, are calculated. These results underscore the substantial potential of BESIII in elucidating the fundamental physics behind the nuclear modification of parton distribution functions. This proposed experimental analysis of photon-nucleon interactions in electron-positron collisions represents uncharted territory, promising fresh prospects for applications in both particle and nuclear physics., Comment: 6 pages, 4 figures
- Published
- 2024
43. Vortex Interference Enables optimal 3D Interferometric Nanoscopy
- Author
-
Wang, Wei, Huang, Zengxin, Wang, Yilin, Li, Hangfeng, and Kanchanawong, Pakorn
- Subjects
Physics - Optics - Abstract
Super-resolution imaging methods that combine interferometric (z) analysis with single-molecule localization microscopy (iSMLM) have achieved ultra-high 3D precision and contributed to the elucidation of important biological ultrastructures. However, their dependence on imaging multiple phase-shifted output channels necessitates complex instrumentation and operation. To solve this problem, we develop an interferometric super-resolution microscope capable of optimal direct axial nanoscopy, termed VILM (Vortex Interference Localization Microscopy). Using a pair of vortex phase plates with opposite orientation for each dual-opposed objective lenses, the detection point-spread functions (PSFs) adopt a bilobed profile whose rotation encodes the axial position. Thus, direct 3D single-molecule coordinate determination can be achieved with a single output image. By reducing the number of output channels to as few as one and utilizing a simple 50:50 beamsplitter, the imaging system is significantly streamlined, while the optimal iSMLM imaging performance is retained, with axial resolution ~2 times better than the lateral. The capability of VILM is demonstrated by resolving the architecture of microtubules and probing the organization of tyrosine-phosphorylated signalling proteins in integrin-based cell adhesions.
- Published
- 2024
44. Flexible Swapping for the Cloud
- Author
-
Pandurov, Milan, Humbel, Lukas, Sepp, Dmitry, Ttofari, Adamos, Thomm, Leon, Quoc, Do Le, Chandrasekaran, Siddharth, Santhanam, Sharan, Ye, Chuan, Bergman, Shai, Wang, Wei, Lundgren, Sven, Sagonas, Konstantinos, and Ros, Alberto
- Subjects
Computer Science - Distributed, Parallel, and Cluster Computing ,Computer Science - Operating Systems ,D.4.2 - Abstract
Memory has become the primary cost driver in cloud data centers. Yet, a significant portion of memory allocated to VMs in public clouds remains unused. To optimize this resource, "cold" memory can be reclaimed from VMs and stored on slower storage or compressed, enabling memory overcommit. Current overcommit systems rely on general-purpose OS swap mechanisms, which are not optimized for virtualized workloads, leading to missed memory-saving opportunities and ineffective use of optimizations like prefetchers. This paper introduces a userspace memory management framework designed for VMs. It enables custom policies that have full control over the virtual machines' memory using a simple userspace API, supports huge page-based swapping to satisfy VM performance requirements, is easy to deploy by leveraging Linux/KVM, and supports zero-copy I/O virtualization with shared VM memory. Our evaluation demonstrates that an overcommit system based on our framework outperforms the state-of-the-art solutions on both micro-benchmarks and commonly used cloud workloads. Specifically our implementation outperforms the Linux Kernel baseline implementation by up to 25% while saving a similar amount of memory. We also demonstrate the benefits of custom policies by implementing workload-specific reclaimers and prefetchers that save $10\%$ additional memory, improve performance in a limited memory scenario by 30% over the Linux baseline, and recover faster from hard limit releases., Comment: 13 pages, 13 figures
- Published
- 2024
45. Towards LifeSpan Cognitive Systems
- Author
-
Wang, Yu, Han, Chi, Wu, Tongtong, He, Xiaoxin, Zhou, Wangchunshu, Sadeq, Nafis, Chen, Xiusi, He, Zexue, Wang, Wei, Haffari, Gholamreza, Ji, Heng, and McAuley, Julian
- Subjects
Computer Science - Computation and Language - Abstract
Building a human-like system that continuously interacts with complex environments -- whether simulated digital worlds or human society -- presents several key challenges. Central to this is enabling continuous, high-frequency interactions, where the interactions are termed experiences. We refer to this envisioned system as the LifeSpan Cognitive System (LSCS). A critical feature of LSCS is its ability to engage in incremental and rapid updates while retaining and accurately recalling past experiences. We identify two major challenges in achieving this: (1) Abstraction and Experience Merging, and (2) Long-term Retention with Accurate Recall. These properties are essential for storing new experiences, organizing past experiences, and responding to the environment in ways that leverage relevant historical data. Unlike language models with continual learning, which typically rely on large corpora for fine-tuning and focus on improving performance within specific domains or tasks, LSCS must rapidly and incrementally update with new information from its environment at a high frequency. Existing technologies with the potential of solving the above two major challenges can be classified into four classes based on a conceptual metric called Storage Complexity, which measures the relative space required to store past experiences. Each of these four classes of technologies has its own strengths and limitations. Given that none of the existing technologies can achieve LSCS alone, we propose a novel paradigm for LSCS that integrates all four classes of technologies. The new paradigm operates through two core processes: Absorbing Experiences and Generating Responses.
- Published
- 2024
46. Schrodinger's Memory: Large Language Models
- Author
-
Wang, Wei and Li, Qing
- Subjects
Computer Science - Computation and Language - Abstract
Memory is the foundation of all human activities; without memory, it would be nearly impossible for people to perform any task in daily life. With the development of Large Language Models (LLMs), their language capabilities are becoming increasingly comparable to those of humans. But do LLMs have memory? Based on current performance, LLMs do appear to exhibit memory. So, what is the underlying mechanism of this memory? Previous research has lacked a deep exploration of LLMs' memory capabilities and the underlying theory. In this paper, we use Universal Approximation Theorem (UAT) to explain the memory mechanism in LLMs. We also conduct experiments to verify the memory capabilities of various LLMs, proposing a new method to assess their abilities based on these memory ability. We argue that LLM memory operates like Schr\"odinger's memory, meaning that it only becomes observable when a specific memory is queried. We can only determine if the model retains a memory based on its output in response to the query; otherwise, it remains indeterminate. Finally, we expand on this concept by comparing the memory capabilities of the human brain and LLMs, highlighting the similarities and differences in their operational mechanisms.
- Published
- 2024
47. Communication-Assisted Sensing Systems: Fundamental Limits and ISAC Waveform Design
- Author
-
Dong, Fuwang, Liu, Fan, Xiong, Yifeng, Cui, Yuanhao, Wang, Wei, and Jin, Shi
- Subjects
Computer Science - Information Theory - Abstract
The communication-assisted sensing (CAS) systems are expected to endow the users with beyond-line-of-sight sensing capabilities without the aid of additional sensors. In this paper, we study the dual-functional signaling strategy, focusing on three primary aspects, namely, the information-theoretic framework, the optimal distribution of channel input, and the optimal waveform design for Gaussian signals. First, we establish the information-theoretic framework and develop a modified source-channel separation theorem (MSST) tailored for CAS systems. The proposed MSST elucidates the relationship between achievable distortion, coding rate, and communication channel capacity in cases where the distortion metric is separable for sensing and communication (S\&C) processes. Second, we present an optimal channel input design for dual-functional signaling, which aims to minimize total distortion under the constraints of the MSST and resource budget. We then conceive a two-step Blahut-Arimoto (BA)-based optimal search algorithm to numerically solve the functional optimization problem. Third, in light of the current signaling strategy, we further propose an optimal waveform design for Gaussian signaling in multi-input multi-output (MIMO) CAS systems. The associated covariance matrix optimization problem is addressed using a successive convex approximation (SCA)-based waveform design algorithm. Finally, we provide numerical simulation results to demonstrate the effectiveness of the proposed algorithms and to show the unique performance tradeoff between S\&C processes.
- Published
- 2024
48. Efficient light upconversion via resonant exciton-exciton annihilation of dark excitons in few-layer transition metal dichalcogenides
- Author
-
Chen, Yi-Hsun, Lo, Ping-Yuan, Boschen, Kyle W., Peng, Guan-Hao, Huang, Chun-Jui, Holtzman, Luke N., Hsu, Chih-En, Hsu, Yung-Ning, Holbrook, Madisen, Wang, Wei-Hua, Barmak, Katayun, Hone, James, Hawrylak, Pawel, Hsueh, Hung-Chung, Davis, Jeffrey A., Cheng, Shun-Jen, Fuhrer, Michael S., and Chen, Shao-Yu
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Mesoscale and Nanoscale Physics - Abstract
In this work, we report a pronounced light upconversion in few-layer transition metal dichalcogenides. Our joint theory-experiment study attributes the upconversion photoluminescence to a resonant exciton-exciton annihilation involving a pair of dark excitons with opposite momenta, followed by the spontaneous emission of upconverted bright excitons, which can have a high upconversion efficiency. Additionally, the upconversion photoluminescence is generic in MoS2, MoSe2, WS2, and WSe2, showing a high tuneability from green to ultraviolet light.
- Published
- 2024
49. Exploring the Feasibility of Multimodal Chatbot AI as Copilot in Pathology Diagnostics: Generalist Model's Pitfall
- Author
-
Liu, Mianxin, Wu, Jianfeng, Yan, Fang, Li, Hongjun, Wang, Wei, Zhang, Shaoting, and Wang, Zhe
- Subjects
Computer Science - Human-Computer Interaction ,Computer Science - Computers and Society - Abstract
Pathology images are crucial for diagnosing and managing various diseases by visualizing cellular and tissue-level abnormalities. Recent advancements in artificial intelligence (AI), particularly multimodal models like ChatGPT, have shown promise in transforming medical image analysis through capabilities such as medical vision-language question answering. However, there remains a significant gap in integrating pathology image data with these AI models for clinical applications. This study benchmarks the performance of GPT on pathology images, assessing their diagnostic accuracy and efficiency in real-word clinical records. We observe significant deficits of GPT in bone diseases and a fair-level performance in diseases from other three systems. Despite offering satisfactory abnormality annotations, GPT exhibits consistent disadvantage in terminology accuracy and multimodal integration. Specifically, we demonstrate GPT's failures in interpreting immunohistochemistry results and diagnosing metastatic cancers. This study highlight the weakness of current generalist GPT model and contribute to the integration of pathology and advanced AI.
- Published
- 2024
50. AllWeatherNet:Unified Image enhancement for autonomous driving under adverse weather and lowlight-conditions
- Author
-
Qian, Chenghao, Rezaei, Mahdi, Anwar, Saeed, Li, Wenjing, Hussain, Tanveer, Azarmi, Mohsen, and Wang, Wei
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence - Abstract
Adverse conditions like snow, rain, nighttime, and fog, pose challenges for autonomous driving perception systems. Existing methods have limited effectiveness in improving essential computer vision tasks, such as semantic segmentation, and often focus on only one specific condition, such as removing rain or translating nighttime images into daytime ones. To address these limitations, we propose a method to improve the visual quality and clarity degraded by such adverse conditions. Our method, AllWeather-Net, utilizes a novel hierarchical architecture to enhance images across all adverse conditions. This architecture incorporates information at three semantic levels: scene, object, and texture, by discriminating patches at each level. Furthermore, we introduce a Scaled Illumination-aware Attention Mechanism (SIAM) that guides the learning towards road elements critical for autonomous driving perception. SIAM exhibits robustness, remaining unaffected by changes in weather conditions or environmental scenes. AllWeather-Net effectively transforms images into normal weather and daytime scenes, demonstrating superior image enhancement results and subsequently enhancing the performance of semantic segmentation, with up to a 5.3% improvement in mIoU in the trained domain. We also show our model's generalization ability by applying it to unseen domains without re-training, achieving up to 3.9% mIoU improvement. Code can be accessed at: https://github.com/Jumponthemoon/AllWeatherNet.
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.