120,925 results on '"LI, JING"'
Search Results
2. American Life: A Chinese Historian's Perspective by Cho-yun Hsu (review)
- Author
-
Li, Jing
- Published
- 2023
3. Can Foreign Direct Investment (FDI) Help Enhance the Capital Structure Stability of Host Companies?: Evidence from China
- Author
-
Li, Jing and Li, Yao
- Published
- 2022
- Full Text
- View/download PDF
4. State Leadership vs. Lawyers’ Entrepreneurship: The Globalization Trajectories of Chinese Legal Professionals Under the Belt & Road Initiative
- Author
-
Li, Jing
- Published
- 2022
5. The Dragon Daughter and Other Lin Lan Fairy Tales ed. by Juwen Zhang (review)
- Author
-
Li, Jing
- Published
- 2023
6. The Cold War and the Origins of Foreign Relations of the People's Republic of China by Niu Jun (review)
- Author
-
Li, Jing
- Published
- 2021
- Full Text
- View/download PDF
7. Natural Language Processing Methods for the Study of Protein-Ligand Interactions
- Author
-
Michels, James, Bandarupalli, Ramya, Akbari, Amin Ahangar, Le, Thai, Xiao, Hong, Li, Jing, and Hom, Erik F. Y.
- Subjects
Quantitative Biology - Quantitative Methods ,Computer Science - Computation and Language - Abstract
Recent advances in Natural Language Processing (NLP) have ignited interest in developing effective methods for predicting protein-ligand interactions (PLIs) given their relevance to drug discovery and protein engineering efforts and the ever-growing volume of biochemical sequence and structural data available. The parallels between human languages and the "languages" used to represent proteins and ligands have enabled the use of NLP machine learning approaches to advance PLI studies. In this review, we explain where and how such approaches have been applied in the recent literature and discuss useful mechanisms such as long short-term memory, transformers, and attention. We conclude with a discussion of the current limitations of NLP methods for the study of PLIs as well as key challenges that need to be addressed in future work., Comment: 52 Pages and 3 Figures
- Published
- 2024
8. StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation
- Author
-
Zhou, Zhengguang, Li, Jing, Li, Huaxia, Chen, Nemo, and Tang, Xu
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Tuning-free personalized image generation methods have achieved significant success in maintaining facial consistency, i.e., identities, even with multiple characters. However, the lack of holistic consistency in scenes with multiple characters hampers these methods' ability to create a cohesive narrative. In this paper, we introduce StoryMaker, a personalization solution that preserves not only facial consistency but also clothing, hairstyles, and body consistency, thus facilitating the creation of a story through a series of images. StoryMaker incorporates conditions based on face identities and cropped character images, which include clothing, hairstyles, and bodies. Specifically, we integrate the facial identity information with the cropped character images using the Positional-aware Perceiver Resampler (PPR) to obtain distinct character features. To prevent intermingling of multiple characters and the background, we separately constrain the cross-attention impact regions of different characters and the background using MSE loss with segmentation masks. Additionally, we train the generation network conditioned on poses to promote decoupling from poses. A LoRA is also employed to enhance fidelity and quality. Experiments underscore the effectiveness of our approach. StoryMaker supports numerous applications and is compatible with other societal plug-ins. Our source codes and model weights are available at https://github.com/RedAIGC/StoryMaker., Comment: 12 pages, 5 figures
- Published
- 2024
9. Exploring and Enhancing the Transfer of Distribution in Knowledge Distillation for Autoregressive Language Models
- Author
-
Rao, Jun, Liu, Xuebo, Lin, Zepeng, Ding, Liang, Li, Jing, Tao, Dacheng, and Zhang, Min
- Subjects
Computer Science - Computation and Language - Abstract
Knowledge distillation (KD) is a technique that compresses large teacher models by training smaller student models to mimic them. The success of KD in auto-regressive language models mainly relies on Reverse KL for mode-seeking and student-generated output (SGO) to combat exposure bias. Our theoretical analyses and experimental validation reveal that while Reverse KL effectively mimics certain features of the teacher distribution, it fails to capture most of its behaviors. Conversely, SGO incurs higher computational costs and presents challenges in optimization, particularly when the student model is significantly smaller than the teacher model. These constraints are primarily due to the immutable distribution of the teacher model, which fails to adjust adaptively to models of varying sizes. We introduce Online Knowledge Distillation (OKD), where the teacher network integrates small online modules to concurrently train with the student model. This strategy abolishes the necessity for on-policy sampling and merely requires minimal updates to the parameters of the teacher's online module during training, thereby allowing dynamic adaptation to the student's distribution to make distillation better. Extensive results across multiple generation datasets show that OKD achieves or exceeds the performance of leading methods in various model architectures and sizes, reducing training time by up to fourfold.
- Published
- 2024
10. A Chinese Continuous Sign Language Dataset Based on Complex Environments
- Author
-
Zhu, Qidan, Li, Jing, Yuan, Fei, Fan, Jiaojiao, and Gan, Quan
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
The current bottleneck in continuous sign language recognition (CSLR) research lies in the fact that most publicly available datasets are limited to laboratory environments or television program recordings, resulting in a single background environment with uniform lighting, which significantly deviates from the diversity and complexity found in real-life scenarios. To address this challenge, we have constructed a new, large-scale dataset for Chinese continuous sign language (CSL) based on complex environments, termed the complex environment - chinese sign language dataset (CE-CSL). This dataset encompasses 5,988 continuous CSL video clips collected from daily life scenes, featuring more than 70 different complex backgrounds to ensure representativeness and generalization capability. To tackle the impact of complex backgrounds on CSLR performance, we propose a time-frequency network (TFNet) model for continuous sign language recognition. This model extracts frame-level features and then utilizes both temporal and spectral information to separately derive sequence features before fusion, aiming to achieve efficient and accurate CSLR. Experimental results demonstrate that our approach achieves significant performance improvements on the CE-CSL, validating its effectiveness under complex background conditions. Additionally, our proposed method has also yielded highly competitive results when applied to three publicly available CSL datasets., Comment: 11 pages, 3 figures
- Published
- 2024
11. Interfacial spin-orbitronic effects controlled with different oxidation levels at the Co|Al interface
- Author
-
Krishnia, Sachin, Vojáček, Libor, Gomes, Tristan Da Câmara Santa Clara, Sebe, Nicolas, Ibrahim, Fatima, Li, Jing, Vicente-Arche, Luis Moreno, Collin, Sophie, Denneulin, Thibaud, Dunin-Borkowski, Rafal E., Ohresser, Philippe, Jaouen, Nicolas, Thiaville, André, Fert, Albert, Jaffrès, Henri, Chshiev, Mairbek, Reyren, Nicolas, and Cros, Vincent
- Subjects
Condensed Matter - Materials Science - Abstract
Perpendicular magnetic anisotropy (PMA) and Dzyaloshinskii-Moriya interactions are key interactions in modern spintronics. These interactions are thought to be dominated by the oxidation of the Co|Al interface in the archetypal Platinum-Cobalt-Aluminum oxide system. Here, we observe a double sign change in the anisotropy and about threefold variation in interfacial chiral interaction, influenced not only by the oxidation, but also by the metallic Al thickness. Contrary to previous assumptions about negligible spin-orbit effects at light metal interfaces, we not only observe strong PMA with fully oxidized Al, decreasing and turning negative (in-plane) with less oxygen at the Co|Al interface, we also observe that the magnetic anisotropy reverts to positive (out-of-plane) values at fully metallic Co|Al interface. These findings suggest modification in Co d band via Co|Al orbital hybridization, an effect supported by X-ray absorption spectroscopy and ab initio theory calculations, highlighting the key impact of strain on interfacial mechanisms at fully metallic Co|Al interface., Comment: 6 pages, 5 figures
- Published
- 2024
12. Sketch: A Toolkit for Streamlining LLM Operations
- Author
-
Jiang, Xin, Li, Xiang, Ma, Wenjia, Fang, Xuezhi, Yao, Yiqun, Yu, Naitong, Meng, Xuying, Han, Peng, Li, Jing, Sun, Aixin, and Wang, Yequan
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Large language models (LLMs) represented by GPT family have achieved remarkable success. The characteristics of LLMs lie in their ability to accommodate a wide range of tasks through a generative approach. However, the flexibility of their output format poses challenges in controlling and harnessing the model's outputs, thereby constraining the application of LLMs in various domains. In this work, we present Sketch, an innovative toolkit designed to streamline LLM operations across diverse fields. Sketch comprises the following components: (1) a suite of task description schemas and prompt templates encompassing various NLP tasks; (2) a user-friendly, interactive process for building structured output LLM services tailored to various NLP tasks; (3) an open-source dataset for output format control, along with tools for dataset construction; and (4) an open-source model based on LLaMA3-8B-Instruct that adeptly comprehends and adheres to output formatting instructions. We anticipate this initiative to bring considerable convenience to LLM users, achieving the goal of ''plug-and-play'' for various applications. The components of Sketch will be progressively open-sourced at https://github.com/cofe-ai/Sketch.
- Published
- 2024
13. Bilateral boundary finite-time stabilization of 2x2 linear first-order hyperbolic systems with spatially varying coefficients
- Author
-
Sun, Wei, Li, Jing, and Xu, Liangyu
- Subjects
Mathematics - Optimization and Control ,Mathematics - Analysis of PDEs ,35L04, 35L40, 93D15 - Abstract
This paper presents bilateral control laws for one-dimensional(1-D) linear 2x2 hyperbolic first-order systems (with spatially varying coefficients). Bilateral control means there are two actuators at each end of the domain. This situation becomes more complex as the transport velocities are no longer constant, and this extension is nontrivial. By selecting the appropriate backstepping transformation and target system, the infinite-dimensional backstepping method is extended and a full-state feedback control law is given that ensures the closed-loop system converges to its zero equilibrium in finite time. The design of bilateral controllers enables a potential for fault-tolerant designs.
- Published
- 2024
14. Area under the ROC Curve has the Most Consistent Evaluation for Binary Classification
- Author
-
Li, Jing
- Subjects
Statistics - Machine Learning ,Computer Science - Machine Learning - Abstract
Evaluation Metrics is an important question for model evaluation and model selection in binary classification tasks. This study investigates how consistent metrics are at evaluating different models under different data scenarios. Analyzing over 150 data scenarios and 18 model evaluation metrics using statistical simulation, I find that for binary classification tasks, evaluation metrics that are less influenced by prevalence offer more consistent ranking of a set of different models. In particular, Area Under the ROC Curve (AUC) has smallest variance in ranking of different models. Matthew's correlation coefficient as a more strict measure of model performance has the second smallest variance. These patterns holds across a rich set of data scenarios and five commonly used machine learning models as well as a naive random guess model. The results have significant implications for model evaluation and model selection in binary classification tasks.
- Published
- 2024
15. BMX: Entropy-weighted Similarity and Semantic-enhanced Lexical Search
- Author
-
Li, Xianming, Lipp, Julius, Shakir, Aamir, Huang, Rui, and Li, Jing
- Subjects
Computer Science - Information Retrieval - Abstract
BM25, a widely-used lexical search algorithm, remains crucial in information retrieval despite the rise of pre-trained and large language models (PLMs/LLMs). However, it neglects query-document similarity and lacks semantic understanding, limiting its performance. We revisit BM25 and introduce BMX, a novel extension of BM25 incorporating entropy-weighted similarity and semantic enhancement techniques. Extensive experiments demonstrate that BMX consistently outperforms traditional BM25 and surpasses PLM/LLM-based dense retrieval in long-context and real-world retrieval benchmarks. This study bridges the gap between classical lexical search and modern semantic approaches, offering a promising direction for future information retrieval research. The reference implementation of BMX can be found in Baguetter, which was created in the context of this work. The code can be found here: https://github.com/mixedbread-ai/baguetter., Comment: correct the affiliation order
- Published
- 2024
16. Long working distance portable smartphone microscopy for metallic mesh defect detection
- Author
-
Lu, Zhengang, Qin, Hongsheng, Li, Jing, Sun, Ming, and Tan, Jiubin
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Metallic mesh is a transparent electromagnetic shielding film with a fine metal line structure. However, it can develop defects that affect the optoelectronic performance whether in the production preparation or in actual use. The development of in-situ non-destructive testing (NDT) devices for metallic mesh requires long working distances, reflective optical path design, and miniaturization. To address the limitations of existing smartphone microscopes, which feature short working distances and inadequate transmission imaging for industrial in-situ inspection, we propose a novel long-working distance reflective smartphone microscopy system (LD-RSM). LD-RSM builds a 4f optical imaging system with external optical components and a smartphone, utilizing a beam splitter to achieve reflective imaging with the illumination system and imaging system on the same side of the sample. It achieves an optical resolution of 4.92$\mu$m and a working distance of up to 22.23 mm. Additionally, we introduce a dual prior weighted Robust Principal Component Analysis (DW-RPCA) for defect detection. This approach leverages spectral filter fusion and Hough transform to model different defect types, enhancing the accuracy and efficiency of defect identification. Coupled with an optimized threshold segmentation algorithm, DW-RPCA method achieves a pixel-level accuracy of 84.8%. Our work showcases strong potential for growth in the field of in-situ on-line inspection of industrial products.
- Published
- 2024
17. Open-domain Implicit Format Control for Large Language Model Generation
- Author
-
Yao, Yiqun, Ma, Wenjia, Fang, Xuezhi, Jiang, Xin, Li, Xiang, Meng, Xuying, Han, Peng, Li, Jing, Sun, Aixin, and Wang, Yequan
- Subjects
Computer Science - Computation and Language - Abstract
Controlling the format of outputs generated by large language models (LLMs) is a critical functionality in various applications. Current methods typically employ constrained decoding with rule-based automata or fine-tuning with manually crafted format instructions, both of which struggle with open-domain format requirements. To address this limitation, we introduce a novel framework for controlled generation in LLMs, leveraging user-provided, one-shot QA pairs. This study investigates LLMs' capabilities to follow open-domain, one-shot constraints and replicate the format of the example answers. We observe that this is a non-trivial problem for current LLMs. We also develop a dataset collection methodology for supervised fine-tuning that enhances the open-domain format control of LLMs without degrading output quality, as well as a benchmark on which we evaluate both the helpfulness and format correctness of LLM outputs. The resulting datasets, named OIFC-SFT, along with the related code, will be made publicly available at https://github.com/cofe-ai/OIFC., Comment: 6 pages
- Published
- 2024
18. Highly Efficient No-reference 4K Video Quality Assessment with Full-Pixel Covering Sampling and Training Strategy
- Author
-
Tan, Xiaoheng, Zhang, Jiabin, Quan, Yuhui, Li, Jing, Wu, Yajing, and Bian, Zilin
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Deep Video Quality Assessment (VQA) methods have shown impressive high-performance capabilities. Notably, no-reference (NR) VQA methods play a vital role in situations where obtaining reference videos is restricted or not feasible. Nevertheless, as more streaming videos are being created in ultra-high definition (e.g., 4K) to enrich viewers' experiences, the current deep VQA methods face unacceptable computational costs. Furthermore, the resizing, cropping, and local sampling techniques employed in these methods can compromise the details and content of original 4K videos, thereby negatively impacting quality assessment. In this paper, we propose a highly efficient and novel NR 4K VQA technology. Specifically, first, a novel data sampling and training strategy is proposed to tackle the problem of excessive resolution. This strategy allows the VQA Swin Transformer-based model to effectively train and make inferences using the full data of 4K videos on standard consumer-grade GPUs without compromising content or details. Second, a weighting and scoring scheme is developed to mimic the human subjective perception mode, which is achieved by considering the distinct impact of each sub-region within a 4K frame on the overall perception. Third, we incorporate the frequency domain information of video frames to better capture the details that affect video quality, consequently further improving the model's generalizability. To our knowledge, this is the first technology for the NR 4K VQA task. Thorough empirical studies demonstrate it not only significantly outperforms existing methods on a specialized 4K VQA dataset but also achieves state-of-the-art performance across multiple open-source NR video quality datasets., Comment: Accepted by ACM MM 2024
- Published
- 2024
- Full Text
- View/download PDF
19. Spin hierarchy in van der Waals molecule formation via ultracold three-body recombination
- Author
-
Li, Jing-Lun, Julienne, Paul S., Denschlag, Johannes Hecker, and D'Incao, José P.
- Subjects
Physics - Atomic Physics - Abstract
We theoretically investigate the product-state distribution of weakly bound diatomic van der Waals molecules via ultracold three-body recombination of bosonic alkali atoms. We find a two-level hierarchy of spin propensity rules at zero magnetic field. The primary propensity rule states that nearly all molecular products conserve the total hyperfine spin of reactant atomic pairs, while molecular products not conserving the total spin are highly suppressed. For the dominant molecular products, there is a secondary propensity to conserve certain spin components of the reactant pair such as the atomic hyperfine spins, or the total electronic or nuclear spins. The second propensity varies across species and depends fundamentally on the interplay between effective electronic exchange and hyperfine interactions. The spin sensitivity of product-state distribution can potentially open up new avenues for controlling state-to-state reaction rates in ultracold three-body recombination.
- Published
- 2024
20. LaMAGIC: Language-Model-based Topology Generation for Analog Integrated Circuits
- Author
-
Chang, Chen-Chia, Shen, Yikang, Fan, Shaoze, Li, Jing, Zhang, Shun, Cao, Ningyuan, Chen, Yiran, and Zhang, Xin
- Subjects
Computer Science - Hardware Architecture ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
In the realm of electronic and electrical engineering, automation of analog circuit is increasingly vital given the complexity and customized requirements of modern applications. However, existing methods only develop search-based algorithms that require many simulation iterations to design a custom circuit topology, which is usually a time-consuming process. To this end, we introduce LaMAGIC, a pioneering language model-based topology generation model that leverages supervised finetuning for automated analog circuit design. LaMAGIC can efficiently generate an optimized circuit design from the custom specification in a single pass. Our approach involves a meticulous development and analysis of various input and output formulations for circuit. These formulations can ensure canonical representations of circuits and align with the autoregressive nature of LMs to effectively addressing the challenges of representing analog circuits as graphs. The experimental results show that LaMAGIC achieves a success rate of up to 96\% under a strict tolerance of 0.01. We also examine the scalability and adaptability of LaMAGIC, specifically testing its performance on more complex circuits. Our findings reveal the enhanced effectiveness of our adjacency matrix-based circuit formulation with floating-point input, suggesting its suitability for handling intricate circuit designs. This research not only demonstrates the potential of language models in graph generation, but also builds a foundational framework for future explorations in automated analog circuit design., Comment: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:6253-6262 https://proceedings.mlr.press/v235/chang24c.html
- Published
- 2024
21. MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis
- Author
-
Zhong, Ziming, Xu, Yanxu, Li, Jing, Xu, Jiale, Li, Zhengxin, Yu, Chaohui, and Gao, Shenghua
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
We present MeshSegmenter, a simple yet effective framework designed for zero-shot 3D semantic segmentation. This model successfully extends the powerful capabilities of 2D segmentation models to 3D meshes, delivering accurate 3D segmentation across diverse meshes and segment descriptions. Specifically, our model leverages the Segment Anything Model (SAM) model to segment the target regions from images rendered from the 3D shape. In light of the importance of the texture for segmentation, we also leverage the pretrained stable diffusion model to generate images with textures from 3D shape, and leverage SAM to segment the target regions from images with textures. Textures supplement the shape for segmentation and facilitate accurate 3D segmentation even in geometrically non-prominent areas, such as segmenting a car door within a car mesh. To achieve the 3D segments, we render 2D images from different views and conduct segmentation for both textured and untextured images. Lastly, we develop a multi-view revoting scheme that integrates 2D segmentation results and confidence scores from various views onto the 3D mesh, ensuring the 3D consistency of segmentation results and eliminating inaccuracies from specific perspectives. Through these innovations, MeshSegmenter offers stable and reliable 3D segmentation results both quantitatively and qualitatively, highlighting its potential as a transformative tool in the field of 3D zero-shot segmentation. The code is available at \url{https://github.com/zimingzhong/MeshSegmenter}., Comment: The paper was accepted by ECCV2024
- Published
- 2024
22. Nonreciprocal Single-Photon Band Structure in a Coupled-Spinning-Resonator chain
- Author
-
Li, Jing, Yang, Ya, Xu, Xun Wei, Lu, Jing, Jing, Hui, and Zhou, Lan
- Subjects
Physics - Optics ,Quantum Physics - Abstract
We analyze the single-photon band structure and the transport of a single photon in a one-dimensional coupled-spinning-resonator chain. The time-reversal symmetry of the resonators chain is broken by the spinning of the resonators, instead of external or synthetic magnetic field. Two nonreciprocal single-photon band gaps can be obtained in the coupled-spinning-resonator chain, whose width depends on the angular velocity of the spinning resonator. Based on the nonreciprocal band gaps, we can implement a single photon circulator at multiple frequency windows, and the direction of photon cycling is opposite for different band gaps. In addition, reciprocal single-photon band structures can also be realized in the coupled-spinning-resonator chain when all resonators rotate in the same direction with equal angular velocity. Our work open a new route to achieve, manipulate, and switch nonreciprocal or reciprocal single-photon band structures, and provides new opportunities to realize novel single-photon devices., Comment: 8 pages, 3 figures
- Published
- 2024
23. GRAPE: Generalizable and Robust Multi-view Facial Capture
- Author
-
Li, Jing, Kang, Di, and He, Zhenyu
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Deep learning-based multi-view facial capture methods have shown impressive accuracy while being several orders of magnitude faster than a traditional mesh registration pipeline. However, the existing systems (e.g. TEMPEH) are strictly restricted to inference on the data captured by the same camera array used to capture their training data. In this study, we aim to improve the generalization ability so that a trained model can be readily used for inference (i.e. capture new data) on a different camera array. To this end, we propose a more generalizable initialization module to extract the camera array-agnostic 3D feature, including a visual hull-based head localization and a visibility-aware 3D feature aggregation module enabled by the visual hull. In addition, we propose an ``update-by-disagreement'' learning strategy to better handle data noise (e.g. inaccurate registration, scan noise) by discarding potentially inaccurate supervision signals during training. The resultant generalizable and robust topologically consistent multi-view facial capture system (GRAPE) can be readily used to capture data on a different camera array, reducing great effort on data collection and processing. Experiments on the FaMoS and FaceScape datasets demonstrate the effectiveness of the proposed method.
- Published
- 2024
24. HSFusion: A high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation
- Author
-
Jiang, Chengjie, Liu, Xiaowen, Zheng, Bowen, Bai, Lu, and Li, Jing
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Infrared and visible image fusion has been developed from vision perception oriented fusion methods to strategies which both consider the vision perception and high-level vision task. However, the existing task-driven methods fail to address the domain gap between semantic and geometric representation. To overcome these issues, we propose a high-level vision task-driven infrared and visible image fusion network via semantic and geometric domain transformation, terms as HSFusion. Specifically, to minimize the gap between semantic and geometric representation, we design two separate domain transformation branches by CycleGAN framework, and each includes two processes: the forward segmentation process and the reverse reconstruction process. CycleGAN is capable of learning domain transformation patterns, and the reconstruction process of CycleGAN is conducted under the constraint of these patterns. Thus, our method can significantly facilitate the integration of semantic and geometric information and further reduces the domain gap. In fusion stage, we integrate the infrared and visible features that extracted from the reconstruction process of two seperate CycleGANs to obtain the fused result. These features, containing varying proportions of semantic and geometric information, can significantly enhance the high level vision tasks. Additionally, we generate masks based on segmentation results to guide the fusion task. These masks can provide semantic priors, and we design adaptive weights for two distinct areas in the masks to facilitate image fusion. Finally, we conducted comparative experiments between our method and eleven other state-of-the-art methods, demonstrating that our approach surpasses others in both visual appeal and semantic segmentation task.
- Published
- 2024
25. Detailed Mapping of the Galactic Disk Structure in the Solar Neighborhood through LAMOST K Dwarfs
- Author
-
Tang, Xi-Can, Tian, Hao, Li, Jing, Chen, Bing-qiu, Chen, Yi-Rong, Liu, Chao, and Qiu, Dan
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
The Galactic disk is one of the main components of the Milky Way, which contributes most of the luminosity. Its structure is essential for understanding the formation and evolution of the Milky Way. Using 174,443 K-type dwarf stars observed by both LAMOST and Gaia DR3, we study the disk density profile in the local volume within 1,200 pc. In the azimuthal dimension, we find strong asymmetric signal of the thin disk. The surface density and the scale height of the southern disk significantly change versus the azimuthal angle at the same galactocentric distance $R$. Meanwhile, in the vertical dimension, the scale height of the northern disk has quite different trend than that of the southern one. The scale height of the southern disk shows a decreasing trend with $\phi\sim-2.5^\circ$, and change to an increasing one with $\phi\sim5.0^\deg$. Meanwhile, the scale height of the northern disk has a consistently smaller increase. Finally, we divide the entire sample into three subsamples based on metallicity and all three subsamples show significant non-axisymmetric and north-south asymmetric signals in the Galactic disk. Furthermore, we find that the scale height of the metal-poor ([Fe/H] $<$ -0.4 dex) subsample in the northern disk is greater than that of the metal-rich ([Fe/H] $>$ -0.1 dex) subsample. However, in the southern disk, the scale height exhibits varying relationships across different metallicity slices., Comment: 15 pages, 24 figures, 6 tables; accepted for publication in MNRAS
- Published
- 2024
- Full Text
- View/download PDF
26. Knowledge Overshadowing Causes Amalgamated Hallucination in Large Language Models
- Author
-
Zhang, Yuji, Li, Sha, Liu, Jiateng, Yu, Pengfei, Fung, Yi R., Li, Jing, Li, Manling, and Ji, Heng
- Subjects
Computer Science - Computation and Language - Abstract
Hallucination is often regarded as a major impediment for using large language models (LLMs), especially for knowledge-intensive tasks. Even when the training corpus consists solely of true statements, language models still generate hallucinations in the form of amalgamations of multiple facts. We coin this phenomenon as ``knowledge overshadowing'': when we query knowledge from a language model with multiple conditions, some conditions overshadow others, leading to hallucinated outputs. This phenomenon partially stems from training data imbalance, which we verify on both pretrained models and fine-tuned models, over a wide range of LM model families and sizes.From a theoretical point of view, knowledge overshadowing can be interpreted as over-generalization of the dominant conditions (patterns). We show that the hallucination rate grows with both the imbalance ratio (between the popular and unpopular condition) and the length of dominant condition description, consistent with our derived generalization bound. Finally, we propose to utilize overshadowing conditions as a signal to catch hallucination before it is produced, along with a training-free self-contrastive decoding method to alleviate hallucination during inference. Our proposed approach showcases up to 82% F1 for hallucination anticipation and 11.2% to 39.4% hallucination control, with different models and datasets.
- Published
- 2024
27. Activity-Induced Stiffness, Entanglement Network and Dynamic Slowdown in Unentangled Semidilute Polymer Solutions
- Author
-
Li, Jing, Zhang, Bokai, and Wang, Zhi-Yong
- Subjects
Condensed Matter - Soft Condensed Matter - Abstract
Active polymers possess numerous unique properties that are quite different from those observed in the system of small active molecule due to the intricate interplay between their activity and topological constraints. This study focuses on the conformational changes induced by activity, impacting effective stiffness and crucially influencing entanglement and dynamics. When the two terminals of a linear chain undergo active modification through coupling to a high-temperature thermal bath, there is a substantial increase in chain size, indicating a notable enhancement in effective stiffness. Unlike in passive semiflexible chains where stiffness predominantly affects local bond angles, activity-induced stiffness manifests at the scale of tens of monomers. While activity raises the ambient temperature, it significantly decreases diffusion by over an order of magnitude. The slowdown of dynamics observed can be attributed to increased entanglement due to chain elongation., Comment: 10 pages, 5 figures
- Published
- 2024
- Full Text
- View/download PDF
28. PROUD: PaRetO-gUided Diffusion Model for Multi-objective Generation
- Author
-
Yao, Yinghua, Pan, Yuangang, Li, Jing, Tsang, Ivor, and Yao, Xin
- Subjects
Computer Science - Machine Learning - Abstract
Recent advancements in the realm of deep generative models focus on generating samples that satisfy multiple desired properties. However, prevalent approaches optimize these property functions independently, thus omitting the trade-offs among them. In addition, the property optimization is often improperly integrated into the generative models, resulting in an unnecessary compromise on generation quality (i.e., the quality of generated samples). To address these issues, we formulate a constrained optimization problem. It seeks to optimize generation quality while ensuring that generated samples reside at the Pareto front of multiple property objectives. Such a formulation enables the generation of samples that cannot be further improved simultaneously on the conflicting property functions and preserves good quality of generated samples. Building upon this formulation, we introduce the PaRetO-gUided Diffusion model (PROUD), wherein the gradients in the denoising process are dynamically adjusted to enhance generation quality while the generated samples adhere to Pareto optimality. Experimental evaluations on image generation and protein generation tasks demonstrate that our PROUD consistently maintains superior generation quality while approaching Pareto optimality across multiple property functions compared to various baselines.
- Published
- 2024
- Full Text
- View/download PDF
29. VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values
- Author
-
Hu, Zhe, Ren, Yixiao, Li, Jing, and Yin, Yu
- Subjects
Computer Science - Computation and Language ,Computer Science - Computer Vision and Pattern Recognition - Abstract
This paper introduces VIVA, a benchmark for VIsion-grounded decision-making driven by human VAlues. While most large vision-language models (VLMs) focus on physical-level skills, our work is the first to examine their multimodal capabilities in leveraging human values to make decisions under a vision-depicted situation. VIVA contains 1,062 images depicting diverse real-world situations and the manually annotated decisions grounded in them. Given an image there, the model should select the most appropriate action to address the situation and provide the relevant human values and reason underlying the decision. Extensive experiments based on VIVA show the limitation of VLMs in using human values to make multimodal decisions. Further analyses indicate the potential benefits of exploiting action consequences and predicted human values.
- Published
- 2024
30. Unveiling Mass Transfer in Solar Flares: Insights from Elemental Abundance Evolutions Observed by Chang'E-2 Solar X-ray Monitor
- Author
-
Ng, Man-Hei, Tang, Chi-Long, Zhang, Xiaoping, Tam, Kuan-Vai, Chen, Peng-Fei, Dong, Wudong, Li, Jing, and Tang, Chi-Pui
- Subjects
Astrophysics - Solar and Stellar Astrophysics - Abstract
Understanding how elemental abundances evolve during solar flares helps shed light on the mass and energy transfer between different solar atmospheric layers. However, prior studies have mostly concentrated on averaged abundances or specific flare phases, leaving a gap in exploring the comprehensive observations throughout the entire flare process. Consequently, investigations into this area are relatively scarce. Exploiting the Solar X-ray Monitor data obtained from the Chang'E-2 lunar orbiter, we present two comprehensive soft X-ray spectroscopic observations of flares in active regions, AR 11149 and 11158, demonstrating elemental abundance evolutions under different conditions. Our findings unveil the inverse first ionization potential (IFIP) effect during flares for Fe for the first time, and reaffirm its existence for Si. Additionally, we observed a rare depletion of elemental abundances, marking the second IFIP effect in flare decay phases. Our study offers a CSHKP model-based interpretation to elucidate the formation of both the FIP and IFIP effects in flare dynamics, with the inertia effect being incorporated into the ponderomotive force fractionation model., Comment: Accepted ApJ
- Published
- 2024
31. The Compressible Navier-Stokes Equations on the Multi-Connected Domains
- Author
-
Fan, Xinyu, Jiang, Song, and Li, Jing
- Subjects
Mathematics - Analysis of PDEs - Abstract
This paper investigates the isentropic compressible Navier-Stokes equations on k-connected domains under Navier-slip boundary conditions. We study the multi-solvability of the stationary systems on general domains, which is closely related with the Cauchy-Riemann systems and critical points of harmonic functions on the domain. Then based on the structure of Green's functions, the commutator estimates are obtained on the circular domains and extended to general domains with the help of conformal mappings. Moreover, we will utilize these assertions to discuss the global well-posedness and large time behaviours of the non-stationary systems on general domains with large initial values containing vacuum.
- Published
- 2024
32. Politics and Digital Literature in the Middle East: Perspectives on Online Text and Context by Nele Lenze (review)
- Author
-
Li, Jing and Ye, Fan
- Published
- 2020
33. Immunological characteristics of hepatic dendritic cells in patients and mouse model with liver 'Echinococcus multilocularis' infection
- Author
-
Wang, Hui, Li, Yinshi, Yu, Qian, Wang, Mingkun, Ainiwaer, Abidan, Tang, Na, Zheng, Xuran, Duolikun, Adilai, Deng, Bingqing, Li, Jing, Shen, Yujuan, and Zhang, Chuanshan
- Published
- 2024
34. COBALT: A Confirmatory Trial of Obeticholic Acid in Primary Biliary Cholangitis With Placebo and External Controls.
- Author
-
Kowdley, Kris V, Hirschfield, Gideon M, Coombs, Charles, Malecha, Elizabeth S, Bessonova, Leona, Li, Jing, Rathnayaka, Nuvan, Mells, George, Jones, David E, Trivedi, Palak J, Hansen, Bettina E, Smith, Rachel, Wason, James, Hiu, Shaun, Kareithi, Dorcas N, Mason, Andrew L, Bowlus, Christopher L, Muller, Kate, Carbone, Marco, Berenguer, Marina, Milkiewicz, Piotr, Adekunle, Femi, and Villamil, Alejandra
- Subjects
Biomedical and Clinical Sciences ,Clinical Sciences ,Clinical Trials and Supportive Activities ,Chronic Liver Disease and Cirrhosis ,Clinical Research ,Rare Diseases ,Digestive Diseases ,Women's Health ,Liver Disease ,6.1 Pharmaceuticals ,Oral and gastrointestinal ,Gastroenterology & Hepatology ,Clinical sciences - Abstract
ObjectivesObeticholic acid (OCA) treatment for primary biliary cholangitis (PBC) was conditionally approved in the phase 3 POISE trial. The COBALT confirmatory trial assessed whether clinical outcomes in PBC patients improve with OCA therapy.MethodsPatients randomized to OCA (5-10 mg) were compared with placebo (randomized controlled trial [RCT]) or external control (EC). The primary composite endpoint was time to death, liver transplant, model for end-stage liver disease score ≥15, uncontrolled ascites, or hospitalization for hepatic decompensation. A prespecified propensity score-weighted EC group was derived from a US healthcare claims database.ResultsIn the RCT, the primary endpoint occurred in 28.6% of OCA (n=168) and 28.9% of placebo patients (n=166; intent-to-treat [ITT] analysis hazard ratio [HR]=1.01, 95% CI=0.68-1.51), but functional unblinding and crossover to commercial therapy occurred, especially in the placebo arm. Correcting for these using inverse probability of censoring weighting (IPCW) and as-treated analyses shifted the HR to favor OCA. In the EC (n=1051), the weighted primary endpoint occurred in 10.1% of OCA and 21.5% of non-OCA patients (HR=0.39; 95% CI=0.22-0.69; P=0.001). No new safety signals were identified in the RCT.ConclusionsFunctional unblinding and treatment crossover, particularly in the placebo arm, confounded the ITT estimate of outcomes associated with OCA in the RCT. Comparison with the real-world EC showed that OCA treatment significantly reduced the risk of negative clinical outcomes. These analyses demonstrate the value of EC data in confirmatory trials and suggest that treatment with OCA improves clinical outcomes in patients with PBC.
- Published
- 2024
35. The next frontier in immunotherapy: potential and challenges of CAR-macrophages.
- Author
-
Li, Jing, Chen, Ping, and Ma, Wenxue
- Subjects
CAR macrophage (CAR-MΦ) ,Clinical trials ,Combination therapies ,Immunotherapy ,Tumor Microenvironment (TME) - Abstract
Chimeric antigen receptor macrophage (CAR-MΦ) represents a significant advancement in immunotherapy, especially for treating solid tumors where traditional CAR-T therapies face limitations. CAR-MΦ offers a promising approach to target and eradicate tumor cells by utilizing macrophages phagocytic and antigen-presenting abilities. However, challenges such as the complex tumor microenvironment (TME), variability in antigen expression, and immune suppression limit their efficacy. This review addresses these issues, exploring mechanisms of CAR-MΦ action, optimal construct designs, and interactions within the TME. It also delves into the ex vivo manufacturing challenges of CAR-MΦ, discussing autologous and allogeneic sources and the importance of stringent quality control. The potential synergies of integrating CAR-MΦ with existing cancer therapies like checkpoint inhibitors and conventional chemotherapeutics are examined to highlight possible enhanced treatment outcomes. Furthermore, regulatory pathways for CAR-MΦ therapies are scrutinized alongside established protocols for CAR-T cells, identifying unique considerations essential for clinical trials and market approval. Proposed safety monitoring frameworks aim to manage potential adverse events, such as cytokine release syndrome, crucial for patient safety. Consolidating current research and clinical insights, this review seeks to refine CAR-MΦ therapeutic applications, overcome barriers, and suggest future research directions to transition CAR-MΦ therapies from experimental platforms to standard cancer care options.
- Published
- 2024
36. Development and application of Breadth-Depth-Context (BDC), a conceptual framework for measuring technology engagement with a qualified clinical data registry
- Author
-
Kersey, Emma, Li, Jing, Kay, Julia, Adler-Milstein, Julia, Yazdany, Jinoos, and Schmajuk, Gabriela
- Subjects
Health Services and Systems ,Health Sciences ,Generic health relevance ,framework ,user engagement ,user profiles ,audit log data ,clinical dashboard ,Health services and systems - Abstract
ObjectivesDespite the proliferation of dashboards that display performance data derived from Qualified Clinical Data Registries (QCDR), the degree to which clinicians and practices engage with such dashboards has not been well described. We aimed to develop a conceptual framework for assessing user engagement with dashboard technology and to demonstrate its application to a rheumatology QCDR.Materials and methodsWe developed the BDC (Breadth-Depth-Context) framework, which included concepts of breadth (derived from dashboard sessions), depth (derived from dashboard actions), and context (derived from practice characteristics). We demonstrated its application via user log data from the American College of Rheumatology's Rheumatology Informatics System for Effectiveness (RISE) registry to define engagement profiles and characterize practice-level factors associated with different profiles.ResultsWe applied the BDC framework to 213 ambulatory practices from the RISE registry in 2020-2021, and classified practices into 4 engagement profiles: not engaged (8%), minimally engaged (39%), moderately engaged (34%), and most engaged (19%). Practices with more patients and with specific electronic health record vendors (eClinicalWorks and eMDs) had a higher likelihood of being in the most engaged group, even after adjusting for other factors.DiscussionWe developed the BDC framework to characterize user engagement with a registry dashboard and demonstrated its use in a specialty QCDR. The application of the BDC framework revealed a wide range of breadth and depth of use and that specific contextual factors were associated with nature of engagement.ConclusionGoing forward, the BDC framework can be used to study engagement with similar dashboards.
- Published
- 2024
37. Structuring medication signeturs as a language regression task: comparison of zero- and few-shot GPT with fine-tuned models.
- Author
-
Garcia-Agundez, Augusto, Kay, Julia, Li, Jing, Gianfrancesco, Milena, Rai, Baljeet, Hu, Angela, Schmajuk, Gabriela, and Yazdany, Jinoos
- Subjects
immunomodulating drugs ,in-context learning ,language regression ,large language models ,natural language processing - Abstract
IMPORTANCE: Electronic health record textual sources such as medication signeturs (sigs) contain valuable information that is not always available in structured form. Commonly processed through manual annotation, this repetitive and time-consuming task could be fully automated using large language models (LLMs). While most sigs include simple instructions, some include complex patterns. OBJECTIVES: We aimed to compare the performance of GPT-3.5 and GPT-4 with smaller fine-tuned models (ClinicalBERT, BlueBERT) in extracting the average daily dose of 2 immunomodulating medications with frequent complex sigs: hydroxychloroquine, and prednisone. METHODS: Using manually annotated sigs as the gold standard, we compared the performance of these models in 702 hydroxychloroquine and 22 104 prednisone prescriptions. RESULTS: GPT-4 vastly outperformed all other models for this task at any level of in-context learning. With 100 in-context examples, the model correctly annotates 94% of hydroxychloroquine and 95% of prednisone sigs to within 1 significant digit. Error analysis conducted by 2 additional manual annotators on annotator-model disagreements suggests that the vast majority of disagreements are model errors. Many model errors relate to ambiguous sigs on which there was also frequent annotator disagreement. DISCUSSION: Paired with minimal manual annotation, GPT-4 achieved excellent performance for language regression of complex medication sigs and vastly outperforms GPT-3.5, ClinicalBERT, and BlueBERT. However, the number of in-context examples needed to reach maximum performance was similar to GPT-3.5. CONCLUSION: LLMs show great potential to rapidly extract structured data from sigs in no-code fashion for clinical and research applications.
- Published
- 2024
38. Unlocking Varied Perspectives: A Persona-Based Multi-Agent Framework with Debate-Driven Text Planning for Argument Generation
- Author
-
Hu, Zhe, Chan, Hou Pong, Li, Jing, and Yin, Yu
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Writing persuasive arguments is a challenging task for both humans and machines. It entails incorporating high-level beliefs from various perspectives on the topic, along with deliberate reasoning and planning to construct a coherent narrative. Current language models often generate surface tokens autoregressively, lacking explicit integration of these underlying controls, resulting in limited output diversity and coherence. In this work, we propose a persona-based multi-agent framework for argument writing. Inspired by the human debate, we first assign each agent a persona representing its high-level beliefs from a unique perspective, and then design an agent interaction process so that the agents can collaboratively debate and discuss the idea to form an overall plan for argument writing. Such debate process enables fluid and nonlinear development of ideas. We evaluate our framework on argumentative essay writing. The results show that our framework can generate more diverse and persuasive arguments through both automatic and human evaluations.
- Published
- 2024
39. Cavity Modified Oscillating Bound States with a $\Lambda$-type giant emitter in a linear waveguide
- Author
-
Sun, Ge, Yang, Ya, Li, Jing, Lu, Jing, and Zhou, Lan
- Subjects
Quantum Physics - Abstract
We study a system composed by a three-level giant atom (3GA), a waveguide initially in the vacuum state, and a single-mode cavity. The 3GA-cavity system is in a strong-coupling regime, and the distance between the coupling points is comparable to the coherent length of a spontaneously emitted photon. The dynamics of the 3GA and its radiative field in the waveguide for long time are analyzed. Besides the steady value, we also found an oscillatory character of the excited state population, a signature of oscillating bound states which is generated by the superposition of two bound states in the continuum. The radiative field propagates in the cavity-like geometry formed by the coupling points. When one bound state is emergent, a sine-like interference pattern is visible for the emitted field intensity in spacetime. An oscillatory character in time and a beat in space for the emitted field intensity are observed when two bound states are emergent in a subspace. The wavelengths and the periods are controlled by the number of the photons in the cavity.
- Published
- 2024
40. Operating Single-Photon Circulator by Spinning Optical Resonators
- Author
-
Li, Jing, Lu, Tian-Xiang, Peng, Meiyu, Kuang, Le-Man, Jing, Hui, and Zhou, Lan
- Subjects
Quantum Physics - Abstract
A circulator is one of the crucial devices in quantum networks and simulations. We propose a four-port circulator that regulate the flow of single photons at muti-frequency points by studying the coherent transmission of a single photon in a coupled system of two resonators and two waveguides. When both resonators are static or rotate at the same angular velocity, single-photon transport demonstrates reciprocity; however, when the angular velocities differ, four distinct frequency points emerge where photon circulation can occur. In particular, when the angular velocities of the two resonators are equal and opposite, there are two different frequency points where photon circulation can be achieved, and there is a frequency point where a single photon input from any waveguide can be completely routed to the other waveguide. Interestingly, by rotating the two resonators, the single-photon circulation suppressed by the internal defect-induced backscattering can be restored., Comment: 12 pages, 5 figures
- Published
- 2024
41. CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference
- Author
-
Yu, Erxin, Li, Jing, Liao, Ming, Wang, Siqi, Gao, Zuchen, Mi, Fei, and Hong, Lanqing
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
As large language models (LLMs) constantly evolve, ensuring their safety remains a critical research problem. Previous red-teaming approaches for LLM safety have primarily focused on single prompt attacks or goal hijacking. To the best of our knowledge, we are the first to study LLM safety in multi-turn dialogue coreference. We created a dataset of 1,400 questions across 14 categories, each featuring multi-turn coreference safety attacks. We then conducted detailed evaluations on five widely used open-source LLMs. The results indicated that under multi-turn coreference safety attacks, the highest attack success rate was 56% with the LLaMA2-Chat-7b model, while the lowest was 13.9% with the Mistral-7B-Instruct model. These findings highlight the safety vulnerabilities in LLMs during dialogue coreference interactions., Comment: Submitted to EMNLP 2024
- Published
- 2024
42. Unveiling the Impact of Multi-Modal Interactions on User Engagement: A Comprehensive Evaluation in AI-driven Conversations
- Author
-
Zhang, Lichao, Yu, Jia, Zhang, Shuai, Li, Long, Zhong, Yangyang, Liang, Guanbao, Yan, Yuming, Ma, Qing, Weng, Fangsheng, Pan, Fayu, Li, Jing, Xu, Renjun, and Lan, Zhenzhong
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Large Language Models (LLMs) have significantly advanced user-bot interactions, enabling more complex and coherent dialogues. However, the prevalent text-only modality might not fully exploit the potential for effective user engagement. This paper explores the impact of multi-modal interactions, which incorporate images and audio alongside text, on user engagement in chatbot conversations. We conduct a comprehensive analysis using a diverse set of chatbots and real-user interaction data, employing metrics such as retention rate and conversation length to evaluate user engagement. Our findings reveal a significant enhancement in user engagement with multi-modal interactions compared to text-only dialogues. Notably, the incorporation of a third modality significantly amplifies engagement beyond the benefits observed with just two modalities. These results suggest that multi-modal interactions optimize cognitive processing and facilitate richer information comprehension. This study underscores the importance of multi-modality in chatbot design, offering valuable insights for creating more engaging and immersive AI communication experiences and informing the broader AI community about the benefits of multi-modal interactions in enhancing user engagement.
- Published
- 2024
43. Latent Variable Sequence Identification for Cognitive Models with Neural Bayes Estimation
- Author
-
Pan, Ti-Fen, Li, Jing-Jing, Thompson, Bill, and Collins, Anne
- Subjects
Computer Science - Machine Learning ,Statistics - Machine Learning - Abstract
Extracting time-varying latent variables from computational cognitive models is a key step in model-based neural analysis, which aims to understand the neural correlates of cognitive processes. However, existing methods only allow researchers to infer latent variables that explain subjects' behavior in a relatively small class of cognitive models. For example, a broad class of relevant cognitive models with analytically intractable likelihood is currently out of reach from standard techniques, based on Maximum a Posteriori parameter estimation. Here, we present an approach that extends neural Bayes estimation to learn a direct mapping between experimental data and the targeted latent variable space using recurrent neural networks and simulated datasets. We show that our approach achieves competitive performance in inferring latent variable sequences in both tractable and intractable models. Furthermore, the approach is generalizable across different computational models and is adaptable for both continuous and discrete latent spaces. We then demonstrate its applicability in real world datasets. Our work underscores that combining recurrent neural networks and simulation-based inference to identify latent variable sequences can enable researchers to access a wider class of cognitive models for model-based neural analyses, and thus test a broader set of theories.
- Published
- 2024
44. Knowledge Fusion By Evolving Weights of Language Models
- Author
-
Du, Guodong, Li, Jing, Liu, Hanting, Jiang, Runhua, Yu, Shuyang, Guo, Yifei, Goh, Sim Kuan, and Tang, Ho-Kin
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Neural and Evolutionary Computing - Abstract
Fine-tuning pre-trained language models, particularly large language models, demands extensive computing resources and can result in varying performance outcomes across different domains and datasets. This paper examines the approach of integrating multiple models from diverse training scenarios into a unified model. This unified model excels across various data domains and exhibits the ability to generalize well on out-of-domain data. We propose a knowledge fusion method named Evolver, inspired by evolutionary algorithms, which does not need further training or additional training data. Specifically, our method involves aggregating the weights of different language models into a population and subsequently generating offspring models through mutation and crossover operations. These offspring models are then evaluated against their parents, allowing for the preservation of those models that show enhanced performance on development datasets. Importantly, our model evolving strategy can be seamlessly integrated with existing model merging frameworks, offering a versatile tool for model enhancement. Experimental results on mainstream language models (i.e., encoder-only, decoder-only, encoder-decoder) reveal that Evolver outperforms previous state-of-the-art models by large margins. The code is publicly available at {https://github.com/duguodong7/model-evolution}., Comment: Accepted by ACL2024 Findings
- Published
- 2024
45. Direct observations of cross-scale energy transfer in space plasmas
- Author
-
Li, Jing-Huan, Zhou, Xu-Zhi, Liu, Zhi-Yang, Wang, Shan, Omura, Yoshiharu, Li, Li, Yue, Chao, Zong, Qiu-Gang, Le, Guan, Russell, Christopher T., and Burch, James L.
- Subjects
Physics - Space Physics - Abstract
The collisionless plasmas in space and astrophysical environments are intrinsically multiscale in nature, behaving as conducting fluids at macroscales and kinetically at microscales comparable to ion- and/or electron-gyroradii. A fundamental question in understanding the plasma dynamics is how energy is transported and dissipated across different scales. Here, we present spacecraft measurements in the solar wind upstream of the terrestrial bow shock, in which the macroscale ultra-low-frequency waves and microscale whistler waves simultaneously resonate with the ions. The ion acceleration from ultra-low-frequency waves leads to velocity distributions unstable to the growth of whistler waves, which in turn resonate with the electrons to complete cross-scale energy transfer. These observations, consistent with numerical simulations in the occurrence of phase-bunched ion and electron distributions, also highlight the importance of anomalous resonance, a nonlinear modification of the classical cyclotron resonance, in the cross-scale wave coupling and energy transfer processes., Comment: 22 pages, 7 figures and supplementary material
- Published
- 2024
46. Multimodal Reasoning with Multimodal Knowledge Graph
- Author
-
Lee, Junlin, Wang, Yequan, Li, Jing, and Zhang, Min
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Multimodal reasoning with large language models (LLMs) often suffers from hallucinations and the presence of deficient or outdated knowledge within LLMs. Some approaches have sought to mitigate these issues by employing textual knowledge graphs, but their singular modality of knowledge limits comprehensive cross-modal understanding. In this paper, we propose the Multimodal Reasoning with Multimodal Knowledge Graph (MR-MKG) method, which leverages multimodal knowledge graphs (MMKGs) to learn rich and semantic knowledge across modalities, significantly enhancing the multimodal reasoning capabilities of LLMs. In particular, a relation graph attention network is utilized for encoding MMKGs and a cross-modal alignment module is designed for optimizing image-text alignment. A MMKG-grounded dataset is constructed to equip LLMs with initial expertise in multimodal reasoning through pretraining. Remarkably, MR-MKG achieves superior performance while training on only a small fraction of parameters, approximately 2.25% of the LLM's parameter size. Experimental results on multimodal question answering and multimodal analogy reasoning tasks demonstrate that our MR-MKG method outperforms previous state-of-the-art models., Comment: Accepted by ACL 2024 (Main Conference)
- Published
- 2024
47. Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models
- Author
-
Chen, Tianrun, Yu, Chunan, Li, Jing, Zhang, Jianqi, Zhu, Lanyun, Ji, Deyi, Zhang, Yong, Zang, Ying, Li, Zejian, and Sun, Lingyun
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Graphics ,Computer Science - Human-Computer Interaction - Abstract
In this paper, we introduce a new task: Zero-Shot 3D Reasoning Segmentation for parts searching and localization for objects, which is a new paradigm to 3D segmentation that transcends limitations for previous category-specific 3D semantic segmentation, 3D instance segmentation, and open-vocabulary 3D segmentation. We design a simple baseline method, Reasoning3D, with the capability to understand and execute complex commands for (fine-grained) segmenting specific parts for 3D meshes with contextual awareness and reasoned answers for interactive segmentation. Specifically, Reasoning3D leverages an off-the-shelf pre-trained 2D segmentation network, powered by Large Language Models (LLMs), to interpret user input queries in a zero-shot manner. Previous research have shown that extensive pre-training endows foundation models with prior world knowledge, enabling them to comprehend complex commands, a capability we can harness to "segment anything" in 3D with limited 3D datasets (source efficient). Experimentation reveals that our approach is generalizable and can effectively localize and highlight parts of 3D objects (in 3D mesh) based on implicit textual queries, including these articulated 3d objects and real-world scanned data. Our method can also generate natural language explanations corresponding to these 3D models and the decomposition. Moreover, our training-free approach allows rapid deployment and serves as a viable universal baseline for future research of part-level 3d (semantic) object understanding in various fields including robotics, object manipulation, part assembly, autonomous driving applications, augment reality and virtual reality (AR/VR), and medical applications. The code, the model weight, the deployment guide, and the evaluation protocol are: http://tianrun-chen.github.io/Reason3D/
- Published
- 2024
48. Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions
- Author
-
Hu, Zhe, Liang, Tuo, Li, Jing, Lu, Yiren, Zhou, Yunlai, Qiao, Yiran, Ma, Jing, and Yin, Yu
- Subjects
Computer Science - Computation and Language ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Recent advancements in large multimodal language models have demonstrated remarkable proficiency across a wide range of tasks. Yet, these models still struggle with understanding the nuances of human humor through juxtaposition, particularly when it involves nonlinear narratives that underpin many jokes and humor cues. This paper investigates this challenge by focusing on comics with contradictory narratives, where each comic consists of two panels that create a humorous contradiction. We introduce the YesBut benchmark, which comprises tasks of varying difficulty aimed at assessing AI's capabilities in recognizing and interpreting these comics, ranging from literal content comprehension to deep narrative reasoning. Through extensive experimentation and analysis of recent commercial or open-sourced large (vision) language models, we assess their capability to comprehend the complex interplay of the narrative humor inherent in these comics. Our results show that even state-of-the-art models still lag behind human performance on this task. Our findings offer insights into the current limitations and potential improvements for AI in understanding human creative expressions.
- Published
- 2024
49. Emergent Oscillating bound states in a semi-infinite linear waveguide with a point-like $\Lambda$-type quantum emitter driven by a classical field
- Author
-
He, YuPing, Sun, Ge, Li, Jing, Yang, Ya, Lu, Ling, and Zhou, Lan
- Subjects
Quantum Physics - Abstract
An oscillating bound state is a phenomenon where excitations mediated by the continuum modes oscillate persistently. Although it is generated by the superposition of two bound states in the continuum (BICs), such phenomenon is said to be unique to giant atoms. We present the phenomenon of an oscillating bound state with an alternative waveguide QED system, which is a one-dimensional semi-infinite waveguide coupled to a \textit{point-like} quantum emitter. This \textit{point-like} quantum emitter is $\Lambda$-type quantum system with one transition driven by a classical field., Comment: 7 pages 4 figures
- Published
- 2024
50. Shape of a droplet on a surface in the presence of an external field and its critical disruption condition
- Author
-
Li, Jing, Wen, Kaiqiang, Xiao, Ke, Chen, Xiaoming, and Wu, Chen-Xu
- Subjects
Condensed Matter - Soft Condensed Matter ,Physics - Fluid Dynamics - Abstract
Due to the potential application of regulating droplet shape by external fields in microfluidic technology and micro devices, it becomes increasingly important to understand the shape formation of a droplet in the presence of an electric field. How to understand and determine such a deformable boundary shape at equilibrium has been a long-term physical and mathematical challenge. Here, based on the theoretical model we propose, and combining the finite element method and the gradient descent algorithm, we successfully obtain the droplet shape by considering the contributions made by electrostatic energy, surface tension energy, and gravitational potential energy. We also carry out scaling analyses and obtain an empirical critical disruption condition with a universal scaling exponent 1/2 for the contact angle in terms of normalized volume. The master curve fits both the experimental and the numerical results very well.
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.