Author: "LI, Yan" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"LI, Yan"' showing total 139,952 results

Start Over Author "LI, Yan"

139,952 results on '"LI, Yan"'

1. A New Data-Mining Method for the Digital Great Wall Exemplified by Statistical Evaluation and Analysis of the Ming Great Wall Archery Windows

Author: Li, Zhe, Zhang, Mengdi, Tuo, Xiaolong, Zhang, Yukun, and Li, Yan
Published: 2024
Full Text: View/download PDF

2. MicroRNA-143 (miR-143) suppresses cell proliferation and invasion by downregulating AKT/STAT3/NF-κB pathway in tongue squamous cell carcinoma (TSCC)

Author: Xu, Ping, Li, Yan, Li, Chenjun, Zheng, Weiyin, Li, Hao, Shen, Lu, and Yang, Shuyong
Published: 2023
Full Text: View/download PDF

3. Establishment and Preliminary Application of Multiplex Fluorescent Quantitative PCR for Simultaneous Detection of BVDV, BRV and BCV

Author: Chang, Liyun, Liu, Zhiyong, Zhao, Yuelan, Li, Yan, and Qin, Jianhua
Published: 2021
Full Text: View/download PDF

4. Research on the changes and predictions of the burden of type 2 diabetes mellitus in Pacific Island countries from 1990 to 2019

Author: Li, Yan, Zhang, Hao, and Jiang, Yi
Published: 2023

5. A Study of the Latin Phonetic Scheme Used in Yingyu Guanhua Hejiang

Author: Li, Yan
Published: 2021
Full Text: View/download PDF

6. HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

Author: Liu, Xinyu, He, Yingqing, Guo, Lanqing, Li, Xiang, Jin, Bu, Li, Peng, Li, Yan, Chan, Chi-Min, Chen, Qifeng, Xue, Wei, Luo, Wenhan, Liu, Qingfeng, and Guo, Yike
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The potential for higher-resolution image generation using pretrained diffusion models is immense, yet these models often struggle with issues of object repetition and structural artifacts especially when scaling to 4K resolution and higher. We figure out that the problem is caused by that, a single prompt for the generation of multiple scales provides insufficient efficacy. In response, we propose HiPrompt, a new tuning-free solution that tackles the above problems by introducing hierarchical prompts. The hierarchical prompts offer both global and local guidance. Specifically, the global guidance comes from the user input that describes the overall content, while the local guidance utilizes patch-wise descriptions from MLLMs to elaborately guide the regional structure and texture generation. Furthermore, during the inverse denoising process, the generated noise is decomposed into low- and high-frequency spatial components. These components are conditioned on multiple prompt levels, including detailed patch-wise descriptions and broader image-level prompts, facilitating prompt-guided denoising under hierarchical semantic guidance. It further allows the generation to focus more on local spatial regions and ensures the generated images maintain coherent local and global semantics, structures, and textures with high definition. Extensive experiments demonstrate that HiPrompt outperforms state-of-the-art works in higher-resolution image generation, significantly reducing object repetition and enhancing structural quality., Comment: https://liuxinyv.github.io/HiPrompt/
Published: 2024

7. Revealing subterahertz atomic vibrations in quantum paraelectrics by surface-sensitive spintronic terahertz spectroscopy

Author: Chu, Zhaodong, Yang, Junyi, Li, Yan, Hwangbo, Kyle, Wen, Jianguo, Bielinski, Ashley R., Zhang, Qi, Martinson, Alex B. F., Hruszkewycz, Stephan, Fong, Dillon D., Xu, Xiaodong, Norman, Michael R., Bhattacharya, Anand, and Wen, Haidan
Subjects: Condensed Matter - Materials Science, Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: Understanding surface collective dynamics in quantum materials is crucial for advancing quantum technologies. For example, surface phonon modes in quantum paraelectrics are thought to play an essential role in facilitating interfacial superconductivity. However, detecting these modes, especially below 1 terahertz (THz), is challenging due to limited sampling volumes and the need for high spectroscopic resolution. Here, we report surface soft transverse optical (TO1) phonon dynamics in KTaO3 and SrTiO3 by developing surface-sensitive spintronic THz spectroscopy that can sense the collective modes only a few nanometers deep from the surface. In KTaO3, the TO1 mode softens and sharpens with decreasing temperature, leveling off at 0.7 THz. In contrast, this mode in SrTiO3 broadens significantly below the quantum paraelectric crossover and coincides with the hardening of a sub-meV phonon mode related to the antiferrodistortive transition. These observations that deviate from their bulk properties may have implications for interfacial superconductivity and ferroelectricity. The developed technique opens opportunities for sensing low-energy surface excitations., Comment: The main text consists of 24 pages and includes 4 figures. Supplementary Information is also provided
Published: 2024

8. Numerical Analysis of the Parallel Orbital-Updating Approach for Eigenvalue Problems

Author: Dai, Xiaoying, Li, Yan, Yang, Bin, and Zhou, Aihui
Subjects: Mathematics - Numerical Analysis
Abstract: The parallel orbital-updating approach is an orbital iteration based approach for solving eigenvalue problems when many eigenpairs are required, and has been proven to be very efficient, for instance, in electronic structure calculations. In this paper, based on the investigation of a quasi-orthogonality, we present the numerical analysis of the parallel orbital-updating approach for linear eigenvalue problems, including convergence and error estimates of the numerical approximations.
Published: 2024

9. FBD-SV-2024: Flying Bird Object Detection Dataset in Surveillance Video

Author: Sun, Zi-Wei, Hua, Ze-Xi, Li, Heng-Chao, Qi, Zhi-Peng, Li, Xiang, Li, Yan, and Zhang, Jin-Chi
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: A Flying Bird Dataset for Surveillance Videos (FBD-SV-2024) is introduced and tailored for the development and performance evaluation of flying bird detection algorithms in surveillance videos. This dataset comprises 483 video clips, amounting to 28,694 frames in total. Among them, 23,833 frames contain 28,366 instances of flying birds. The proposed dataset of flying birds in surveillance videos is collected from realistic surveillance scenarios, where the birds exhibit characteristics such as inconspicuous features in single frames (in some instances), generally small sizes, and shape variability during flight. These attributes pose challenges that need to be addressed when developing flying bird detection methods for surveillance videos. Finally, advanced (video) object detection algorithms were selected for experimentation on the proposed dataset, and the results demonstrated that this dataset remains challenging for the algorithms above. The FBD-SV-2024 is now publicly available: Please visit https://github.com/Ziwei89/FBD-SV-2024_github for the dataset download link and related processing scripts.
Published: 2024

10. Broad-line Region of the Quasar PG 2130+099. II. Doubling the Size Over Four Years?

Author: Yao, Zhu-Heng, Yang, Sen, Guo, Wei-Jian, Chen, Yong-Jie, Songsheng, Yu-Yang, Bao, Dong-Wei, Jiang, Bo-Wei, Wang, Yi-Lin, Zhang, Hao, Hu, Chen, Li, Yan-Rong, Du, Pu, Xiao, Ming, Bai, Jin-Ming, Ho, Luis C., Brotherton, Michael S., Aceituno, Jesús, Winkler, Hartmut, and Wang, Jian-Min
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: Over the past three decades, multiple reverberation mapping (RM) campaigns conducted for the quasar PG 2130+099 have exhibited inconsistent findings with time delays ranging from $\sim$10 to $\sim$200 days. To achieve a comprehensive understanding of the geometry and dynamics of the broad-line region (BLR) in PG 2130+099, we continued an ongoing high-cadence RM monitoring campaign using the Calar Alto Observatory 2.2m optical telescope for an extra four years from 2019 to 2022. We measured the time lags of several broad emission lines (including He II, He I, H$\beta$, and Fe II) with respect to the 5100 {\AA} continuum, and their time lags continuously vary through the years. Especially, the H$\beta$ time lags exhibited approximately a factor of two increase in the last two years. Additionally, the velocity-resolved time delays of the broad H$\beta$ emission line reveal a back-and-forth change between signs of virial motion and inflow in the BLR. The combination of negligible ($\sim$10%) continuum change and substantial time-lag variation (over two times) results in significant scatter in the intrinsic $R_{\rm H\beta}-L_{\rm 5100}$ relationship for PG 2130+099. Taking into account the consistent changes in the continuum variability time scale and the size of the BLR, we tentatively propose that the changes in the measurement of the BLR size may be affected by 'geometric dilution'., Comment: 21 pages, 13 figures, 7 tables; accepted for publication in ApJ
Published: 2024

11. Order-preserving pattern mining with forgetting mechanism

Author: Li, Yan, Ma, Chenyu, Gao, Rong, Wu, Youxi, Li, Jinyan, Wang, Wenjian, and Wu, Xindong
Subjects: Computer Science - Databases
Abstract: Order-preserving pattern (OPP) mining is a type of sequential pattern mining method in which a group of ranks of time series is used to represent an OPP. This approach can discover frequent trends in time series. Existing OPP mining algorithms consider data points at different time to be equally important; however, newer data usually have a more significant impact, while older data have a weaker impact. We therefore introduce the forgetting mechanism into OPP mining to reduce the importance of older data. This paper explores the mining of OPPs with forgetting mechanism (OPF) and proposes an algorithm called OPF-Miner that can discover frequent OPFs. OPF-Miner performs two tasks, candidate pattern generation and support calculation. In candidate pattern generation, OPF-Miner employs a maximal support priority strategy and a group pattern fusion strategy to avoid redundant pattern fusions. For support calculation, we propose an algorithm called support calculation with forgetting mechanism, which uses prefix and suffix pattern pruning strategies to avoid redundant support calculations. The experiments are conducted on nine datasets and 12 alternative algorithms. The results verify that OPF-Miner is superior to other competitive algorithms. More importantly, OPF-Miner yields good clustering performance for time series, since the forgetting mechanism is employed.
Published: 2024

12. A Noncontact Technique for Wave Measurement Based on Thermal Stereography and Deep Learning

Author: Li, Deyu, Xiao, Longfei, Wei, Handi, Li, Yan, and Zhang, Binghua
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: The accurate measurement of the wave field and its spatiotemporal evolution is essential in many hydrodynamic experiments and engineering applications. The binocular stereo imaging technique has been widely used to measure waves. However, the optical properties of indoor water surfaces, including transparency, specular reflection, and texture absence, pose challenges for image processing and stereo reconstruction. This study proposed a novel technique that combined thermal stereography and deep learning to achieve fully noncontact wave measurements. The optical imaging properties of water in the long-wave infrared spectrum were found to be suitable for stereo matching, effectively avoiding the issues in the visible-light spectrum. After capturing wave images using thermal stereo cameras, a reconstruction strategy involving deep learning techniques was proposed to improve stereo matching performance. A generative approach was employed to synthesize a dataset with ground-truth disparity from unannotated infrared images. This dataset was then fed to a pretrained stereo neural network for fine-tuning to achieve domain adaptation. Wave flume experiments were conducted to validate the feasibility and accuracy of the proposed technique. The final reconstruction results indicated great agreement and high accuracy with a mean bias of less than 2.1% compared with the measurements obtained using wave probes, suggesting that the novel technique effectively measures the spatiotemporal distribution of wave surface in hydrodynamic experiments.
Published: 2024

13. Enhancing quantum phase synchronization through squeezed-reservoir engineering

Author: Xiao, Xing, Lu, Tian-Xiang, Zhong, Wo-Jun, and Li, Yan-Ling
Subjects: Quantum Physics
Abstract: We investigate the enhancement of quantum phase synchronization in a two-level system (TLS) coupled to a squeezed reservoir. Our study reveals that the squeezed reservoir induces a stable limit cycle in the TLS, enhancing the quantum phase synchronization. We utilize the Husimi $Q$-function to describe the phase portrait of the driven TLS, and the $S$-function to quantitatively illustrate the effects of signal strength and detuning on phase synchronization. Remarkably, we demonstrate that the squeezed reservoir imparts its squeezing characteristics to the TLS, leading to a more localized and pronounced synchronization. Additionally, we observe typical features of the Arnold tongue in the synchronization regions. The experimental feasibility of our findings is discussed in the context of a circuit QED system, suggesting that squeezed-reservoir engineering is an effective approach for achieving quantum phase synchronization., Comment: 6 pages,4 figures, comments are welcome!
Published: 2024

14. MIDAS: Multi-level Intent, Domain, And Slot Knowledge Distillation for Multi-turn NLU

Author: Li, Yan, Kim, So-Eon, Park, Seong-Bae, and Han, Soyeon Caren
Subjects: Computer Science - Computation and Language
Abstract: Although Large Language Models(LLMs) can generate coherent and contextually relevant text, they often struggle to recognise the intent behind the human user's query. Natural Language Understanding (NLU) models, however, interpret the purpose and key information of user's input to enable responsive interactions. Existing NLU models generally map individual utterances to a dual-level semantic frame, involving sentence-level intent and word-level slot labels. However, real-life conversations primarily consist of multi-turn conversations, involving the interpretation of complex and extended dialogues. Researchers encounter challenges addressing all facets of multi-turn dialogue conversations using a unified single NLU model. This paper introduces a novel approach, MIDAS, leveraging a multi-level intent, domain, and slot knowledge distillation for multi-turn NLU. To achieve this, we construct distinct teachers for varying levels of conversation knowledge, namely, sentence-level intent detection, word-level slot filling, and conversation-level domain classification. These teachers are then fine-tuned to acquire specific knowledge of their designated levels. A multi-teacher loss is proposed to facilitate the combination of these multi-level teachers, guiding a student model in multi-turn dialogue tasks. The experimental results demonstrate the efficacy of our model in improving the overall multi-turn conversation understanding, showcasing the potential for advancements in NLU models through the incorporation of multi-level dialogue knowledge distillation techniques.
Published: 2024

15. Production characteristics of light nuclei, hypertritons and $\Omega$-hypernuclei in Pb+Pb collisions at $\sqrt{s_{NN}}=5.02$ TeV

Author: Wang, Rui-Qin, Hou, Xin-Lei, Li, Yan-Hao, Song, Jun, and Shao, Feng-Lan
Subjects: High Energy Physics - Phenomenology, Nuclear Theory
Abstract: We extend an analytical nucleon coalescence model with hyperons to study productions of light nuclei, hypertritons and $\Omega$-hypernuclei in Pb+Pb collisions at $\sqrt{s_{NN}}=5.02$ TeV. We derive the formula of the momentum distribution of two bodies coalescing into dibaryon states and that of three bodies coalescing into tribaryon states. We explain the available data of the coalescence factors $B_2$ and $B_3$, the transverse momentum spectra, the averaged transverse momenta, the yield rapidity densities, yield ratios of the deuteron, antihelium-3, antitriton, hypertriton measured by the ALICE collaboration, and give predictions of different $\Omega$-hypernuclei, e.g., $H(p\Omega^-)$, $H(n\Omega^-)$ and $H(pn\Omega^-)$. We find two groups of interesting observables, the averaged transverse momentum ratios of light (hyper-)nuclei to protons (hyperons) and the centrality-dependent yield ratios of theirs. The former group exhibits a reverse-hierarchy of the nucleus size, and the latter is helpful for the judgements of the nucleus production mechanism as well as the nucleus own size., Comment: 16 pages, 13 figures. arXiv admin note: text overlap with arXiv:2309.16296
Published: 2024

16. Lithography-free patterning of chalcogenide materials for integrated photonic devices

Author: Hu, Zhen, Li, Yuru, Li, Yan, Yao, Shunyu, Chen, Hongfei, Zhang, Tao, Ao, Zhaohuan, and Li, Zhaohui
Subjects: Physics - Optics, Physics - Applied Physics
Abstract: Chalcogenide material-based integrated photonic devices have garnered widespread attention due to their unique wideband transparency. Despite their recognized CMOS compatibility, the fabrication of these devices relies predominantly on lithography techniques. However, chalcogenide thin films are highly susceptible to oxidation, necessitating customized process flows and complex protective measures during lithography. These requirements are hardly compatible with current commercial CMOS manufacturing platforms designed for silicon photonics, significantly limiting the practical applications of chalcogenide photonic devices. In this work, we ingeniously exploit the ease of oxidation of chalcogenide materials, presenting a novel laser-induced localized oxidation technique for spatial patterning on chalcogenide thin films, enabling concise lithography-free fabrication of chalcogenide integrated photonic devices. Using Sb2S3 as an example, we experimentally demonstrate localized multi-level oxidation with a sizable overall refractive index contrast of 0.7 at near-infrared, featuring a high spatial resolution of 0.6 um. Based on this technique, multiple integrated photonic devices are demonstrated, showing versatile functionalities, including color printing at visible and metasurface-based spatial light modulation at near-infrared regions. Leveraging the inherent phase-change property of Sb2S3, an active Fresnel zone plate, enabling switchable beam focusing, is further demonstrated, indicating the feasibility of concise fabrication of active photonic devices. Our work offers a brand-new modulation dimension for chalcogenide materials and provides a significantly simplified approach for realizing chalcogenide-integrated photonic devices.
Published: 2024

17. Investigation of pion-nucleon contributions to nucleon matrix elements

Author: Alexandrou, Constantia, Koutsou, Giannis, Li, Yan, Petschlies, Marcus, and Pittler, Ferenc
Subjects: High Energy Physics - Lattice, High Energy Physics - Phenomenology, Nuclear Theory
Abstract: We investigate contributions of excited states to nucleon matrix elements computed in lattice QCD by employing, in addition to the standard nucleon interpolating operator, pion-nucleon ($\pi$-$N$) operators. We solve a generalized eigenvalue problem (GEVP) to obtain an optimal interpolating operator that minimizes overlap with the $\pi$-$N$ states. We derive a variant of the standard application of the GEVP method, which allows for constructing 3-point correlation functions using the optimized interpolating operator without requiring the computationally demanding combination that includes $\pi$-$N$ operators in both sink and source. We extract nucleon matrix elements using two twisted mass fermion ensembles, one ensemble generated using pion mass of 346 MeV and one ensemble tuned to reproduce the physical value of the pion mass. Especially, we determine the isoscalar and isovector scalar, pseudoscalar, vector, axial, and tensor matrix elements. We include results obtained using a range of kinematic setups, including momentum in the sink. Our results using this variational approach are compared with previous results obtained using the same ensembles and multi-state fits without GEVP improvement. We find that for the physical mass point ensemble, the improvement, in terms of suppression of excited states using this method, is most significant for the case of the matrix elements of the isovector axial and pseudoscalar currents., Comment: 36 pages, 39 figures
Published: 2024

18. Formation of WNL stars for the MW and LMC based on the k-omega model

Author: Si, Jijuan, Li, Zhi, and Li, Yan
Subjects: Astrophysics - Solar and Stellar Astrophysics, Astrophysics - Astrophysics of Galaxies
Abstract: We adopt a set of second-order differential equations ($k-\omega$ model) to handle core convective overshooting in massive stars, simulate the evolution of WNL stars with different metallicities and initial masses, both rotating and non-rotating models, and compare the results with the classical overshooting model. The results indicate that under the same initial conditions, the $k-\omega$ model generally produces larger convective cores and wider overshooting regions, thereby increasing the mass ranges and extending the lifetimes of WNL stars, as well as the likelihood of forming WNL stars. The masses and lifetimes of WNL stars both increase with higher metallicities and initial masses. Under higher-metallicity conditions, the two overshooting schemes significantly differ in their impacts on lifetimes of the WNL stars, but insignificant in the mass ranges of the WNL stars. Rotation may drive the formation of WNL stars in low-mass, metal-poor counterparts, with this effect being more pronounced in the OV model. The surface nitrogen of metal-rich WNL stars formed during the MS phase is likely primarily from the CN-cycle, while it may come from both the CN- and NO-cycles for relatively metal-poor counterparts. Our model can effectively explain the distribution of WNL stars in the Milky Way, but appears to have inadequacies in explaining the WNL stars in the LMC., Comment: Accepted by the ApJ
Published: 2024

19. ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval

Author: Zhao, Ruixiang, Jia, Jian, Li, Yan, Bai, Xuehan, Chen, Quan, Li, Han, Jiang, Peng, and Li, Xirong
Subjects: Computer Science - Multimedia, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: E-commerce is increasingly multimedia-enriched, with products exhibited in a broad-domain manner as images, short videos, or live stream promotions. A unified and vectorized cross-domain production representation is essential. Due to large intra-product variance and high inter-product similarity in the broad-domain scenario, a visual-only representation is inadequate. While Automatic Speech Recognition (ASR) text derived from the short or live-stream videos is readily accessible, how to de-noise the excessively noisy text for multimodal representation learning is mostly untouched. We propose ASR-enhanced Multimodal Product Representation Learning (AMPere). In order to extract product-specific information from the raw ASR text, AMPere uses an easy-to-implement LLM-based ASR text summarizer. The LLM-summarized text, together with visual data, is then fed into a multi-branch network to generate compact multimodal embeddings. Extensive experiments on a large-scale tri-domain dataset verify the effectiveness of AMPere in obtaining a unified multimodal product representation that clearly improves cross-domain product retrieval., Comment: 10 pages, 5 figures
Published: 2024

20. A reference frame-based microgrid primary control for ensuring global convergence to a periodic orbit

Author: Jiang, Xinyuan, Lagoa, Constantino M., Huang, Daning, and Li, Yan
Subjects: Electrical Engineering and Systems Science - Systems and Control
Abstract: Electric power systems with growing penetration of renewable generation face problems of frequency oscillation and increased uncertainty as the operating point may veer close to instability. Traditionally the stability of these systems is studied either in terms of local stability or as an angle synchronization problem under the simplifying assumption that decouples the amplitude along with all dissipations. Without the simplifying assumption, however, the steady state being studied is basically a limit cycle with the convergence of its orbit in question. In this paper we present an analysis of the orbital stability of a microgrid integrating the proposed type of distributed generation controller, whose internal reference voltage arises from the rotation of the reference frame much like a rotating machine. We utilize the shifted passivity framework to prove that, with sufficient dissipation, such system is globally convergent to a nontrivial orbit. This is the first global stability result for the limit cycle of such system in the full state space, which provides new insight into the synchronization mechanism as well as how dissipation plays a role in the orbital stability. The proposed controller is verified with a test microgrid, demonstrating its stability and transient smoothness compared to the standard droop control.
Published: 2024

21. A Brief Discussion on the Philosophical Principles and Development Directions of Data Circulation

Author: Li, Zhi, Zhang, Lei, Xin, Junyi, He, Jianfei, Li, Yan, Ma, Zhenjun, and Sun, Qi
Subjects: Computer Science - Other Computer Science
Abstract: The data circulation is a complex scenario involving a large number of participants and different types of requirements, which not only has to comply with the laws and regulations, but also faces multiple challenges in technical and business areas. In order to systematically and comprehensively address these issues, it is essential to have a comprehensive and profound understanding of 'data circulation'. The traditional analysis method tends to proceed based on the traditional circulation model of commodities, that is, tangible objects, which has some defects and shortcomings, and tends to be a formalized approach, which is faced numerous challenges in practice. This paper analyzes the circulation of data with a philosophical approach, obtains the new explication of data and executing entity, and provides a new definition of the concepts of data utilization and data key stakeholders (objects). At the same time, it puts forward the idea of ``data alienation'', and constructs a new interpretive framework of ``data circulation''. Based on the framework of this interpretation, it is clearly proposed that ``data alienation'' is the core of ``data circulation'', benefit distribution is the driving force, and legal compliance is the foundation, and further discussed the three modes of ``data circulation''. It further discusses the three modes of ``data circulation''. It is pointed out that ``data circulation'' is different from traditional ``commodity circulation''. To achieve ``data circulation'',a comprehensive information infrastructure needs to be established. from a theoretical point of view, it lays a solid foundation for the development of ``data circulation''.
Published: 2024

22. Spatiotemporal Graph Guided Multi-modal Network for Livestreaming Product Retrieval

Author: Hu, Xiaowan, Chen, Yiyi, Li, Yan, Wang, Minquan, Wang, Haoqian, Chen, Quan, Li, Han, and Jiang, Peng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia
Abstract: With the rapid expansion of e-commerce, more consumers have become accustomed to making purchases via livestreaming. Accurately identifying the products being sold by salespeople, i.e., livestreaming product retrieval (LPR), poses a fundamental and daunting challenge. The LPR task encompasses three primary dilemmas in real-world scenarios: 1) the recognition of intended products from distractor products present in the background; 2) the video-image heterogeneity that the appearance of products showcased in live streams often deviates substantially from standardized product images in stores; 3) there are numerous confusing products with subtle visual nuances in the shop. To tackle these challenges, we propose the Spatiotemporal Graphing Multi-modal Network (SGMN). First, we employ a text-guided attention mechanism that leverages the spoken content of salespeople to guide the model to focus toward intended products, emphasizing their salience over cluttered background products. Second, a long-range spatiotemporal graph network is further designed to achieve both instance-level interaction and frame-level matching, solving the misalignment caused by video-image heterogeneity. Third, we propose a multi-modal hard example mining, assisting the model in distinguishing highly similar products with fine-grained features across the video-image-text domain. Through extensive quantitative and qualitative experiments, we demonstrate the superior performance of our proposed SGMN model, surpassing the state-of-the-art methods by a substantial margin. The code is available at https://github.com/Huxiaowan/SGMN., Comment: 16 pages, 12 figures
Published: 2024

23. Golden-Retriever: High-Fidelity Agentic Retrieval Augmented Generation for Industrial Knowledge Base

Author: An, Zhiyu, Ding, Xianzhong, Fu, Yen-Chun, Chu, Cheng-Chung, Li, Yan, and Du, Wan
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Digital Libraries
Abstract: This paper introduces Golden-Retriever, designed to efficiently navigate vast industrial knowledge bases, overcoming challenges in traditional LLM fine-tuning and RAG frameworks with domain-specific jargon and context interpretation. Golden-Retriever incorporates a reflection-based question augmentation step before document retrieval, which involves identifying jargon, clarifying its meaning based on context, and augmenting the question accordingly. Specifically, our method extracts and lists all jargon and abbreviations in the input question, determines the context against a pre-defined list, and queries a jargon dictionary for extended definitions and descriptions. This comprehensive augmentation ensures the RAG framework retrieves the most relevant documents by providing clear context and resolving ambiguities, significantly improving retrieval accuracy. Evaluations using three open-source LLMs on a domain-specific question-answer dataset demonstrate Golden-Retriever's superior performance, providing a robust solution for efficiently integrating and querying industrial knowledge bases.
Published: 2024

24. Uniform K-stability of $G$-varieties of complexity 1

Author: Li, Yan and Li, Zhenye
Subjects: Mathematics - Algebraic Geometry, Mathematics - Differential Geometry
Abstract: Let ${\rm k}$ be an algebraically closed field of characteristic 0 and $G$ a connect, reductive group over it. Let $X$ be a projective $G$-variety of complexity 1. We classify $G$-equivariant normal test configurations of $X$ with integral central fibre via the combinatorial data. We also give a formula of anti-canonical divisors on $X$. Based on this formula, when $X$ is $\mathbb Q$-Fano, we give an expression of the Futaki invariant, and derive a criterion of uniform K-stability in terms of the combinatorial data.
Published: 2024

25. A cryogenic on-chip microwave pulse generator for large-scale superconducting quantum computing

Author: Bao, Zenghui, Li, Yan, Wang, Zhiling, Wang, Jiahui, Yang, Jize, Xiong, Haonan, Song, Yipu, Wu, Yukai, Zhang, Hongyi, and Duan, Luming
Subjects: Quantum Physics
Abstract: For superconducting quantum processors, microwave signals are delivered to each qubit from room-temperature electronics to the cryogenic environment through coaxial cables. Limited by the heat load of cabling and the massive cost of electronics, such an architecture is not viable for millions of qubits required for fault-tolerant quantum computing. Monolithic integration of the control electronics and the qubits provides a promising solution, which, however, requires a coherent cryogenic microwave pulse generator that is compatible with superconducting quantum circuits. Here, we report such a signal source driven by digital-like signals, generating pulsed microwave emission with well-controlled phase, intensity, and frequency directly at millikelvin temperatures. We showcase high-fidelity readout of superconducting qubits with the microwave pulse generator. The device demonstrated here has a small footprint, negligible heat load, great flexibility to operate, and is fully compatible with today's superconducting quantum circuits, thus providing an enabling technology for large-scale superconducting quantum computers., Comment: 12 pages, 4 figures
Published: 2024
Full Text: View/download PDF

26. Spectroastrometry and Reverberation Mapping (SARM) of Active Galactic Nuclei. I. The H$\beta$ Broad-line Region Structure and Black Hole Mass of Five Quasars

Author: Li, Yan-Rong, Hu, Chen, Yao, Zhu-Heng, Chen, Yong-Jie, Bai, Hua-Rui, Yang, Sen, Du, Pu, Fang, Feng-Na, Fu, Yi-Xin, Liu, Jun-Rong, Peng, Yue-Chang, Songsheng, Yu-Yang, Wang, Yi-Lin, Xiao, Ming, Zhai, Shuo, Winkler, Hartmut, Bai, Jin-Ming, Ho, Luis C., Petrov, Romain G., Aceituno, Jesus, and Wang, Jian-Min
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We conduct a reverberation mapping (RM) campaign to spectroscopically monitor a sample of selected bright active galactic nuclei with large anticipated broad-line region (BLR) sizes adequate for spectroastrometric observations by the GRAVITY instrument on the Very Large Telescope Interferometer. We report the first results for five objects, IC 4329A, Mrk 335, Mrk 509, Mrk 1239, and PDS 456, among which Mrk 1239 and PDS 456 are for the first time spectroscopically monitored. We obtain multi-year monitoring data and perform multi-component spectral decomposition to extract the broad H$\beta$ profiles. We detect significant time lags between the H$\beta$ and continuum variations, generally obeying the previously established BLR size-luminosity relation. Velocity-resolved H$\beta$ time lags illustrate diverse, possibly evolving BLR kinematics. We further measure the H$\beta$ line widths from mean and rms spectra and the resulting virial products show good consistency among different seasons. Adopting a unity virial factor and the full width at half maximum of the broad H$\beta$ line from the mean spectrum as the measure of velocity, the obtained black hole mass averaged over seasons is $\log M_\bullet/M_\odot=8.02_{-0.14}^{+0.09}$, $6.92_{-0.12}^{+0.12}$, $8.01_{-0.25}^{+0.16}$, $7.44_{-0.14}^{+0.13}$, and $8.59_{-0.11}^{+0.07}$ for the five objects, respectively. The black hole mass estimations using other line width measures are also reported (up to the virial factors). For objects with previous RM campaigns, our mass estimates are in agreement with earlier results. In a companion paper, we will employ BLR dynamical modeling to directly infer the black hole mass and thereby determine the virial factors., Comment: 32 pages, 6 tables, 20 figures. To appear in ApJ
Published: 2024

27. Efficient IoT Devices Localization Through Wi-Fi CSI Feature Fusion and Anomaly Detection

Author: Li, Yan, Yang, Jie, Shih, Shang-Ling, Shih, Wan-Ting, Wen, Chao-Kai, and Jin, Shi
Subjects: Computer Science - Information Theory
Abstract: Internet of Things (IoT) device localization is fundamental to smart home functionalities, including indoor navigation and tracking of individuals. Traditional localization relies on relative methods utilizing the positions of anchors within a home environment, yet struggles with precision due to inherent inaccuracies in these anchor positions. In response, we introduce a cutting-edge smartphone-based localization system for IoT devices, leveraging the precise positioning capabilities of smartphones equipped with motion sensors. Our system employs artificial intelligence (AI) to merge channel state information from proximal trajectory points of a single smartphone, significantly enhancing line of sight (LoS) angle of arrival (AoA) estimation accuracy, particularly under severe multipath conditions. Additionally, we have developed an AI-based anomaly detection algorithm to further increase the reliability of LoSAoA estimation. This algorithm improves measurement reliability by analyzing the correlation between the accuracy of reversed feature reconstruction and the LoS-AoA estimation. Utilizing a straightforward least squares algorithm in conjunction with accurate LoS-AoA estimation and smartphone positional data, our system efficiently identifies IoT device locations. Validated through extensive simulations and experimental tests with a receiving antenna array comprising just two patch antenna elements in the horizontal direction, our methodology has been shown to attain decimeter-level localization accuracy in nearly 90% of cases, demonstrating robust performance even in challenging real-world scenarios. Additionally, our proposed anomaly detection algorithm trained on Wi-Fi data can be directly applied to ultra-wideband, also outperforming the most advanced techniques., Comment: Accepted in IEEE Internet of Things Journal, Early Access, 2024
Published: 2024
Full Text: View/download PDF

28. Shyness and Socioemotional Functioning in Young Chinese Children: The Moderating Role of Insecure Attachment

Author: Zhu, Jingjing, Xiao, Bowen, Li, Yan, Hipson, Will E., and Coplan, Robert J.
Published: 2020

29. Monetary Policy and Housing Prices: Expansion of the Fictitious Economy in China

Author: Lin, Xiaoyan, Wang, Peng, and Li, Yan
Published: 2020
Full Text: View/download PDF

30. Cholesterol-dependent LXR transcription factor activity represses pronociceptive effects of estrogen in sensory neurons and pain induced by myelin basic protein fragments.

Author: Hullugundi, Swathi, Dolkas, Jennifer, Chernov, Andrei, Yaksh, Tony, Eddinger, Kelly, Angert, Mila, Catroli, Glaucilene, Strongin, Alex, Dougherty, Patrick, Li, Yan, Quehenberger, Oswal, Armando, Aaron, and Shubayev, Veronica
Subjects: Cholesterol, DRG culture, Estrogen, Interleukin 6, LXR, Liver x receptor, Myelin basic protein, Neuropathic pain, Oxysterol, Sensory neuron
Abstract: BACKGROUND: A bioactive myelin basic protein (MBP) fragment, comprising MBP84-104, is released in sciatic nerve after chronic constriction injury (CCI). Intraneural injection (IN) of MBP84-104 in an intact sciatic nerve is sufficient to induce persistent neuropathic pain-like behavior via robust transcriptional remodeling at the injection site and ipsilateral dorsal root ganglia (DRG) and spinal cord. The sex (female)-specific pronociceptive activity of MBP84-104 associates with sex-specific changes in cholesterol metabolism and activation of estrogen receptor (ESR)1 signaling. METHODS: In male and female normal and post-CCI rat sciatic nerves, we assessed: (i) cholesterol precursor and metabolite levels by lipidomics; (ii) MBP84-104 interactors by mass spectrometry of MBP84-104 pull-down; and (iii) liver X receptor (LXR)α protein expression by immunoblotting. To test the effect of LXRα stimulation on IN MBP84-104-induced mechanical hypersensitivity, the LXRα expression was confirmed along the segmental neuraxis, in DRG and spinal cord, followed by von Frey testing of the effect of intrathecally administered synthetic LXR agonist, GW3965. In cultured male and female rat DRGs exposed to MBP84-104 and/or estrogen treatments, transcriptional effect of LXR stimulation by GW3965 was assessed on downstream cholesterol transporter Abc, interleukin (IL)-6, and pronociceptive Cacna2d1 gene expression. RESULTS: CCI regulated LXRα ligand and receptor levels in nerves of both sexes, with cholesterol precursors, desmosterol and 7-DHC, and oxysterol elevated in females relative to males. MBP84-104 interacted with nuclear receptor coactivator (Ncoa)1, known to activate LXRα, injury-specific in nerves of both sexes. LXR stimulation suppressed ESR1-induced IL-6 and Cacna2d1 expression in cultured DRGs of both sexes and attenuated MBP84-104-induced pain in females. CONCLUSION: The injury-released bioactive MBP fragments induce pronociceptive changes by selective inactivation of nuclear transcription factors, including LXRα. By Ncoa1 sequestration, bioactive MBP fragments render LXRα function to counteract pronociceptive activity of estrogen/ESR1 in sensory neurons. This effect of MBP fragments is prevalent in females due to high circulating estrogen levels in females relative to males. Restoring LXR activity presents a promising therapeutic strategy in management of neuropathic pain induced by bioactive MBP.
Published: 2024

31. CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI

Author: Wang, Zi, Wang, Fanwen, Qin, Chen, Lyu, Jun, Cheng, Ouyang, Wang, Shuo, Li, Yan, Yu, Mengyao, Zhang, Haoyu, Guo, Kunyuan, Shi, Zhang, Li, Qirong, Xu, Ziqiang, Zhang, Yajing, Li, Hao, Hua, Sha, Chen, Binghua, Sun, Longyu, Sun, Mengting, Li, Qin, Chu, Ying-Hua, Bai, Wenjia, Qin, Jing, Zhuang, Xiahai, Prieto, Claudia, Young, Alistair, Markl, Michael, Wang, He, Wu, Lianming, Yang, Guang, Qu, Xiaobo, and Wang, Chengyan
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Databases
Abstract: Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover high-quality, clinically interpretable images from undersampled measurements. However, the lack of publicly available cardiac MRI k-space dataset in terms of both quantity and diversity has severely hindered substantial technological progress, particularly for data-driven artificial intelligence. Here, we provide a standardized, diverse, and high-quality CMRxRecon2024 dataset to facilitate the technical development, fair evaluation, and clinical transfer of cardiac MRI reconstruction approaches, towards promoting the universal frameworks that enable fast and robust reconstructions across different cardiac MRI protocols in clinical practice. To the best of our knowledge, the CMRxRecon2024 dataset is the largest and most diverse publicly available cardiac k-space dataset. It is acquired from 330 healthy volunteers, covering commonly used modalities, anatomical views, and acquisition trajectories in clinical cardiac MRI workflows. Besides, an open platform with tutorials, benchmarks, and data processing tools is provided to facilitate data usage, advanced method development, and fair performance evaluation., Comment: 19 pages, 3 figures, 2 tables
Published: 2024

32. Protecting three-dimensional entanglement from correlated amplitude damping channel

Author: Xiao, Xing, Huang, Wen-Rui, Lu, Tian-Xiang, and Li, Yan-Ling
Subjects: Quantum Physics
Abstract: Quantum entanglement is a crucial resource in quantum information processing, and protecting it against noise poses a significant challenge. This paper introduces two strategies for preserving qutrit-qutrit entanglement in the presence of correlated amplitude damping (CAD) noise: weak measurement (WM) and environment-assisted measurement (EAM), both combined with quantum measurement reversal (QMR). Two prototypical classes of three-dimensional entangled states are examined. The findings demonstrate that while the WM+QMR method can partially retain entanglement, the EAM+QMR approach is more effective at protecting entanglement as well as enhancing success probabilities, particularly for specific qutrit-qutrit entangled states. Additionally, we thoroughly discuss the impact of correlation effects on entanglement protection and the enhancement of success probability. Our results provide valuable insights into defending high-dimensional entanglement from CAD noise, thus offering practical solutions for the advancement of quantum information technologies., Comment: 14 pages,, 6 figures, comments are welcome!
Published: 2024

33. A Mathematical Aspect of Bloch's Theorem

Author: Li, Yan, Yang, Bin, and Zhou, Aihui
Subjects: Mathematical Physics
Abstract: In this paper, by studying a class of 1-D Sturm-Liouville problems with periodic coefficients, we show and classify the solutions of periodic Schrodinger equations in a multidimensional case, which tells that not all the solutions are Bloch solutions. In addition, we also provide several properties of the solutions and quasimomenta and illustrate the relationship between bounded solutions and Bloch solutions.
Published: 2024

34. An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation

Author: Li, Qin, Zhang, Yizhe, Li, Yan, Lyu, Jun, Liu, Meng, Sun, Longyu, Sun, Mengting, Li, Qirong, Mao, Wenyue, Wu, Xinran, Zhang, Yajing, Chu, Yinghua, Wang, Shuo, and Wang, Chengyan
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: The segmentation foundation model, e.g., Segment Anything Model (SAM), has attracted increasing interest in the medical image community. Early pioneering studies primarily concentrated on assessing and improving SAM's performance from the perspectives of overall accuracy and efficiency, yet little attention was given to the fairness considerations. This oversight raises questions about the potential for performance biases that could mirror those found in task-specific deep learning models like nnU-Net. In this paper, we explored the fairness dilemma concerning large segmentation foundation models. We prospectively curate a benchmark dataset of 3D MRI and CT scans of the organs including liver, kidney, spleen, lung and aorta from a total of 1056 healthy subjects with expert segmentations. Crucially, we document demographic details such as gender, age, and body mass index (BMI) for each subject to facilitate a nuanced fairness analysis. We test state-of-the-art foundation models for medical image segmentation, including the original SAM, medical SAM and SAT models, to evaluate segmentation efficacy across different demographic groups and identify disparities. Our comprehensive analysis, which accounts for various confounding factors, reveals significant fairness concerns within these foundational models. Moreover, our findings highlight not only disparities in overall segmentation metrics, such as the Dice Similarity Coefficient but also significant variations in the spatial distribution of segmentation errors, offering empirical evidence of the nuanced challenges in ensuring fairness in medical image segmentation., Comment: Accepted to MICCAI-2024
Published: 2024

35. A note on entanglement entropy and topological defects in symmetric orbifold CFTs

Author: Gutperle, Michael, Li, Yan-Yan, Rathore, Dikshant, and Roumpedakis, Konstantinos
Subjects: High Energy Physics - Theory
Abstract: In this brief note we calculate the entanglement entropy in $M^{\otimes N}/S_N$ symmetric orbifold CFTs in the presence of topological defects, which were recently constructed in \cite{Gutperle:2024vyp,Knighton:2024noc}. We consider both universal defects which realize $Rep(S_N)$ non-invertible symmetry and non-universal defects. We calculate the sub-leading defect entropy/g-factor for defects at the boundary entangling surface as well as inside it., Comment: 16 pages, 5 figures
Published: 2024

36. Center-Sensitive Kernel Optimization for Efficient On-Device Incremental Learning

Author: Zhang, Dingwen, Li, Yan, Cheng, De, Wang, Nannan, and Han, Junwei
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: To facilitate the evolution of edge intelligence in ever-changing environments, we study on-device incremental learning constrained in limited computation resource in this paper. Current on-device training methods just focus on efficient training without considering the catastrophic forgetting, preventing the model getting stronger when continually exploring the world. To solve this problem, a direct solution is to involve the existing incremental learning mechanisms into the on-device training framework. Unfortunately, such a manner cannot work well as those mechanisms usually introduce large additional computational cost to the network optimization process, which would inevitably exceed the memory capacity of the edge devices. To address this issue, this paper makes an early effort to propose a simple but effective edge-friendly incremental learning framework. Based on an empirical study on the knowledge intensity of the kernel elements of the neural network, we find that the center kernel is the key for maximizing the knowledge intensity for learning new data, while freezing the other kernel elements would get a good balance on the model's capacity for overcoming catastrophic forgetting. Upon this finding, we further design a center-sensitive kernel optimization framework to largely alleviate the cost of the gradient computation and back-propagation. Besides, a dynamic channel element selection strategy is also proposed to facilitate a sparse orthogonal gradient projection for further reducing the optimization complexity, upon the knowledge explored from the new task data. Extensive experiments validate our method is efficient and effective, e.g., our method achieves average accuracy boost of 38.08% with even less memory and approximate computation compared to existing on-device training methods, indicating its significant potential for on-device incremental learning.
Published: 2024

37. Distributionally robust stochastic optimal control

Author: Shapiro, Alexander and Li, Yan
Subjects: Mathematics - Optimization and Control
Abstract: The main goal of this paper is to discuss the construction of distributionally robust counterparts of stochastic optimal control problems. Randomized and non-randomized policies are considered. In particular, necessary and sufficient conditions for the existence of non-randomized policies are given.
Published: 2024

38. Characterization of Recirculating Waveguide Meshes Based on an Optimization Method with a Parameter Space Reduction Technology

Author: Tao, Ran, Qiu, Jifang, Chen, Yuchen, Zhang, Bowen, Li, Yan, Guo, Hongxiang, and Wu, Jian
Subjects: Physics - Optics
Abstract: Fabrication imperfections must be considered during configuration to ensure that the setup is suitable for the actual fabricated programmable photonic integrated circuits (PPICs). Therefore, characterization of imperfections is crucial but difficult, especially for PPICs made from recirculating waveguide meshes. The flexibility required by these meshes demands a more complex topology and compact TBU structure, complicating the characterization. In this paper, we propose a characterization method applicable to recirculating waveguide meshes based on an optimization approach, along with a step-by-step procedure to reduce the parameter space of optimization, allowing for characterizing imperfect parameters of each individual component within the waveguide mesh. To the best of our knowledge, this method can greatly broaden the range of characterized parameters compared to currently reported methods. In order to verify the effectiveness of our method, we used the characterized parameters to build a multi-frequency model of a mesh with fabrication errors and successfully demonstrated accurate prediction of its behavior. Furthermore, we applied our method on implementations of 6 different kind of FIR/IRR filters, to further prove the effectiveness of our method in configuring applications on meshes with fabrication errors. At last, our method was carried out under various scenarios considering beam splitter splitting ratio variance, inaccurate measurements of mesh and imprecise TBU insertion loss characterization, to demonstrate its strong robustness under various practical scenarios.
Published: 2024

39. Multi-scale Quaternion CNN and BiGRU with Cross Self-attention Feature Fusion for Fault Diagnosis of Bearing

Author: Liu, Huanbai, Zhang, Fanlong, Tan, Yin, Huang, Lian, Li, Yan, Huang, Guoheng, Luo, Shenghong, and Zeng, An
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: In recent years, deep learning has led to significant advances in bearing fault diagnosis (FD). Most techniques aim to achieve greater accuracy. However, they are sensitive to noise and lack robustness, resulting in insufficient domain adaptation and anti-noise ability. The comparison of studies reveals that giving equal attention to all features does not differentiate their significance. In this work, we propose a novel FD model by integrating multi-scale quaternion convolutional neural network (MQCNN), bidirectional gated recurrent unit (BiGRU), and cross self-attention feature fusion (CSAFF). We have developed innovative designs in two modules, namely MQCNN and CSAFF. Firstly, MQCNN applies quaternion convolution to multi-scale architecture for the first time, aiming to extract the rich hidden features of the original signal from multiple scales. Then, the extracted multi-scale information is input into CSAFF for feature fusion, where CSAFF innovatively incorporates cross self-attention mechanism to enhance discriminative interaction representation within features. Finally, BiGRU captures temporal dependencies while a softmax layer is employed for fault classification, achieving accurate FD. To assess the efficacy of our approach, we experiment on three public datasets (CWRU, MFPT, and Ottawa) and compare it with other excellent methods. The results confirm its state-of-the-art, which the average accuracies can achieve up to 99.99%, 100%, and 99.21% on CWRU, MFPT, and Ottawa datasets. Moreover, we perform practical tests and ablation experiments to validate the efficacy and robustness of the proposed approach. Code is available at https://github.com/mubai011/MQCCAF.
Published: 2024
Full Text: View/download PDF

40. Non-invertible symmetries in $S_N$ orbifold CFTs and holography

Author: Gutperle, Michael, Li, Yan-Yan, Rathore, Dikshant, and Roumpedakis, Konstantinos
Subjects: High Energy Physics - Theory
Abstract: We study non-invertible defects in two-dimensional $S_N$ orbifold CFTs. We construct universal defects which do not depend on the details of the seed CFT and hence exist in any orbifold CFT. Additionally, we investigate non-universal defects arising from the topological defects of the seed CFT. We argue that there exist universal defects that are non-trivial in the large-$N$ limit, making them relevant for the AdS$_3$/CFT$_2$ correspondence. We then focus on AdS$_3\times$S$^3\times \mathcal M_4$ with one unit of NS-NS flux and propose an explicit realization of these defects on the worldsheet.
Published: 2024

41. Learning Multi-dimensional Human Preference for Text-to-Image Generation

Author: Zhang, Sixian, Wang, Bohan, Wu, Junqiang, Li, Yan, Gao, Tingting, Zhang, Di, and Wang, Zhongyuan
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Current metrics for text-to-image models typically rely on statistical metrics which inadequately represent the real preference of humans. Although recent work attempts to learn these preferences via human annotated images, they reduce the rich tapestry of human preference to a single overall score. However, the preference results vary when humans evaluate images with different aspects. Therefore, to learn the multi-dimensional human preferences, we propose the Multi-dimensional Preference Score (MPS), the first multi-dimensional preference scoring model for the evaluation of text-to-image models. The MPS introduces the preference condition module upon CLIP model to learn these diverse preferences. It is trained based on our Multi-dimensional Human Preference (MHP) Dataset, which comprises 918,315 human preference choices across four dimensions (i.e., aesthetics, semantic alignment, detail quality and overall assessment) on 607,541 images. The images are generated by a wide range of latest text-to-image models. The MPS outperforms existing scoring methods across 3 datasets in 4 dimensions, enabling it a promising metric for evaluating and improving text-to-image generation.
Published: 2024

42. Learning Coarse-Grained Dynamics on Graph

Author: Yu, Yin, Harlim, John, Huang, Daning, and Li, Yan
Subjects: Mathematics - Numerical Analysis, Condensed Matter - Disordered Systems and Neural Networks, Computer Science - Machine Learning
Abstract: We consider a Graph Neural Network (GNN) non-Markovian modeling framework to identify coarse-grained dynamical systems on graphs. Our main idea is to systematically determine the GNN architecture by inspecting how the leading term of the Mori-Zwanzig memory term depends on the coarse-grained interaction coefficients that encode the graph topology. Based on this analysis, we found that the appropriate GNN architecture that will account for $K$-hop dynamical interactions has to employ a Message Passing (MP) mechanism with at least $2K$ steps. We also deduce that the memory length required for an accurate closure model decreases as a function of the interaction strength under the assumption that the interaction strength exhibits a power law that decays as a function of the hop distance. Supporting numerical demonstrations on two examples, a heterogeneous Kuramoto oscillator model and a power system, suggest that the proposed GNN architecture can predict the coarse-grained dynamics under fixed and time-varying graph topologies., Comment: 33 pages, 12 figures
Published: 2024

43. FORESEE: Multimodal and Multi-view Representation Learning for Robust Prediction of Cancer Survival

Author: Pan, Liangrui, Peng, Yijun, Li, Yan, Liang, Yiyi, Xu, Liwen, Liang, Qingchun, and Peng, Shaoliang
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Integrating the different data modalities of cancer patients can significantly improve the predictive performance of patient survival. However, most existing methods ignore the simultaneous utilization of rich semantic features at different scales in pathology images. When collecting multimodal data and extracting features, there is a likelihood of encountering intra-modality missing data, introducing noise into the multimodal data. To address these challenges, this paper proposes a new end-to-end framework, FORESEE, for robustly predicting patient survival by mining multimodal information. Specifically, the cross-fusion transformer effectively utilizes features at the cellular level, tissue level, and tumor heterogeneity level to correlate prognosis through a cross-scale feature cross-fusion method. This enhances the ability of pathological image feature representation. Secondly, the hybrid attention encoder (HAE) uses the denoising contextual attention module to obtain the contextual relationship features and local detail features of the molecular data. HAE's channel attention module obtains global features of molecular data. Furthermore, to address the issue of missing information within modalities, we propose an asymmetrically masked triplet masked autoencoder to reconstruct lost information within modalities. Extensive experiments demonstrate the superiority of our method over state-of-the-art methods on four benchmark datasets in both complete and missing settings.
Published: 2024

44. The jet problem for three-dimensional axisymmetric compressible subsonic flows with large vorticity

Author: Li, Yan
Subjects: Mathematics - Analysis of PDEs
Abstract: In this paper, we establish the existence of three-dimensional axisymmetric compressible jet flows for steady Euler system with large vorticity by using the variational method. More precisely, for given axial velocity of the flow at the upstream, if the mass flux is sufficiently large, we can find a unique outer pressure such that a smooth subsonic three-dimensional axisymmetric jet flow with large vorticity exists and has certain far fields behavior., Comment: arXiv admin note: substantial text overlap with arXiv:2405.06213; text overlap with arXiv:2404.16377, arXiv:2006.05672
Published: 2024

45. The jet problem for three-dimensional axially symmetric full compressible subsonic flows with nonzero vorticity

Author: Li, Yan
Subjects: Mathematics - Analysis of PDEs
Abstract: In this paper, we show that for given Bernoulli function and entropy function at the upstream, if the incoming mass flux is within a suitable range, then there exists a unique outer pressure such that smooth subsonic three-dimensional axially symmetric jet flows for steady full Euler system with nonzero vorticity exist and have certain far fields behavior. A key observation is that we can transform the jet problem for three-dimensional axially symmetric steady full Euler system with nonzero vorticity into a variational problem in terms of the stream function. Moreover, using uniform estimates of the stream function and the iteration method, we exclude the singularity of the jet flows near the symmetry axis., Comment: arXiv admin note: text overlap with arXiv:2006.05672. text overlap with arXiv:2404.16377
Published: 2024

46. A rigidity property for a type of wave-Klein-Gordon system

Author: Li, Yan-Tao and Ma, Yue
Subjects: Mathematics - Analysis of PDEs
Abstract: In this paper we investigate the rigidity property of a wave component coupled in a wave-Klein-Gordon system. We prove that when the radiation field of the wave component vanishes at the null infinity, the initial data of this component also vanish, therefor there is no wave in the whole spacetime, Comment: 29 pages
Published: 2024

47. Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application

Author: Jia, Jian, Wang, Yipei, Li, Yan, Chen, Honggang, Bai, Xuehan, Liu, Zhaocheng, Liang, Jian, Chen, Quan, Li, Han, Jiang, Peng, and Gai, Kun
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence
Abstract: Contemporary recommender systems predominantly rely on collaborative filtering techniques, employing ID-embedding to capture latent associations among users and items. However, this approach overlooks the wealth of semantic information embedded within textual descriptions of items, leading to suboptimal performance in cold-start scenarios and long-tail user recommendations. Leveraging the capabilities of Large Language Models (LLMs) pretrained on massive text corpus presents a promising avenue for enhancing recommender systems by integrating open-world domain knowledge. In this paper, we propose an Llm-driven knowlEdge Adaptive RecommeNdation (LEARN) framework that synergizes open-world knowledge with collaborative knowledge. We address computational complexity concerns by utilizing pretrained LLMs as item encoders and freezing LLM parameters to avoid catastrophic forgetting and preserve open-world knowledge. To bridge the gap between the open-world and collaborative domains, we design a twin-tower structure supervised by the recommendation task and tailored for practical industrial application. Through offline experiments on the large-scale industrial dataset and online experiments on A/B tests, we demonstrate the efficacy of our approach., Comment: 11 pages, 6 figures
Published: 2024

48. Co-occurrence order-preserving pattern mining

Author: Wu, Youxi, Wang, Zhen, Li, Yan, Guo, Yingchun, Jiang, He, Zhu, Xingquan, and Wu, Xindong
Subjects: Computer Science - Databases
Abstract: Recently, order-preserving pattern (OPP) mining has been proposed to discover some patterns, which can be seen as trend changes in time series. Although existing OPP mining algorithms have achieved satisfactory performance, they discover all frequent patterns. However, in some cases, users focus on a particular trend and its associated trends. To efficiently discover trend information related to a specific prefix pattern, this paper addresses the issue of co-occurrence OPP mining (COP) and proposes an algorithm named COP-Miner to discover COPs from historical time series. COP-Miner consists of three parts: extracting keypoints, preparation stage, and iteratively calculating supports and mining frequent COPs. Extracting keypoints is used to obtain local extreme points of patterns and time series. The preparation stage is designed to prepare for the first round of mining, which contains four steps: obtaining the suffix OPP of the keypoint sub-time series, calculating the occurrences of the suffix OPP, verifying the occurrences of the keypoint sub-time series, and calculating the occurrences of all fusion patterns of the keypoint sub-time series. To further improve the efficiency of support calculation, we propose a support calculation method with an ending strategy that uses the occurrences of prefix and suffix patterns to calculate the occurrences of superpatterns. Experimental results indicate that COP-Miner outperforms the other competing algorithms in running time and scalability. Moreover, COPs with keypoint alignment yield better prediction performance.
Published: 2024

49. Two-dimensional jet flows for compressible full Euler system with general vorticity

Author: Li, Yan
Subjects: Mathematics - Analysis of PDEs
Abstract: In this paper, we consider the well-posedness theory of two-dimensional compressible subsonic jet flows for steady full Euler system with general vorticity. Inspired by the analysis in arXiv:2006.05672, we show that the stream function formulation for such system admits a variational structure. Then the existence and uniqueness of a smooth subsonic jet flow can be established by the variational method developed by Alt, Caffarelli and Friedman. Furthermore, the far fields behavior of the flow and the existence of a critical upstream pressure are also obtained., Comment: arXiv admin note: text overlap with arXiv:2006.05672
Published: 2024

50. Generalized boost transformations in finite volumes and application to Hamiltonian methods

Author: Li, Yan, Wu, Jia-Jun, Lee, T. -S. H., and Young, R. D.
Subjects: High Energy Physics - Lattice, High Energy Physics - Phenomenology
Abstract: The investigation of hadron interactions within lattice QCD has been facilitated by the well-known quantisation condition, linking scattering phase shifts to finite-volume energies. Additionally, the ability to utilise systems at finite total boosts has been pivotal in smoothly charting the energy-dependent behaviour of these phase shifts. The existing implementations of the quantization condition at finite boosts rely on momentum transformations between rest and moving frames, defined directly in terms of the energy eigenvalues. This energy dependence is unsuitable in the formulation of a Hamiltonian.In this work, we introduce a novel approach to generalise the three-momentum boost prescription, enabling the incorporation of energy-independent finite-volume Hamiltonians within moving frames. We demonstrate the application of our method through numerical comparisons, employing a phenomenological $\pi\pi$ scattering example., Comment: 30 pages, 5 figures
Published: 2024

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Category

Publication Type

Journal

Region

Database

Publisher

139,952 results on '"LI, Yan"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources