Author: "AN, Yanfei" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"AN, Yanfei"' showing total 42,830 results

Start Over Author "AN, Yanfei"

42,830 results on '"AN, Yanfei"'

1. Effortless Efficiency: Low-Cost Pruning of Diffusion Models

Author: Zhang, Yang, Jin, Er, Dong, Yanfei, Khakzar, Ashkan, Torr, Philip, Stegmaier, Johannes, and Kawaguchi, Kenji
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Diffusion models have achieved impressive advancements in various vision tasks. However, these gains often rely on increasing model size, which escalates computational complexity and memory demands, complicating deployment, raising inference costs, and causing environmental impact. While some studies have explored pruning techniques to improve the memory efficiency of diffusion models, most existing methods require extensive retraining to retain the model performance. Retraining a modern large diffusion model is extremely costly and resource-intensive, which limits the practicality of these methods. In this work, we achieve low-cost diffusion pruning without retraining by proposing a model-agnostic structural pruning framework for diffusion models that learns a differentiable mask to sparsify the model. To ensure effective pruning that preserves the quality of the final denoised latent, we design a novel end-to-end pruning objective that spans the entire diffusion process. As end-to-end pruning is memory-intensive, we further propose time step gradient checkpointing, a technique that significantly reduces memory usage during optimization, enabling end-to-end pruning within a limited memory budget. Results on state-of-the-art U-Net diffusion models SDXL and diffusion transformers (FLUX) demonstrate that our method can effectively prune up to 20% parameters with minimal perceptible performance degradation, and notably, without the need for model retraining. We also showcase that our method can still prune on top of time step distilled diffusion models., Comment: Project page: https://yangzhang-v5.github.io/EcoDiff
Published: 2024

2. ADAF: An Artificial Intelligence Data Assimilation Framework for Weather Forecasting

Author: Xiang, Yanfei, Jin, Weixin, Dong, Haiyu, Bai, Mingliang, Fang, Zuliang, Zhao, Pengcheng, Sun, Hongyu, Thambiratnam, Kit, Zhang, Qi, and Huang, Xiaomeng
Subjects: Physics - Atmospheric and Oceanic Physics, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The forecasting skill of numerical weather prediction (NWP) models critically depends on the accurate initial conditions, also known as analysis, provided by data assimilation (DA). Traditional DA methods often face a trade-off between computational cost and accuracy due to complex linear algebra computations and the high dimensionality of the model, especially in nonlinear systems. Moreover, processing massive data in real-time requires substantial computational resources. To address this, we introduce an artificial intelligence-based data assimilation framework (ADAF) to generate high-quality kilometer-scale analysis. This study is the pioneering work using real-world observations from varied locations and multiple sources to verify the AI method's efficacy in DA, including sparse surface weather observations and satellite imagery. We implemented ADAF for four near-surface variables in the Contiguous United States (CONUS). The results indicate that ADAF surpasses the High Resolution Rapid Refresh Data Assimilation System (HRRRDAS) in accuracy by 16% to 33% for near-surface atmospheric conditions, aligning more closely with actual observations, and can effectively reconstruct extreme events, such as tropical cyclone wind fields. Sensitivity experiments reveal that ADAF can generate high-quality analysis even with low-accuracy backgrounds and extremely sparse surface observations. ADAF can assimilate massive observations within a three-hour window at low computational cost, taking about two seconds on an AMD MI200 graphics processing unit (GPU). ADAF has been shown to be efficient and effective in real-world DA, underscoring its potential role in operational weather forecasting., Comment: 29 pages, 15 figures
Published: 2024

3. Evaluating Moral Beliefs across LLMs through a Pluralistic Framework

Author: Liu, Xuelin, Zhu, Yanfei, Zhu, Shucheng, Liu, Pengyuan, Liu, Ying, and Yu, Dong
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Proper moral beliefs are fundamental for language models, yet assessing these beliefs poses a significant challenge. This study introduces a novel three-module framework to evaluate the moral beliefs of four prominent large language models. Initially, we constructed a dataset containing 472 moral choice scenarios in Chinese, derived from moral words. The decision-making process of the models in these scenarios reveals their moral principle preferences. By ranking these moral choices, we discern the varying moral beliefs held by different language models. Additionally, through moral debates, we investigate the firmness of these models to their moral choices. Our findings indicate that English language models, namely ChatGPT and Gemini, closely mirror moral decisions of the sample of Chinese university students, demonstrating strong adherence to their choices and a preference for individualistic moral beliefs. In contrast, Chinese models such as Ernie and ChatGLM lean towards collectivist moral beliefs, exhibiting ambiguity in their moral choices and debates. This study also uncovers gender bias embedded within the moral beliefs of all examined language models. Our methodology offers an innovative means to assess moral beliefs in both artificial and human intelligence, facilitating a comparison of moral values across different cultures.
Published: 2024

4. Formation mechanisms and fluorescence properties of carbon dots in coal burning dust from coal fired power plants

Author: Zhao, Zhexian, Zhang, Weizuo, Zhang, Jin, Li, Yuzhao, Bai, Han, Zhao, Fangming, Jin, Zhongcai, Tang, Ju, Xiao, Yiming, Xu, Wen, and Lü, Yanfei
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: Carbon dots (CDs) shows great application potential with their unique and excellent performances. Coal and its derivatives are rich in aromatic ring structure, which is suitable for preparing CDs in microstructure. Coal burning dust from coal-fired power plants can be utilized as a rich resource to separate and extract CDs. It has been shown in our results that there have two main possible mechanisms for the formation of CDs in coal burning dust. One is the self-assembly of polycyclic aromatic hydrocarbons contained in coal or produced by incomplete combustion of coal. The other mechanism is that the bridge bonds linking different aromatic structures in coal are breaking which would form CDs with different functional groups when the coals are burning at high temperature. Under violet light excitation at 310-340 nm or red light at 610-640 nm, CDs extracted from coal burning dust can emit purple fluorescence around 410 nm. The mechanism of up-conversion fluorescence emission of CDs is due to a two-photon absorption process. The recycling of CDs from coal burning dust from coal-fired power plants are not only good to protect environment but also would be helpful for mass production of CDs.
Published: 2024
Full Text: View/download PDF

5. Fire and Smoke Detection with Burning Intensity Representation

Author: Han, Xiaoyi, Wu, Yanfei, Pu, Nan, Feng, Zunlei, Zhang, Qifei, Bei, Yijun, and Cheng, Lechao
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: An effective Fire and Smoke Detection (FSD) and analysis system is of paramount importance due to the destructive potential of fire disasters. However, many existing FSD methods directly employ generic object detection techniques without considering the transparency of fire and smoke, which leads to imprecise localization and reduces detection performance. To address this issue, a new Attentive Fire and Smoke Detection Model (a-FSDM) is proposed. This model not only retains the robust feature extraction and fusion capabilities of conventional detection algorithms but also redesigns the detection head specifically for transparent targets in FSD, termed the Attentive Transparency Detection Head (ATDH). In addition, Burning Intensity (BI) is introduced as a pivotal feature for fire-related downstream risk assessments in traditional FSD methodologies. Extensive experiments on multiple FSD datasets showcase the effectiveness and versatility of the proposed FSD model. The project is available at \href{https://xiaoyihan6.github.io/FSD/}{https://xiaoyihan6.github.io/FSD/}.
Published: 2024

6. Reverse Refinement Network for Narrow Rural Road Detection in High-Resolution Satellite Imagery

Author: Wang, Ningjing, Wang, Xinyu, Pan, Yang, Yao, Wanqiang, and Zhong, Yanfei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The automated extraction of rural roads is pivotal for rural development and transportation planning, serving as a cornerstone for socio-economic progress. Current research primarily focuses on road extraction in urban areas. However, rural roads present unique challenges due to their narrow and irregular nature, posing significant difficulties for road extraction. In this article, a reverse refinement network (R2-Net) is proposed to extract narrow rural roads, enhancing their connectivity and distinctiveness from the background. Specifically, to preserve the fine details of roads within high-resolution feature maps, R2-Net utilizes an axis context aware module (ACAM) to capture the long-distance spatial context information in various layers. Subsequently, the multi-level features are aggregated through a global aggregation module (GAM). Moreover, in the decoder stage, R2-Net employs a reverse-aware module (RAM) to direct the attention of the network to the complex background, thus amplifying its separability. In experiments, we compare R2-Net with several state-of-the-art methods using the DeepGlobe road extraction dataset and the WHU-RuR+ global large-scale rural road dataset. R2-Net achieved superior performance and especially excelled in accurately detecting narrow roads. Furthermore, we explored the applicability of R2-Net for large-scale rural road mapping. The results show that the proposed R2-Net has significant performance advantages for large-scale rural road mapping applications.
Published: 2024

7. TableRAG: Million-Token Table Understanding with Language Models

Author: Chen, Si-An, Miculicich, Lesly, Eisenschlos, Julian Martin, Wang, Zifeng, Wang, Zilong, Chen, Yanfei, Fujii, Yasuhisa, Lin, Hsuan-Tien, Lee, Chen-Yu, and Pfister, Tomas
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: Recent advancements in language models (LMs) have notably enhanced their ability to reason with tabular data, primarily through program-aided mechanisms that manipulate and analyze tables. However, these methods often require the entire table as input, leading to scalability challenges due to the positional bias or context length constraints. In response to these challenges, we introduce TableRAG, a Retrieval-Augmented Generation (RAG) framework specifically designed for LM-based table understanding. TableRAG leverages query expansion combined with schema and cell retrieval to pinpoint crucial information before providing it to the LMs. This enables more efficient data encoding and precise retrieval, significantly reducing prompt lengths and mitigating information loss. We have developed two new million-token benchmarks from the Arcade and BIRD-SQL datasets to thoroughly evaluate TableRAG's effectiveness at scale. Our results demonstrate that TableRAG's retrieval design achieves the highest retrieval quality, leading to the new state-of-the-art performance on large-scale table understanding., Comment: Accepted to NeurIPS 2024
Published: 2024

8. The Effects of Air Pollution on Health: A Study of Los Angeles County

Author: Qu, Yanfei
Subjects: Statistics - Applications
Abstract: This study aims to develop and implement a Poisson regression model with measurement error using a Bayesian framework, with model fitting performed in Stan. The focus is on examining the relationship between air pollution exposure and health outcomes, such as respiratory and cardiovascular disease counts, while accounting for inaccuracies in pollution measurements. Air pollution data is often subject to measurement error due to imperfect monitoring or averaging, which, if ignored, can lead to biased estimates and incorrect conclusions. The Poisson regression will model count data, where the response variable, such as disease counts, follows a Poisson distribution. Covariates including pollution levels, demographic factors, and meteorological conditions will be incorporated to control for confounders. To address measurement error in the exposure data, a Bayesian hierarchical model will be used, where observed pollution levels are treated as noisy measurements of the true underlying exposure. Priors will be specified for both the regression coefficients and the measurement error parameters, and posterior distributions will be estimated via Markov Chain Monte Carlo (MCMC) sampling. This approach ensures that both the count nature of the response and the uncertainty in exposure measurements are properly accounted for, leading to more accurate estimates of the health risks associated with air pollution.
Published: 2024

9. Mediating Effect of Teacher Support and Student Engagement in Mathematics at Chinese Junior Middle School

Author: Yanfei Yang, Priyalatha a/p Govindasamy, and Nor Junainah binti Mohd Isa
Abstract: This study aimed to examine the influence of teacher support on student engagement in mathematics. A cross-sectional survey was employed to recruit 632 students from grades 7 to 9 in junior middle schools across China. Participants completed various scales, including a teacher support scale, an academic self-efficacy scale, and a student engagement scale. Data analysis was conducted using SPSS 20.0 and Amos 24.0. Results showed that, first, emotional support and competence support had a significant positive effect on student engagement in mathematics. The contribution from competence support was the strongest. The contribution from autonomy support was the weakest; second, academic self-efficacy mediated the relationship between autonomy support, emotional support, and competence support and student engagement in mathematics. These findings supplement and expand the understanding of the factors that affect student engagement and provide theoretical support and useful guidance for formulating effective education policies to improve student engagement in mathematics.
Published: 2024
Full Text: View/download PDF

10. Mechanism and Kinetics of Ethanol–Acetaldehyde Conversion to 1,3-Butadiene over Isolated Lewis Acid La Sites in Silanol Nests in Dealuminated Beta Zeolite

Author: Zhang, Yanfei, Qi, Liang, Li, Yuting, Yang, Tingshu, Meira, Debora M, Dun, Chaochao, Hu, Haocheng, Chen, Huihui, Xu, Shutao, Urban, Jeffrey J, Sadow, Aaron D, Kobayashi, Takeshi, Qi, Long, Tian, Peng, and Bell, Alexis T
Subjects: Inorganic Chemistry, Chemical Sciences, isolated La sites, ethanol-acetaldehyde conversionto 1, 3-butadiene, C-C coupling, hydrogen-bondinginteraction, reaction mechanism, kinetics, Organic Chemistry, Chemical Engineering, Industrial biotechnology, Organic chemistry, Physical chemistry
Abstract: Biomass-derived ethanol (EtOH) and acetaldehyde (AcH) conversion to 1,3-butadiene (1,3-BD) is an alternative process for 1,3-BD production. The present investigation reports the preparation and characterization of isolated La sites introduced into the silanol nests in DeAlBEA as well as detailed studies of the mechanism and kinetics for the conversion of an EtOH-AcH mixture to 1,3-BD. La sites supported on DeAlBEA are found to be present as (≡SiO)2La-OH groups that are H-bonded with adjacent Si-OH groups, possessing high C-C coupling activity and stability, superior to state-of-the-art Y-DeAlBEA. La sites supported on silica (La-SiO2) with a similar chemical structure but no H-bonding interaction with Si-OH groups were prepared for comparison. Lewis acid La sites promote AcH aldol condensation, and the activity of such sites is nearly identical for both La-DeAlBEA and La-SiO2. The rate of C4 product formation increases by a factor of 4.8 upon addition of EtOH to the feed of AcH over La-DeAlBEA, whereas that over La/SiO2 remains unchanged. Investigation of the mechanism and kinetics of AcH aldol condensation and EtOH-AcH conversion to 1,3-BD revealed two C-C bond forming pathways─AcH aldol condensation by Lewis acid La sites and direct coupling of EtOH-AcH over H-bonded (≡SiO)2La-OH···HO-Si≡ sites. This study provides important information about the role of the local environment of isolated Lewis acid sites and their effects on the direct coupling of EtOH and AcH to form 1,3-BD.
Published: 2024

11. Investigating Layer Importance in Large Language Models

Author: Zhang, Yang, Dong, Yanfei, and Kawaguchi, Kenji
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Large language models (LLMs) have gained increasing attention due to their prominent ability to understand and process texts. Nevertheless, LLMs largely remain opaque. The lack of understanding of LLMs has obstructed the deployment in safety-critical scenarios and hindered the development of better models. In this study, we advance the understanding of LLM by investigating the significance of individual layers in LLMs. We propose an efficient sampling method to faithfully evaluate the importance of layers using Shapley values, a widely used explanation framework in feature attribution and data valuation. In addition, we conduct layer ablation experiments to assess the performance degradation resulting from the exclusion of specific layers. Our findings reveal the existence of cornerstone layers, wherein certain early layers can exhibit a dominant contribution over others. Removing one cornerstone layer leads to a drastic collapse of the model performance, often reducing it to random guessing. Conversely, removing non-cornerstone layers results in only marginal performance changes. This study identifies cornerstone layers in LLMs and underscores their critical role for future research.
Published: 2024

12. Hybrid LSMR algorithms for large-scale general-form regularization

Author: Yang, Yanfei
Subjects: Mathematics - Numerical Analysis, 65F22, 65F10, 65J20, 65F35, 65F50
Abstract: The hybrid LSMR algorithm is proposed for large-scale general-form regularization. It is based on a Krylov subspace projection method where the matrix $A$ is first projected onto a subspace, typically a Krylov subspace, which is implemented via the Golub-Kahan bidiagonalization process applied to $A$, with starting vector $b$. Then a regularization term is employed to the projections. Finally, an iterative algorithm is exploited to solve a least squares problem with constraints. The resulting algorithms are called the {hybrid LSMR algorithm}. At every step, we exploit LSQR algorithm to solve the inner least squares problem, which is proven to become better conditioned as the number of $k$ increases, so that the LSQR algorithm converges faster. We prove how to select the stopping tolerances for LSQR in order to guarantee that the regularized solution obtained by iteratively computing the inner least squares problems and the one obtained by exactly computing the inner least squares problems have the same accuracy. Numerical experiments illustrate that the best regularized solution by the hybrid LSMR algorithm is as accurate as that by JBDQR which is a joint bidiagonalization based algorithm.
Published: 2024

13. AnomalyCD: A benchmark for Earth anomaly change detection with high-resolution and time-series observations

Author: Li, Jingtao, Zhu, Qian, Wang, Xinyu, Zhao, Hengwei, and Zhong, Yanfei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Various Earth anomalies have destroyed the stable, balanced state, resulting in fatalities and serious destruction of property. With the advantages of large-scale and precise observation, high-resolution remote sensing images have been widely used for anomaly monitoring and localization. Powered by the deep representation, the existing methods have achieved remarkable advances, primarily in classification and change detection techniques. However, labeled samples are difficult to acquire due to the low probability of anomaly occurrence, and the trained models are limited to fixed anomaly categories, which hinders the application for anomalies with few samples or unknown anomalies. In this paper, to tackle this problem, we propose the anomaly change detection (AnomalyCD) technique, which accepts time-series observations and learns to identify anomalous changes by learning from the historical normal change pattern. Compared to the existing techniques, AnomalyCD processes an unfixed number of time steps and can localize the various anomalies in a unified manner, without human supervision. To benchmark AnomalyCD, we constructed a high-resolution dataset with time-series images dedicated to various Earth anomalies (the AnomalyCDD dataset). AnomalyCDD contains high-resolution (from 0.15 to 2.39 m/pixel), time-series (from 3 to 7 time steps), and large-scale images (1927.93 km2 in total) collected globally Furthermore, we developed a zero-shot baseline model (AnomalyCDM), which implements the AnomalyCD technique by extracting a general representation from the segment anything model (SAM) and conducting temporal comparison to distinguish the anomalous changes from normal changes. AnomalyCDM is designed as a two-stage workflow to enhance the efficiency, and has the ability to process the unseen images directly, without retraining for each scene., Comment: remote sensing benchmark
Published: 2024

14. Hierarchical Neural Constructive Solver for Real-world TSP Scenarios

Author: Goh, Yong Liang, Cao, Zhiguang, Ma, Yining, Dong, Yanfei, Dupty, Mohammed Haroon, and Lee, Wee Sun
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Existing neural constructive solvers for routing problems have predominantly employed transformer architectures, conceptualizing the route construction as a set-to-sequence learning task. However, their efficacy has primarily been demonstrated on entirely random problem instances that inadequately capture real-world scenarios. In this paper, we introduce realistic Traveling Salesman Problem (TSP) scenarios relevant to industrial settings and derive the following insights: (1) The optimal next node (or city) to visit often lies within proximity to the current node, suggesting the potential benefits of biasing choices based on current locations. (2) Effectively solving the TSP requires robust tracking of unvisited nodes and warrants succinct grouping strategies. Building upon these insights, we propose integrating a learnable choice layer inspired by Hypernetworks to prioritize choices based on the current location, and a learnable approximate clustering algorithm inspired by the Expectation-Maximization algorithm to facilitate grouping the unvisited cities. Together, these two contributions form a hierarchical approach towards solving the realistic TSP by considering both immediate local neighbourhoods and learning an intermediate set of node representations. Our hierarchical approach yields superior performance compared to both classical and recent transformer models, showcasing the efficacy of the key designs., Comment: Accepted to KDD 2024
Published: 2024
Full Text: View/download PDF

15. Re-Invoke: Tool Invocation Rewriting for Zero-Shot Tool Retrieval

Author: Chen, Yanfei, Yoon, Jinsung, Sachan, Devendra Singh, Wang, Qingze, Cohen-Addad, Vincent, Bateni, Mohammadhossein, Lee, Chen-Yu, and Pfister, Tomas
Subjects: Computer Science - Computation and Language
Abstract: Recent advances in large language models (LLMs) have enabled autonomous agents with complex reasoning and task-fulfillment capabilities using a wide range of tools. However, effectively identifying the most relevant tools for a given task becomes a key bottleneck as the toolset size grows, hindering reliable tool utilization. To address this, we introduce Re-Invoke, an unsupervised tool retrieval method designed to scale effectively to large toolsets without training. Specifically, we first generate a diverse set of synthetic queries that comprehensively cover different aspects of the query space associated with each tool document during the tool indexing phase. Second, we leverage LLM's query understanding capabilities to extract key tool-related context and underlying intents from user queries during the inference phase. Finally, we employ a novel multi-view similarity ranking strategy based on intents to pinpoint the most relevant tools for each query. Our evaluation demonstrates that Re-Invoke significantly outperforms state-of-the-art alternatives in both single-tool and multi-tool scenarios, all within a fully unsupervised setting. Notably, on the ToolE datasets, we achieve a 20% relative improvement in nDCG@5 for single-tool retrieval and a 39% improvement for multi-tool retrieval., Comment: EMNLP Findings 2024
Published: 2024

16. Changen2: Multi-Temporal Remote Sensing Generative Change Foundation Model

Author: Zheng, Zhuo, Ermon, Stefano, Kim, Dongjun, Zhang, Liangpei, and Zhong, Yanfei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Our understanding of the temporal dynamics of the Earth's surface has been advanced by deep vision models, which often require lots of labeled multi-temporal images for training. However, collecting, preprocessing, and annotating multi-temporal remote sensing images at scale is non-trivial since it is expensive and knowledge-intensive. In this paper, we present change data generators based on generative models, which are cheap and automatic, alleviating these data problems. Our main idea is to simulate a stochastic change process over time. We describe the stochastic change process as a probabilistic graphical model (GPCM), which factorizes the complex simulation problem into two more tractable sub-problems, i.e., change event simulation and semantic change synthesis. To solve these two problems, we present Changen2, a GPCM with a resolution-scalable diffusion transformer which can generate time series of images and their semantic and change labels from labeled or unlabeled single-temporal images. Changen2 is a generative change foundation model that can be trained at scale via self-supervision, and can produce change supervisory signals from unlabeled single-temporal images. Unlike existing foundation models, Changen2 synthesizes change data to train task-specific foundation models for change detection. The resulting model possesses inherent zero-shot change detection capabilities and excellent transferability. Experiments suggest Changen2 has superior spatiotemporal scalability, e.g., Changen2 model trained on 256$^2$ pixel single-temporal images can yield time series of any length and resolutions of 1,024$^2$ pixels. Changen2 pre-trained models exhibit superior zero-shot performance (narrowing the performance gap to 3% on LEVIR-CD and approximately 10% on both S2Looking and SECOND, compared to fully supervised counterparts) and transferability across multiple types of change tasks., Comment: The enhanced extension of our ICCV 2023 (Changen)
Published: 2024

17. Single-Temporal Supervised Learning for Universal Remote Sensing Change Detection

Author: Zheng, Zhuo, Zhong, Yanfei, Ma, Ailong, and Zhang, Liangpei
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Bitemporal supervised learning paradigm always dominates remote sensing change detection using numerous labeled bitemporal image pairs, especially for high spatial resolution (HSR) remote sensing imagery. However, it is very expensive and labor-intensive to label change regions in large-scale bitemporal HSR remote sensing image pairs. In this paper, we propose single-temporal supervised learning (STAR) for universal remote sensing change detection from a new perspective of exploiting changes between unpaired images as supervisory signals. STAR enables us to train a high-accuracy change detector only using unpaired labeled images and can generalize to real-world bitemporal image pairs. To demonstrate the flexibility and scalability of STAR, we design a simple yet unified change detector, termed ChangeStar2, capable of addressing binary change detection, object change detection, and semantic change detection in one architecture. ChangeStar2 achieves state-of-the-art performances on eight public remote sensing change detection datasets, covering above two supervised settings, multiple change types, multiple scenarios. The code is available at https://github.com/Z-Zheng/pytorch-change-models., Comment: IJCV 2024. arXiv admin note: text overlap with arXiv:2108.07002
Published: 2024
Full Text: View/download PDF

18. Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

Author: Zhang, Yusen, Sun, Ruoxi, Chen, Yanfei, Pfister, Tomas, Zhang, Rui, and Arik, Sercan Ö.
Subjects: Computer Science - Computation and Language
Abstract: Addressing the challenge of effectively processing long contexts has become a critical issue for Large Language Models (LLMs). Two common strategies have emerged: 1) reducing the input length, such as retrieving relevant chunks by Retrieval-Augmented Generation (RAG), and 2) expanding the context window limit of LLMs. However, both strategies have drawbacks: input reduction has no guarantee of covering the part with needed information, while window extension struggles with focusing on the pertinent information for solving the task. To mitigate these limitations, we propose Chain-of-Agents (CoA), a novel framework that harnesses multi-agent collaboration through natural language to enable information aggregation and context reasoning across various LLMs over long-context tasks. CoA consists of multiple worker agents who sequentially communicate to handle different segmented portions of the text, followed by a manager agent who synthesizes these contributions into a coherent final output. CoA processes the entire input by interleaving reading and reasoning, and it mitigates long context focus issues by assigning each agent a short context. We perform comprehensive evaluation of CoA on a wide range of long-context tasks in question answering, summarization, and code completion, demonstrating significant improvements by up to 10% over strong baselines of RAG, Full-Context, and multi-agent LLMs., Comment: 19 pages, 6 figures
Published: 2024

19. Differentiable Cluster Graph Neural Network

Author: Dong, Yanfei, Dupty, Mohammed Haroon, Deng, Lambert, Liu, Zhuanghua, Goh, Yong Liang, and Lee, Wee Sun
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Graph Neural Networks often struggle with long-range information propagation and in the presence of heterophilous neighborhoods. We address both challenges with a unified framework that incorporates a clustering inductive bias into the message passing mechanism, using additional cluster-nodes. Central to our approach is the formulation of an optimal transport based implicit clustering objective function. However, the algorithm for solving the implicit objective function needs to be differentiable to enable end-to-end learning of the GNN. To facilitate this, we adopt an entropy regularized objective function and propose an iterative optimization process, alternating between solving for the cluster assignments and updating the node/cluster-node embeddings. Notably, our derived closed-form optimization steps are themselves simple yet elegant message passing steps operating seamlessly on a bipartite graph of nodes and cluster-nodes. Our clustering-based approach can effectively capture both local and global information, demonstrated by extensive experiments on both heterophilous and homophilous datasets.
Published: 2024

20. Conformal Classification with Equalized Coverage for Adaptively Selected Groups

Author: Zhou, Yanfei and Sesia, Matteo
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: This paper introduces a conformal inference method to evaluate uncertainty in classification by generating prediction sets with valid coverage conditional on adaptively chosen features. These features are carefully selected to reflect potential model limitations or biases. This can be useful to find a practical compromise between efficiency -- by providing informative predictions -- and algorithmic fairness -- by ensuring equalized coverage for the most sensitive groups. We demonstrate the validity and effectiveness of this method on simulated and real data sets.
Published: 2024

21. MPI Progress For All

Author: Zhou, Hui, Latham, Robert, Raffenetti, Ken, Guo, Yanfei, and Thakur, Rajeev
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: The progression of communication in the Message Passing Interface (MPI) is not well defined, yet it is critical for application performance, particularly in achieving effective computation and communication overlap. The opaque nature of MPI progress poses significant challenges in advancing MPI within modern high-performance computing (HPC) practices. Firstly, the lack of clarity hinders the development of explicit guidelines for enhancing computation and communication overlap in applications. Secondly, it prevents MPI from seamlessly integrating with contemporary programming paradigms, such as task-based runtimes and event-driven programming. Thirdly, it limits the extension of MPI functionalities from the user space. In this paper, we examine the role of MPI progress by analyzing the implementation details of MPI messaging. We then generalize the asynchronous communication pattern and identify key factors influencing application performance. Based on this analysis, we propose a set of MPI extensions designed to enable users to explicitly construct and manage an efficient progress engine. We provide example codes to demonstrate the use of these proposed APIs in achieving improved performance, adapting MPI to task-based or event-driven programming styles, and constructing collective algorithms that rival the performance of native implementations. Our approach is compared to previous efforts in the field, highlighting its reduced complexity and increased effectiveness., Comment: Not submitted
Published: 2024

22. Lightweight Spatial Modeling for Combinatorial Information Extraction From Documents

Author: Dong, Yanfei, Deng, Lambert, Zhang, Jiazheng, Yu, Xiaodong, Lin, Ting, Gelli, Francesco, Poria, Soujanya, and Lee, Wee Sun
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Documents that consist of diverse templates and exhibit complex spatial structures pose a challenge for document entity classification. We propose KNN-former, which incorporates a new kind of spatial bias in attention calculation based on the K-nearest-neighbor (KNN) graph of document entities. We limit entities' attention only to their local radius defined by the KNN graph. We also use combinatorial matching to address the one-to-one mapping property that exists in many documents, where one field has only one corresponding entity. Moreover, our method is highly parameter-efficient compared to existing approaches in terms of the number of trainable parameters. Despite this, experiments across various datasets show our method outperforms baselines in most entity types. Many real-world documents exhibit combinatorial properties which can be leveraged as inductive biases to improve extraction accuracy, but existing datasets do not cover these documents. To facilitate future research into these types of documents, we release a new ID document dataset that covers diverse templates and languages. We also release enhanced annotations for an existing dataset.
Published: 2024
Full Text: View/download PDF

23. Enabling Roll-up and Drill-down Operations in News Exploration with Knowledge Graphs for Due Diligence and Risk Management

Author: Wang, Sha, Li, Yuchen, Xiao, Hanhua, Bao, Zhifeng, Deng, Lambert, and Dong, Yanfei
Subjects: Computer Science - Information Retrieval
Abstract: Efficient news exploration is crucial in real-world applications, particularly within the financial sector, where numerous control and risk assessment tasks rely on the analysis of public news reports. The current processes in this domain predominantly rely on manual efforts, often involving keywordbased searches and the compilation of extensive keyword lists. In this paper, we introduce NCEXPLORER, a framework designed with OLAP-like operations to enhance the news exploration experience. NCEXPLORER empowers users to use roll-up operations for a broader content overview and drill-down operations for detailed insights. These operations are achieved through integration with external knowledge graphs (KGs), encompassing both fact-based and ontology-based structures. This integration significantly augments exploration capabilities, offering a more comprehensive and efficient approach to unveiling the underlying structures and nuances embedded in news content. Extensive empirical studies through master-qualified evaluators on Amazon Mechanical Turk demonstrate NCEXPLORER's superiority over existing state-of-the-art news search methodologies across an array of topic domains, using real-world news datasets., Comment: The paper was accepted by ICDE 2024
Published: 2024

24. Adiabatic State Preparation in a Quantum Ising Spin Chain

Author: Kim, Sooshin, Lukin, Alexander, Rispoli, Matthew, Tai, M. Eric, Kaufman, Adam M., Segura, Perrin, Li, Yanfei, Kwan, Joyce, Léonard, Julian, Bakkali-Hassani, Brice, and Greiner, Markus
Subjects: Condensed Matter - Quantum Gases, Quantum Physics
Abstract: We report on adiabatic state preparation in the one-dimensional quantum Ising model using ultracold bosons in a tilted optical lattice. We prepare many-body ground states of controllable system sizes and observe enhanced fluctuations around the transition between paramagnetic and antiferromagnetic states, marking the precursor of quantum critical behavior. Furthermore, we find evidence for superpositions of domain walls and study their effect on the many-body ground state by measuring the populations of each spin configuration across the transition. These results shed new light on the effect of boundary conditions in finite-size quantum systems., Comment: 5+5 pages, 4+8 figures
Published: 2024

25. Significantly Enhanced Vacancy Diffusion in Mn-containing Alloys

Author: Guan, Huaqing, Cui, Hanwen, Ding, Ning, Yang, Kuo, Jiang, Siqi, Sui, Yanfei, Wang, Yuanyuan, Tian, Fuyang, Li, Zhe, Wang, Shuai, Zheng, Pengfei, Lu, Chenyang, Xu, Qiu, Vitos, Levente, and Huang, Shaosong
Subjects: Condensed Matter - Materials Science
Abstract: Manipulating point defects for tailored macroscopic properties remains a formidable challenge in materials science. This study demonstrates a proof-of-principle for a universal law involving element Mn, significantly enhancing vacancy diffusion through an unprecedented anomalous Friedel Oscillations phenomenon, across most metals in the periodic table. The correlation between Mn-induced point-defect dynamic changes and intrinsic macro-properties is robustly validated through the first-principles theory and well-designed experiments. The physical origin stems from Mn's exceptionally large effective intra-elemental 3d electron interactions, surpassing the Coulomb attraction induced by vacancy and disrupting the electron screening effect. Given the ubiquitous nature of vacancies and their recognition as the most crucial defects influencing nearly all physical and mechanical properties of crystalline materials, this outcome may drive advances in a broad domain.
Published: 2024

26. Cadmium Exposure Induces Inflammation Through Oxidative Stress-Mediated Activation of the NF-κB Signaling Pathway and Causes Heat Shock Response in a Piglet Testis

Author: Li, Yulong, Wang, Hongbao, and Wang, Yanfei
Published: 2024
Full Text: View/download PDF

27. Optimization Design and Experimental Analysis of Spraying Device for High Ground Clearance Plant Protection Machine

Author: Zhang, Changkun, Gong, Jinliang, and Zhang, Yanfei
Published: 2024
Full Text: View/download PDF

28. International Students in Counselor Education Programs: A 20-Year Content Analysis

Author: Xiong, Yiying, Lee, Rachel Kyeong Min, Huang, Renyi, and Mao, Yanfei
Published: 2024
Full Text: View/download PDF

29. Comparative Assessment of the Environmental and Economic Performance of Two Straw Utilization Pathways in China

Author: Li, Tong, Wei, Guoxia, Liu, Hanqiao, Zhu, Yuwen, Lin, Yanfei, and Han, Qianlong
Published: 2024
Full Text: View/download PDF

30. Research on resource allocation methods for traditional Chinese medicine services based on deep reinforcement learning

Author: Ma, Yuntao, Fang, Xiaolin, Qi, Jin, and Sun, Yanfei
Published: 2024
Full Text: View/download PDF

31. Inhibiting NLRP3 Inflammasome Activation to Alleviate Retinal Inflammation and Protect the Optic Nerve of OPTN(E50K)Mice

Author: Wang, Shujing, Xiao, Rong, Lu, Yanfei, Zhang, Yanfeng, Zhang, Shiqi, Liu, Xinna, and Yuan, Huiping
Published: 2024
Full Text: View/download PDF

32. Thermal and hydration properties of 3D printing gypsum materials incorporated with accelerators

Author: Zhi, Zhenzhen, Guo, Yanfei, Jin, Zihao, Ru, Xiaohong, Yang, Xinya, and Ma, Baoguo
Published: 2024
Full Text: View/download PDF

33. TRIM46 accelerates H1N1 influenza virus-induced ferroptosis and inflammatory response by regulating SLC7A11 ubiquitination

Author: Zhou, Chao, Bao, Genchong, and Chen, Yanfei
Published: 2024
Full Text: View/download PDF

34. A novel probabilistic analysis method for long-term dynamical response analysis

Author: Meng, Jingwei and Jin, Yanfei
Published: 2024
Full Text: View/download PDF

35. Activation of the microstructures in nickel-based bimetallic complexes for aqueous batteries

Author: Su, Yichun, Zhang, Yanfei, Yuan, Guoqiang, Tang, Yijian, Zhang, Guangxun, Shakouri, Mohsen, Chen, Hsiao-Chien, Zhou, Huijie, Liu, Zheng, and Pang, Huan
Published: 2024
Full Text: View/download PDF

36. Pathogenicity of Bipolaris oryzae isolated from banana to different plant species

Author: Zhang, Zhaojing, Zhao, Hong, Ouyang, Yanfei, Zhang, Xin, and Qi, Yanxiang
Published: 2024
Full Text: View/download PDF

37. An artificial intelligence platform for the screening and managing of strabismus

Author: Wu, Dawen, Li, Yanfei, Zhang, Haixian, Yang, Xubo, Mao, Yiji, Chen, Bingjie, Feng, Yi, Chen, Liang, Zou, Xingyu, Nie, Yan, Yin, Teng, Yang, Zeyi, Liu, Jingyu, Shang, Wenyi, Yang, Guoyuan, and Liu, Longqian
Published: 2024
Full Text: View/download PDF

38. Visible to mid-wave infrared PbS/HgTe colloidal quantum dot imagers

Author: Mu, Ge, Tan, Yimei, Bi, Cheng, Liu, Yanfei, Hao, Qun, and Tang, Xin
Published: 2024
Full Text: View/download PDF

39. A frequency-dependent model for bone remodeling using a micromorphic porous medium subjected to harmonic mechanical loading

Author: Lu, Yanfei
Published: 2024
Full Text: View/download PDF

40. Efficient epoxidation of propylene over non-noble nickel-based catalyst promoted by alkali metals

Author: Li, Wenqian, Li, Wanting, Cao, Xinxin, Chen, Longfei, Qin, Yibo, Zhu, Yanfeng, Zhang, Yanfei, Miao, Gai, Kong, Lingzhao, Li, Jiong, and Chen, Xinqing
Published: 2024
Full Text: View/download PDF

41. Comparative study on environmental and economic assessment of straw heating across administrative areas scale of rural China

Author: Li, Tong, Wei, Guoxia, Liu, Hanqiao, Zhao, Hailong, Zhu, Yuwen, Lin, Yanfei, Han, Qianlong, Chen, Yifan, and Wang, Yanzhang
Published: 2024
Full Text: View/download PDF

42. Thermal-Economic Comparative Analysis and Optimization of the Maisotsenko Gas Turbine Cycle under Different Configurations

Author: Shi, Qile, He, Weifeng, Yao, Zhaohui, Gao, Yanfei, Su, Pengfei, and Han, Dong
Published: 2024
Full Text: View/download PDF

43. Multifunctional metallized particleboard for enhanced electromagnetic interference shielding and mechanical thermal stability

Author: Guo, Qiang, Pan, Yanfei, Hu, Shuaiqi, Qing, Long, Wang, Yu, and Huang, Jintian
Published: 2024
Full Text: View/download PDF

44. The value of the radiological diameter-to-thickness ratio in patients with HER2-positive resectable advanced gastric cancer: implications for long survival and stage migration

Author: Hou, Bin, Guo, Tiantian, Gao, Jianbo, Cao, Yanfei, Lu, Hao, Ma, Tian, Zhang, Yan, and Zhao, Huiping
Published: 2024
Full Text: View/download PDF

45. Population pharmacokinetics of fluconazole for prevention or treatment of invasive candidiasis in Chinese young infants

Author: Wang, Honghong, Li, Dandan, Jiang, Yongjiang, Liang, Jing, Yu, Qiaoai, Kuang, Linghong, Huang, Yuling, Qin, Dongjie, Li, Ping, He, Jing, Xu, Feng, Li, Xueli, Wang, Fei, Wei, Yanfei, and Li, Xingang
Published: 2024
Full Text: View/download PDF

46. User language and cultural product innovation: insights from the global mobile gaming industry

Author: Zhang, Pengxiang, Yi, Jingtao, Niu, Chao, Zhao, Eric Yanfei, and Li, Sali
Published: 2024
Full Text: View/download PDF

47. Enantioselective copper-catalyzed dearomative borylative cyclization of indoles and mechanistic insights

Author: Li, Yanfei, Li, Xuetao, Zhang, Zheng, Zhang, Yiliang, Yuan, Xiuping, Yin, Jianjun, Wang, Simin, Guan, Wei, Zhang, Qian, and Xiong, Tao
Published: 2024
Full Text: View/download PDF

48. Research on the recovery method of disrupted flights considering passenger transfer and cancellation costs

Author: Lu, Liang, Xu, Yanfei, Fan, Wei, Pan, Haiying, Ip, Waihung, and Yung, Kai Leung
Published: 2024
Full Text: View/download PDF

49. Preclinical evaluation and pilot clinical study of [68Ga]Ga-NOTA-H006 for non-invasive PET imaging of 5T4 oncofetal antigen

Author: He, Yingfang, Tian, Ruhua, Xu, Dong, Wu, Yanfei, Rina, Sa, Chen, Tengxiang, Guan, Yihui, Xie, Tianwu, Ying, Tianlei, Xie, Fang, and Han, Junbin
Published: 2024
Full Text: View/download PDF

50. Design and Mechanical Performance Analysis of Ti6Al4V Biomimetic Bone with One-Dimensional Continuous Gradient Porous Structures

Author: Zhang, Hao, Zhang, Qiang, Zhou, Changchun, Tan, Yanfei, Hui, David, and Zhai, Yun
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

42,830 results on '"AN, Yanfei"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources