Author: "Wang, Haotian" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wang, Haotian"' showing total 1,880 results

Start Over Author "Wang, Haotian"

1,880 results on '"Wang, Haotian"'

1. InLINE: Inner-Layer Information Exchange for Multi-task Learning on Heterogeneous Graphs

Author: Feng, Xinyue, Hang, Jinquan, Zhang, Yuequn, Wang, Haotian, Zhang, Desheng, and Wang, Guang
Subjects: Computer Science - Machine Learning
Abstract: Heterogeneous graph is an important structure for modeling complex relational data in real-world scenarios and usually involves various node prediction tasks within a single graph. Training these tasks separately may neglect beneficial information sharing, hence a preferred way is to learn several tasks in a same model by Multi-Task Learning (MTL). However, MTL introduces the issue of negative transfer, where the training of different tasks interferes with each other as they may focus on different information from the data, resulting in suboptimal performance. To solve the issue, existing MTL methods use separate backbones for each task, then selectively exchange beneficial features through interactions among the output embeddings from each layer of different backbones, which we refer to as outer-layer exchange. However, the negative transfer in heterogeneous graphs arises not simply from the varying importance of an individual node feature across tasks, but also from the varying importance of inter-relation between two nodes across tasks. These inter-relations are entangled in the output embedding, making it difficult for existing methods to discriminate beneficial information from the embedding. To address this challenge, we propose the Inner-Layer Information Exchange (InLINE) model that facilitate fine-grained information exchanges within each graph layer rather than through output embeddings. Specifically, InLINE consists of (1) Structure Disentangled Experts for layer-wise structure disentanglement, (2) Structure Disentangled Gates for assigning disentangled information to different tasks. Evaluations on two public datasets and a large industry dataset show that our model effectively alleviates the significant performance drop on specific tasks caused by negative transfer, improving Macro F1 by 6.3% on DBLP dataset and AUC by 3.6% on the industry dataset compared to SoA methods.
Published: 2024

2. Scale Propagation Network for Generalizable Depth Completion

Author: Wang, Haotian, Yang, Meng, Zheng, Xinhu, and Hua, Gang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Depth completion, inferring dense depth maps from sparse measurements, is crucial for robust 3D perception. Although deep learning based methods have made tremendous progress in this problem, these models cannot generalize well across different scenes that are unobserved in training, posing a fundamental limitation that yet to be overcome. A careful analysis of existing deep neural network architectures for depth completion, which are largely borrowing from successful backbones for image analysis tasks, reveals that a key design bottleneck actually resides in the conventional normalization layers. These normalization layers are designed, on one hand, to make training more stable, on the other hand, to build more visual invariance across scene scales. However, in depth completion, the scale is actually what we want to robustly estimate in order to better generalize to unseen scenes. To mitigate, we propose a novel scale propagation normalization (SP-Norm) method to propagate scales from input to output, and simultaneously preserve the normalization operator for easy convergence. More specifically, we rescale the input using learned features of a single-layer perceptron from the normalized input, rather than directly normalizing the input as conventional normalization layers. We then develop a new network architecture based on SP-Norm and the ConvNeXt V2 backbone. We explore the composition of various basic blocks and architectures to achieve superior performance and efficient inference for generalizable depth completion. Extensive experiments are conducted on six unseen datasets with various types of sparse depth maps, i.e., randomly sampled 0.1\%/1\%/10\% valid pixels, 4/8/16/32/64-line LiDAR points, and holes from Structured-Light. Our model consistently achieves the best accuracy with faster speed and lower memory when compared to state-of-the-art methods., Comment: Major revision in IEEE Transactions on Pattern Analysis and Machine Intelligence
Published: 2024

3. Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention

Author: Weng, Yuzhe, Wang, Haotian, Gao, Tian, Li, Kewei, Niu, Shutong, and Du, Jun
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: In multimodal sentiment analysis, collecting text data is often more challenging than video or audio due to higher annotation costs and inconsistent automatic speech recognition (ASR) quality. To address this challenge, our study has developed a robust model that effectively integrates multimodal sentiment information, even in the absence of text modality. Specifically, we have developed a Double-Flow Self-Distillation Framework, including Unified Modality Cross-Attention (UMCA) and Modality Imagination Autoencoder (MIA), which excels at processing both scenarios with complete modalities and those with missing text modality. In detail, when the text modality is missing, our framework uses the LLM-based model to simulate the text representation from the audio modality, while the MIA module supplements information from the other two modalities to make the simulated text representation similar to the real text representation. To further align the simulated and real representations, and to enable the model to capture the continuous nature of sample orders in sentiment valence regression tasks, we have also introduced the Rank-N Contrast (RNC) loss function. When testing on the CMU-MOSEI, our model achieved outstanding performance on MAE and significantly outperformed other models when text modality is missing. The code is available at: https://github.com/WarmCongee/SDUMC
Published: 2024

4. HypomimiaCoach: An AU-based Digital Therapy System for Hypomimia Detection & Rehabilitation with Parkinson's Disease

Author: Xu, Yingjing, Cai, Xueyan, Zhou, Zihong, Xue, Mengru, Wang, Bo, Wang, Haotian, Li, Zhengke, Weng, Chentian, Luo, Wei, Yao, Cheng, Lin, Bo, and Yin, Jianwei
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Artificial Intelligence
Abstract: Hypomimia is a non-motor symptom of Parkinson's disease that manifests as delayed facial movements and expressions, along with challenges in articulation and emotion. Currently, subjective evaluation by neurologists is the primary method for hypomimia detection, and conventional rehabilitation approaches heavily rely on verbal prompts from rehabilitation physicians. There remains a deficiency in accessible, user-friendly and scientifically rigorous assistive tools for hypomimia treatments. To investigate this, we developed HypomimaCoach, an Action Unit (AU)-based digital therapy system for hypomimia detection and rehabilitation in Parkinson's disease. The HypomimaCoach system was designed to facilitate engagement through the incorporation of both relaxed and controlled rehabilitation exercises, while also stimulating initiative through the integration of digital therapies that incorporated traditional face training methods. We extract action unit(AU) features and their relationship for hypomimia detection. In order to facilitate rehabilitation, a series of training programmes have been devised based on the Action Units (AUs) and patients are provided with real-time feedback through an additional AU recognition model, which guides them through their training routines. A pilot study was conducted with seven participants in China, all of whom exhibited symptoms of Parkinson's disease hypomimia. The results of the pilot study demonstrated a positive impact on participants' self-efficacy, with favourable feedback received. Furthermore, physician evaluations validated the system's applicability in a therapeutic setting for patients with Parkinson's disease, as well as its potential value in clinical applications.
Published: 2024

5. BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering

Author: Chu, Zheng, Chen, Jingchang, Chen, Qianglong, Wang, Haotian, Zhu, Kun, Du, Xiyuan, Yu, Weijiang, Liu, Ming, and Qin, Bing
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Large language models (LLMs) have demonstrated strong reasoning capabilities. Nevertheless, they still suffer from factual errors when tackling knowledge-intensive tasks. Retrieval-augmented reasoning represents a promising approach. However, significant challenges still persist, including inaccurate and insufficient retrieval for complex questions, as well as difficulty in integrating multi-source knowledge. To address this, we propose Beam Aggregation Reasoning, BeamAggR, a reasoning framework for knowledge-intensive multi-hop QA. BeamAggR explores and prioritizes promising answers at each hop of question. Concretely, we parse the complex questions into trees, which include atom and composite questions, followed by bottom-up reasoning. For atomic questions, the LLM conducts reasoning on multi-source knowledge to get answer candidates. For composite questions, the LLM combines beam candidates, explores multiple reasoning paths through probabilistic aggregation, and prioritizes the most promising trajectory. Extensive experiments on four open-domain multi-hop reasoning datasets show that our method significantly outperforms SOTA methods by 8.5%. Furthermore, our analysis reveals that BeamAggR elicits better knowledge collaboration and answer aggregation., Comment: Accepted to ACL 2024
Published: 2024

6. UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions

Author: Wang, Xunzhi, Zhang, Zhuowei, Li, Qiongyu, Chen, Gaonan, Hu, Mengting, li, Zhiyu, Luo, Bitong, Gao, Hang, Han, Zhixin, and Wang, Haotian
Subjects: Computer Science - Computation and Language
Abstract: The rapid development of large language models (LLMs) has shown promising practical results. However, their low interpretability often leads to errors in unforeseen circumstances, limiting their utility. Many works have focused on creating comprehensive evaluation systems, but previous benchmarks have primarily assessed problem-solving abilities while neglecting the response's uncertainty, which may result in unreliability. Recent methods for measuring LLM reliability are resource-intensive and unable to test black-box models. To address this, we propose UBENCH, a comprehensive benchmark for evaluating LLM reliability. UBENCH includes 3,978 multiple-choice questions covering knowledge, language, understanding, and reasoning abilities. Experimental results show that UBENCH has achieved state-of-the-art performance, while its single-sampling method significantly saves computational resources compared to baseline methods that require multiple samplings. Additionally, based on UBENCH, we evaluate the reliability of 15 popular LLMs, finding GLM4 to be the most outstanding, closely followed by GPT-4. We also explore the impact of Chain-of-Thought prompts, role-playing prompts, option order, and temperature on LLM reliability, analyzing the varying effects on different LLMs., Comment: Under review
Published: 2024

7. An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation

Author: Zhu, Kun, Feng, Xiaocheng, Du, Xiyuan, Gu, Yuxuan, Yu, Weijiang, Wang, Haotian, Chen, Qianglong, Chu, Zheng, Chen, Jingchang, and Qin, Bing
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Retrieval-augmented generation integrates the capabilities of large language models with relevant information retrieved from an extensive corpus, yet encounters challenges when confronted with real-world noisy data. One recent solution is to train a filter module to find relevant content but only achieve suboptimal noise compression. In this paper, we propose to introduce the information bottleneck theory into retrieval-augmented generation. Our approach involves the filtration of noise by simultaneously maximizing the mutual information between compression and ground output, while minimizing the mutual information between compression and retrieved passage. In addition, we derive the formula of information bottleneck to facilitate its application in novel comprehensive evaluations, the selection of supervised fine-tuning data, and the construction of reinforcement learning rewards. Experimental results demonstrate that our approach achieves significant improvements across various question answering datasets, not only in terms of the correctness of answer generation but also in the conciseness with $2.5\%$ compression rate., Comment: Accepted to ACL 2024
Published: 2024

8. PPA-Game: Characterizing and Learning Competitive Dynamics Among Online Content Creators

Author: Xu, Renzhe, Wang, Haotian, Zhang, Xingxuan, Li, Bo, and Cui, Peng
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Machine Learning
Abstract: We introduce the Proportional Payoff Allocation Game (PPA-Game) to model how agents, akin to content creators on platforms like YouTube and TikTok, compete for divisible resources and consumers' attention. Payoffs are allocated to agents based on heterogeneous weights, reflecting the diversity in content quality among creators. Our analysis reveals that although a pure Nash equilibrium (PNE) is not guaranteed in every scenario, it is commonly observed, with its absence being rare in our simulations. Beyond analyzing static payoffs, we further discuss the agents' online learning about resource payoffs by integrating a multi-player multi-armed bandit framework. We propose an online algorithm facilitating each agent's maximization of cumulative payoffs over $T$ rounds. Theoretically, we establish that the regret of any agent is bounded by $O(\log^{1 + \eta} T)$ for any $\eta > 0$. Empirical results further validate the effectiveness of our approach.
Published: 2024

9. Asteroseismological analysis of the non-Blazhko RRab star EPIC~248846335 in LAMOST -- Kepler$/$ K2 project

Author: Zong, Peng, Fu, Jian-Ning, Su, Jie, Hu, Xueying, Zhang, Bo, Wang, Jiaxin, Liu, Gao-Chao, Meng, Gang, Catanzaro, Gianni, Frasca, Antonio, Wang, Haotian, and Zong, Weikai
Subjects: Astrophysics - Solar and Stellar Astrophysics
Abstract: We conduct an asteroseismological analysis on the non-Blazhko ab-type RR Lyrae star EPIC 248846335 employing the Radial Stellar Pulsations (RSP) module of the Modules for Experiments in Stellar Astrophysics (MESA) based on the set of stellar parameters. The atmospheric parameters as $T_\mathrm{eff}$ = 6933$\pm$70 $K$, log $g$ = 3.35$\pm$ 0.50 and [Fe/H] = -1.18 $\pm$ 0.14 are estimated from the Low-Resolution Spectra of LAMOST DR9. The luminosity $L$ = 49.70$_{-1.80}^{+2.99}$ $L_\odot$ and mass M = 0.56 $\pm$ 0.07 $M_\odot$ are calculated, respectively, using the distance provided by Gaia and the metallicity estimated from the Low-Resolution Spectra. The Fourier parameters of the light curves observed by $K2$ and RV curves determined from the Medium-Resolution Spectra of LAMOST DR10 are also calculated in this work. The period of the fundamental mode of the star and the residuals $r$ of the Fourier parameters between the models and observations serve to select optimal model, whose stellar parameters are $T_\mathrm{eff}$ = 6700 $\pm$ 220 K, log $g$ = 2.70, [Fe/H] = -1.20 $\pm$ 0.2, M = 0.59 $\pm$ 0.05 $M_\odot$, and $L$ = 56.0 $\pm$ 4.2 $L_\odot$. The projection factors are constrained as 1.20 $\pm$ 0.02 and 1.59 $\pm$ 0.13 by the blue- and red-arm observed velocities with their corresponding RV curves derived from the best-fit model, respectively. The precise determination of stellar parameters in ab-type RR Lyrae stars is crucial for understanding the physical processes that occur during pulsation and for providing a deeper understanding of its Period-Luminosity relationship.
Published: 2024

10. A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

Author: Dai, Yusheng, Chen, Hang, Du, Jun, Wang, Ruoyu, Chen, Shihao, Ma, Jiefeng, Wang, Haotian, and Lee, Chin-Hui
Subjects: Computer Science - Sound, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Computer Science - Multimedia, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Advanced Audio-Visual Speech Recognition (AVSR) systems have been observed to be sensitive to missing video frames, performing even worse than single-modality models. While applying the dropout technique to the video modality enhances robustness to missing frames, it simultaneously results in a performance loss when dealing with complete data input. In this paper, we investigate this contrasting phenomenon from the perspective of modality bias and reveal that an excessive modality bias on the audio caused by dropout is the underlying reason. Moreover, we present the Modality Bias Hypothesis (MBH) to systematically describe the relationship between modality bias and robustness against missing modality in multimodal systems. Building on these findings, we propose a novel Multimodal Distribution Approximation with Knowledge Distillation (MDA-KD) framework to reduce over-reliance on the audio modality and to maintain performance and robustness simultaneously. Finally, to address an entirely missing modality, we adopt adapters to dynamically switch decision strategies. The effectiveness of our proposed approach is evaluated and validated through a series of comprehensive experiments using the MISP2021 and MISP2022 datasets. Our code is available at https://github.com/dalision/ModalBiasAVSR, Comment: the paper is accepted by CVPR2024
Published: 2024

11. Identification of metabolic progression and subtypes in progressive supranuclear palsy by PET molecular imaging

Author: Wang, Haotian, Wang, Bo, Liao, Yi, Niu, Jiaqi, Chen, Miao, Chen, Xinhui, Dou, Xiaofeng, Yu, Congcong, Zhong, Yan, Wang, Jing, Jin, Nan, Kang, Yixin, Zhang, Hong, Tian, Mei, and Luo, Wei
Published: 2024
Full Text: View/download PDF

12. Ruthenium-lead oxide for acidic oxygen evolution reaction in proton exchange membrane water electrolysis

Author: Chen, Feng-Yang, Qiu, Chang, Wu, Zhen-Yu, Wi, Tae-Ung, Finfrock, Y. Zou, and Wang, Haotian
Published: 2024
Full Text: View/download PDF

13. Analysis of influence of thermal tooth backlash on nonlinear dynamic characteristics of planetary gear system

Author: Wang, Jingyue, Wu, Zhijian, Wang, Haotian, Ding, Jianming, and Yi, Cai
Published: 2024
Full Text: View/download PDF

14. Electrochemical nitrate reduction to ammonia with cation shuttling in a solid electrolyte reactor

Author: Chen, Feng-Yang, Elgazzar, Ahmad, Pecaut, Stephanie, Qiu, Chang, Feng, Yuge, Ashokkumar, Sushanth, Yu, Zhou, Sellers, Chase, Hao, Shaoyun, Zhu, Peng, and Wang, Haotian
Published: 2024
Full Text: View/download PDF

15. Comparative Study of Anti-Slide Pile Reinforcement Schemes for Expansive Soil Canal Slopes in Cold and Arid Regions

Author: Wang, Haotian, Zhang, Lingkai, Shi, Chong, Zhao, Lingfeng, and Zhang, Yonggang
Published: 2024
Full Text: View/download PDF

16. Flares hunting in hot subdwarf and white dwarf stars from Cycles 1-5 of TESS photometry

Author: Xing, Keyu, Zong, Weikai, Silvotti, Roberto, Fu, Jian-Ning, Charpinet, Stéphane, Cang, Tianqi, Hermes, J. J., Ma, Xiao-Yu, Wang, Haotian, Wang, Xuan, Wu, Tao, and Wang, Jiaxin
Subjects: Astrophysics - Solar and Stellar Astrophysics
Abstract: Stellar flares are critical phenomena on stellar surfaces, which are closely tied to stellar magnetism. While extensively studied in main-sequence (MS) stars, their occurrence in evolved compact stars, specifically hot subdwarfs and white dwarfs (WDs), remains scarcely explored. Based on Cycles 1-5 of TESS photometry, we conducted a pioneering survey of flare events in $\sim12,000$ compact stars, corresponding to $\sim38,000$ light curves with 2-minute cadence. Through dedicated techniques for detrending light curves, identifying preliminary flare candidates, and validating them via machine learning, we established a catalog of 1016 flares from 193 compact stars, including 182 from 58 sdB/sdO stars and 834 from 135 WDs, respectively. However, all flaring compact stars showed signs of contamination from nearby objects or companion stars, preventing sole attribution of the detected flares. For WDs, it is highly probable that the flares originated from their cool MS companions. In contrast, the higher luminosities of sdB/sdO stars diminish companion contributions, suggesting that detected flares originated from sdB/sdO stars themselves or through close magnetic interactions with companions. Focusing on a refined sample of 23 flares from 13 sdB/sdO stars, we found their flare frequency distributions were slightly divergent from those of cool MS stars; instead, they resemble those of hot B/A-type MS stars having radiative envelopes. This similarity implies the flares on sdB/sdO stars, if these flares did originate from them, may share underlying mechanisms with hot MS stars, which warrants further investigation., Comment: 25 pages, 11 figures, 5 tables, accepted for publication in ApJS
Published: 2024
Full Text: View/download PDF

17. Composite Active Learning: Towards Multi-Domain Active Learning with Theoretical Guarantees

Author: Hao, Guang-Yuan, Huang, Hengguan, Wang, Haotian, Gao, Jie, and Wang, Hao
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Neural and Evolutionary Computing
Abstract: Active learning (AL) aims to improve model performance within a fixed labeling budget by choosing the most informative data points to label. Existing AL focuses on the single-domain setting, where all data come from the same domain (e.g., the same dataset). However, many real-world tasks often involve multiple domains. For example, in visual recognition, it is often desirable to train an image classifier that works across different environments (e.g., different backgrounds), where images from each environment constitute one domain. Such a multi-domain AL setting is challenging for prior methods because they (1) ignore the similarity among different domains when assigning labeling budget and (2) fail to handle distribution shift of data across different domains. In this paper, we propose the first general method, dubbed composite active learning (CAL), for multi-domain AL. Our approach explicitly considers the domain-level and instance-level information in the problem; CAL first assigns domain-level budgets according to domain-level importance, which is estimated by optimizing an upper error bound that we develop; with the domain-level budgets, CAL then leverages a certain instance-level query strategy to select samples to label from each domain. Our theoretical analysis shows that our method achieves a better error bound compared to current AL methods. Our empirical results demonstrate that our approach significantly outperforms the state-of-the-art AL methods on both synthetic and real-world multi-domain datasets. Code is available at https://github.com/Wang-ML-Lab/multi-domain-active-learning.
Published: 2024

18. Learning to Break: Knowledge-Enhanced Reasoning in Multi-Agent Debate System

Author: Wang, Haotian, Du, Xiyuan, Yu, Weijiang, Chen, Qianglong, Zhu, Kun, Chu, Zheng, Yan, Lian, and Guan, Yi
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Multi-agent debate system (MAD) imitating the process of human discussion in pursuit of truth, aims to align the correct cognition of different agents for the optimal solution. It is challenging to make various agents perform right and highly consistent cognition due to their limited and different knowledge backgrounds (i.e., cognitive islands), which hinders the search for the optimal solution. To address the challenge, we propose a novel \underline{M}ulti-\underline{A}gent \underline{D}ebate with \underline{K}nowledge-\underline{E}nhanced framework (\textbf{MADKE}) to promote the system to find the solution. First, we involve a shared retrieval knowledge pool in the debate process to solve the problem of limited and different knowledge backgrounds. Then, we propose an adaptive knowledge selection method to guarantee the accuracy and personalization of knowledge. This method allows agents to choose whether to use external knowledge in each conversation round according to their own needs. Our experimental results on six datasets show that our method achieves state-of-the-art results compared to existing single-agent and multi-agent methods. Further analysis reveals that the introduction of retrieval knowledge can help the agent to break cognitive islands in the debate process and effectively improve the consistency and correctness of the model. Moreover, MADKE using Qwen1.5-72B-Chat surpasses GPT-4 by +1.26\% on average in six datasets, which validates that our method can help open-source LLMs achieve or even surpass the performance of GPT-4. Our code is available at \url{https://github.com/FutureForMe/MADKE}., Comment: 18 pages, 10 figures, work in progress
Published: 2023

19. Laser frequency stabilization and photoacoustic detection based on the tapered fiber coupled crystalline resonator

Author: Xu, Yaohui, Liu, Xiaolan, Li, Wujun, Wang, Haotian, Guo, Jun, Ma, Jie, Zhang, Jianing, and Shen, Deyuan
Subjects: Physics - Optics
Abstract: We demonstrate laser frequency stabilization using a high-Q MgF2 crystalline whispering gallery mode resonator coupled with a tapered fiber. We discovered that the tapered fiber, acting as a microcantilever, exhibits mechanical resonance characteristics that is capable of transmitting acoustic perturbations to the frequency locking loop. Both experimental and theoretical investigations into the influence of external acoustic waves on the coupling system were conducted. After acoustic isolation, the locked laser exhibits a minimum frequency noise of 0.4Hz2/Hz at 7kHz and an integral linewidth of 68Hz (0.1s integration time). Benefiting from the ultralow frequency noise of the stabilized laser, it achieves a minimum noise equivalent acoustic signal level of 4.76*10-4 Pa/Hz1/2. Our results not only facilitate the realization of ultralow noise lasers but also serves as a novel and sensitive photoacoustic detector.
Published: 2023

20. Spatial-temporal dynamic evolution of lewy body dementia by metabolic PET imaging

Author: Niu, Jiaqi, Zhong, Yan, Xue, Le, Wang, Haotian, Hu, Daoyan, Liao, Yi, Zhang, Xiaohui, Dou, Xiaofeng, Yu, Congcong, Wang, Bo, Sun, Yuan, Tian, Mei, Zhang, Hong, and Wang, Jing
Published: 2024
Full Text: View/download PDF

21. CFD-Based Lift and Drag Estimations of a Novel Flight-Style AUV with Bow-Wings: Insights from Drag Polar Curves and Thrust Estimations

Author: Ahmed, Faheem, Xiang, Xianbo, Wang, Haotian, Xiang, Gong, and Yang, Shaolong
Published: 2024
Full Text: View/download PDF

22. Opencl-pytorch: an OpenCL-based extension of PyTorch

Author: Sui, Yicheng, Sun, Yufei, Shi, Changqing, Wang, Haotian, Zhang, Zhiqiang, Wang, Jiahao, and Zhang, Yuzhi
Published: 2024
Full Text: View/download PDF

23. oclCUB: an OpenCL parallel computing library for deep learning operators

Author: Shi, Changqing, Sun, Yufei, Sui, Yicheng, Chen, Yuqiao, Wang, Haotian, and Zhang, Yuzhi
Published: 2024
Full Text: View/download PDF

24. Diagnosis and classification of gear composite faults based on S-transform and improved 2D convolutional neural network

Author: Zheng, Junwen, Wang, Jingyue, Wang, Haotian, Ding, Jianming, and Yi, Cai
Published: 2024
Full Text: View/download PDF

25. TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models

Author: Chu, Zheng, Chen, Jingchang, Chen, Qianglong, Yu, Weijiang, Wang, Haotian, Liu, Ming, and Qin, Bing
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Grasping the concept of time is a fundamental facet of human cognition, indispensable for truly comprehending the intricacies of the world. Previous studies typically focus on specific aspects of time, lacking a comprehensive temporal reasoning benchmark. To address this, we propose TimeBench, a comprehensive hierarchical temporal reasoning benchmark that covers a broad spectrum of temporal reasoning phenomena. TimeBench provides a thorough evaluation for investigating the temporal reasoning capabilities of large language models. We conduct extensive experiments on GPT-4, LLaMA2, and other popular LLMs under various settings. Our experimental results indicate a significant performance gap between the state-of-the-art LLMs and humans, highlighting that there is still a considerable distance to cover in temporal reasoning. Besides, LLMs exhibit capability discrepancies across different reasoning categories. Furthermore, we thoroughly analyze the impact of multiple aspects on temporal reasoning and emphasize the associated challenges. We aspire for TimeBench to serve as a comprehensive benchmark, fostering research in temporal reasoning. Resources are available at: https://github.com/zchuz/TimeBench, Comment: Accepted to ACL 2024
Published: 2023

26. Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications

Author: Feng, Zhangyin, Ma, Weitao, Yu, Weijiang, Huang, Lei, Wang, Haotian, Chen, Qianglong, Peng, Weihua, Feng, Xiaocheng, Qin, Bing, and liu, Ting
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) exhibit superior performance on various natural language tasks, but they are susceptible to issues stemming from outdated data and domain-specific limitations. In order to address these challenges, researchers have pursued two primary strategies, knowledge editing and retrieval augmentation, to enhance LLMs by incorporating external information from different aspects. Nevertheless, there is still a notable absence of a comprehensive survey. In this paper, we propose a review to discuss the trends in integration of knowledge and large language models, including taxonomy of methods, benchmarks, and applications. In addition, we conduct an in-depth analysis of different methods and point out potential research directions in the future. We hope this survey offers the community quick access and a comprehensive overview of this research area, with the intention of inspiring future research endeavors., Comment: Work in progress; 22 pages. This work has been submitted to the IEEE for possible publication
Published: 2023

27. A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

Author: Huang, Lei, Yu, Weijiang, Ma, Weitao, Zhong, Weihong, Feng, Zhangyin, Wang, Haotian, Chen, Qianglong, Peng, Weihua, Feng, Xiaocheng, Qin, Bing, and Liu, Ting
Subjects: Computer Science - Computation and Language
Abstract: The emergence of large language models (LLMs) has marked a significant breakthrough in natural language processing (NLP), leading to remarkable advancements in text understanding and generation. Nevertheless, alongside these strides, LLMs exhibit a critical tendency to produce hallucinations, resulting in content that is inconsistent with real-world facts or user inputs. This phenomenon poses substantial challenges to their practical deployment and raises concerns over the reliability of LLMs in real-world scenarios, which attracts increasing attention to detect and mitigate these hallucinations. In this survey, we aim to provide a thorough and in-depth overview of recent advances in the field of LLM hallucinations. We begin with an innovative taxonomy of LLM hallucinations, then delve into the factors contributing to hallucinations. Subsequently, we present a comprehensive overview of hallucination detection methods and benchmarks. Additionally, representative approaches designed to mitigate hallucinations are introduced accordingly. Finally, we analyze the challenges that highlight the current limitations and formulate open questions, aiming to delineate pathways for future research on hallucinations in LLMs., Comment: Work in progress; 49 pages
Published: 2023

28. G2-MonoDepth: A General Framework of Generalized Depth Inference from Monocular RGB+X Data

Author: Wang, Haotian, Yang, Meng, and Zheng, Nanning
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Monocular depth inference is a fundamental problem for scene perception of robots. Specific robots may be equipped with a camera plus an optional depth sensor of any type and located in various scenes of different scales, whereas recent advances derived multiple individual sub-tasks. It leads to additional burdens to fine-tune models for specific robots and thereby high-cost customization in large-scale industrialization. This paper investigates a unified task of monocular depth inference, which infers high-quality depth maps from all kinds of input raw data from various robots in unseen scenes. A basic benchmark G2-MonoDepth is developed for this task, which comprises four components: (a) a unified data representation RGB+X to accommodate RGB plus raw depth with diverse scene scale/semantics, depth sparsity ([0%, 100%]) and errors (holes/noises/blurs), (b) a novel unified loss to adapt to diverse depth sparsity/errors of input raw data and diverse scales of output scenes, (c) an improved network to well propagate diverse scene scales from input to output, and (d) a data augmentation pipeline to simulate all types of real artifacts in raw depth maps for training. G2-MonoDepth is applied in three sub-tasks including depth estimation, depth completion with different sparsity, and depth enhancement in unseen scenes, and it always outperforms SOTA baselines on both real-world data and synthetic data., Comment: 18 pages, 16 figures
Published: 2023

29. Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future

Author: Chu, Zheng, Chen, Jingchang, Chen, Qianglong, Yu, Weijiang, He, Tao, Wang, Haotian, Peng, Weihua, Liu, Ming, Qin, Bing, and Liu, Ting
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Reasoning, a fundamental cognitive process integral to human intelligence, has garnered substantial interest within artificial intelligence. Notably, recent studies have revealed that chain-of-thought prompting significantly enhances LLM's reasoning capabilities, which attracts widespread attention from both academics and industry. In this paper, we systematically investigate relevant research, summarizing advanced methods through a meticulous taxonomy that offers novel perspectives. Moreover, we delve into the current frontiers and delineate the challenges and future directions, thereby shedding light on future research. Furthermore, we engage in a discussion about open questions. We hope this paper serves as an introduction for beginners and fosters future research. Resources have been made publicly available at https://github.com/zchuz/CoT-Reasoning-Survey, Comment: Accepted to ACL 2024
Published: 2023

30. Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023

Author: Wang, Haotian, Xi, Yuxuan, Chen, Hang, Du, Jun, Song, Yan, Wang, Qing, Zhou, Hengshun, Wang, Chenxi, Ma, Jiefeng, Hu, Pengfei, Jiang, Ya, Cheng, Shi, Zhang, Jie, and Weng, Yuzhe
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Artificial Intelligence, Computer Science - Multimedia, Computer Science - Sound
Abstract: In this paper, we propose a novel framework for recognizing both discrete and dimensional emotions. In our framework, deep features extracted from foundation models are used as robust acoustic and visual representations of raw video. Three different structures based on attention-guided feature gathering (AFG) are designed for deep feature fusion. Then, we introduce a joint decoding structure for emotion classification and valence regression in the decoding stage. A multi-task loss based on uncertainty is also designed to optimize the whole process. Finally, by combining three different structures on the posterior probability level, we obtain the final predictions of discrete and dimensional emotions. When tested on the dataset of multimodal emotion recognition challenge (MER 2023), the proposed framework yields consistent improvements in both emotion classification and valence regression. Our final system achieves state-of-the-art performance and ranks third on the leaderboard on MER-MULTI sub-challenge., Comment: 5 pages, 4 figures
Published: 2023
Full Text: View/download PDF

31. Response of tetraploid Citrus wilsonii Tanaka to drought stress by phosphoproteomics analysis

Author: DENG Xixi, REN Kexin, WEI Li'na, WANG Haotian, HU Jia, and JIANG Jinglong
Subjects: drought stress, tetraploid, citrus wilsonii tanaka, protein phosphorylation, Biology (General), QH301-705.5, Botany, QK1-989
Abstract: Abstract [Objective] Using phosphoproteomics, expression patterns of the phosphorylated proteins were analyzed with tetraploid Citrus wilsonii leaves under drought stress, aiming to reveal the mechanism of tetraploid C. wilsonii in response to drought stress and provide support for the improvement of droughttolerant citrus rootstock varieties. [Methods] Phosphorylated proteins in the leaves of tetraploid C. wilsonii after drought stress were identified and analyzed using IMAC affinity enrichment and TMT labeling technology. Functional annotation and metabolic pathway analysis were performed for the differentially expressed phosphorylated proteins. [Results] (1) A total of 3 794 phosphorylation sites and 1 521 phosphorylated proteins were quantified. There were 662 phosphorylated proteins with a fold change exceeding 1.3 (αFC>1.3), which were mainly located in the nucleus (46.07%) and chloroplasts (24.62%). (2) The differentially expressed phosphorylated proteins were mainly involved in binding RNA and Ca2+ , and participating in metabolic pathways such as RNA splicing, photosynthesis, and SNARE interaction in vesicle transport. (3) RT-qPCR results showed that 92.86% of the genes coding the differentially expressed phosphorylated proteins showed similar trend of change in transcriptional and protein levels with the Pearson correlation coefficient of 0.893. [Conclusion] Tetraploid C. wilsonii regulates the proteins involved in RNA splicing and photosynthesis pathway through phosphorylation in response to drought stress.
Published: 2024
Full Text: View/download PDF

32. Breast Ultrasound Tumor Classification Using a Hybrid Multitask CNN-Transformer Network

Author: Shareef, Bryar, Xian, Min, Vakanski, Aleksandar, and Wang, Haotian
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Capturing global contextual information plays a critical role in breast ultrasound (BUS) image classification. Although convolutional neural networks (CNNs) have demonstrated reliable performance in tumor classification, they have inherent limitations for modeling global and long-range dependencies due to the localized nature of convolution operations. Vision Transformers have an improved capability of capturing global contextual information but may distort the local image patterns due to the tokenization operations. In this study, we proposed a hybrid multitask deep neural network called Hybrid-MT-ESTAN, designed to perform BUS tumor classification and segmentation using a hybrid architecture composed of CNNs and Swin Transformer components. The proposed approach was compared to nine BUS classification methods and evaluated using seven quantitative metrics on a dataset of 3,320 BUS images. The results indicate that Hybrid-MT-ESTAN achieved the highest accuracy, sensitivity, and F1 score of 82.7%, 86.4%, and 86.0%, respectively., Comment: 10 pages, 3 figures, 3 tables
Published: 2023

33. Transactional Indexes on (RDMA or CXL-based) Disaggregated Memory with Repairable Transaction

Author: Wei, Xingda, Wang, Haotian, Wang, Tianxia, Chen, Rong, Gu, Jinyu, Zuo, Pengfei, and Chen, Haibo
Subjects: Computer Science - Databases
Abstract: The failure atomic and isolated execution of clients operations is a default requirement for a system that serve multiple loosely coupled clients at a server. However, disaggregated memory breaks this requirement in remote indexes because a client operation is disaggregated to multiple remote reads/writes. Current indexes focus on performance improvements and largely ignore tolerating client failures. We argue that a practical DM index should be transactional: each index operation should be failure atomic and isolated in addition to being concurrency isolated. We present repairable transaction (rTX), a lightweight primitive to execute DM index operations. Each rTX can detect other failed rTXes on-the-fly with the help of concurrency control. Upon detection, it will repair their non-atomic updates online with the help of logging, thus hiding their failures from healthy clients. By further removing unnecessary logging and delegating concurrency control to existing carefully-tuned index algorithms, we show that transactional indexes can be built at a low performance overhead on disaggregated memory. We have refactored two state-of-the-art DM indexes, RaceHashing and Sherman (B+Tree), with rTX. Evaluations show that rTX is 1.2 to 2X faster than other alternatives, e.g., distributed transaction. Meanwhile, its overhead is up to 42% compared to non-fault-tolerant indexes.
Published: 2023

34. Associations of metabolic changes and polygenic risk scores with cardiovascular outcomes and all-cause mortality across BMI categories: a prospective cohort study

Author: Li, Cancan, Meng, Xiaoni, Zhang, Jie, Wang, Haotian, Lu, Huimin, Cao, Meiling, Sun, Shengzhi, and Wang, Youxin
Published: 2024
Full Text: View/download PDF

35. Association between inflammatory bowel disease and cancer risk: evidence triangulation from genetic correlation, Mendelian randomization, and colocalization analyses across East Asian and European populations

Author: Liu, Di, Cao, Meiling, Wang, Haotian, Cao, Weijie, Zheng, Chenguang, Li, Yun, and Wang, Youxin
Published: 2024
Full Text: View/download PDF

36. Clothing-change person re-identification based on fusion of RGB modality and gait features

Author: Tu, Hongbin, Liu, Chao, Peng, Yuanyuan, Xiong, Haibo, and Wang, Haotian
Published: 2024
Full Text: View/download PDF

37. Char formation and smoke suppression mechanism of montmorillonite modified by ammonium polyphosphate/silane towards fire safety enhancement for wood composites

Author: Zhang, Liangliang, Niu, Kangren, Wang, Haotian, Wang, Jiamin, Liu, Meihong, Lei, Yafang, and Yan, Li
Published: 2024
Full Text: View/download PDF

38. Competing for Shareable Arms in Multi-Player Multi-Armed Bandits

Author: Xu, Renzhe, Wang, Haotian, Zhang, Xingxuan, Li, Bo, and Cui, Peng
Subjects: Computer Science - Machine Learning, Computer Science - Computers and Society, Computer Science - Computer Science and Game Theory, Computer Science - Multiagent Systems
Abstract: Competitions for shareable and limited resources have long been studied with strategic agents. In reality, agents often have to learn and maximize the rewards of the resources at the same time. To design an individualized competing policy, we model the competition between agents in a novel multi-player multi-armed bandit (MPMAB) setting where players are selfish and aim to maximize their own rewards. In addition, when several players pull the same arm, we assume that these players averagely share the arms' rewards by expectation. Under this setting, we first analyze the Nash equilibrium when arms' rewards are known. Subsequently, we propose a novel Selfish MPMAB with Averaging Allocation (SMAA) approach based on the equilibrium. We theoretically demonstrate that SMAA could achieve a good regret guarantee for each player when all players follow the algorithm. Additionally, we establish that no single selfish player can significantly increase their rewards through deviation, nor can they detrimentally affect other players' rewards without incurring substantial losses for themselves. We finally validate the effectiveness of the method in extensive synthetic experiments., Comment: ICML 2023
Published: 2023

39. E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition

Author: Zhang, Zhen, Hu, Mengting, Zhao, Shiwan, Huang, Minlie, Wang, Haotian, Liu, Lemao, Zhang, Zhirui, Liu, Zhe, and Wu, Bingzhe
Subjects: Computer Science - Computation and Language
Abstract: Most named entity recognition (NER) systems focus on improving model performance, ignoring the need to quantify model uncertainty, which is critical to the reliability of NER systems in open environments. Evidential deep learning (EDL) has recently been proposed as a promising solution to explicitly model predictive uncertainty for classification tasks. However, directly applying EDL to NER applications faces two challenges, i.e., the problems of sparse entities and OOV/OOD entities in NER tasks. To address these challenges, we propose a trustworthy NER framework named E-NER by introducing two uncertainty-guided loss terms to the conventional EDL, along with a series of uncertainty-guided training strategies. Experiments show that E-NER can be applied to multiple NER paradigms to obtain accurate uncertainty estimation. Furthermore, compared to state-of-the-art baselines, the proposed method achieves a better OOV/OOD detection performance and better generalization ability on OOV entities., Comment: accepted by ACL Findings (2023)
Published: 2023

40. Large Language Models are Few-Shot Summarizers: Multi-Intent Comment Generation via In-Context Learning

Author: Geng, Mingyang, Wang, Shangwen, Dong, Dezun, Wang, Haotian, Li, Ge, Jin, Zhi, Mao, Xiaoguang, and Liao, Xiangke
Subjects: Computer Science - Software Engineering
Abstract: Code comment generation aims at generating natural language descriptions for a code snippet to facilitate developers' program comprehension activities. Despite being studied for a long time, a bottleneck for existing approaches is that given a code snippet, they can only generate one comment while developers usually need to know information from diverse perspectives such as what is the functionality of this code snippet and how to use it. To tackle this limitation, this study empirically investigates the feasibility of utilizing large language models (LLMs) to generate comments that can fulfill developers' diverse intents. Our intuition is based on the facts that (1) the code and its pairwise comment are used during the pre-training process of LLMs to build the semantic connection between the natural language and programming language, and (2) comments in the real-world projects, which are collected for the pre-training, usually contain different developers' intents. We thus postulate that the LLMs can already understand the code from different perspectives after the pre-training. Indeed, experiments on two large-scale datasets demonstrate the rationale of our insights: by adopting the in-context learning paradigm and giving adequate prompts to the LLM (e.g., providing it with ten or more examples), the LLM can significantly outperform a state-of-the-art supervised learning approach on generating comments with multiple intents. Results also show that customized strategies for constructing the prompts and post-processing strategies for reranking the results can both boost the LLM's performances, which shed light on future research directions for using LLMs to achieve comment generation., Comment: Accepted by the 46th International Conference on Software Engineering (ICSE 2024)
Published: 2023

41. A Study of Pulsation properties of 57 Non-Blazhko effect ab-type RR Lyrae stars with homogeneous metallicities from the LAMOST-Kepler/K2 survey

Author: Zong, Peng, Fu, Jian-Ning, Wang, Jiaxin, Cang, Tian-Qi, Wang, HaoTian, Ma, Xiao-Yu, and Zong, Weikai
Subjects: Astrophysics - Solar and Stellar Astrophysics
Abstract: Homogeneous metallicities and continuous high-precision light curves play key roles in studying the pulsation properties of RR Lyrae stars. By cross-matching with LAMOST DR6, we have determined 7 and 50 Non-Blazhko RRab stars in the Kepler and K2 fields, respectively, who have homogeneous metallicities determined from low-resolution spectra of the LAMOST-Kepler/K2 project. The Fourier Decomposition method is applied to the light curves of these stars provided by the Kepler space based telescope to determine the fundamental pulsation periods and the pulsation parameters. The calculated amplitude ratios of R21, R31 and the phase differences of {\phi}21, {\phi}31 are consistent with the parameters of the RRab stars in both the Globular Clusters and the Large Magellanic Cloud. We find a linear relationship between the phase differences {\phi}21 and {\phi}31, which is in good agreement with the results in previous literature. As far as the amplitude, we find that the amplitude of primary frequency A1 and the total amplitude Atot follow either a cubic or linear relationship. For the rise time RT, we do not find its relevance with the period of the fundamental pulsation mode P1, or Atot and {\phi}21. However, it might follow a linear relationship with R31. Based on the homogeneous metallicities, we have derived a new calibration formula for the relationship of period-{\phi}31-[Fe/H], which agrees well with the previous studies.
Published: 2023
Full Text: View/download PDF

42. Enhanced Sharp-GAN For Histopathology Image Synthesis

Author: Butte, Sujata, Wang, Haotian, Vakanski, Aleksandar, and Xian, Min
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Histopathology image synthesis aims to address the data shortage issue in training deep learning approaches for accurate cancer detection. However, existing methods struggle to produce realistic images that have accurate nuclei boundaries and less artifacts, which limits the application in downstream tasks. To address the challenges, we propose a novel approach that enhances the quality of synthetic images by using nuclei topology and contour regularization. The proposed approach uses the skeleton map of nuclei to integrate nuclei topology and separate touching nuclei. In the loss function, we propose two new contour regularization terms that enhance the contrast between contour and non-contour pixels and increase the similarity between contour pixels. We evaluate the proposed approach on the two datasets using image quality metrics and a downstream task (nuclei segmentation). The proposed approach outperforms Sharp-GAN in all four image quality metrics on two datasets. By integrating 6k synthetic images from the proposed approach into training, a nuclei segmentation model achieves the state-of-the-art segmentation performance on TNBC dataset and its detection quality (DQ), segmentation quality (SQ), panoptic quality (PQ), and aggregated Jaccard index (AJI) is 0.855, 0.863, 0.691, and 0.683, respectively.
Published: 2023

43. Frequency Enhanced Carbon Dioxide Emissions Forecasting Model with Missing Values Encoding

Author: Yu, Zhenda, Wang, Haotian, Li, Zerui, Li, Kun, Ma, Dawei, Lv, Wenjun, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Huang, De-Shuang, editor, Zhang, Chuanlei, editor, and Pan, Yijie, editor
Published: 2024
Full Text: View/download PDF

44. Design and Experiments of a Regolith Environment Simulator for Lunar Polar Region Exploration

Author: Zhong, Peineng, Xu, Jinchang, Wang, Lusi, Zhao, Zeng, Sun, Qichen, Zeng, Ting, Wang, Haotian, Liu, Jiabin, Wu, Riyue, Dong, Jiawei, Xu, Kun, Zhang, Tao, Ceccarelli, Marco, Series Editor, Corves, Burkhard, Advisory Editor, Glazunov, Victor, Advisory Editor, Hernández, Alfonso, Advisory Editor, Huang, Tian, Advisory Editor, Jauregui Correa, Juan Carlos, Advisory Editor, Takeda, Yukio, Advisory Editor, Agrawal, Sunil K., Advisory Editor, Tan, Jianrong, editor, Liu, Yu, editor, Huang, Hong-Zhong, editor, Yu, Jingjun, editor, and Wang, Zequn, editor
Published: 2024
Full Text: View/download PDF

45. Blockchain-Based Multi-factor K-Anonymity Group Location Privacy Protection Scheme

Author: Wang, Haotian, Wang, Shang, Zhao, Mingzhu, Yu, Meiju, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Sun, Yuqing, editor, Lu, Tun, editor, Wang, Tong, editor, Fan, Hongfei, editor, Liu, Dongning, editor, and Du, Bowen, editor
Published: 2024
Full Text: View/download PDF

46. Stacking order and interlayer coupling tuning the properties of charge density waves in layered 1T-NbSe_2

Author: Jiang, Tao, Wang, Haotian, Gao, Heng, Zheng, Qinghe, Li, Zhenya, and Ren, Wei
Subjects: Condensed Matter - Strongly Correlated Electrons
Abstract: Layered transition metal dichalcogenide 1T-NbSe_2 is a good candidate to explore the charge density wave (CDW) and Mott physics. However, the effects of stacking orders and interlayer coupling in CDW 1T-NbSe_2 are still less explored and understood. Using density functional theory calculations, we present a systematic study of the electronic and magnetic properties of monolayer and layered CDW 1T-NbSe_2. Our results indicate that monolayer CDW 1T-NbSe_2 is a magnetic insulator with \sqrt13\times\sqrt13 periodic lattice modulation. Nevertheless, the magnetic properties of bilayer CDWs 1T-NbSe_2 are found stacking orders dependence. The mechanism is understood by the changes of local magnetic moments in each layer due to spin charge transfer between interlayers. Furthermore, the bulk CDW 1T-NbSe_2 opens a band gap with 0.02 eV in 1\times 1 \times 2 supercell due to the interlayer spin coupling. We also discover that the electronic structures of layered 1T-NbSe_2 show a strong dependence on stacking configurations and dimensionality.
Published: 2022

47. SIAN: Style-Guided Instance-Adaptive Normalization for Multi-Organ Histopathology Image Synthesis

Author: Wang, Haotian, Xian, Min, Vakanski, Aleksandar, and Shareef, Bryar
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Existing deep neural networks for histopathology image synthesis cannot generate image styles that align with different organs, and cannot produce accurate boundaries of clustered nuclei. To address these issues, we propose a style-guided instance-adaptive normalization (SIAN) approach to synthesize realistic color distributions and textures for histopathology images from different organs. SIAN contains four phases, semantization, stylization, instantiation, and modulation. The first two phases synthesize image semantics and styles by using semantic maps and learned image style vectors. The instantiation module integrates geometrical and topological information and generates accurate nuclei boundaries. We validate the proposed approach on a multiple-organ dataset, Extensive experimental results demonstrate that the proposed method generates more realistic histopathology images than four state-of-the-art approaches for five organs. By incorporating synthetic images from the proposed approach to model training, an instance segmentation network can achieve state-of-the-art performance.
Published: 2022

48. Study on propagation properties of fractional soliton in the inhomogeneous fiber with higher-order effects

Author: Liu, Muwei, Wang, Haotian, Yang, Hujiang, and Liu, Wenjun
Published: 2024
Full Text: View/download PDF

49. Environmental cadmium pollution and health risk assessment in rice–wheat rotation area around a smelter

Author: Liu, Hailong, Wang, Hu, Zhou, Jun, Zhang, Ying, Wang, Haotian, Li, Min, and Wang, Xiaozhi
Published: 2024
Full Text: View/download PDF

50. AUV-assisted information collection scheme with energy balance and low delay of underwater things

Author: Chi, Dingwen, Tao, Jun, Hu, Yulai, Wang, Haotian, Wang, Zuyan, and Xu, Yifan
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

1,880 results on '"Wang, Haotian"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources