Author: "Shen, Han" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Shen, Han"' showing total 3,406 results

Start Over Author "Shen, Han"

3,406 results on '"Shen, Han"'

1. Mitigating Forgetting in LLM Supervised Fine-Tuning and Preference Learning

Author: Fernando, Heshan, Shen, Han, Ram, Parikshit, Zhou, Yi, Samulowitz, Horst, Baracaldo, Nathalie, and Chen, Tianyi
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: Post-training of pre-trained LLMs, which typically consists of the supervised fine-tuning (SFT) stage and the preference learning (RLHF or DPO) stage, is crucial to effective and safe LLM applications. The widely adopted approach in post-training popular open-source LLMs is to sequentially perform SFT and RLHF/DPO. However, sequential training is sub-optimal in terms of SFT and RLHF/DPO trade-off: the LLM gradually forgets about the first stage's training when undergoing the second stage's training. We theoretically prove the sub-optimality of sequential post-training. Furthermore, we propose a practical joint post-training framework with theoretical convergence guarantees and empirically outperforms sequential post-training framework, while having similar computational cost. Our code is available at https://github.com/heshandevaka/XRIGHT.
Published: 2024

2. SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection

Author: Shen, Han, Chen, Pin-Yu, Das, Payel, and Chen, Tianyi
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Fine-tuning on task-specific data to boost downstream performance is a crucial step for leveraging Large Language Models (LLMs). However, previous studies have demonstrated that fine-tuning the models on several adversarial samples or even benign data can greatly comprise the model's pre-equipped alignment and safety capabilities. In this work, we propose SEAL, a novel framework to enhance safety in LLM fine-tuning. SEAL learns a data ranker based on the bilevel optimization to up rank the safe and high-quality fine-tuning data and down rank the unsafe or low-quality ones. Models trained with SEAL demonstrate superior quality over multiple baselines, with 8.5% and 9.7% win rate increase compared to random selection respectively on Llama-3-8b-Instruct and Merlinite-7b models. Our code is available on github https://github.com/hanshen95/SEAL.
Published: 2024

3. FedNE: Surrogate-Assisted Federated Neighbor Embedding for Dimensionality Reduction

Author: Li, Ziwei, Wang, Xiaoqi, Chen, Hong-You, Shen, Han-Wei, and Chao, Wei-Lun
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Federated learning (FL) has rapidly evolved as a promising paradigm that enables collaborative model training across distributed participants without exchanging their local data. Despite its broad applications in fields such as computer vision, graph learning, and natural language processing, the development of a data projection model that can be effectively used to visualize data in the context of FL is crucial yet remains heavily under-explored. Neighbor embedding (NE) is an essential technique for visualizing complex high-dimensional data, but collaboratively learning a joint NE model is difficult. The key challenge lies in the objective function, as effective visualization algorithms like NE require computing loss functions among pairs of data. In this paper, we introduce \textsc{FedNE}, a novel approach that integrates the \textsc{FedAvg} framework with the contrastive NE technique, without any requirements of shareable data. To address the lack of inter-client repulsion which is crucial for the alignment in the global embedding space, we develop a surrogate loss function that each client learns and shares with each other. Additionally, we propose a data-mixing strategy to augment the local data, aiming to relax the problems of invisible neighbors and false neighbors constructed by the local $k$NN graphs. We conduct comprehensive experiments on both synthetic and real-world datasets. The results demonstrate that our \textsc{FedNE} can effectively preserve the neighborhood data structures and enhance the alignment in the global embedding space compared to several baseline methods.
Published: 2024

4. Regularized Multi-Decoder Ensemble for an Error-Aware Scene Representation Network

Author: Xiong, Tianyu, Wurster, Skylar W., Guo, Hanqi, Peterka, Tom, and Shen, Han-Wei
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, Computer Science - Human-Computer Interaction
Abstract: Feature grid Scene Representation Networks (SRNs) have been applied to scientific data as compact functional surrogates for analysis and visualization. As SRNs are black-box lossy data representations, assessing the prediction quality is critical for scientific visualization applications to ensure that scientists can trust the information being visualized. Currently, existing architectures do not support inference time reconstruction quality assessment, as coordinate-level errors cannot be evaluated in the absence of ground truth data. We propose a parameter-efficient multi-decoder SRN (MDSRN) ensemble architecture consisting of a shared feature grid with multiple lightweight multi-layer perceptron decoders. MDSRN can generate a set of plausible predictions for a given input coordinate to compute the mean as the prediction of the multi-decoder ensemble and the variance as a confidence score. The coordinate-level variance can be rendered along with the data to inform the reconstruction quality, or be integrated into uncertainty-aware volume visualization algorithms. To prevent the misalignment between the quantified variance and the prediction quality, we propose a novel variance regularization loss for ensemble learning that promotes the Regularized multi-decoder SRN (RMDSRN) to obtain a more reliable variance that correlates closely to the true model error. We comprehensively evaluate the quality of variance quantification and data reconstruction of Monte Carlo Dropout, Mean Field Variational Inference, Deep Ensemble, and Predicting Variance compared to the proposed MDSRN and RMDSRN across diverse scalar field datasets. We demonstrate that RMDSRN attains the most accurate data reconstruction and competitive variance-error correlation among uncertain SRNs under the same neural network parameter budgets., Comment: To be published in Proc. IEEE VIS 2024
Published: 2024

5. SurroFlow: A Flow-Based Surrogate Model for Parameter Space Exploration and Uncertainty Quantification

Author: Shen, Jingyi, Duan, Yuhan, and Shen, Han-Wei
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, Computer Science - Human-Computer Interaction
Abstract: Existing deep learning-based surrogate models facilitate efficient data generation, but fall short in uncertainty quantification, efficient parameter space exploration, and reverse prediction. In our work, we introduce SurroFlow, a novel normalizing flow-based surrogate model, to learn the invertible transformation between simulation parameters and simulation outputs. The model not only allows accurate predictions of simulation outcomes for a given simulation parameter but also supports uncertainty quantification in the data generation process. Additionally, it enables efficient simulation parameter recommendation and exploration. We integrate SurroFlow and a genetic algorithm as the backend of a visual interface to support effective user-guided ensemble simulation exploration and visualization. Our framework significantly reduces the computational costs while enhancing the reliability and exploration capabilities of scientific surrogate models., Comment: To be published in Proc. IEEE VIS 2024
Published: 2024

6. USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation

Author: Wang, Xiaoqi, He, Wenbin, Xuan, Xiwei, Sebastian, Clint, Ono, Jorge Piazentin, Li, Xin, Behpour, Sima, Doan, Thang, Gou, Liang, Shen, Han Wei, and Ren, Liu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The open-vocabulary image segmentation task involves partitioning images into semantically meaningful segments and classifying them with flexible text-defined categories. The recent vision-based foundation models such as the Segment Anything Model (SAM) have shown superior performance in generating class-agnostic image segments. The main challenge in open-vocabulary image segmentation now lies in accurately classifying these segments into text-defined categories. In this paper, we introduce the Universal Segment Embedding (USE) framework to address this challenge. This framework is comprised of two key components: 1) a data pipeline designed to efficiently curate a large amount of segment-text pairs at various granularities, and 2) a universal segment embedding model that enables precise segment classification into a vast range of text-defined categories. The USE model can not only help open-vocabulary image segmentation but also facilitate other downstream tasks (e.g., querying and ranking). Through comprehensive experimental studies on semantic segmentation and part segmentation benchmarks, we demonstrate that the USE framework outperforms state-of-the-art open-vocabulary segmentation methods.
Published: 2024

7. MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing

Author: Wang, Cong, Kang, Di, Sun, He-Yi, Qian, Shen-Han, Wang, Zi-Xuan, Bao, Linchao, and Zhang, Song-Hai
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Creating high-fidelity head avatars from multi-view videos is a core issue for many AR/VR applications. However, existing methods usually struggle to obtain high-quality renderings for all different head components simultaneously since they use one single representation to model components with drastically different characteristics (e.g., skin vs. hair). In this paper, we propose a Hybrid Mesh-Gaussian Head Avatar (MeGA) that models different head components with more suitable representations. Specifically, we select an enhanced FLAME mesh as our facial representation and predict a UV displacement map to provide per-vertex offsets for improved personalized geometric details. To achieve photorealistic renderings, we obtain facial colors using deferred neural rendering and disentangle neural textures into three meaningful parts. For hair modeling, we first build a static canonical hair using 3D Gaussian Splatting. A rigid transformation and an MLP-based deformation field are further applied to handle complex dynamic expressions. Combined with our occlusion-aware blending, MeGA generates higher-fidelity renderings for the whole head and naturally supports more downstream tasks. Experiments on the NeRSemble dataset demonstrate the effectiveness of our designs, outperforming previous state-of-the-art methods and supporting various editing functionalities, including hairstyle alteration and texture editing., Comment: Project page: https://conallwang.github.io/MeGA_Pages/
Published: 2024

8. Compressing and Interpreting Word Embeddings with Latent Space Regularization and Interactive Semantics Probing

Author: Li, Haoyu, Wang, Junpeng, Zheng, Yan, Wang, Liang, Zhang, Wei, and Shen, Han-Wei
Subjects: Computer Science - Human-Computer Interaction
Abstract: Word embedding, a high-dimensional (HD) numerical representation of words generated by machine learning models, has been used for different natural language processing tasks, e.g., translation between two languages. Recently, there has been an increasing trend of transforming the HD embeddings into a latent space (e.g., via autoencoders) for further tasks, exploiting various merits the latent representations could bring. To preserve the embeddings' quality, these works often map the embeddings into an even higher-dimensional latent space, making the already complicated embeddings even less interpretable and consuming more storage space. In this work, we borrow the idea of $\beta$VAE to regularize the HD latent space. Our regularization implicitly condenses information from the HD latent space into a much lower-dimensional space, thus compressing the embeddings. We also show that each dimension of our regularized latent space is more semantically salient, and validate our assertion by interactively probing the encoding-level of user-proposed semantics in the dimensions. To the end, we design a visual analytics system to monitor the regularization process, explore the HD latent space, and interpret latent dimensions' semantics. We validate the effectiveness of our embedding regularization and interpretation approach through both quantitative and qualitative evaluations.
Published: 2024
Full Text: View/download PDF

9. On the chemical and kinematic signatures of the resonances of the Galactic bar as revealed by the LAMOST-APOGEE red clump stars

Author: Sun, Weixiang, Shen, Han, Jiang, Biwei, and Liu, Xiaowei
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: The Milky Way is widely considered to exhibit features of a rotational bar or quadrupole bar. In either case, the feature of the resonance of the Galactic bar should be present in the properties of the chemistry and kinematics, over a large area of the disk. With a sample of over 170,000 red clump (RC) stars from LAMOST-APOGEE data, we attempt to detect the chemical and kinematic signatures of the resonances of the Galactic bar, within 4.0 $\leq$ $R$ $\leq$ 15.0 kpc and $|Z|$ $\leq$ 3.0 kpc. The measurement of the $\Delta$[Fe/H]/$\Delta|Z|$ $-$ $R$ with subtracted the global profiles trends, shows that the thin and thick disks values are Cor_$\Delta$[Fe/H]/$\Delta|Z|$ = 0.010 $\mathrm{sin}$ (1.598 $R$ + 2.551) and Cor_$\Delta$[Fe/H]/$\Delta|Z|$ = 0.006 $\mathrm{sin}$ (1.258 $R$ $-$ 0.019), respectively. The analysis of the tilt angle of the velocity ellipsoid indicates that the thin and thick disks are accurately described as $\alpha$ = $\alpha_{0}$ arctan (Z/R), with $\alpha_{0}$ = 0.198 $\mathrm{sin}$ (0.853 $R$ + 1.982) + 0.630 and $\alpha_{0}$ = 0.220 $\mathrm{sin}$ (0.884 $R$ + 2.012) + 0.679 for thin and thick disks, respectively. These periodic oscillations in Cor_$\Delta$[Fe/H]/$\Delta|Z|$ and $\alpha_{0}$ with $R$ appear in both thin and thick disks, are the most likely chemical and kinematic signatures of the resonance of the Galactic bar. The difference in the phase of the functions of the fitted periodic oscillations for the thin and thick disks may be related to the presence of a second Galactic bar., Comment: 6 pages, 6 figures, 1 table, accepted for publication in ApJL
Published: 2024

10. Mapping the Chemo-dynamics of the Galactic disk using the LAMOST and APOGEE red clump stars

Author: Sun, Weixiang, Shen, Han, Jiang, Biwei, and Liu, Xiaowei
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: A detailed measurement is made of the metallicity distributions, kinematics and dynamics of the thin and thick disks, across a large disk volume (5.0 $\leq$ $R$ $\leq$ 15.0 kpc and $|Z|$ $\leq$3.0 kpc), by using the LAMOST-APOGEE red clump stars. The metallicity distributions results show that the radial metallicity gradient $\Delta$[Fe/H]/$\Delta$R of the thin disk weakens with $|Z|$ from $-$0.06 dex kpc$^{-1}$ at around $|Z|$ $<$ 0.25 kpc to $-$0.02 dex kpc$^{-1}$ at around $|Z|$ $>$ 2.75 kpc, while the thick disk displays a global weak positive $\Delta$[Fe/H]/$\Delta$R, generally weaker than 0.01 dex kpc$^{-1}$. The vertical metallicity gradient $\Delta$[Fe/H]/$\Delta|Z|$ weakened steadily from $-$0.36 dex kpc$^{-1}$ at $R$ $\sim$ 5.5 kpc to $-$0.05 dex kpc$^{-1}$ at around R $>$ 11.5 kpc for the thin disk, while the thick disk presents an almost constant value (nearly $-$0.06 $\sim$ $-$0.08 dex kpc$^{-1}$) for all the $R$ bins. These results indicate the contribution of the radial migration to the disk evolution, and the obvious north-south asymmetry in [Fe/H] may be linked to the disk warp and/or the disk perturbation events. The oscillations of the corrected $\Delta$[Fe/H]/$\Delta|Z|$ with $R$ are likely because of the resonances with the Galactic Bar. Our detailed measurements of $\Delta$V$_{\phi}$/$\Delta$[Fe/H] indicate an "inside-out" and "upside-down" star formation scenario for the thick disk. The results of eccentricity distributions and [$\alpha$/Fe]--velocity dispersion relations are likely to suggest that the thick disk stars require an obvious contribution from other heating mechanisms such as merger and accretion, or born in the chaotic mergers of gas-rich systems and/or turbulent interstellar medium., Comment: 12 pages, 11 figures, 1 table, accepted for publication in ApJS
Published: 2024

11. Improving Efficiency of Iso-Surface Extraction on Implicit Neural Representations Using Uncertainty Propagation

Author: Li, Haoyu and Shen, Han-Wei
Subjects: Computer Science - Graphics, Computer Science - Machine Learning
Abstract: Implicit Neural representations (INRs) are widely used for scientific data reduction and visualization by modeling the function that maps a spatial location to a data value. Without any prior knowledge about the spatial distribution of values, we are forced to sample densely from INRs to perform visualization tasks like iso-surface extraction which can be very computationally expensive. Recently, range analysis has shown promising results in improving the efficiency of geometric queries, such as ray casting and hierarchical mesh extraction, on INRs for 3D geometries by using arithmetic rules to bound the output range of the network within a spatial region. However, the analysis bounds are often too conservative for complex scientific data. In this paper, we present an improved technique for range analysis by revisiting the arithmetic rules and analyzing the probability distribution of the network output within a spatial region. We model this distribution efficiently as a Gaussian distribution by applying the central limit theorem. Excluding low probability values, we are able to tighten the output bounds, resulting in a more accurate estimation of the value range, and hence more accurate identification of iso-surface cells and more efficient iso-surface extraction on INRs. Our approach demonstrates superior performance in terms of the iso-surface extraction time on four datasets compared to the original range analysis method and can also be generalized to other geometric query tasks., Comment: Accepted to IEEE Transactions on Visualization and Computer Graphics, presented in VIS 2024
Published: 2024
Full Text: View/download PDF

12. Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF

Author: Shen, Han, Yang, Zhuoran, and Chen, Tianyi
Subjects: Computer Science - Machine Learning, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: Bilevel optimization has been recently applied to many machine learning tasks. However, their applications have been restricted to the supervised learning setting, where static objective functions with benign structures are considered. But bilevel problems such as incentive design, inverse reinforcement learning (RL), and RL from human feedback (RLHF) are often modeled as dynamic objective functions that go beyond the simple static objective structures, which pose significant challenges of using existing bilevel solutions. To tackle this new class of bilevel problems, we introduce the first principled algorithmic framework for solving bilevel RL problems through the lens of penalty formulation. We provide theoretical studies of the problem landscape and its penalty-based (policy) gradient algorithms. We demonstrate the effectiveness of our algorithms via simulations in the Stackelberg Markov game, RL from human feedback and incentive design., Comment: Shorter version accepted to ICML 2024
Published: 2024

13. Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization

Author: Saif, A F M, Cui, Xiaodong, Shen, Han, Lu, Songtao, Kingsbury, Brian, and Chen, Tianyi
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: In this paper, we present a novel bilevel optimization-based training approach to training acoustic models for automatic speech recognition (ASR) tasks that we term {bi-level joint unsupervised and supervised training (BL-JUST)}. {BL-JUST employs a lower and upper level optimization with an unsupervised loss and a supervised loss respectively, leveraging recent advances in penalty-based bilevel optimization to solve this challenging ASR problem with affordable complexity and rigorous convergence guarantees.} To evaluate BL-JUST, extensive experiments on the LibriSpeech and TED-LIUM v2 datasets have been conducted. BL-JUST achieves superior performance over the commonly used pre-training followed by fine-tuning strategy., Comment: This paper has been accepted in ICASSP-2024 conference
Published: 2024

14. Fabrication and Properties of a New Reactive Diluent for Cationic UV Curing

Author: Wu, Zhengsen, Huang, Biwu, Liu, Yuansheng, and Shen, Han
Published: 2024
Full Text: View/download PDF

15. Mapping the Galactic disk with the LAMOST and Gaia Red clump sample: VIII: Mapping the kinematics of the Galactic disk using mono-age and mono-abundance stellar populations

Author: Sun, Weixiang, Huang, Yang, Shen, Han, Wang, Chun, Zhang, Huawei, Tian, Zhijia, Liu, Xiaowei, and Jiang, Biwei
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: We present a comprehensive study of the kinematic properties of the different Galactic disk populations, as defined by the chemical abundance ratios and stellar ages, across a large disk volume (4.5 $\leq$ R $\leq$ 15.0 kpc and $|Z|$ $\leq$ 3.0 kpc), by using the LAMOST-Gaia red clump sample stars. We determine the median velocities for various spatial and population bins, finding large-scale bulk motions, such as the wave-like behavior in radial velocity, the north-south discrepancy in azimuthal velocity and the warp signal in vertical velocity, and the amplitudes and spatial-dependences of those bulk motions show significant variations for different mono-age and mono-abundance populations. The global spatial behaviors of the velocity dispersions clearly show a signal of spiral arms and, a signal of the disk perturbation event within 4 Gyr, as well as the disk flaring in the outer region (i.e., $R \ge 12$ kpc) mostly for young or alpha-poor stellar populations. Our detailed measurements of age/[$\alpha$/Fe]-velocity dispersion relations for different disk volumes indicate that young/$\alpha$-poor populations are likely originated from dynamically heated by both giant molecular clouds and spiral arms, while old/$\alpha$-enhanced populations require an obvious contribution from other heating mechanisms such as merger and accretion, or born in the chaotic mergers of gas-rich systems and/or turbulent interstellar medium., Comment: 35 pages, 31 figures, 3 tables, accepted for publication in ApJ
Published: 2023

16. A human monoclonal antibody targeting the monomeric N6 neuraminidase confers protection against avian H5N6 influenza virus infection

Author: Wang, Min, Gao, Yuan, Shen, Chenguang, Yang, Wei, Peng, Qi, Cheng, Jinlong, Shen, Han-Ming, Yang, Yang, Gao, George Fu, and Shi, Yi
Published: 2024
Full Text: View/download PDF

17. Identification of oxylipins and lipid mediators in pulmonary embolism

Author: Chen, Fei, Peng, Daibao, Xia, Yanyan, Sun, Haixuan, Shen, Han, and Xia, Mao
Published: 2024
Full Text: View/download PDF

18. The potential of a nomogram risk assessment model for the diagnosis of abdominal aortic aneurysm: a multicenter retrospective study

Author: Huo, Guijun, Shen, Han, Zheng, Jin, Zeng, Yuqi, Yao, Zhichao, Cao, Junjie, Tang, Yao, Huang, Jian, Liu, Zhanao, and Zhou, Dayong
Published: 2024
Full Text: View/download PDF

19. Macular vascular and photoreceptor changes for diabetic macular edema at early stage

Author: Gu, Qinyuan, Pan, Ting, Cheng, Ruiwen, Huang, Junlong, Zhang, Kang, Zhang, Junyan, Yang, Yang, Cheng, Peng, Liu, Qinghuai, and Shen, Han
Published: 2024
Full Text: View/download PDF

20. Correction: Follistatin-like 1 protects mesenchymal stem cells from hypoxic damage and enhances their therapeutic efficacy in a mouse myocardial infarction model

Author: Shen, Han, Cui, Guanghao, Li, Yanqiong, Ye, Wenxue, Sun, Yimin, Zhang, Zihan, Li, Jingjing, Xu, Guiying, Zeng, Xiansheng, Zhang, Yanxia, Zhang, Wencheng, Huang, Zan, Chen, Weiqian, and Shen, Zhenya
Published: 2024
Full Text: View/download PDF

21. Prevalence and causes of blindness and distance visual impairment in Chinese adult population in 2022 during the COVID-19 pandemic: a cross-sectional study

Author: Wang, Hua, Xu, Zhi, Chen, Dandan, Li, Huihui, Zhang, Junyan, Liu, Qinghuai, and Shen, Han
Published: 2024
Full Text: View/download PDF

22. FBP1 inhibits NSCLC stemness by promoting ubiquitination of Notch1 intracellular domain and accelerating degradation

Author: He, Tianyu, Wang, Yanye, Lv, Wang, Wang, Yiqing, Li, Xinye, Zhang, Qingyi, Shen, Han-Ming, and Hu, Jian
Published: 2024
Full Text: View/download PDF

23. PSRFlow: Probabilistic Super Resolution with Flow-Based Models for Scientific Data

Author: Shen, Jingyi and Shen, Han-Wei
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, Computer Science - Machine Learning
Abstract: Although many deep-learning-based super-resolution approaches have been proposed in recent years, because no ground truth is available in the inference stage, few can quantify the errors and uncertainties of the super-resolved results. For scientific visualization applications, however, conveying uncertainties of the results to scientists is crucial to avoid generating misleading or incorrect information. In this paper, we propose PSRFlow, a novel normalizing flow-based generative model for scientific data super-resolution that incorporates uncertainty quantification into the super-resolution process. PSRFlow learns the conditional distribution of the high-resolution data based on the low-resolution counterpart. By sampling from a Gaussian latent space that captures the missing information in the high-resolution data, one can generate different plausible super-resolution outputs. The efficient sampling in the Gaussian latent space allows our model to perform uncertainty quantification for the super-resolved results. During model training, we augment the training data with samples across various scales to make the model adaptable to data of different scales, achieving flexible super-resolution for a given input. Our results demonstrate superior performance and robust uncertainty quantification compared with existing methods such as interpolation and GAN-based super-resolution networks., Comment: To be published in Proc. IEEE VIS 2023
Published: 2023

24. Adaptively Placed Multi-Grid Scene Representation Networks for Large-Scale Data Visualization

Author: Wurster, Skylar Wolfgang, Xiong, Tianyu, Shen, Han-Wei, Guo, Hanqi, and Peterka, Tom
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics
Abstract: Scene representation networks (SRNs) have been recently proposed for compression and visualization of scientific data. However, state-of-the-art SRNs do not adapt the allocation of available network parameters to the complex features found in scientific data, leading to a loss in reconstruction quality. We address this shortcoming with an adaptively placed multi-grid SRN (APMGSRN) and propose a domain decomposition training and inference technique for accelerated parallel training on multi-GPU systems. We also release an open-source neural volume rendering application that allows plug-and-play rendering with any PyTorch-based SRN. Our proposed APMGSRN architecture uses multiple spatially adaptive feature grids that learn where to be placed within the domain to dynamically allocate more neural network resources where error is high in the volume, improving state-of-the-art reconstruction accuracy of SRNs for scientific data without requiring expensive octree refining, pruning, and traversal like previous adaptive models. In our domain decomposition approach for representing large-scale data, we train an set of APMGSRNs in parallel on separate bricks of the volume to reduce training time while avoiding overhead necessary for an out-of-core solution for volumes too large to fit in GPU memory. After training, the lightweight SRNs are used for realtime neural volume rendering in our open-source renderer, where arbitrary view angles and transfer functions can be explored. A copy of this paper, all code, all models used in our experiments, and all supplemental materials and videos are available at https://github.com/skywolf829/APMGSRN., Comment: Accepted to IEEE VIS 2023. https://www.computer.org/csdl/journal/tg/2024/01/10297599/1RyYguiNBLO
Published: 2023
Full Text: View/download PDF

25. Neural Stream Functions

Author: Wurster, Skylar Wolfgang, Guo, Hanqi, Peterka, Tom, and Shen, Han-Wei
Subjects: Computer Science - Graphics, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: We present a neural network approach to compute stream functions, which are scalar functions with gradients orthogonal to a given vector field. As a result, isosurfaces of the stream function extract stream surfaces, which can be visualized to analyze flow features. Our approach takes a vector field as input and trains an implicit neural representation to learn a stream function for that vector field. The network learns to map input coordinates to a stream function value by minimizing the inner product of the gradient of the neural network's output and the vector field. Since stream function solutions may not be unique, we give optional constraints for the network to learn particular stream functions of interest. Specifically, we introduce regularizing loss functions that can optionally be used to generate stream function solutions whose stream surfaces follow the flow field's curvature, or that can learn a stream function that includes a stream surface passing through a seeding rake. We also discuss considerations for properly visualizing the trained implicit network and extracting artifact-free surfaces. We compare our results with other implicit solutions and present qualitative and quantitative results for several synthetic and simulated vector fields., Comment: Accepted and presented at PVIS2023 in Seoul, South Korea
Published: 2023
Full Text: View/download PDF

26. SKG: A Versatile Information Retrieval and Analysis Framework for Academic Papers with Semantic Knowledge Graphs

Author: Tu, Yamei, Qiu, Rui, and Shen, Han-Wei
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence, Computer Science - Graphics, Computer Science - Human-Computer Interaction
Abstract: The number of published research papers has experienced exponential growth in recent years, which makes it crucial to develop new methods for efficient and versatile information extraction and knowledge discovery. To address this need, we propose a Semantic Knowledge Graph (SKG) that integrates semantic concepts from abstracts and other meta-information to represent the corpus. The SKG can support various semantic queries in academic literature thanks to the high diversity and rich information content stored within. To extract knowledge from unstructured text, we develop a Knowledge Extraction Module that includes a semi-supervised pipeline for entity extraction and entity normalization. We also create an ontology to integrate the concepts with other meta information, enabling us to build the SKG. Furthermore, we design and develop a dataflow system that demonstrates how to conduct various semantic queries flexibly and interactively over the SKG. To demonstrate the effectiveness of our approach, we conduct the research based on the visualization literature and provide real-world use cases to show the usefulness of the SKG. The dataset and codes for this work are available at https://osf.io/aqv8p/?view_only=2c26b36e3e3941ce999df47e4616207f.
Published: 2023

27. The tilt of the velocity ellipsoid of different Galactic disk populations

Author: Sun, Weixiang, Shen, Han, and Liu, Xiaowei
Subjects: Astrophysics - Astrophysics of Galaxies
Abstract: The tilt of the velocity ellipsoid is a helpful tracer of the gravitational potential of the Milky Way. In this paper, we use nearly 140,000 RC stars selected from the LAMOST and Gaia to make a detailed analysis of the tilt of the velocity ellipsoid for various populations, as defined by the stellar ages and chemical information, within 4.5 $\leq$ $R$ $\leq$ 15.0 kpc and $|Z|$ $\leq$ 3.0 kpc. The tilt angles of the velocity ellipsoids of the RC sample stars are accurately described as $\alpha$ = $\alpha_{0}$ $\mathrm{arctan}$ ($Z$/$R$) with $\alpha_{0}$ = (0.68 $\pm$ 0.05). This indicates the alignment of velocity ellipsoids is between cylindrical and spherical, implying that any deviation from the spherical alignment of the velocity ellipsoids may be caused by the gravitational potential of the baryonic disk. The results of various populations suggest that the $\alpha_{0}$ displays an age and population dependence, with the thin and thick disks respectively values $\alpha_{0}$ = (0.72 $\pm$ 0.08) and $\alpha_{0}$ = (0.64 $\pm$ 0.07), and the $\alpha_{0}$ displays a decreasing trend with age (and [$\alpha$/Fe]) increases, meaning that the velocity ellipsoids of the kinematically relaxed stars are mainly dominated by the gravitational potential of the baryonic disk. We determine the $\alpha_{0} - R$ for various populations, finding that the $\alpha_{0}$ displays oscillations with $R$ for all the different populations. The oscillations in $\alpha_{0}$ appear in both kinematically hot and cold populations, indicating that resonances with the Galactic bar are the most likely origin for these oscillations., Comment: 10 pages, 8 figures, 3 tables, accepted for publication in ApJ
Published: 2023
Full Text: View/download PDF

28. VMap: An Interactive Rectangular Space-filling Visualization for Map-like Vertex-centric Graph Exploration

Author: Xu, Jiayi and Shen, Han-Wei
Subjects: Computer Science - Graphics, Computer Science - Discrete Mathematics, Computer Science - Human-Computer Interaction, Computer Science - Information Retrieval, Computer Science - Social and Information Networks
Abstract: We present VMap, a map-like rectangular space-filling visualization, to perform vertex-centric graph exploration. Existing visualizations have limited support for quality optimization among rectangular aspect ratios, vertex-edge intersection, and data encoding accuracy. To tackle this problem, VMap integrates three novel components: (1) a desired-aspect-ratio (DAR) rectangular partitioning algorithm, (2) a two-stage rectangle adjustment algorithm, and (3) a simulated annealing based heuristic optimizer. First, to generate a rectangular space-filling layout of an input graph, we subdivide the 2D embedding of the graph into rectangles with optimization of rectangles' aspect ratios toward a desired aspect ratio. Second, to route graph edges between rectangles without vertex-edge occlusion, we devise a two-stage algorithm to adjust a rectangular layout to insert border space between rectangles. Third, to produce and arrange rectangles by considering multiple visual criteria, we design a simulated annealing based heuristic optimization to adjust vertices' 2D embedding to support trade-offs among aspect ratio quality and the encoding accuracy of vertices' weights and adjacency. We evaluated the effectiveness of VMap on both synthetic and application datasets. The resulting rectangular layout has better aspect ratio quality on synthetic data compared with the existing method for the rectangular partitioning of 2D points. On three real-world datasets, VMap achieved better encoding accuracy and attained faster generation speed compared with existing methods on graphs' rectangular layout generation. We further illustrate the usefulness of VMap for vertex-centric graph exploration through three case studies on visualizing social networks, representing academic communities, and displaying geographic information., Comment: Submitted to IEEE Visualization Conference (IEEE VIS) 2019 and 2022
Published: 2023

29. Progress and perspectives of metabolic biomarkers in human aortic dissection

Author: Mu, Gaohang, Cao, Xiangyu, Shao, Lianbo, Shen, Han, Guo, Xingyou, Gao, Yamei, Su, Chengkai, Fan, Hongyou, Yu, You, and Shen, Zhenya
Published: 2024
Full Text: View/download PDF

30. On Penalty-based Bilevel Gradient Descent Method

Author: Shen, Han, Xiao, Quan, and Chen, Tianyi
Subjects: Computer Science - Machine Learning, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: Bilevel optimization enjoys a wide range of applications in hyper-parameter optimization, meta-learning and reinforcement learning. However, bilevel optimization problems are difficult to solve. Recent progress on scalable bilevel algorithms mainly focuses on bilevel optimization problems where the lower-level objective is either strongly convex or unconstrained. In this work, we tackle the bilevel problem through the lens of the penalty method. We show that under certain conditions, the penalty reformulation recovers the solutions of the original bilevel problem. Further, we propose the penalty-based bilevel gradient descent (PBGD) algorithm and establish its finite-time convergence for the constrained bilevel problem without lower-level strong convexity. Experiments showcase the efficiency of the proposed PBGD algorithm., Comment: Improved Section 4 by removing a critical assumption; Added Section 5 and citations
Published: 2023

31. SIGformer: Sign-aware Graph Transformer for Recommendation.

Author: Sirui Chen, Jiawei Chen 0007, Sheng Zhou 0004, Bohao Wang, Shen Han, Chanfei Su, Yuqing Yuan, and Can Wang 0001
Published: 2024
Full Text: View/download PDF

32. Extraction of Frequently Active Areas of Ships Based on Advanced Grid Density Peak Clustering

Author: Xiong, Xuanrui, Shen, Han, Zhu, Lanke, Zheng, Jianbo, Akan, Ozgur, Editorial Board Member, Bellavista, Paolo, Editorial Board Member, Cao, Jiannong, Editorial Board Member, Coulson, Geoffrey, Editorial Board Member, Dressler, Falko, Editorial Board Member, Ferrari, Domenico, Editorial Board Member, Gerla, Mario, Editorial Board Member, Kobayashi, Hisashi, Editorial Board Member, Palazzo, Sergio, Editorial Board Member, Sahni, Sartaj, Editorial Board Member, Shen, Xuemin, Editorial Board Member, Stan, Mircea, Editorial Board Member, Jia, Xiaohua, Editorial Board Member, Zomaya, Albert Y., Editorial Board Member, Leung, Victor C.M., editor, Li, Hezhang, editor, Hu, Xiping, editor, and Ning, Zhaolong, editor
Published: 2024
Full Text: View/download PDF

33. Enhancing Network Intrusion Detection with Deep Oversampling and Convolutional Autoencoder for Imbalanced Dataset

Author: Xiong, Xuanrui, Li, Junfeng, Zhang, Huijun, Shen, Han, Liu, Mengru, Peng, Wei, Huang, Qi, Zhang, Yuan, Akan, Ozgur, Editorial Board Member, Bellavista, Paolo, Editorial Board Member, Cao, Jiannong, Editorial Board Member, Coulson, Geoffrey, Editorial Board Member, Dressler, Falko, Editorial Board Member, Ferrari, Domenico, Editorial Board Member, Gerla, Mario, Editorial Board Member, Kobayashi, Hisashi, Editorial Board Member, Palazzo, Sergio, Editorial Board Member, Sahni, Sartaj, Editorial Board Member, Shen, Xuemin, Editorial Board Member, Stan, Mircea, Editorial Board Member, Jia, Xiaohua, Editorial Board Member, Zomaya, Albert Y., Editorial Board Member, Gao, Feifei, editor, Wu, Jun, editor, Li, Yun, editor, Gao, Honghao, editor, and Wang, Shangguang, editor
Published: 2024
Full Text: View/download PDF

34. An Improved Path Planning Algorithm for Heterogeneous Marine Unmanned Systems

Author: Liu, Muyi, Shen, Han, Wang, Shuwang, Wang, Linan, Zhou, Yan, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Hirche, Sandra, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Li, Xiaoduo, editor, Song, Xun, editor, and Zhou, Yingjiang, editor
Published: 2024
Full Text: View/download PDF

35. Alternating Implicit Projected SGD and Its Efficient Variants for Equality-constrained Bilevel Optimization

Author: Xiao, Quan, Shen, Han, Yin, Wotao, and Chen, Tianyi
Subjects: Computer Science - Machine Learning, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: Stochastic bilevel optimization, which captures the inherent nested structure of machine learning problems, is gaining popularity in many recent applications. Existing works on bilevel optimization mostly consider either unconstrained problems or constrained upper-level problems. This paper considers the stochastic bilevel optimization problems with equality constraints both in the upper and lower levels. By leveraging the special structure of the equality constraints problem, the paper first presents an alternating implicit projected SGD approach and establishes the $\tilde{\cal O}(\epsilon^{-2})$ sample complexity that matches the state-of-the-art complexity of ALSET \citep{chen2021closing} for unconstrained bilevel problems. To further save the cost of projection, the paper presents two alternating implicit projection-efficient SGD approaches, where one algorithm enjoys the $\tilde{\cal O}(\epsilon^{-2}/T)$ upper-level and $\tilde{\cal O}(\epsilon^{-1.5}/T^{\frac{3}{4}})$ lower-level projection complexity with ${\cal O}(T)$ lower-level batch size, and the other one enjoys $\tilde{\cal O}(\epsilon^{-1.5})$ upper-level and lower-level projection complexity with ${\cal O}(1)$ batch size. Application to federated bilevel optimization has been presented to showcase the empirical performance of our algorithms. Our results demonstrate that equality-constrained bilevel optimization with strongly-convex lower-level problems can be solved as efficiently as stochastic single-level optimization problems., Comment: Submitted to conference in Oct 2022
Published: 2022

36. Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach

Author: Fernando, Heshan, Shen, Han, Liu, Miao, Chaudhury, Subhajit, Murugesan, Keerthiram, and Chen, Tianyi
Subjects: Computer Science - Machine Learning, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: Machine learning problems with multiple objective functions appear either in learning with multiple criteria where learning has to make a trade-off between multiple performance metrics such as fairness, safety and accuracy; or, in multi-task learning where multiple tasks are optimized jointly, sharing inductive bias between them. This problems are often tackled by the multi-objective optimization framework. However, existing stochastic multi-objective gradient methods and its variants (e.g., MGDA, PCGrad, CAGrad, etc.) all adopt a biased noisy gradient direction, which leads to degraded empirical performance. To this end, we develop a stochastic Multi-objective gradient Correction (MoCo) method for multi-objective optimization. The unique feature of our method is that it can guarantee convergence without increasing the batch size even in the non-convex setting. Simulations on multi-task supervised and reinforcement learning demonstrate the effectiveness of our method relative to state-of-the-art methods., Comment: Changed hyper-parameter choice which affects some of the convergence rate results in the paper
Published: 2022

37. CircUSP1 as a novel marker promotes gastric cancer progression via stabilizing HuR to upregulate USP1 and Vimentin

Author: Li, Rong, Wang, Junyi, Xie, Zhenfan, Tian, Xinyu, Hou, Jie, Wang, Dongli, Qian, Hui, Shen, Han, and Xu, Wenrong
Published: 2024
Full Text: View/download PDF

38. TransPOS: Transformers for Consolidating Different POS Tagset Datasets

Author: Li, Alex, Bankole-Hameed, Ilyas, Singh, Ranadeep, Ng, Gabriel Shen Han, and Gupta, Akshat
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: In hope of expanding training data, researchers often want to merge two or more datasets that are created using different labeling schemes. This paper considers two datasets that label part-of-speech (POS) tags under different tagging schemes and leverage the supervised labels of one dataset to help generate labels for the other dataset. This paper further discusses the theoretical difficulties of this approach and proposes a novel supervised architecture employing Transformers to tackle the problem of consolidating two completely disjoint datasets. The results diverge from initial expectations and discourage exploration into the use of disjoint labels to consolidate datasets with different labels., Comment: Accepted in 2022 Workshop on Noisy User-generated Text (W-NUT)
Published: 2022

39. GNNInterpreter: A Probabilistic Generative Model-Level Explanation for Graph Neural Networks

Author: Wang, Xiaoqi and Shen, Han-Wei
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Recently, Graph Neural Networks (GNNs) have significantly advanced the performance of machine learning tasks on graphs. However, this technological breakthrough makes people wonder: how does a GNN make such decisions, and can we trust its prediction with high confidence? When it comes to some critical fields, such as biomedicine, where making wrong decisions can have severe consequences, it is crucial to interpret the inner working mechanisms of GNNs before applying them. In this paper, we propose a model-agnostic model-level explanation method for different GNNs that follow the message passing scheme, GNNInterpreter, to explain the high-level decision-making process of the GNN model. More specifically, GNNInterpreter learns a probabilistic generative graph distribution that produces the most discriminative graph pattern the GNN tries to detect when making a certain prediction by optimizing a novel objective function specifically designed for the model-level explanation for GNNs. Compared to existing works, GNNInterpreter is more flexible and computationally efficient in generating explanation graphs with different types of node and edge features, without introducing another blackbox or requiring manually specified domain-specific rules. In addition, the experimental studies conducted on four different datasets demonstrate that the explanation graphs generated by GNNInterpreter match the desired graph pattern if the model is ideal; otherwise, potential model pitfalls can be revealed by the explanation. The official implementation can be found at https://github.com/yolandalalala/GNNInterpreter.
Published: 2022

40. Not As Easy As You Think -- Experiences and Lessons Learnt from Trying to Create a Bottom-Up Visualization Image Typology

Author: Chen, Jian, Isenberg, Petra, Laramee, Robert S., Isenberg, Tobias, Sedlmair, Michael, Moeller, Torsten, and Shen, Han-Wei
Subjects: Computer Science - Graphics, Computer Science - Multimedia
Abstract: We present and discuss the results of a two-year qualitative analysis of images published in IEEE Visualization (VIS) papers. Specifically, we derive a typology of 13 visualization image types, coded to distinguish visualizations and several image characteristics. The categorization process required much more time and was more difficult than we initially thought. The resulting typology and image analysis may serve a number of purposes: to study the evolution of the community and its research output over time, to facilitate the categorization of visualization images for the purpose of teaching, to identify visual designs for evaluation purposes, or to enable progress towards standardization in visualization. In addition to the typology and image characterization, we provide a dataset of 6,833 tagged images and an online tool that can be used to explore and analyze the large set of tagged images. We thus facilitate a discussion of the diverse visualizations used and how they are published and communicated in our community.
Published: 2022

41. IDLat: An Importance-Driven Latent Generation Method for Scientific Data

Author: Shen, Jingyi, Li, Haoyu, Xu, Jiayi, Biswas, Ayan, and Shen, Han-Wei
Subjects: Computer Science - Machine Learning, Computer Science - Graphics
Abstract: Deep learning based latent representations have been widely used for numerous scientific visualization applications such as isosurface similarity analysis, volume rendering, flow field synthesis, and data reduction, just to name a few. However, existing latent representations are mostly generated from raw data in an unsupervised manner, which makes it difficult to incorporate domain interest to control the size of the latent representations and the quality of the reconstructed data. In this paper, we present a novel importance-driven latent representation to facilitate domain-interest-guided scientific data visualization and analysis. We utilize spatial importance maps to represent various scientific interests and take them as the input to a feature transformation network to guide latent generation. We further reduced the latent size by a lossless entropy encoding algorithm trained together with the autoencoder, improving the storage and memory efficiency. We qualitatively and quantitatively evaluate the effectiveness and efficiency of latent representations generated by our method with data from multiple scientific visualization applications., Comment: 11 pages, 12 figures, Proc. IEEE VIS 2022
Published: 2022
Full Text: View/download PDF

42. VDL-Surrogate: A View-Dependent Latent-based Model for Parameter Space Exploration of Ensemble Simulations

Author: Shi, Neng, Xu, Jiayi, Li, Haoyu, Guo, Hanqi, Woodring, Jonathan, and Shen, Han-Wei
Subjects: Computer Science - Graphics, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: We propose VDL-Surrogate, a view-dependent neural-network-latent-based surrogate model for parameter space exploration of ensemble simulations that allows high-resolution visualizations and user-specified visual mappings. Surrogate-enabled parameter space exploration allows domain scientists to preview simulation results without having to run a large number of computationally costly simulations. Limited by computational resources, however, existing surrogate models may not produce previews with sufficient resolution for visualization and analysis. To improve the efficient use of computational resources and support high-resolution exploration, we perform ray casting from different viewpoints to collect samples and produce compact latent representations. This latent encoding process reduces the cost of surrogate model training while maintaining the output quality. In the model training stage, we select viewpoints to cover the whole viewing sphere and train corresponding VDL-Surrogate models for the selected viewpoints. In the model inference stage, we predict the latent representations at previously selected viewpoints and decode the latent representations to data space. For any given viewpoint, we make interpolations over decoded data at selected viewpoints and generate visualizations with user-specified visual mappings. We show the effectiveness and efficiency of VDL-Surrogate in cosmological and ocean simulations with quantitative and qualitative evaluations. Source code is publicly available at https://github.com/trainsn/VDL-Surrogate., Comment: Accepted by IEEE Transactions on Visualization and Computer Graphics (Proc. IEEE VIS 2022)
Published: 2022

43. Efficient Interpolation-based Pathline Tracing with B-spline Curves in Particle Dataset

Author: Li, Haoyu, Xiong, Tianyu, and Shen, Han-Wei
Subjects: Computer Science - Graphics, Computer Science - Human-Computer Interaction
Abstract: Particle tracing through numerical integration is a well-known approach to generating pathlines for visualization. However, for particle simulations, the computation of pathlines is expensive, since the interpolation method is complicated due to the lack of connectivity information. Previous studies utilize the k-d tree to reduce the time for neighborhood search. However, the efficiency is still limited by the number of tracing time steps. Therefore, we propose a novel interpolation-based particle tracing method that first represents particle data as B-spline curves and interpolates B-spline control points to reduce the number of interpolation time steps. We demonstrate our approach achieves good tracing accuracy with much less computation time., Comment: To be included in 2022 IEEE VIS short papers
Published: 2022

44. A Study on the Influence of Cultural Heritage Tourism Perception on the National Tourism Image and Cultural Identity of Overseas Chinese Youth

Author: Zhang, Han, Wang, Jingqi, and Shen, Han
Published: 2023
Full Text: View/download PDF

45. Digital marketing practices in tourism: advances in the cultural heritage of China

Author: Yang, Yang, primary and Shen, Han, additional
Published: 2024
Full Text: View/download PDF

46. On the Importance and Applicability of Pre-Training for Federated Learning

Author: Chen, Hong-You, Tu, Cheng-Hao, Li, Ziwei, Shen, Han-Wei, and Chao, Wei-Lun
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Pre-training is prevalent in nowadays deep learning to improve the learned model's performance. However, in the literature on federated learning (FL), neural networks are mostly initialized with random weights. These attract our interest in conducting a systematic study to explore pre-training for FL. Across multiple visual recognition benchmarks, we found that pre-training can not only improve FL, but also close its accuracy gap to the counterpart centralized learning, especially in the challenging cases of non-IID clients' data. To make our findings applicable to situations where pre-trained models are not directly available, we explore pre-training with synthetic data or even with clients' data in a decentralized manner, and found that they can already improve FL notably. Interestingly, many of the techniques we explore are complementary to each other to further boost the performance, and we view this as a critical result toward scaling up deep FL for real-world applications. We conclude our paper with an attempt to understand the effect of pre-training on FL. We found that pre-training enables the learned global models under different clients' data conditions to converge to the same loss basin, and makes global aggregation in FL more stable. Nevertheless, pre-training seems to not alleviate local model drifting, a fundamental problem in FL under non-IID data., Comment: Accepted to ICLR 2023
Published: 2022

47. A Single-Timescale Analysis For Stochastic Approximation With Multiple Coupled Sequences

Author: Shen, Han and Chen, Tianyi
Subjects: Computer Science - Machine Learning, Mathematics - Optimization and Control, Statistics - Machine Learning
Abstract: Stochastic approximation (SA) with multiple coupled sequences has found broad applications in machine learning such as bilevel learning and reinforcement learning (RL). In this paper, we study the finite-time convergence of nonlinear SA with multiple coupled sequences. Different from existing multi-timescale analysis, we seek for scenarios where a fine-grained analysis can provide the tight performance guarantee for multi-sequence single-timescale SA (STSA). At the heart of our analysis is the smoothness property of the fixed points in multi-sequence SA that holds in many applications. When all sequences have strongly monotone increments, we establish the iteration complexity of $\mathcal{O}(\epsilon^{-1})$ to achieve $\epsilon$-accuracy, which improves the existing $\mathcal{O}(\epsilon^{-1.5})$ complexity for two coupled sequences. When all but the main sequence have strongly monotone increments, we establish the iteration complexity of $\mathcal{O}(\epsilon^{-2})$. The merit of our results lies in that applying them to stochastic bilevel and compositional optimization problems, as well as RL problems leads to either relaxed assumptions or improvements over their existing performance guarantees.
Published: 2022

48. SmartGD: A GAN-Based Graph Drawing Framework for Diverse Aesthetic Goals

Author: Wang, Xiaoqi, Yen, Kevin, Hu, Yifan, and Shen, Han-Wei
Subjects: Computer Science - Machine Learning
Abstract: While a multitude of studies have been conducted on graph drawing, many existing methods only focus on optimizing a single aesthetic aspect of graph layouts, which can lead to sub-optimal results. There are a few existing methods that have attempted to develop a flexible solution for optimizing different aesthetic aspects measured by different aesthetic criteria. Furthermore, thanks to the significant advance in deep learning techniques, several deep learning-based layout methods were proposed recently. These methods have demonstrated the advantages of deep learning approaches for graph drawing. However, none of these existing methods can be directly applied to optimizing non-differentiable criteria without special accommodation. In this work, we propose a novel Generative Adversarial Network (GAN) based deep learning framework for graph drawing, called SmartGD, which can optimize different quantitative aesthetic goals, regardless of their differentiability. To demonstrate the effectiveness and efficiency of SmartGD, we conducted experiments on minimizing stress, minimizing edge crossing, maximizing crossing angle, maximizing shape-based metrics, and a combination of multiple aesthetics. Compared with several popular graph drawing algorithms, the experimental results show that SmartGD achieves good performance both quantitatively and qualitatively.
Published: 2022

49. Estimating accurate reddening values of LAMOST M dwarfs

Author: Shen, Han, Chen, Bingqiu, Guo, Helong, Yuan, Haibo, Sun, Weixiang, and Li, Jing
Subjects: Astrophysics - Solar and Stellar Astrophysics, Astrophysics - Astrophysics of Galaxies
Abstract: M dwarfs are the dominating type of stars in the solar neighbourhood. They serve as excellent tracers for the study of the distribution and properties of the nearby interstellar dust. In this work, we aim to obtain high accuracy reddening values of M dwarf stars from the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) Data Release 8 (DR8). Combining the LAMOST spectra with the high-quality optical photometry from the Gaia Early Data Release 3 (Gaia EDR3), we have estimated the reddening values $E(G_{\rm BP}-G_{\rm RP})$ of 641,426 M dwarfs with the machine-learning algorithm Random Forest regression. The typical reddening uncertainty is only 0.03 mag in $E(G_{\rm BP}-G_{\rm RP})$. We have obtained the reddening coefficient $R_{(G_{\rm BP}-G_{\rm RP})}$, which is a function of the stellar intrinsic colour $(G_{\rm BP}-G_{\rm RP})_0$ and reddening value $E(B-V)$. The values of $E(B-V)$ are also provided for the individual stars in our catalogue. Our resultant high accuracy reddening values of M dwarfs, combined with the Gaia parallaxes, will be very powerful to map the fine structures of the dust in the solar neighbourhood., Comment: 8 pages, 8 figures, accepted for publication in MNRAS
Published: 2022
Full Text: View/download PDF

50. Parametric study for thermal uniformity analysis on vertical fin located in novel designed impacting-jet double-layer nested microchannel heat sinks verified by SLM 3D printing method

Author: Shen, Han, Lan, Xinyue, Xie, Gongnan, and Wang, Chi-Chuan
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

3,406 results on '"Shen, Han"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources