Author: "Qin, Yipeng" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Qin, Yipeng"' showing total 207 results

Start Over Author "Qin, Yipeng"

207 results on '"Qin, Yipeng"'

1. Training-free Editioning of Text-to-Image Models

Author: Wang, Jinqi, Fu, Yunfei, Ding, Zhangcan, Deng, Bailin, Lai, Yu-Kun, and Qin, Yipeng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Inspired by the software industry's practice of offering different editions or versions of a product tailored to specific user groups or use cases, we propose a novel task, namely, training-free editioning, for text-to-image models. Specifically, we aim to create variations of a base text-to-image model without retraining, enabling the model to cater to the diverse needs of different user groups or to offer distinct features and functionalities. To achieve this, we propose that different editions of a given text-to-image model can be formulated as concept subspaces in the latent space of its text encoder (e.g., CLIP). In such a concept subspace, all points satisfy a specific user need (e.g., generating images of a cat lying on the grass/ground/falling leaves). Technically, we apply Principal Component Analysis (PCA) to obtain the desired concept subspaces from representative text embedding that correspond to a specific user need or requirement. Projecting the text embedding of a given prompt into these low-dimensional subspaces enables efficient model editioning without retraining. Intuitively, our proposed editioning paradigm enables a service provider to customize the base model into its "cat edition" (or other editions) that restricts image generation to cats, regardless of the user's prompt (e.g., dogs, people, etc.). This introduces a new dimension for product differentiation, targeted functionality, and pricing strategies, unlocking novel business models for text-to-image generators. Extensive experimental results demonstrate the validity of our approach and its potential to enable a wide range of customized text-to-image model editions across various domains and applications.
Published: 2024

2. SuDA: Support-based Domain Adaptation for Sim2Real Motion Capture with Flexible Sensors

Author: Fang, Jiawei, Song, Haishan, Zuo, Chengxu, Gao, Xiaoxia, Chen, Xiaowei, Guo, Shihui, and Qin, Yipeng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Human-Computer Interaction
Abstract: Flexible sensors hold promise for human motion capture (MoCap), offering advantages such as wearability, privacy preservation, and minimal constraints on natural movement. However, existing flexible sensor-based MoCap methods rely on deep learning and necessitate large and diverse labeled datasets for training. These data typically need to be collected in MoCap studios with specialized equipment and substantial manual labor, making them difficult and expensive to obtain at scale. Thanks to the high-linearity of flexible sensors, we address this challenge by proposing a novel Sim2Real Mocap solution based on domain adaptation, eliminating the need for labeled data yet achieving comparable accuracy to supervised learning. Our solution relies on a novel Support-based Domain Adaptation method, namely SuDA, which aligns the supports of the predictive functions rather than the instance-dependent distributions between the source and target domains. Extensive experimental results demonstrate the effectiveness of our method andits superiority over state-of-the-art distribution-based domain adaptation methods in our task., Comment: 20 pages conference, accepted ICML paper
Published: 2024

3. LATUP-Net: A Lightweight 3D Attention U-Net with Parallel Convolutions for Brain Tumor Segmentation

Author: Alwadee, Ebtihal J., Sun, Xianfang, Qin, Yipeng, and Langbein, Frank C.
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Early-stage 3D brain tumor segmentation from magnetic resonance imaging (MRI) scans is crucial for prompt and effective treatment. However, this process faces the challenge of precise delineation due to the tumors' complex heterogeneity. Moreover, energy sustainability targets and resource limitations, especially in developing countries, require efficient and accessible medical imaging solutions. The proposed architecture, a Lightweight 3D ATtention U-Net with Parallel convolutions, LATUP-Net, addresses these issues. It is specifically designed to reduce computational requirements significantly while maintaining high segmentation performance. By incorporating parallel convolutions, it enhances feature representation by capturing multi-scale information. It further integrates an attention mechanism to refine segmentation through selective feature recalibration. LATUP-Net achieves promising segmentation performance: the average Dice scores for the whole tumor, tumor core, and enhancing tumor on the BraTS2020 dataset are 88.41%, 83.82%, and 73.67%, and on the BraTS2021 dataset, they are 90.29%, 89.54%, and 83.92%, respectively. Hausdorff distance metrics further indicate its improved ability to delineate tumor boundaries. With its significantly reduced computational demand using only 3.07 M parameters, about 59 times fewer than other state-of-the-art models, and running on a single V100 GPU, LATUP-Net stands out as a promising solution for real-world clinical applications, particularly in settings with limited resources. Investigations into the model's interpretability, utilizing gradient-weighted class activation mapping and confusion matrices, reveal that while attention mechanisms enhance the segmentation of small regions, their impact is nuanced. Achieving the most accurate tumor delineation requires carefully balancing local and global features.
Published: 2024

4. NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation

Author: Chen, Jiahao, Qin, Yipeng, Liu, Lingjie, Lu, Jiangbo, and Li, Guanbin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Neural Radiance Field (NeRF) has been widely recognized for its excellence in novel view synthesis and 3D scene reconstruction. However, their effectiveness is inherently tied to the assumption of static scenes, rendering them susceptible to undesirable artifacts when confronted with transient distractors such as moving objects or shadows. In this work, we propose a novel paradigm, namely "Heuristics-Guided Segmentation" (HuGS), which significantly enhances the separation of static scenes from transient distractors by harmoniously combining the strengths of hand-crafted heuristics and state-of-the-art segmentation models, thus significantly transcending the limitations of previous solutions. Furthermore, we delve into the meticulous design of heuristics, introducing a seamless fusion of Structure-from-Motion (SfM)-based heuristics and color residual heuristics, catering to a diverse range of texture profiles. Extensive experiments demonstrate the superiority and robustness of our method in mitigating transient distractors for NeRFs trained in non-static scenes. Project page: https://cnhaox.github.io/NeRF-HuGS/., Comment: To appear in CVPR2024
Published: 2024

5. Deep Generative Model based Rate-Distortion for Image Downscaling Assessment

Author: Liang, Yuanbang, Garg, Bhavesh, Rosin, Paul L, and Qin, Yipeng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: In this paper, we propose Image Downscaling Assessment by Rate-Distortion (IDA-RD), a novel measure to quantitatively evaluate image downscaling algorithms. In contrast to image-based methods that measure the quality of downscaled images, ours is process-based that draws ideas from rate-distortion theory to measure the distortion incurred during downscaling. Our main idea is that downscaling and super-resolution (SR) can be viewed as the encoding and decoding processes in the rate-distortion model, respectively, and that a downscaling algorithm that preserves more details in the resulting low-resolution (LR) images should lead to less distorted high-resolution (HR) images in SR. In other words, the distortion should increase as the downscaling algorithm deteriorates. However, it is non-trivial to measure this distortion as it requires the SR algorithm to be blind and stochastic. Our key insight is that such requirements can be met by recent SR algorithms based on deep generative models that can find all matching HR images for a given LR image on their learned image manifolds. Extensive experimental results show the effectiveness of our IDA-RD measure., Comment: Accepted at CVPR 2024
Published: 2024

6. PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsigns

Author: Ning, Shuliang, Wang, Duomin, Qin, Yipeng, Jin, Zirong, Wang, Baoyuan, and Han, Xiaoguang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we propose a novel virtual try-on from unconstrained designs (ucVTON) task to enable photorealistic synthesis of personalized composite clothing on input human images. Unlike prior arts constrained by specific input types, our method allows flexible specification of style (text or image) and texture (full garment, cropped sections, or texture patches) conditions. To address the entanglement challenge when using full garment images as conditions, we develop a two-stage pipeline with explicit disentanglement of style and texture. In the first stage, we generate a human parsing map reflecting the desired style conditioned on the input. In the second stage, we composite textures onto the parsing map areas based on the texture input. To represent complex and non-stationary textures that have never been achieved in previous fashion editing works, we first propose extracting hierarchical and balanced CLIP features and applying position encoding in VTON. Experiments demonstrate superior synthesis quality and personalization enabled by our method. The flexible control over style and texture mixing brings virtual try-on to a new level of user experience for online shopping and fashion design., Comment: Project page: https://ningshuliang.github.io/2023/Arxiv/index.html
Published: 2023

7. Exploration and Exploitation of Unlabeled Data for Open-Set Semi-supervised Learning

Author: Zhao, Ganlong, Li, Guanbin, Qin, Yipeng, Zhang, Jinjin, Chai, Zhenhua, Wei, Xiaolin, Lin, Liang, and Yu, Yizhou
Published: 2024
Full Text: View/download PDF

8. Feature Proliferation -- the 'Cancer' in StyleGAN and its Treatments

Author: Song, Shuang, Liang, Yuanbang, Wu, Jing, Lai, Yu-Kun, and Qin, Yipeng
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Despite the success of StyleGAN in image synthesis, the images it synthesizes are not always perfect and the well-known truncation trick has become a standard post-processing technique for StyleGAN to synthesize high-quality images. Although effective, it has long been noted that the truncation trick tends to reduce the diversity of synthesized images and unnecessarily sacrifices many distinct image features. To address this issue, in this paper, we first delve into the StyleGAN image synthesis mechanism and discover an important phenomenon, namely Feature Proliferation, which demonstrates how specific features reproduce with forward propagation. Then, we show how the occurrence of Feature Proliferation results in StyleGAN image artifacts. As an analogy, we refer to it as the" cancer" in StyleGAN from its proliferating and malignant nature. Finally, we propose a novel feature rescaling method that identifies and modulates risky features to mitigate feature proliferation. Thanks to our discovery of Feature Proliferation, the proposed feature rescaling method is less destructive and retains more useful image features than the truncation trick, as it is more fine-grained and works in a lower-level feature space rather than a high-level latent space. Experimental results justify the validity of our claims and the effectiveness of the proposed feature rescaling method. Our code is available at https://github. com/songc42/Feature-proliferation., Comment: Accepted at ICCV 2023
Published: 2023

9. Computational Design of Wiring Layout on Tight Suits with Minimal Motion Resistance

Author: Wang, Kai, Xu, Xiaoyu, Zhen, Yinping, Zhou, Da, Guo, Shihui, Qin, Yipeng, and Guo, Xiaohu
Subjects: Computer Science - Human-Computer Interaction
Abstract: An increasing number of electronics are directly embedded on the clothing to monitor human status (e.g., skeletal motion) or provide haptic feedback. A specific challenge to prototype and fabricate such a clothing is to design the wiring layout, while minimizing the intervention to human motion. We address this challenge by formulating the topological optimization problem on the clothing surface as a deformation-weighted Steiner tree problem on a 3D clothing mesh. Our method proposed an energy function for minimizing strain energy in the wiring area under different motions, regularized by its total length. We built the physical prototype to verify the effectiveness of our method and conducted user study with participants of both design experts and smart cloth users. On three types of commercial products of smart clothing, the optimized layout design reduced wire strain energy by an average of 77% among 248 actions compared to baseline design, and 18% over the expert design., Comment: This work is accepted at SIGGRAPH ASIA 2023(Conference Track)
Published: 2023

10. Improved Distribution Matching for Dataset Condensation

Author: Zhao, Ganlong, Li, Guanbin, Qin, Yipeng, and Yu, Yizhou
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Dataset Condensation aims to condense a large dataset into a smaller one while maintaining its ability to train a well-performing model, thus reducing the storage cost and training effort in deep learning applications. However, conventional dataset condensation methods are optimization-oriented and condense the dataset by performing gradient or parameter matching during model optimization, which is computationally intensive even on small datasets and models. In this paper, we propose a novel dataset condensation method based on distribution matching, which is more efficient and promising. Specifically, we identify two important shortcomings of naive distribution matching (i.e., imbalanced feature numbers and unvalidated embeddings for distance computation) and address them with three novel techniques (i.e., partitioning and expansion augmentation, efficient and enriched model sampling, and class-aware distribution regularization). Our simple yet effective method outperforms most previous optimization-oriented methods with much fewer computational resources, thereby scaling data condensation to larger datasets and models. Extensive experiments demonstrate the effectiveness of our method. Codes are available at https://github.com/uitrbn/IDM, Comment: CVPR2023
Published: 2023

11. Universal Semi-supervised Model Adaptation via Collaborative Consistency Training

Author: Yan, Zizheng, Wu, Yushuang, Qin, Yipeng, Han, Xiaoguang, Cui, Shuguang, and Li, Guanbin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we introduce a realistic and challenging domain adaptation problem called Universal Semi-supervised Model Adaptation (USMA), which i) requires only a pre-trained source model, ii) allows the source and target domain to have different label sets, i.e., they share a common label set and hold their own private label set, and iii) requires only a few labeled samples in each class of the target domain. To address USMA, we propose a collaborative consistency training framework that regularizes the prediction consistency between two models, i.e., a pre-trained source model and its variant pre-trained with target data only, and combines their complementary strengths to learn a more powerful model. The rationale of our framework stems from the observation that the source model performs better on common categories than the target-only model, while on target-private categories, the target-only model performs better. We also propose a two-perspective, i.e., sample-wise and class-wise, consistency regularization to improve the training. Experimental results demonstrate the effectiveness of our method on several benchmark datasets.
Published: 2023

12. Exploration and Exploitation of Unlabeled Data for Open-Set Semi-Supervised Learning

Author: Zhao, Ganlong, Li, Guanbin, Qin, Yipeng, Zhang, Jinjin, Chai, Zhenhua, Wei, Xiaolin, Lin, Liang, and Yu, Yizhou
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we address a complex but practical scenario in semi-supervised learning (SSL) named open-set SSL, where unlabeled data contain both in-distribution (ID) and out-of-distribution (OOD) samples. Unlike previous methods that only consider ID samples to be useful and aim to filter out OOD ones completely during training, we argue that the exploration and exploitation of both ID and OOD samples can benefit SSL. To support our claim, i) we propose a prototype-based clustering and identification algorithm that explores the inherent similarity and difference among samples at feature level and effectively cluster them around several predefined ID and OOD prototypes, thereby enhancing feature learning and facilitating ID/OOD identification; ii) we propose an importance-based sampling method that exploits the difference in importance of each ID and OOD sample to SSL, thereby reducing the sampling bias and improving the training. Our proposed method achieves state-of-the-art in several challenging benchmarks, and improves upon existing SSL methods even when ID samples are totally absent in unlabeled data.
Published: 2023

13. Parametric Implicit Face Representation for Audio-Driven Facial Reenactment

Author: Huang, Ricong, Lai, Peiwen, Qin, Yipeng, and Li, Guanbin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Audio-driven facial reenactment is a crucial technique that has a range of applications in film-making, virtual avatars and video conferences. Existing works either employ explicit intermediate face representations (e.g., 2D facial landmarks or 3D face models) or implicit ones (e.g., Neural Radiance Fields), thus suffering from the trade-offs between interpretability and expressive power, hence between controllability and quality of the results. In this work, we break these trade-offs with our novel parametric implicit face representation and propose a novel audio-driven facial reenactment framework that is both controllable and can generate high-quality talking heads. Specifically, our parametric implicit representation parameterizes the implicit representation with interpretable parameters of 3D face models, thereby taking the best of both explicit and implicit methods. In addition, we propose several new techniques to improve the three components of our framework, including i) incorporating contextual information into the audio-to-expression parameters encoding; ii) using conditional image synthesis to parameterize the implicit representation and implementing it with an innovative tri-plane structure for efficient learning; iii) formulating facial reenactment as a conditional image inpainting problem and proposing a novel data augmentation technique to improve model generalizability. Extensive experiments demonstrate that our method can generate more realistic results than previous methods with greater fidelity to the identities and talking styles of speakers., Comment: CVPR 2023
Published: 2023

14. Motion-R3: Fast and Accurate Motion Annotation via Representation-based Representativeness Ranking

Author: Yu, Jubo, Ren, Tianxiang, Guo, Shihui, Fang, Fengyi, Wang, Kai, Zeng, Zijiao, Zhang, Yazhan, Aristidou, Andreas, and Qin, Yipeng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: In this paper, we follow a data-centric philosophy and propose a novel motion annotation method based on the inherent representativeness of motion data in a given dataset. Specifically, we propose a Representation-based Representativeness Ranking R3 method that ranks all motion data in a given dataset according to their representativeness in a learned motion representation space. We further propose a novel dual-level motion constrastive learning method to learn the motion representation space in a more informative way. Thanks to its high efficiency, our method is particularly responsive to frequent requirements change and enables agile development of motion annotation models. Experimental results on the HDM05 dataset against state-of-the-art methods demonstrate the superiority of our method.
Published: 2023

15. Diverse Motion In-betweening with Dual Posture Stitching

Author: Ren, Tianxiang, Yu, Jubo, Guo, Shihui, Ma, Ying, Ouyang, Yutao, Zeng, Zijiao, Zhang, Yazhan, and Qin, Yipeng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Graphics
Abstract: In-betweening is a technique for generating transitions given initial and target character states. The majority of existing works require multiple (often $>$10) frames as input, which are not always accessible. Our work deals with a focused yet challenging problem: to generate the transition when given exactly two frames (only the first and last). To cope with this challenging scenario, we implement our bi-directional scheme which generates forward and backward transitions from the start and end frames with two adversarial autoregressive networks, and stitches them in the middle of the transition where there is no strict ground truth. The autoregressive networks based on conditional variational autoencoders (CVAE) are optimized by searching for a pair of optimal latent codes that minimize a novel stitching loss between their outputs. Results show that our method achieves higher motion quality and more diverse results than existing methods on both the LaFAN1 and Human3.6m datasets., Comment: 10 pages, 5 figures
Published: 2023

16. Reduced-Reference Quality Assessment of Point Clouds via Content-Oriented Saliency Projection

Author: Zhou, Wei, Yue, Guanghui, Zhang, Ruizeng, Qin, Yipeng, and Liu, Hantao
Subjects: Computer Science - Multimedia, Computer Science - Computer Vision and Pattern Recognition
Abstract: Many dense 3D point clouds have been exploited to represent visual objects instead of traditional images or videos. To evaluate the perceptual quality of various point clouds, in this letter, we propose a novel and efficient Reduced-Reference quality metric for point clouds, which is based on Content-oriented sAliency Projection (RR-CAP). Specifically, we make the first attempt to simplify reference and distorted point clouds into projected saliency maps with a downsampling operation. Through this process, we tackle the issue of transmitting large-volume original point clouds to user-ends for quality assessment. Then, motivated by the characteristics of the human visual system (HVS), the objective quality scores of distorted point clouds are produced by combining content-oriented similarity and statistical correlation measurements. Finally, extensive experiments are conducted on SJTU-PCQA and WPC databases. The experimental results demonstrate that our proposed algorithm outperforms existing reduced-reference and no-reference quality metrics, and significantly reduces the performance gap between state-of-the-art full-reference quality assessment methods. In addition, we show the performance variation of each proposed technical component by ablation tests.
Published: 2023
Full Text: View/download PDF

17. WristSketcher: Creating Dynamic Sketches in AR with a Sensing Wristband

Author: Ying, Enting, Xiong, Tianyang, Guo, Shihui, Qiu, Ming, Qin, Yipeng, and Fu, Hongbo
Subjects: Computer Science - Human-Computer Interaction
Abstract: Restricted by the limited interaction area of native AR glasses (e.g., touch bars), it is challenging to create sketches in AR glasses. Recent works have attempted to use mobile devices (e.g., tablets) or mid-air bare-hand gestures to expand the interactive spaces and can work as the 2D/3D sketching input interfaces for AR glasses. Between them, mobile devices allow for accurate sketching but are often heavy to carry, while sketching with bare hands is zero-burden but can be inaccurate due to arm instability. In addition, mid-air bare-hand sketching can easily lead to social misunderstandings and its prolonged use can cause arm fatigue. As a new attempt, in this work, we present WristSketcher, a new AR system based on a flexible sensing wristband for creating 2D dynamic sketches, featuring an almost zero-burden authoring model for accurate and comfortable sketch creation in real-world scenarios. Specifically, we have streamlined the interaction space from the mid-air to the surface of a lightweight sensing wristband, and implemented AR sketching and associated interaction commands by developing a gesture recognition method based on the sensing pressure points on the wristband. The set of interactive gestures used by our WristSketcher is determined by a heuristic study on user preferences. Moreover, we endow our WristSketcher with the ability of animation creation, allowing it to create dynamic and expressive sketches. Experimental results demonstrate that our WristSketcher i) faithfully recognizes users' gesture interactions with a high accuracy of 96.0%; ii) achieves higher sketching accuracy than Freehand sketching; iii) achieves high user satisfaction in ease of use, usability and functionality; and iv) shows innovation potentials in art creation, memory aids, and entertainment applications.
Published: 2022

18. Ultrasonic assisted electrochemical drilling and grinding of small holes on SLMed Hastelloy X with rotating abrasive tube electrode

Author: Qin, Yipeng, Liu, Yong, Guan, Wenchao, Shu, Tong, and Wang, Kan
Published: 2024
Full Text: View/download PDF

19. Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels

Author: Zhao, Ganlong, Li, Guanbin, Qin, Yipeng, Liu, Feng, and Yu, Yizhou
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Deep models trained with noisy labels are prone to over-fitting and struggle in generalization. Most existing solutions are based on an ideal assumption that the label noise is class-conditional, i.e., instances of the same class share the same noise model, and are independent of features. While in practice, the real-world noise patterns are usually more fine-grained as instance-dependent ones, which poses a big challenge, especially in the presence of inter-class imbalance. In this paper, we propose a two-stage clean samples identification method to address the aforementioned challenge. First, we employ a class-level feature clustering procedure for the early identification of clean samples that are near the class-wise prediction centers. Notably, we address the class imbalance problem by aggregating rare classes according to their prediction entropy. Second, for the remaining clean samples that are close to the ground truth class boundary (usually mixed with the samples with instance-dependent noises), we propose a novel consistency-based classification method that identifies them using the consistency of two classifier heads: the higher the consistency, the larger the probability that a sample is clean. Extensive experiments on several challenging benchmarks demonstrate the superior performance of our method against the state-of-the-art., Comment: Accepted to ECCV2022
Published: 2022

20. Exploring and Exploiting Hubness Priors for High-Quality GAN Latent Sampling

Author: Liang, Yuanbang, Wu, Jing, Lai, Yu-Kun, and Qin, Yipeng
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Despite the extensive studies on Generative Adversarial Networks (GANs), how to reliably sample high-quality images from their latent spaces remains an under-explored topic. In this paper, we propose a novel GAN latent sampling method by exploring and exploiting the hubness priors of GAN latent distributions. Our key insight is that the high dimensionality of the GAN latent space will inevitably lead to the emergence of hub latents that usually have much larger sampling densities than other latents in the latent space. As a result, these hub latents are better trained and thus contribute more to the synthesis of high-quality images. Unlike the a posterior "cherry-picking", our method is highly efficient as it is an a priori method that identifies high-quality latents before the synthesis of images. Furthermore, we show that the well-known but purely empirical truncation trick is a naive approximation to the central clustering effect of hub latents, which not only uncovers the rationale of the truncation trick, but also indicates the superiority and fundamentality of our method. Extensive experimental results demonstrate the effectiveness of the proposed method., Comment: Accepted at ICML 2022. Our code is available at: https://github.com/Byronliang8/HubnessGANSampling
Published: 2022

21. Multi-level Consistency Learning for Semi-supervised Domain Adaptation

Author: Yan, Zizheng, Wu, Yushuang, Li, Guanbin, Qin, Yipeng, Han, Xiaoguang, and Cui, Shuguang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Semi-supervised domain adaptation (SSDA) aims to apply knowledge learned from a fully labeled source domain to a scarcely labeled target domain. In this paper, we propose a Multi-level Consistency Learning (MCL) framework for SSDA. Specifically, our MCL regularizes the consistency of different views of target domain samples at three levels: (i) at inter-domain level, we robustly and accurately align the source and target domains using a prototype-based optimal transport method that utilizes the pros and cons of different views of target samples; (ii) at intra-domain level, we facilitate the learning of both discriminative and compact target feature representations by proposing a novel class-wise contrastive clustering loss; (iii) at sample level, we follow standard practice and improve the prediction accuracy by conducting a consistency-based self-training. Empirically, we verified the effectiveness of our MCL framework on three popular SSDA benchmarks, i.e., VisDA2017, DomainNet, and Office-Home datasets, and the experimental results demonstrate that our MCL framework achieves the state-of-the-art performance., Comment: IJCAI 2022
Published: 2022

22. Real-World Blind Super-Resolution via Feature Matching with Implicit High-Resolution Priors

Author: Chen, Chaofeng, Shi, Xinyu, Qin, Yipeng, Li, Xiaoming, Han, Xiaoguang, Yang, Tao, and Guo, Shihui
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: A key challenge of real-world image super-resolution (SR) is to recover the missing details in low-resolution (LR) images with complex unknown degradations (e.g., downsampling, noise and compression). Most previous works restore such missing details in the image space. To cope with the high diversity of natural images, they either rely on the unstable GANs that are difficult to train and prone to artifacts, or resort to explicit references from high-resolution (HR) images that are usually unavailable. In this work, we propose Feature Matching SR (FeMaSR), which restores realistic HR images in a much more compact feature space. Unlike image-space methods, our FeMaSR restores HR images by matching distorted LR image {\it features} to their distortion-free HR counterparts in our pretrained HR priors, and decoding the matched features to obtain realistic HR images. Specifically, our HR priors contain a discrete feature codebook and its associated decoder, which are pretrained on HR images with a Vector Quantized Generative Adversarial Network (VQGAN). Notably, we incorporate a novel semantic regularization in VQGAN to improve the quality of reconstructed images. For the feature matching, we first extract LR features with an LR encoder consisting of several Swin Transformer blocks and then follow a simple nearest neighbour strategy to match them with the pretrained codebook. In particular, we equip the LR encoder with residual shortcut connections to the decoder, which is critical to the optimization of feature matching loss and also helps to complement the possible feature matching errors. Experimental results show that our approach produces more realistic HR images than previous methods. Codes are released at \url{https://github.com/chaofengc/FeMaSR}., Comment: Accepted to ACM MM2022
Published: 2022

23. PVSeRF: Joint Pixel-, Voxel- and Surface-Aligned Radiance Field for Single-Image Novel View Synthesis

Author: Yu, Xianggang, Tang, Jiapeng, Qin, Yipeng, Li, Chenghong, Bao, Linchao, Han, Xiaoguang, and Cui, Shuguang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We present PVSeRF, a learning framework that reconstructs neural radiance fields from single-view RGB images, for novel view synthesis. Previous solutions, such as pixelNeRF, rely only on pixel-aligned features and suffer from feature ambiguity issues. As a result, they struggle with the disentanglement of geometry and appearance, leading to implausible geometries and blurry results. To address this challenge, we propose to incorporate explicit geometry reasoning and combine it with pixel-aligned features for radiance field prediction. Specifically, in addition to pixel-aligned features, we further constrain the radiance field learning to be conditioned on i) voxel-aligned features learned from a coarse volumetric grid and ii) fine surface-aligned features extracted from a regressed point cloud. We show that the introduction of such geometry-aware features helps to achieve a better disentanglement between appearance and geometry, i.e. recovering more accurate geometries and synthesizing higher quality images of novel views. Extensive experiments against state-of-the-art methods on ShapeNet benchmarks demonstrate the superiority of our approach for single-image novel view synthesis.
Published: 2022

24. Material corrosion characteristics of heat-treated SLM-ed Hastelloy X in electrochemical machining process

Author: Qin, Yipeng, Liu, Yong, Guan, Wenchao, and Wang, Kan
Published: 2024
Full Text: View/download PDF

25. Improved StyleGAN Embedding: Where are the Good Latents?

Author: Zhu, Peihao, Abdal, Rameen, Qin, Yipeng, Femiani, John, and Wonka, Peter
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics
Abstract: StyleGAN is able to produce photorealistic images that are almost indistinguishable from real photos. The reverse problem of finding an embedding for a given image poses a challenge. Embeddings that reconstruct an image well are not always robust to editing operations. In this paper, we address the problem of finding an embedding that both reconstructs images and also supports image editing tasks. First, we introduce a new normalized space to analyze the diversity and the quality of the reconstructed latent codes. This space can help answer the question of where good latent codes are located in latent space. Second, we propose an improved embedding algorithm using a novel regularization method based on our analysis. Finally, we analyze the quality of different embedding algorithms. We compare our results with the current state-of-the-art methods and achieve a better trade-off between reconstruction quality and editing quality.
Published: 2020

26. A Survey of Algorithms for Geodesic Paths and Distances

Author: Crane, Keenan, Livesu, Marco, Puppo, Enrico, and Qin, Yipeng
Subjects: Computer Science - Graphics, Computer Science - Computational Geometry
Abstract: Numerical computation of shortest paths or geodesics on curved domains, as well as the associated geodesic distance, arises in a broad range of applications across digital geometry processing, scientific computing, computer graphics, and computer vision. Relative to Euclidean distance computation, these tasks are complicated by the influence of curvature on the behavior of shortest paths, as well as the fact that the representation of the domain may itself be approximate. In spite of the difficulty of this problem, recent literature has developed a wide variety of sophisticated methods that enable rapid queries of geodesic information, even on relatively large models. This survey reviews the major categories of approaches to the computation of geodesic paths and distances, highlighting common themes and opportunities for future improvement.
Published: 2020

27. SEAN: Image Synthesis with Semantic Region-Adaptive Normalization

Author: Zhu, Peihao, Abdal, Rameen, Qin, Yipeng, and Wonka, Peter
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: We propose semantic region-adaptive normalization (SEAN), a simple but effective building block for Generative Adversarial Networks conditioned on segmentation masks that describe the semantic regions in the desired output image. Using SEAN normalization, we can build a network architecture that can control the style of each semantic region individually, e.g., we can specify one style reference image per region. SEAN is better suited to encode, transfer, and synthesize style than the best previous method in terms of reconstruction quality, variability, and visual quality. We evaluate SEAN on multiple datasets and report better quantitative metrics (e.g. FID, PSNR) than the current state of the art. SEAN also pushes the frontier of interactive image editing. We can interactively edit images by changing segmentation masks or the style for any given region. We can also interpolate styles from two reference images per region., Comment: Accepted as a CVPR 2020 oral paper. The interactive demo is available at https://youtu.be/0Vbj9xFgoUw
Published: 2019
Full Text: View/download PDF

28. Image2StyleGAN++: How to Edit the Embedded Images?

Author: Abdal, Rameen, Qin, Yipeng, and Wonka, Peter
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics
Abstract: We propose Image2StyleGAN++, a flexible image editing framework with many applications. Our framework extends the recent Image2StyleGAN in three ways. First, we introduce noise optimization as a complement to the $W^+$ latent space embedding. Our noise optimization can restore high-frequency features in images and thus significantly improves the quality of reconstructed images, e.g. a big increase of PSNR from 20 dB to 45 dB. Second, we extend the global $W^+$ latent space embedding to enable local embeddings. Third, we combine embedding with activation tensor manipulation to perform high-quality local edits along with global semantic edits on images. Such edits motivate various high-quality image editing applications, e.g. image reconstruction, image inpainting, image crossover, local style transfer, image editing using scribbles, and attribute level feature transfer. Examples of the edited images are shown across the paper for visual inspection., Comment: CVPR 2020 " For the video, visit https://youtu.be/yd5WczbFt68 "
Published: 2019

29. Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?

Author: Abdal, Rameen, Qin, Yipeng, and Wonka, Peter
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We propose an efficient algorithm to embed a given image into the latent space of StyleGAN. This embedding enables semantic image editing operations that can be applied to existing photographs. Taking the StyleGAN trained on the FFHQ dataset as an example, we show results for image morphing, style transfer, and expression transfer. Studying the results of the embedding algorithm provides valuable insights into the structure of the StyleGAN latent space. We propose a set of experiments to test what class of images can be embedded, how they are embedded, what latent space is suitable for embedding, and if the embedding is semantically meaningful., Comment: Accepted for oral presentation at ICCV 2019, "For videos visit https://youtu.be/RnTXLXw9o_I , https://youtu.be/zJoYY2eHAF0 and https://youtu.be/bA893L-PjbI"
Published: 2019

30. How does Lipschitz Regularization Influence GAN Training?

Author: Qin, Yipeng, Mitra, Niloy, and Wonka, Peter
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Despite the success of Lipschitz regularization in stabilizing GAN training, the exact reason of its effectiveness remains poorly understood. The direct effect of $K$-Lipschitz regularization is to restrict the $L2$-norm of the neural network gradient to be smaller than a threshold $K$ (e.g., $K=1$) such that $\|\nabla f\| \leq K$. In this work, we uncover an even more important effect of Lipschitz regularization by examining its impact on the loss function: It degenerates GAN loss functions to almost linear ones by restricting their domain and interval of attainable gradient values. Our analysis shows that loss functions are only successful if they are degenerated to almost linear ones. We also show that loss functions perform poorly if they are not degenerated and that a wide range of functions can be used as loss function as long as they are sufficiently degenerated by regularization. Basically, Lipschitz regularization ensures that all loss functions effectively work in the same way. Empirically, we verify our proposition on the MNIST, CIFAR10 and CelebA datasets., Comment: Accepted at ECCV 2020
Published: 2018

31. Double Narrowband Induced Perfect Absorption Photonic Sensor Based on Graphene–Dielectric–Gold Hybrid Metamaterial

Author: Liu, Zhimin, Zhuo, Shanshan, Zhou, Fengqi, Zhang, Xiao, Qin, Yipeng, Luo, Xin, Ji, Cheng, and Yang, Guangxin
Published: 2022
Full Text: View/download PDF

32. Microdroplets confined assembly of opal composites in dynamic borate ester-based networks

Author: Zhang, Jing, Qin, Yipeng, Pambos, Oliver J., Zhang, Jingjing, Chen, Su, Yu, Ziyi, and Abell, Chris
Published: 2021
Full Text: View/download PDF

33. How Does Lipschitz Regularization Influence GAN Training?

Author: Qin, Yipeng, Mitra, Niloy, Wonka, Peter, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Vedaldi, Andrea, editor, Bischof, Horst, editor, Brox, Thomas, editor, and Frahm, Jan-Michael, editor
Published: 2020
Full Text: View/download PDF

34. Prognostic Evaluation of Prognostic Nutrition Index for Patients with Radical Cystectomy: A Meta-analysis

Author: TANG Wenchao, LI Yuanwei, CHEN Jia, QIN Yipeng, WU Zhiying, and FU Huifeng
Subjects: bladder cancer, radical cystectomy, prognostic nutritional index, prognosis, Neoplasms. Tumors. Oncology. Including cancer and carcinogens, RC254-282
Abstract: Objective To systematically evaluate the relation between prognostic nutrition index (PNI) and prognosis of bladder cancer (BC) patients treated with radical cystectomy (RC). Methods We searched the literatures about the relation between PNI and the prognosis of patients treated with radical cystectomy published from the inception to January 30, 2021 in PubMed, Embase, Web of Science, CNKI, Wanfang, VIP and Chinese Medical Journal Database, and used RevMan5.3 software for Meta analysis. Results We included six literatures which comprise a total of 1273 patients. The results showed that there was a significant correlation between low PNI and OS of BC patients treated with RC (HR=2.0, 95%CI: 1.56-2.56), and there was a significant difference in RFS, PFS and DSS between low PNI and BC patients treated with RC (HR=1.93, 95%CI: 1.51-2.48). In the subgroup analysis, there were statistical differences in PNI and the prognosis of BC patients treated with RC between the Chinese group (HR=2.13, 95%CI: 1.62-2.81) and the Japanese group (HR=1.78, 95%CI: 1.08-2.94), and the PNI cutoff value had a good predictive effect on the prognosis of patients in the range of 46.08-51.30. Conclusion There is a significant relation between the level of PNI and OS of bladder cancer patients treated with radical cystectomy. Low PNI can be used as an effective marker to predict the prognosis of patients.
Published: 2021
Full Text: View/download PDF

35. Fast and exact geodesic computation using Edge-based Windows Grouping

Author: Qin, Yipeng
Subjects: 006.6
Abstract: Computing discrete geodesic distance over triangle meshes is one of the fundamental problems in computational geometry and computer graphics. As the “Big Data Era” arrives, a fast and accurate solution to the geodesic computation problem on large scale models with constantly increasing resolutions is desired. However, it is still challenging to deal with the speed, memory cost and accuracy of the geodesic computation at the same time. This thesis addresses the aforementioned challenge by proposing the Edge- based Windows Grouping (EWG) technique. With the local geodesic information encoded in a “window”, EWG groups the windows based on the mesh edges and processes them together. Thus, the interrelationships among the grouped windows can be utilized to improve the performance of geodesic computation on triangle meshes. Based on EWG, a novel exact geodesic algorithm is proposed in this thesis, which is fast, accurate and memory-efficient. This algorithm computes the geodesic distances at mesh vertices by propagating the geodesic information from the source over the entire mesh. Its high performance comes from its low computational redundancy and management overhead, which are both introduced by EWG. First, the redundant windows on an edge can be removed by comparing its distance with those of the other windows on the same edge. Second, the windows grouped on an edge usually have similar geodesic distances and can be propagated in batches efficiently. To the best of my knowledge, the proposed exact geodesic algorithm is the fastest and most memory-efficient one among all existing methods. In addition, the proposed exact geodesic algorithm is revised and employed to construct the geodesic-metric-based Voronoi diagram on triangle meshes. In this application, the geodesic computation is the bottleneck in both the time and memory costs. The proposed method achieves low memory cost from the key observation that the Voronoi diagram boundaries usually only cross a minority of the meshes’ triangles and most of the windows stored on edges are redundant. As a result, the proposed method resolves the memory bottleneck of the Voronoi diagram construction without sacrificing its speed.
Published: 2017

36. Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels

Author: Zhao, Ganlong, primary, Li, Guanbin, additional, Qin, Yipeng, additional, Liu, Feng, additional, and Yu, Yizhou, additional
Published: 2022
Full Text: View/download PDF

37. Novel photothermal-responsive sandwich-structured mesoporous silica nanoparticles: synthesis, characterization, and application for controlled drug delivery

Author: Qin, Yipeng, Huang, Yuhan, Li, Min, Ren, Bo, Wang, Pan, Zhong, Qidi, and Liu, Chunyan
Published: 2021
Full Text: View/download PDF

38. Universal Semi-supervised Model Adaptation via Collaborative Consistency Training

Author: Yan, Zizheng, primary, Wu, Yushuang, additional, Qin, Yipeng, additional, Han, Xiaoguang, additional, Cui, Shuguang, additional, and Li, Guanbin, additional
Published: 2024
Full Text: View/download PDF

39. Diverse Motion In-betweening from Sparse Keyframes with Dual Posture Stitching

Author: Ren, Tianxiang, primary, Yu, Jubo, additional, Guo, Shihui, additional, Ma, Ying, additional, Ouyang, Yutao, additional, Zeng, Zijiao, additional, Zhang, Yazhan, additional, and Qin, Yipeng, additional
Published: 2024
Full Text: View/download PDF

40. Computational Design of Wiring Layout on Tight Suits with Minimal Motion Resistance

Author: Wang, Kai, primary, Xu, Xiaoyu, additional, Zheng, Yinping, additional, Zhou, Da, additional, Guo, Shihui, additional, Qin, Yipeng, additional, and Guo, Xiaohu, additional
Published: 2023
Full Text: View/download PDF

41. Analysis of Operation Effect of Ground Source Heat Pump System in a Low-Carbon Park Based on Measured Data

Author: Tan, Zhukui, primary, Qiao, Biao, additional, Wang, Yang, additional, Li, Jintang, additional, Sun, Zongyu, additional, Li, Ji, additional, and Qin, Yipeng, additional
Published: 2023
Full Text: View/download PDF

42. Review of Research on Evaluation Index System of Integrated Energy System in Low-Carbon Park

Author: Tan, Zhukui, primary, Qin, Yipeng, additional, Sun, Zongyu, additional, Wang, Yang, additional, Li, Ji, additional, Xu, Wei, additional, Zheng, Youzhuo, additional, and Liu, Zhenpeng, additional
Published: 2023
Full Text: View/download PDF

43. Exploration and practice of zero-carbon intra-park in the context of Carbon peaking and Carbon neutrality Goals： A case study on an intra-park

Author: Zhang Jun, Qin Yipeng, Feng Chao, Qiao Biao, Song Jialiang, Liang Shukui, Mu Lichun, and Wang Lu
Subjects: Environmental sciences, GE1-350
Abstract: Aiming at the planning of zero carbon park under the background of double carbon, tins paper takes an industrial park as an example, and analyzes and evaluates the industrial planning situation, energy resources situation and energy plaiming situation of the park based on the concept of “passive priority, active optimization”. Taking the project as the blueprint, integrating the building energy-saving measures and the advanced nature and carbon reduction ability of the integrated smart energy technology, starting from the overall optimization layout of the park, covering the industrial structure, buildings, transportation, energy and other key areas of the park, the comprehensive application of low-carbon technology forms such as building micro-environment, ultra-low energy consumption building system, efficient integrated energy system and renewable energy. Total carbon emissions as the core goal, to achieve the goal zero carbon zone. This study for the project and project time zero carbon building and future zero carbon park planning to provide the reference of the same kind of park.
Published: 2023
Full Text: View/download PDF

44. Research on Load Modeling Method for Typical Low Carbon Energy Consumption Scenarios in Border and Cross border Regions Considering Seasonal Migration Characteristics

Author: Chen Shumin, Liang Shukui, Zhang Hao, You Guangzeng, Qiao Biao, Qin Yipeng, and Wang Lu
Subjects: Environmental sciences, GE1-350
Abstract: With the process of urbanization and the ‘the Belt and Road’ initiative, the cross-border energy demand in southwest China has grown rapidly, driving the development of the energy system. The accuracy of load forecasting directly affects the application of energy systems, so it is crucial to conduct research on load forecasting for energy terminals in border and cross-border areas. However, there is a seasonal shift in the diverse energy consumption loads in border and cross-border regions, and currently, research on load forecasting and simulation of typical low-carbon energy consumption scenarios under this feature is basically in a blank state. Based on existing problems, this article conducts research on load modeling methods under the significant ‘seasonal migration’ characteristics of border and cross-border loads, conducts research on characteristic industries in border and cross-border areas, establishes typical low-carbon energy consumption scenarios and simulation models in border and cross-border areas, and uses sensitivity analysis method of dynamic simulation to analyze the impact of different influencing factors on the load of various building types, The Monte Carlo simulation prediction method is used to obtain the sensitivity probability distribution of various influencing characteristic factors, and the typical energy consumption building load model is modified. Finally, by comparing the energy consumption simulation results with statistical results, the accuracy of simulation energy consumption prediction is verified to be higher than 90%.
Published: 2023
Full Text: View/download PDF

45. Development of an integrated energy system simulation computing platform based on a typical cross-border area

Author: Chen, Shumin, primary, Qiao, Biao, additional, Zhang, Hao, additional, Liang, Shukui, additional, You, Guangzeng, additional, and Qin, Yipeng, additional
Published: 2023
Full Text: View/download PDF

46. Optimization Design and Operation Effect Verification of Large-Scale Ground Source Heat Pump System in a Low-Carbon Park Based on Whole Process Analysis

Author: Qiao, Biao, primary, Tan, Zhukui, additional, Wang, Yang, additional, Li, Ji, additional, Feng, Xiaomei, additional, Li, Jintang, additional, Qin, Yipeng, additional, and Liang, Shukui, additional
Published: 2023
Full Text: View/download PDF

47. Research on Load Forecasting Method Based on Building Load Database of Typical Low Carbon Scenarios in Border and Cross Border Areas

Author: Chen, Shumin, primary, Liang, Shukui, additional, Zhang, Hao, additional, You, Guangzeng, additional, Qiao, Biao, additional, and Qin, Yipeng, additional
Published: 2023
Full Text: View/download PDF

48. Triple plasmon-induced transparency and polarization-insensitive optical switch based on monolayer patterned graphene metamaterial

Author: Liu, Zhimin, primary, Qin, Yipeng, additional, Zhou, Fengqi, additional, Zhuo, Shanshan, additional, Ji, Cheng, additional, Yang, Guangxin, additional, Xie, Yadong, additional, Yang, Ruihan, additional, and Luo, Xin, additional
Published: 2023
Full Text: View/download PDF

49. How Does Lipschitz Regularization Influence GAN Training?

Author: Qin, Yipeng, primary, Mitra, Niloy, additional, and Wonka, Peter, additional
Published: 2020
Full Text: View/download PDF

50. Parametric Implicit Face Representation for Audio-Driven Facial Reenactment

Author: Huang, Ricong, primary, Lai, Peiwen, additional, Qin, Yipeng, additional, and Li, Guanbin, additional
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

207 results on '"Qin, Yipeng"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources