Author: "Peng, Yaxin" / Search Limiters: Full Text - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Peng, Yaxin"' showing total 109 results

Start Over Author "Peng, Yaxin" Search Limiters Full Text

109 results on '"Peng, Yaxin"'

1. Scaling Diffusion Policy in Transformer to 1 Billion Parameters for Robotic Manipulation

Author: Zhu, Minjie, Zhu, Yichen, Li, Jinming, Wen, Junjie, Xu, Zhiyuan, Liu, Ning, Cheng, Ran, Shen, Chaomin, Peng, Yaxin, Feng, Feifei, and Tang, Jian
Subjects: Computer Science - Robotics
Abstract: Diffusion Policy is a powerful technique tool for learning end-to-end visuomotor robot control. It is expected that Diffusion Policy possesses scalability, a key attribute for deep neural networks, typically suggesting that increasing model size would lead to enhanced performance. However, our observations indicate that Diffusion Policy in transformer architecture (\DP) struggles to scale effectively; even minor additions of layers can deteriorate training outcomes. To address this issue, we introduce Scalable Diffusion Transformer Policy for visuomotor learning. Our proposed method, namely \textbf{\methodname}, introduces two modules that improve the training dynamic of Diffusion Policy and allow the network to better handle multimodal action distribution. First, we identify that \DP~suffers from large gradient issues, making the optimization of Diffusion Policy unstable. To resolve this issue, we factorize the feature embedding of observation into multiple affine layers, and integrate it into the transformer blocks. Additionally, our utilize non-causal attention which allows the policy network to \enquote{see} future actions during prediction, helping to reduce compounding errors. We demonstrate that our proposed method successfully scales the Diffusion Policy from 10 million to 1 billion parameters. This new model, named \methodname, can effectively scale up the model size with improved performance and generalization. We benchmark \methodname~across 50 different tasks from MetaWorld and find that our largest \methodname~outperforms \DP~with an average improvement of 21.6\%. Across 7 real-world robot tasks, our ScaleDP demonstrates an average improvement of 36.25\% over DP-T on four single-arm tasks and 75\% on three bimanual tasks. We believe our work paves the way for scaling up models for visuomotor learning. The project page is available at scaling-diffusion-policy.github.io.
Published: 2024

2. TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation

Author: Wen, Junjie, Zhu, Yichen, Li, Jinming, Zhu, Minjie, Wu, Kun, Xu, Zhiyuan, Liu, Ning, Cheng, Ran, Shen, Chaomin, Peng, Yaxin, Feng, Feifei, and Tang, Jian
Subjects: Computer Science - Robotics, Computer Science - Computer Vision and Pattern Recognition
Abstract: Vision-Language-Action (VLA) models have shown remarkable potential in visuomotor control and instruction comprehension through end-to-end learning processes. However, current VLA models face significant challenges: they are slow during inference and require extensive pre-training on large amounts of robotic data, making real-world deployment difficult. In this paper, we introduce a new family of compact vision-language-action models, called TinyVLA, which offers two key advantages over existing VLA models: (1) faster inference speeds, and (2) improved data efficiency, eliminating the need for pre-training stage. Our framework incorporates two essential components to build TinyVLA: (1) initializing the policy backbone with robust, high-speed multimodal models, and (2) integrating a diffusion policy decoder during fine-tuning to enable precise robot actions. We conducted extensive evaluations of TinyVLA in both simulation and on real robots, demonstrating that our approach significantly outperforms the state-of-the-art VLA model, OpenVLA, in terms of speed and data efficiency, while delivering comparable or superior performance. Additionally, TinyVLA exhibits strong generalization capabilities across various dimensions, including language instructions, novel objects, unseen positions, changes in object appearance, background variations, and environmental shifts, often matching or exceeding the performance of OpenVLA. We believe that \methodname offers an interesting perspective on utilizing pre-trained multimodal models for policy learning. Our project is at https://tiny-vla.github.io., Comment: add more citations
Published: 2024

3. MMRo: Are Multimodal LLMs Eligible as the Brain for In-Home Robotics?

Author: Li, Jinming, Zhu, Yichen, Xu, Zhiyuan, Gu, Jindong, Zhu, Minjie, Liu, Xin, Liu, Ning, Peng, Yaxin, Feng, Feifei, and Tang, Jian
Subjects: Computer Science - Robotics, Computer Science - Computer Vision and Pattern Recognition
Abstract: It is fundamentally challenging for robots to serve as useful assistants in human environments because this requires addressing a spectrum of sub-problems across robotics, including perception, language understanding, reasoning, and planning. The recent advancements in Multimodal Large Language Models (MLLMs) have demonstrated their exceptional abilities in solving complex mathematical problems, mastering commonsense and abstract reasoning. This has led to the recent utilization of MLLMs as the brain in robotic systems, enabling these models to conduct high-level planning prior to triggering low-level control actions for task execution. However, it remains uncertain whether existing MLLMs are reliable in serving the brain role of robots. In this study, we introduce the first benchmark for evaluating Multimodal LLM for Robotic (MMRo) benchmark, which tests the capability of MLLMs for robot applications. Specifically, we identify four essential capabilities perception, task planning, visual reasoning, and safety measurement that MLLMs must possess to qualify as the robot's central processing unit. We have developed several scenarios for each capability, resulting in a total of 14 metrics for evaluation. We present experimental results for various MLLMs, including both commercial and open-source models, to assess the performance of existing systems. Our findings indicate that no single model excels in all areas, suggesting that current MLLMs are not yet trustworthy enough to serve as the cognitive core for robots. Our data can be found in https://mm-robobench.github.io/.
Published: 2024

4. Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models

Author: Zhu, Minjie, Zhu, Yichen, Liu, Xin, Liu, Ning, Xu, Zhiyuan, Shen, Chaomin, Peng, Yaxin, Ou, Zhicai, Feng, Feifei, and Tang, Jian
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Multimodal Large Language Models (MLLMs) have showcased impressive skills in tasks related to visual understanding and reasoning. Yet, their widespread application faces obstacles due to the high computational demands during both the training and inference phases, restricting their use to a limited audience within the research and user communities. In this paper, we investigate the design aspects of Multimodal Small Language Models (MSLMs) and propose an efficient multimodal assistant named Mipha, which is designed to create synergy among various aspects: visual representation, language models, and optimization strategies. We show that without increasing the volume of training data, our Mipha-3B outperforms the state-of-the-art large MLLMs, especially LLaVA-1.5-13B, on multiple benchmarks. Through detailed discussion, we provide insights and guidelines for developing strong MSLMs that rival the capabilities of MLLMs. Our code is available at https://github.com/zhuyiche/llava-phi.
Published: 2024

5. Language-Conditioned Robotic Manipulation with Fast and Slow Thinking

Author: Zhu, Minjie, Zhu, Yichen, Li, Jinming, Wen, Junjie, Xu, Zhiyuan, Che, Zhengping, Shen, Chaomin, Peng, Yaxin, Liu, Dong, Feng, Feifei, and Tang, Jian
Subjects: Computer Science - Robotics, Computer Science - Computer Vision and Pattern Recognition
Abstract: The language-conditioned robotic manipulation aims to transfer natural language instructions into executable actions, from simple pick-and-place to tasks requiring intent recognition and visual reasoning. Inspired by the dual process theory in cognitive science, which suggests two parallel systems of fast and slow thinking in human decision-making, we introduce Robotics with Fast and Slow Thinking (RFST), a framework that mimics human cognitive architecture to classify tasks and makes decisions on two systems based on instruction types. Our RFST consists of two key components: 1) an instruction discriminator to determine which system should be activated based on the current user instruction, and 2) a slow-thinking system that is comprised of a fine-tuned vision language model aligned with the policy networks, which allows the robot to recognize user intention or perform reasoning tasks. To assess our methodology, we built a dataset featuring real-world trajectories, capturing actions ranging from spontaneous impulses to tasks requiring deliberate contemplation. Our results, both in simulation and real-world scenarios, confirm that our approach adeptly manages intricate tasks that demand intent recognition and reasoning. The project is available at https://jlm-z.github.io/RSFT/, Comment: accepted to ICRA2024
Published: 2024

6. Object-Centric Instruction Augmentation for Robotic Manipulation

Author: Wen, Junjie, Zhu, Yichen, Zhu, Minjie, Li, Jinming, Xu, Zhiyuan, Che, Zhengping, Shen, Chaomin, Peng, Yaxin, Liu, Dong, Feng, Feifei, and Tang, Jian
Subjects: Computer Science - Robotics, Computer Science - Computer Vision and Pattern Recognition
Abstract: Humans interpret scenes by recognizing both the identities and positions of objects in their observations. For a robot to perform tasks such as \enquote{pick and place}, understanding both what the objects are and where they are located is crucial. While the former has been extensively discussed in the literature that uses the large language model to enrich the text descriptions, the latter remains underexplored. In this work, we introduce the \textit{Object-Centric Instruction Augmentation (OCI)} framework to augment highly semantic and information-dense language instruction with position cues. We utilize a Multi-modal Large Language Model (MLLM) to weave knowledge of object locations into natural language instruction, thus aiding the policy network in mastering actions for versatile manipulation. Additionally, we present a feature reuse mechanism to integrate the vision-language features from off-the-shelf pre-trained MLLM into policy networks. Through a series of simulated and real-world robotic tasks, we demonstrate that robotic manipulator imitation policies trained with our enhanced instructions outperform those relying solely on traditional language instructions., Comment: accepted to ICRA2024
Published: 2024

7. Exploring Gradient Explosion in Generative Adversarial Imitation Learning: A Probabilistic Perspective

Author: Wang, Wanying, Zhu, Yichen, Zhou, Yirui, Shen, Chaomin, Tang, Jian, Xu, Zhiyuan, Peng, Yaxin, and Zhang, Yangchun
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Generative Adversarial Imitation Learning (GAIL) stands as a cornerstone approach in imitation learning. This paper investigates the gradient explosion in two types of GAIL: GAIL with deterministic policy (DE-GAIL) and GAIL with stochastic policy (ST-GAIL). We begin with the observation that the training can be highly unstable for DE-GAIL at the beginning of the training phase and end up divergence. Conversely, the ST-GAIL training trajectory remains consistent, reliably converging. To shed light on these disparities, we provide an explanation from a theoretical perspective. By establishing a probabilistic lower bound for GAIL, we demonstrate that gradient explosion is an inevitable outcome for DE-GAIL due to occasionally large expert-imitator policy disparity, whereas ST-GAIL does not have the issue with it. To substantiate our assertion, we illustrate how modifications in the reward function can mitigate the gradient explosion challenge. Finally, we propose CREDO, a simple yet effective strategy that clips the reward function during the training phase, allowing the GAIL to enjoy high data efficiency and stable trainability., Comment: AAAI2024
Published: 2023

8. PMNN:Physical Model-driven Neural Network for solving time-fractional differential equations

Author: Ma, Zhiying, Hou, Jie, Zhu, Wenhao, Peng, Yaxin, and Li, Ying
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Mathematics - Numerical Analysis
Abstract: In this paper, an innovative Physical Model-driven Neural Network (PMNN) method is proposed to solve time-fractional differential equations. It establishes a temporal iteration scheme based on physical model-driven neural networks which effectively combines deep neural networks (DNNs) with interpolation approximation of fractional derivatives. Specifically, once the fractional differential operator is discretized, DNNs are employed as a bridge to integrate interpolation approximation techniques with differential equations. On the basis of this integration, we construct a neural-based iteration scheme. Subsequently, by training DNNs to learn this temporal iteration scheme, approximate solutions to the differential equations can be obtained. The proposed method aims to preserve the intrinsic physical information within the equations as far as possible. It fully utilizes the powerful fitting capability of neural networks while maintaining the efficiency of the difference schemes for fractional differential equations. Moreover, we validate the efficiency and accuracy of PMNN through several numerical experiments.
Published: 2023

9. Recognizable Information Bottleneck

Author: Lyu, Yilin, Liu, Xin, Song, Mingyang, Wang, Xinyue, Peng, Yaxin, Zeng, Tieyong, and Jing, Liping
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Information Bottlenecks (IBs) learn representations that generalize to unseen data by information compression. However, existing IBs are practically unable to guarantee generalization in real-world scenarios due to the vacuous generalization bound. The recent PAC-Bayes IB uses information complexity instead of information compression to establish a connection with the mutual information generalization bound. However, it requires the computation of expensive second-order curvature, which hinders its practical application. In this paper, we establish the connection between the recognizability of representations and the recent functional conditional mutual information (f-CMI) generalization bound, which is significantly easier to estimate. On this basis we propose a Recognizable Information Bottleneck (RIB) which regularizes the recognizability of representations through a recognizability critic optimized by density ratio matching under the Bregman divergence. Extensive experiments on several commonly used datasets demonstrate the effectiveness of the proposed method in regularizing the model and estimating the generalization gap., Comment: 12 pages. To appear in IJCAI 2023
Published: 2023

10. CP$^3$: Channel Pruning Plug-in for Point-based Networks

Author: Huang, Yaomin, Liu, Ning, Che, Zhengping, Xu, Zhiyuan, Shen, Chaomin, Peng, Yaxin, Zhang, Guixu, Liu, Xinmei, Feng, Feifei, and Tang, Jian
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Channel pruning can effectively reduce both computational cost and memory footprint of the original network while keeping a comparable accuracy performance. Though great success has been achieved in channel pruning for 2D image-based convolutional networks (CNNs), existing works seldom extend the channel pruning methods to 3D point-based neural networks (PNNs). Directly implementing the 2D CNN channel pruning methods to PNNs undermine the performance of PNNs because of the different representations of 2D images and 3D point clouds as well as the network architecture disparity. In this paper, we proposed CP$^3$, which is a Channel Pruning Plug-in for Point-based network. CP$^3$ is elaborately designed to leverage the characteristics of point clouds and PNNs in order to enable 2D channel pruning methods for PNNs. Specifically, it presents a coordinate-enhanced channel importance metric to reflect the correlation between dimensional information and individual channel features, and it recycles the discarded points in PNN's sampling process and reconsiders their potentially-exclusive information to enhance the robustness of channel pruning. Experiments on various PNN architectures show that CP$^3$ constantly improves state-of-the-art 2D CNN pruning approaches on different point cloud tasks. For instance, our compressed PointNeXt-S on ScanObjectNN achieves an accuracy of 88.52% with a pruning rate of 57.8%, outperforming the baseline pruning methods with an accuracy gain of 1.94%., Comment: Yaomin Huang and Ning Liu are with equal contributions. This paper has been accepted by CVPR 2023
Published: 2023

11. Label-Guided Auxiliary Training Improves 3D Object Detector

Author: Huang, Yaomin, Liu, Xinmei, Zhu, Yichen, Xu, Zhiyuan, Shen, Chaomin, Che, Zhengping, Zhang, Guixu, Peng, Yaxin, Feng, Feifei, and Tang, Jian
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Detecting 3D objects from point clouds is a practical yet challenging task that has attracted increasing attention recently. In this paper, we propose a Label-Guided auxiliary training method for 3D object detection (LG3D), which serves as an auxiliary network to enhance the feature learning of existing 3D object detectors. Specifically, we propose two novel modules: a Label-Annotation-Inducer that maps annotations and point clouds in bounding boxes to task-specific representations and a Label-Knowledge-Mapper that assists the original features to obtain detection-critical representations. The proposed auxiliary network is discarded in inference and thus has no extra computational cost at test time. We conduct extensive experiments on both indoor and outdoor datasets to verify the effectiveness of our approach. For example, our proposed LG3D improves VoteNet by 2.5% and 3.1% mAP on the SUN RGB-D and ScanNetV2 datasets, respectively., Comment: Yaomin Huang and Xinmei Liu are with equal contribution. This paper has been accepted by ECCV 2022
Published: 2022

12. Hybrid Atlas Building with Deep Registration Priors

Author: Wu, Nian, Wang, Jian, Zhang, Miaomiao, Zhang, Guixu, Peng, Yaxin, and Shen, Chaomin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Registration-based atlas building often poses computational challenges in high-dimensional image spaces. In this paper, we introduce a novel hybrid atlas building algorithm that fast estimates atlas from large-scale image datasets with much reduced computational cost. In contrast to previous approaches that iteratively perform registration tasks between an estimated atlas and individual images, we propose to use learned priors of registration from pre-trained neural networks. This newly developed hybrid framework features several advantages of (i) providing an efficient way of atlas building without losing the quality of results, and (ii) offering flexibility in utilizing a wide variety of deep learning based registration methods. We demonstrate the effectiveness of this proposed model on 3D brain magnetic resonance imaging (MRI) scans.
Published: 2021

13. A tripartite rheostat controls self-regulated host plant resistance to insects

Author: Guo, Jianping, Wang, Huiying, Guan, Wei, Guo, Qin, Wang, Jing, Yang, Jing, Peng, Yaxin, Shan, Junhan, Gao, Mingyang, Shi, Shaojie, Shangguan, Xinxin, Liu, Bingfang, Jing, Shengli, Zhang, Jing, Xu, Chunxue, Huang, Jin, Rao, Weiwei, Zheng, Xiaohong, Wu, Di, Zhou, Cong, Du, Bo, Chen, Rongzhi, Zhu, Lili, Zhu, Yuxian, Walling, Linda L., Zhang, Qifa, and He, Guangcun
Published: 2023
Full Text: View/download PDF

14. A memory pool variational autoencoder framework for cross-domain recommendation

Author: Yang, Jie, Zhu, Jianxiang, Ding, Xiaofeng, Peng, Yaxin, and Zhang, Yangchun
Published: 2024
Full Text: View/download PDF

15. Defending Against Adversarial Attacks by Suppressing the Largest Eigenvalue of Fisher Information Matrix

Author: Shen, Chaomin, Peng, Yaxin, Zhang, Guixu, and Fan, Jinsong
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We propose a scheme for defending against adversarial attacks by suppressing the largest eigenvalue of the Fisher information matrix (FIM). Our starting point is one explanation on the rationale of adversarial examples. Based on the idea of the difference between a benign sample and its adversarial example is measured by the Euclidean norm, while the difference between their classification probability densities at the last (softmax) layer of the network could be measured by the Kullback-Leibler (KL) divergence, the explanation shows that the output difference is a quadratic form of the input difference. If the eigenvalue of this quadratic form (a.k.a. FIM) is large, the output difference becomes large even when the input difference is small, which explains the adversarial phenomenon. This makes the adversarial defense possible by controlling the eigenvalues of the FIM. Our solution is adding one term representing the trace of the FIM to the loss function of the original network, as the largest eigenvalue is bounded by the trace. Our defensive scheme is verified by experiments using a variety of common attacking methods on typical deep neural networks, e.g. LeNet, VGG and ResNet, with datasets MNIST, CIFAR-10, and German Traffic Sign Recognition Benchmark (GTSRB). Our new network, after adopting the novel loss function and retraining, has an effective and robust defensive capability, as it decreases the fooling ratio of the generated adversarial examples, and remains the classification accuracy of the original network., Comment: 11 pages, 5 figures
Published: 2019

16. Design and preparation of resin matrix composite coating with good ablation resistance performance under high-energy laser irradiation

Author: Xue, Xinyi, Peng, Yaxin, Huang, Jiang, Li, Lixin, Ni, Yushan, Ma, Zhuang, Gao, Lihong, Chen, Wenhua, Chen, Guohua, and Ma, Chen
Published: 2023
Full Text: View/download PDF

17. The Adversarial Attack and Detection under the Fisher Information Metric

Author: Zhao, Chenxiao, Fletcher, P. Thomas, Yu, Mixue, Peng, Yaxin, Zhang, Guixu, and Shen, Chaomin
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Many deep learning models are vulnerable to the adversarial attack, i.e., imperceptible but intentionally-designed perturbations to the input can cause incorrect output of the networks. In this paper, using information geometry, we provide a reasonable explanation for the vulnerability of deep learning models. By considering the data space as a non-linear space with the Fisher information metric induced from a neural network, we first propose an adversarial attack algorithm termed one-step spectral attack (OSSA). The method is described by a constrained quadratic form of the Fisher information matrix, where the optimal adversarial perturbation is given by the first eigenvector, and the model vulnerability is reflected by the eigenvalues. The larger an eigenvalue is, the more vulnerable the model is to be attacked by the corresponding eigenvector. Taking advantage of the property, we also propose an adversarial detection method with the eigenvalues serving as characteristics. Both our attack and detection algorithms are numerically optimized to work efficiently on large datasets. Our evaluations show superior performance compared with other methods, implying that the Fisher information is a promising approach to investigate the adversarial attacks and defenses., Comment: Accepted as an AAAI-2019 oral paper
Published: 2018

18. FHL3 promotes the formation of fast glycolytic muscle fibers by interacting with YY1 and muscle glycolytic metabolism

Author: Bai, Wei, Zhang, Yunxia, Ma, Jun, Du, Mengmeng, Xu, Haiyang, Wang, Jian, Zhang, Lu, Li, Wentao, Hou, Yunqing, Liu, Xiaomeng, Zhang, Xinyue, Peng, Yaxin, Li, Jianan, Zhan, Xizhen, Jiang, Wei, Liu, Shengsi, Liu, Xiao, Li, Qinying, Miao, Yang, Sui, Mengru, Yang, Yuhan, Zhang, Shenghao, Xu, Zaiyan, and Zuo, Bo
Published: 2023
Full Text: View/download PDF

19. Lipidomic analyses reveal enhanced lipolysis in planthoppers feeding on resistant host plants

Author: Zheng, Xiaohong, Xin, Yeyun, Peng, Yaxin, Shan, Junhan, Zhang, Ning, Wu, Di, Guo, Jianping, Huang, Jin, Guan, Wei, Shi, Shaojie, Zhou, Cong, Chen, Rongzhi, Du, Bo, Zhu, Lili, Yang, Fang, Fu, Xiqin, Yuan, Longping, and He, Guangcun
Published: 2021
Full Text: View/download PDF

20. Visual Robotic Manipulation with Depth-Aware Pretraining

Author: Wang, Wanying, Li, Jinming, Zhu, Yichen, Xu, Zhiyuan, Che, Zhengping, Peng, Yaxin, Shen, Chaomin, Liu, Dong, Feng, Feifei, Tang, Jian, Wang, Wanying, Li, Jinming, Zhu, Yichen, Xu, Zhiyuan, Che, Zhengping, Peng, Yaxin, Shen, Chaomin, Liu, Dong, Feng, Feifei, and Tang, Jian
Abstract: Recent work on visual representation learning has shown to be efficient for robotic manipulation tasks. However, most existing works pretrained the visual backbone solely on 2D images or egocentric videos, ignoring the fact that robots learn to act in 3D space, which is hard to learn from 2D observation. In this paper, we examine the effectiveness of pretraining for vision backbone with public-available large-scale 3D data to improve manipulation policy learning. Our method, namely Depth-aware Pretraining for Robotics (DPR), enables an RGB-only backbone to learn 3D scene representations from self-supervised contrastive learning, where depth information serves as auxiliary knowledge. No 3D information is necessary during manipulation policy learning and inference, making our model enjoy both efficiency and effectiveness in 3D space manipulation. Furthermore, we introduce a new way to inject robots' proprioception into the policy networks that makes the manipulation model robust and generalizable. We demonstrate in experiments that our proposed framework improves performance on unseen objects and visual environments for various robotics tasks on both simulated and real robots., Comment: submitted to ICRA2024
Published: 2024

21. CAB U-Net: An end-to-end category attention boosting algorithm for segmentation

Author: Ding, Xiaofeng, Peng, Yaxin, Shen, Chaomin, and Zeng, Tieyong
Published: 2020
Full Text: View/download PDF

22. A memory pool variational autoencoder framework for cross-domain recommendation

Author: Yang, Jie, primary, Zhu, Jianxiang, additional, Ding, Xiaofeng, additional, Peng, Yaxin, additional, and Zhang, Yangchun, additional
Published: 2023
Full Text: View/download PDF

23. APFN: Adaptive Perspective-Based Fusion Network for 3-D Place Recognition

Author: Zhu, Jianxiang, Yang, Keni, Zhang, Yangchun, Peng, Yan, and Peng, Yaxin
Abstract: Place recognition holds a pivotal influence in the field of computer vision. Feature pyramid, an advanced architecture introduced into place recognition, aims to produce features with richer semantic content. However, the existing methods ignore the efficient utilization of low-level features. To tackle this issue, we propose a novel place recognition architecture called the adaptive perspective-based fusion network (APFN). The main benefits of APFN lie in three aspects: 1) it adaptively optimizes the appropriate perspective and assigns the appropriate perspective-based weights dynamically for the multiscale low-level feature maps by a newly designed adaptive perspective-based attention (APA) module; 2) it effectively enhances the extracted low-level features and significantly shortens the transmission distance of low-level information; and 3) it enhances global information extraction via supervising the generation of high-level features by regularization. Extensive experiments on several public datasets validate the effectiveness of our method. APFN outperforms previous baseline methods by 1.6% points in average recall at top-1% (AR@1%) and 1.2% points in average recall at top-1 (AR@1) metrics.
Published: 2024
Full Text: View/download PDF

24. Long noncoding RNA lncMREF promotes myogenic differentiation and muscle regeneration by interacting with the Smarca5/p300 complex

Author: Lv, Wei, primary, Jiang, Wei, additional, Luo, Hongmei, additional, Tong, Qian, additional, Niu, Xiaoyu, additional, Liu, Xiao, additional, Miao, Yang, additional, Wang, Jingnan, additional, Guo, Yiwen, additional, Li, Jianan, additional, Zhan, Xizhen, additional, Hou, Yunqing, additional, Peng, Yaxin, additional, Wang, Jian, additional, Zhao, Shuhong, additional, Xu, Zaiyan, additional, and Zuo, Bo, additional
Published: 2022
Full Text: View/download PDF

25. Scale robust point matching‐Net: End‐to‐end scale point matching using Lie group

Author: Wang, Xin, primary, Ding, Hui, additional, Zhao, Guangwei, additional, Peng, Yaxin, additional, and Shen, Chaomin, additional
Published: 2022
Full Text: View/download PDF

26. Conservative analysis of Synaptopodin‐2 intron sense‐overlapping lncRNA reveals its novel function in promoting muscle atrophy

Author: Jin, Jianjun, primary, Du, Mengmeng, additional, Wang, Jian, additional, Guo, Yubo, additional, Zhang, Jiali, additional, Zuo, Hao, additional, Hou, Yunqing, additional, Wang, Shanshan, additional, Lv, Wei, additional, Bai, Wei, additional, Wang, Jin, additional, Zhan, Xizhen, additional, Peng, Yaxin, additional, Tong, Qian, additional, Chai, Jin, additional, Xu, Zaiyan, additional, and Zuo, Bo, additional
Published: 2022
Full Text: View/download PDF

27. SAB Net: A Semantic Attention Boosting Framework for Semantic Segmentation

Author: Ding, Xiaofeng, primary, Shen, Chaomin, additional, Zeng, Tieyong, additional, and Peng, Yaxin, additional
Published: 2022
Full Text: View/download PDF

28. Single Nucleotide Polymorphisms of Porcine lncMGPF Regulate Meat Production Traits by Affecting RNA Stability

Author: Lv, Wei, primary, Zhao, Shiyu, additional, Hou, Yunqing, additional, Tong, Qian, additional, Peng, Yaxin, additional, Li, Jianan, additional, Xu, Zaiyan, additional, and Zuo, Bo, additional
Published: 2021
Full Text: View/download PDF

29. Improved unsupervised domain adaptation network based on category attention

Author: Fan, Longhao, primary, Gao, Shouwei, additional, Zhu, Fan, additional, Zhu, Zhenzhong, additional, Zhou, Chaozheng, additional, and Peng, Yaxin, additional
Published: 2021
Full Text: View/download PDF

30. A modified Chan–Vese model and its theoretical proof

Author: Pi, Ling, Peng, Yaxin, Shen, Chunli, and Li, Fang
Published: 2009
Full Text: View/download PDF

31. APPLICATION OF PAINTING ART EXPRESSION IN COLLEGE STUDENTS' MENTAL AND MENTAL HEALTH EDUCATION

Author: Peng, Yaxin
Published: 2021

32. WADE-Net: Weighted Aggregation with Density Estimation for Point Cloud Place Recognition

Author: Liu, Ke, primary, Wang, Xing, additional, Peng, Yaxin, additional, Ye, Zhen, additional, and Zhou, Chaozheng, additional
Published: 2021
Full Text: View/download PDF

33. Effects of KPNA7 gene polymorphisms on reproductive traits in France Large White pigs

Author: Zhang, Yunxia, primary, Li, Mingyang, additional, Liu, Jun, additional, Peng, Yaxin, additional, Zuo, Bo, additional, and Xu, Zaiyan, additional
Published: 2021
Full Text: View/download PDF

34. Semantic Constraint Based Unsupervised Domain Adaptation for Cardiac Segmentation

Author: Wang, Xin, primary, Zhu, Fan, additional, Peng, Yaxin, additional, Shen, Chaomin, additional, Ye, Zhen, additional, and Zhou, Chaozheng, additional
Published: 2021
Full Text: View/download PDF

35. Lipidomic analyses reveal enhanced lipolysis in planthoppers feeding on resistant host plants

Author: Zheng, Xiaohong, primary, Xin, Yeyun, additional, Peng, Yaxin, additional, Shan, Junhan, additional, Zhang, Ning, additional, Wu, Di, additional, Guo, Jianping, additional, Huang, Jin, additional, Guan, Wei, additional, Shi, Shaojie, additional, Zhou, Cong, additional, Chen, Rongzhi, additional, Du, Bo, additional, Zhu, Lili, additional, Yang, Fang, additional, Fu, Xiqin, additional, Yuan, Longping, additional, and He, Guangcun, additional
Published: 2020
Full Text: View/download PDF

36. Salivary Protein 1 of Brown Planthopper Is Required for Survival and Induces Immunity Response in Plants

Author: Huang, Jin, primary, Zhang, Ning, additional, Shan, Junhan, additional, Peng, Yaxin, additional, Guo, Jianping, additional, Zhou, Cong, additional, Shi, Shaojie, additional, Zheng, Xiaohong, additional, Wu, Di, additional, Guan, Wei, additional, Yang, Ke, additional, Du, Bo, additional, Zhu, Lili, additional, Yuan, Longping, additional, He, Guangcun, additional, and Chen, Rongzhi, additional
Published: 2020
Full Text: View/download PDF

37. Large-Scale Place Recognition Based on Camera-LiDAR Fused Descriptor

Author: Xie, Shaorong, primary, Pan, Chao, additional, Peng, Yaxin, additional, Liu, Ke, additional, and Ying, Shihui, additional
Published: 2020
Full Text: View/download PDF

38. Consensus in Fractional-Order Multi-Agent Systems With Intermittence Sampled Data Over Directed Networks

Author: Ye, Yanyan, primary, Su, Housheng, additional, Chen, Jinhe, additional, and Peng, Yaxin, additional
Published: 2020
Full Text: View/download PDF

39. Robust Dual-Color Watermarking Based on Quaternion Singular Value Decomposition

Author: Chen, Yong, primary, Jia, Zhigang, additional, Peng, Yan, additional, and Peng, Yaxin, additional
Published: 2020
Full Text: View/download PDF

40. A Coarse-to-Fine Generalized-ICP Algorithm With Trimmed Strategy

Author: Wang, Xin, primary, Li, Yun, additional, Peng, Yaxin, additional, and Ying, Shihui, additional
Published: 2020
Full Text: View/download PDF

41. A Local-to-Global Metric Learning Framework From the Geometric Insight

Author: Peng, Yaxin, primary, Zhang, Nijing, additional, Li, Ying, additional, and Ying, Shihui, additional
Published: 2020
Full Text: View/download PDF

42. Conservative analysis of Synaptopodin‐2intron sense‐overlapping lncRNA reveals its novel function in promoting muscle atrophy

Author: Jin, Jianjun, Du, Mengmeng, Wang, Jian, Guo, Yubo, Zhang, Jiali, Zuo, Hao, Hou, Yunqing, Wang, Shanshan, Lv, Wei, Bai, Wei, Wang, Jin, Zhan, Xizhen, Peng, Yaxin, Tong, Qian, Chai, Jin, Xu, Zaiyan, and Zuo, Bo
Abstract: Dissection of the regulatory pathways that control skeletal muscle development and atrophy is important for the treatment of muscle wasting. Long noncoding RNA (lncRNA) play important roles in various stages of muscle development. We previously reported that Synaptopodin‐2(SYNPO2) intron sense‐overlapping lncRNA (SYISL) regulates myogenesis through an interaction with enhancer of zeste homologue 2 (EZH2). However, it remains unclear whether SYISLhomologues exist in humans and pigs, and whether the functions and mechanisms of these homologues are conserved among species. Bioinformatics, cell fractionation, and quantitative real‐time polymerase chain reaction (qRT‐PCR) analyses were used for the identification and molecular characterization of SYISLhomologues in humans and pigs. Effects on myogenesis and muscle atrophy were determined via loss‐of‐function or gain‐of‐function experiments using C2C12 myoblasts, myogenic progenitor cells, dexamethasone (DEX), and aging‐induced muscle atrophy models. RNA pulldown, RNA immunoprecipitation, dual luciferase reporting, and co‐transfection experiments were used to explore the mechanisms of SYISLinteractions with proteins and miRNAs. We identified SYISLhomologues in humans (designated hSYISL) and pigs (designated pSYISL). Functional experiments demonstrated that hSYISLand pSYISLregulate myogenesis through interactions with EZH2. Interestingly, we showed that SYISLfunctions to regulate muscle atrophy and sarcopenia through comparative analysis. SYISLis significantly up‐regulated after muscle atrophy (P< 0.01); it significantly promotes muscle atrophy in DEX‐induced muscle atrophy models (P< 0.01). SYISLknockdown or knockout alleviates muscle atrophy and sarcopenia in DEX‐induced and aged mice. The tibialis anterior (TA) muscle weight of 3‐month‐old wild‐type (WT) mice decreased by 33.24% after DEX treatment (P< 0.001), while the muscle weight loss of 3‐month‐old SYISLknockout mice was only 18.20% after DEX treatment (P< 0.001). SYISLknockout in 18‐month‐old WT mice significantly increased the weights of quadriceps (Qu), gastrocnemius (Gas), and TA muscles by 10.45% (P< 0.05), 13.95% (P< 0.01), and 24.82% (P< 0.05), respectively. Mechanistically, SYISLincreases the expression levels of the muscle atrophy genes forkhead box protein O3a(FoxO3a), muscle ring finger 1(MuRF1), and muscle atrophy‐related F‐box(Atrogin‐1) via sponging of miR‐23a‐3p/miR‐103‐3p/miR‐205‐5pand thus promotes muscle atrophy. Additionally, we verified that human SYISLoverexpression in muscles of 18‐month‐old WT mice significantly decreased the weights of Gas, Qu, and TA muscles by 7.76% (P< 0.01), 12.26% (P< 0.05), and 13.44% (P< 0.01), respectively, and accelerates muscle atrophy through conserved mechanisms. Our results identify SYISLas a conserved lncRNA that modulates myogenesis in mice, pigs, and humans. We also demonstrated its previously unknown ability to promote muscle atrophy.
Published: 2022
Full Text: View/download PDF

43. The Adversarial Attack and Detection under the Fisher Information Metric

Author: Zhao, Chenxiao, primary, Fletcher, P. Thomas, additional, Yu, Mixue, additional, Peng, Yaxin, additional, Zhang, Guixu, additional, and Shen, Chaomin, additional
Published: 2019
Full Text: View/download PDF

44. Intrinsic Metric Learning With Subspace Representation

Author: Cai, Lipeng, primary, Ying, Shihui, additional, Peng, Yaxin, additional, He, Changzhou, additional, and Du, Shaoyi, additional
Published: 2019
Full Text: View/download PDF

45. Effects of KPNA7gene polymorphisms on reproductive traits in France Large White pigs

Author: Zhang, Yunxia, Li, Mingyang, Liu, Jun, Peng, Yaxin, Zuo, Bo, and Xu, Zaiyan
Abstract: ABSTRACTKPNA7 is an essential nuclear protein for early embryogenesis and normal fertility. The aim of our study was to determine the associations between single nucleotide polymorphisms (SNPs) in the KPNA7gene and the reproductive traits in France Large White pigs. Six new SNPs were identified by sequencing. Sows with CT genotype of rs81308652 had a significantly higher total number born than sows with CC and TT genotype in first parity. Litters from sows with TT genotype of rs81308652 had a significantly lower number of weak births than litters from sows with CC and CT genotype in multi parity. Individuals with GG genotype of rs327848277 showed higher litter weight at birth (LWB) and number of healthy births than individuals with genotype GT and TT in first parity, and individuals with genotype GG had lower LWB compared to other genotypes in multi parity. However, we obtained no statistically significant results for association between the SNPs in other four loci and reproductive traits in both primiparous and multiparous pigs. In conclusion, our results show that there are significant association of identified SNPs located in KPNA7gene with pig reproductive traits, and provided a theoretical basis for genetic improvement of pig reproductive traits.
Published: 2021
Full Text: View/download PDF

46. Nonlinear image registration with bidirectional metric and reciprocal regularization

Author: Ying, Shihui, primary, Li, Dan, additional, Xiao, Bin, additional, Peng, Yaxin, additional, Du, Shaoyi, additional, and Xu, Meifeng, additional
Published: 2017
Full Text: View/download PDF

47. Manifold Preserving: An Intrinsic Approach for Semisupervised Distance Metric Learning

Author: Ying, Shihui, primary, Wen, Zhijie, additional, Shi, Jun, additional, Peng, Yaxin, additional, Peng, Jigen, additional, and Qiao, Hong, additional
Published: 2017
Full Text: View/download PDF

48. Nonlinear Semi-Supervised Metric Learning Via Multiple Kernels and Local Topology.

Author: Li, Xin, Bai, Yanqin, Peng, Yaxin, Du, Shaoyi, and Ying, Shihui
Subjects: LAPLACIAN matrices, DATA analysis, MACHINE learning, CALCULUS of tensors, METRIC spaces
Abstract: Changing the metric on the data may change the data distribution, hence a good distance metric can promote the performance of learning algorithm. In this paper, we address the semi-supervised distance metric learning (ML) problem to obtain the best nonlinear metric for the data. First, we describe the nonlinear metric by the multiple kernel representation. By this approach, we project the data into a high dimensional space, where the data can be well represented by linear ML. Then, we reformulate the linear ML by a minimization problem on the positive definite matrix group. Finally, we develop a two-step algorithm for solving this model and design an intrinsic steepest descent algorithm to learn the positive definite metric matrix. Experimental results validate that our proposed method is effective and outperforms several state-of-the-art ML methods. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

49. Manifold Preserving: An Intrinsic Approach for Semisupervised Distance Metric Learning.

Author: Ying, Shihui, Wen, Zhijie, Shi, Jun, Peng, Yaxin, Peng, Jigen, and Qiao, Hong
Subjects: MANIFOLDS (Mathematics), METRIC system, MACHINE learning
Abstract: In this paper, we address the semisupervised distance metric learning problem and its applications in classification and image retrieval. First, we formulate a semisupervised distance metric learning model by considering the metric information of inner classes and interclasses. In this model, an adaptive parameter is designed to balance the inner metrics and intermetrics by using data structure. Second, we convert the model to a minimization problem whose variable is symmetric positive-definite matrix. Third, in implementation, we deduce an intrinsic steepest descent method, which assures that the metric matrix is strictly symmetric positive-definite at each iteration, with the manifold structure of the symmetric positive-definite matrix manifold. Finally, we test the proposed algorithm on conventional data sets, and compare it with other four representative methods. The numerical results validate that the proposed method significantly improves the classification with the same computational efficiency. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

50. Soft shape registration under Lie group frame

Author: Peng, Yaxin, primary, Lin, Wei, additional, Ying, Shihui, additional, and Peng, Jigen, additional
Published: 2013
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

109 results on '"Peng, Yaxin"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources