Author: "Cong, Peishan" / Publication Year Range: Last 10 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Cong, Peishan"' showing total 20 results

Start Over Author "Cong, Peishan" Publication Year Range Last 10 years

20 results on '"Cong, Peishan"'

1. LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment

Author: Cong, Peishan, Wang, Ziyi, Dou, Zhiyang, Ren, Yiming, Yin, Wei, Cheng, Kai, Sun, Yujing, Long, Xiaoxiao, Zhu, Xinge, and Ma, Yuexin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Language-guided scene-aware human motion generation has great significance for entertainment and robotics. In response to the limitations of existing datasets, we introduce LaserHuman, a pioneering dataset engineered to revolutionize Scene-Text-to-Motion research. LaserHuman stands out with its inclusion of genuine human motions within 3D environments, unbounded free-form natural language descriptions, a blend of indoor and outdoor scenarios, and dynamic, ever-changing scenes. Diverse modalities of capture data and rich annotations present great opportunities for the research of conditional motion generation, and can also facilitate the development of real-life applications. Moreover, to generate semantically consistent and physically plausible human motions, we propose a multi-conditional diffusion model, which is simple but effective, achieving state-of-the-art performance on existing datasets.
Published: 2024

2. Human-centric Scene Understanding for 3D Large-scale Scenarios

Author: Xu, Yiteng, Cong, Peishan, Yao, Yichen, Chen, Runnan, Hou, Yuenan, Zhu, Xinge, He, Xuming, Yu, Jingyi, and Ma, Yuexin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Human-centric scene understanding is significant for real-world applications, but it is extremely challenging due to the existence of diverse human poses and actions, complex human-environment interactions, severe occlusions in crowds, etc. In this paper, we present a large-scale multi-modal dataset for human-centric scene understanding, dubbed HuCenLife, which is collected in diverse daily-life scenarios with rich and fine-grained annotations. Our HuCenLife can benefit many 3D perception tasks, such as segmentation, detection, action recognition, etc., and we also provide benchmarks for these tasks to facilitate related research. In addition, we design novel modules for LiDAR-based segmentation and action recognition, which are more applicable for large-scale human-centric scenarios and achieve state-of-the-art performance.
Published: 2023

3. WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language

Author: Lin, Zhenxiang, Peng, Xidong, Cong, Peishan, Zheng, Ge, Sun, Yujin, Hou, Yuenan, Zhu, Xinge, Yang, Sibei, and Ma, Yuexin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We introduce the task of 3D visual grounding in large-scale dynamic scenes based on natural linguistic descriptions and online captured multi-modal visual data, including 2D images and 3D LiDAR point clouds. We present a novel method, dubbed WildRefer, for this task by fully utilizing the rich appearance information in images, the position and geometric clues in point cloud as well as the semantic knowledge of language descriptions. Besides, we propose two novel datasets, i.e., STRefer and LifeRefer, which focus on large-scale human-centric daily-life scenarios accompanied with abundant 3D object and natural language annotations. Our datasets are significant for the research of 3D visual grounding in the wild and has huge potential to boost the development of autonomous driving and service robots. Extensive experiments and ablation studies demonstrate that our method achieves state-of-the-art performance on the proposed benchmarks. The code is provided in https://github.com/4DVLab/WildRefer.
Published: 2023

4. Weakly Supervised 3D Multi-person Pose Estimation for Large-scale Scenes based on Monocular Camera and Single LiDAR

Author: Cong, Peishan, Xu, Yiteng, Ren, Yiming, Zhang, Juze, Xu, Lan, Wang, Jingya, Yu, Jingyi, and Ma, Yuexin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Depth estimation is usually ill-posed and ambiguous for monocular camera-based 3D multi-person pose estimation. Since LiDAR can capture accurate depth information in long-range scenes, it can benefit both the global localization of individuals and the 3D pose estimation by providing rich geometry features. Motivated by this, we propose a monocular camera and single LiDAR-based method for 3D multi-person pose estimation in large-scale scenes, which is easy to deploy and insensitive to light. Specifically, we design an effective fusion strategy to take advantage of multi-modal input data, including images and point cloud, and make full use of temporal information to guide the network to learn natural and coherent human motions. Without relying on any 3D pose annotations, our method exploits the inherent geometry constraints of point cloud for self-supervision and utilizes 2D keypoints on images for weak supervision. Extensive experiments on public datasets and our newly collected dataset demonstrate the superiority and generalization capability of our proposed method., Comment: Accepted by AAAI 2023
Published: 2022

5. Gait Recognition in Large-scale Free Environment via Single LiDAR

Author: Han, Xiao, Ren, Yiming, Cong, Peishan, Sun, Yujing, Wang, Jingya, Xu, Lan, and Ma, Yuexin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Human gait recognition is crucial in multimedia, enabling identification through walking patterns without direct interaction, enhancing the integration across various media forms in real-world applications like smart homes, healthcare and non-intrusive security. LiDAR's ability to capture depth makes it pivotal for robotic perception and holds promise for real-world gait recognition. In this paper, based on a single LiDAR, we present the Hierarchical Multi-representation Feature Interaction Network (HMRNet) for robust gait recognition. Prevailing LiDAR-based gait datasets primarily derive from controlled settings with predefined trajectory, remaining a gap with real-world scenarios. To facilitate LiDAR-based gait recognition research, we introduce FreeGait, a comprehensive gait dataset from large-scale, unconstrained settings, enriched with multi-modal and varied 2D/3D data. Notably, our approach achieves state-of-the-art performance on prior dataset (SUSTech1K) and on FreeGait., Comment: Accepted by ACM MM Oral 2024
Published: 2022

6. LiDAR-aid Inertial Poser: Large-scale Human Motion Capture by Sparse Inertial and LiDAR Sensors

Author: Ren, Yiming, Zhao, Chengfeng, He, Yannan, Cong, Peishan, Liang, Han, Yu, Jingyi, Xu, Lan, and Ma, Yuexin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We propose a multi-sensor fusion method for capturing challenging 3D human motions with accurate consecutive local poses and global trajectories in large-scale scenarios, only using single LiDAR and 4 IMUs, which are set up conveniently and worn lightly. Specifically, to fully utilize the global geometry information captured by LiDAR and local dynamic motions captured by IMUs, we design a two-stage pose estimator in a coarse-to-fine manner, where point clouds provide the coarse body shape and IMU measurements optimize the local actions. Furthermore, considering the translation deviation caused by the view-dependent partial point cloud, we propose a pose-guided translation corrector. It predicts the offset between captured points and the real root locations, which makes the consecutive movements and trajectories more precise and natural. Moreover, we collect a LiDAR-IMU multi-modal mocap dataset, LIPD, with diverse human actions in long-range scenarios. Extensive quantitative and qualitative experiments on LIPD and other open datasets all demonstrate the capability of our approach for compelling motion capture in large-scale scenarios, which outperforms other methods by an obvious margin. We will release our code and captured dataset to stimulate future research.
Published: 2022
Full Text: View/download PDF

7. STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes

Author: Cong, Peishan, Zhu, Xinge, Qiao, Feng, Ren, Yiming, Peng, Xidong, Hou, Yuenan, Xu, Lan, Yang, Ruigang, Manocha, Dinesh, and Ma, Yuexin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Accurately detecting and tracking pedestrians in 3D space is challenging due to large variations in rotations, poses and scales. The situation becomes even worse for dense crowds with severe occlusions. However, existing benchmarks either only provide 2D annotations, or have limited 3D annotations with low-density pedestrian distribution, making it difficult to build a reliable pedestrian perception system especially in crowded scenes. To better evaluate pedestrian perception algorithms in crowded scenarios, we introduce a large-scale multimodal dataset,STCrowd. Specifically, in STCrowd, there are a total of 219 K pedestrian instances and 20 persons per frame on average, with various levels of occlusion. We provide synchronized LiDAR point clouds and camera images as well as their corresponding 3D labels and joint IDs. STCrowd can be used for various tasks, including LiDAR-only, image-only, and sensor-fusion based pedestrian detection and tracking. We provide baselines for most of the tasks. In addition, considering the property of sparse global distribution and density-varying local distribution of pedestrians, we further propose a novel method, Density-aware Hierarchical heatmap Aggregation (DHA), to enhance pedestrian perception in crowded scenes. Extensive experiments show that our new method achieves state-of-the-art performance for pedestrian detection on various datasets., Comment: accepted at CVPR2022
Published: 2022

8. Self-supervised Point Cloud Completion on Real Traffic Scenes via Scene-concerned Bottom-up Mechanism

Author: Ren, Yiming, Cong, Peishan, Zhu, Xinge, and Ma, Yuexin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Real scans always miss partial geometries of objects due to the self-occlusions, external-occlusions, and limited sensor resolutions. Point cloud completion aims to refer the complete shapes for incomplete 3D scans of objects. Current deep learning-based approaches rely on large-scale complete shapes in the training process, which are usually obtained from synthetic datasets. It is not applicable for real-world scans due to the domain gap. In this paper, we propose a self-supervised point cloud completion method (TraPCC) for vehicles in real traffic scenes without any complete data. Based on the symmetry and similarity of vehicles, we make use of consecutive point cloud frames to construct vehicle memory bank as reference. We design a bottom-up mechanism to focus on both local geometry details and global shape features of inputs. In addition, we design a scene-graph in the network to pay attention to the missing parts by the aid of neighboring vehicles. Experiments show that TraPCC achieve good performance for real-scan completion on KITTI and nuScenes traffic datasets even without any complete data in training. We also show a downstream application of 3D detection, which benefits from our completion approach.
Published: 2022

9. Input-Output Balanced Framework for Long-tailed LiDAR Semantic Segmentation

Author: Cong, Peishan, Zhu, Xinge, and Ma, Yuexin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: A thorough and holistic scene understanding is crucial for autonomous vehicles, where LiDAR semantic segmentation plays an indispensable role. However, most existing methods focus on the network design while neglecting the inherent difficulty, imbalanced data distribution in the realistic dataset (also named long-tailed distribution), which narrows down the capability of state-of-the-art methods. In this paper, we propose an input-output balanced framework to handle the issue of long-tailed distribution. Specifically, for the input space, we synthesize these tailed instances from mesh models and well simulate the position and density distribution of LiDAR scan, which enhances the input data balance and improves the data diversity. For the output space, a multi-head block is proposed to group different categories based on their shapes and instance amounts, which alleviates the biased representation of dominating category during the feature learning. We evaluate the proposed model on two large-scale datasets, SemanticKITTI and nuScenes, where state-of-the-art results demonstrate its effectiveness. The proposed new modules can also be used as a plug-and-play, and we apply them on various backbones and datasets, showing its good generalization ability., Comment: Accepted by ICME 2021
Published: 2021

10. Risk factors for pelvic lymph node metastasis in endometrial cancer

Author: Li, Yujie, Cong, Peishan, Wang, Pan, Peng, Chong, Liu, Mingjun, and Sun, Guirong
Published: 2019
Full Text: View/download PDF

11. Weakly Supervised 3D Multi-Person Pose Estimation for Large-Scale Scenes Based on Monocular Camera and Single LiDAR

Author: Cong, Peishan, primary, Xu, Yiteng, additional, Ren, Yiming, additional, Zhang, Juze, additional, Xu, Lan, additional, Wang, Jingya, additional, Yu, Jingyi, additional, and Ma, Yuexin, additional
Published: 2023
Full Text: View/download PDF

12. LiDAR-aid Inertial Poser: Large-scale Human Motion Capture by Sparse Inertial and LiDAR Sensors

Author: Ren, Yiming, primary, Zhao, Chengfeng, additional, He, Yannan, additional, Cong, Peishan, additional, Liang, Han, additional, Yu, Jingyi, additional, Xu, Lan, additional, and Ma, Yuexin, additional
Published: 2023
Full Text: View/download PDF

13. LiCamGait: Gait Recognition in the Wild by Using LiDAR and Camera Multi-modal Visual Sensors

Author: Han, Xiao, Cong, Peishan, Xu, Lan, Wang, Jingya, Yu, Jingyi, and Ma, Yuexin
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition
Abstract: LiDAR can capture accurate depth information in large-scale scenarios without the effect of light conditions, and the captured point cloud contains gait-related 3D geometric properties and dynamic motion characteristics. We make the first attempt to leverage LiDAR to remedy the limitation of view-dependent and light-sensitive camera for more robust and accurate gait recognition. In this paper, we propose a LiDAR-camera-based gait recognition method with an effective multi-modal feature fusion strategy, which fully exploits advantages of both point clouds and images. In particular, we propose a new in-the-wild gait dataset, LiCamGait, involving multi-modal visual data and diverse 2D/3D representations. Our method achieves state-of-the-art performance on the new dataset. Code and dataset will be released when this paper is published.
Published: 2022

14. Self-Supervised Point Cloud Completion on Real Traffic Scenes Via Scene-Concerned Bottom-Up Mechanism

Author: Ren, Yiming, primary, Cong, Peishan, additional, Zhu, Xinge, additional, and Ma, Yuexin, additional
Published: 2022
Full Text: View/download PDF

15. STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes

Author: Cong, Peishan, primary, Zhu, Xinge, additional, Qiao, Feng, additional, Ren, Yiming, additional, Peng, Xidong, additional, Hou, Yuenan, additional, Xu, Lan, additional, Yang, Ruigang, additional, Manocha, Dinesh, additional, and Ma, Yuexin, additional
Published: 2022
Full Text: View/download PDF

16. Disulfiram inhibits IL-1β secretion and inflammatory cells recruitment in Aspergillus fumigatus keratitis

Author: Yan, Haijing, primary, Yang, Hua, additional, Wang, Limei, additional, Sun, Xiaoyan, additional, Han, Lin, additional, Cong, Peishan, additional, Chen, Xiaomeng, additional, Lu, Danli, additional, and Che, Chengye, additional
Published: 2022
Full Text: View/download PDF

17. Urine Culture in Hospitalized Patients during 2014-2018: An Analysis on Pathogen Distribution and Drug Sensitivity

Author: Sun, Dongkai, primary, Cong, Peishan, additional, Guan, Fengju, additional, Liu, Shuai, additional, Sun, Lijiang, additional, and Zhang, Guiming, additional
Published: 2021
Full Text: View/download PDF

18. Input-Output Balanced Framework for Long-Tailed Lidar Semantic Segmentation

Author: Cong, Peishan, primary, Zhu, Xinge, additional, and Ma, Yuexin, additional
Published: 2021
Full Text: View/download PDF

19. Effect of Fucoidan on Gut Microbiota and its Clinical Efficacy in Helicobacter Pylori Eradication: Randomised Controlled Trial

Author: Wang, Shu, primary, Tian, Zibin, additional, Chen, Jianwei, additional, Cong, Peishan, additional, Ding, Xueli, additional, Yin, Xiaoyan, additional, Mao, Tao, additional, Sun, Zhanyi, additional, Jiang, Jinju, additional, and Yu, Yanan, additional
Published: 2021
Full Text: View/download PDF

20. The Combination of the Tumor Markers Suggests the Histological Diagnosis of Lung Cancer

Author: Liu, Linjie, primary, Teng, Jinlong, additional, Zhang, Lijun, additional, Cong, Peishan, additional, Yao, Yuan, additional, Sun, Guirong, additional, Liu, Zhijun, additional, Yu, Teng, additional, and Liu, Mingjun, additional
Published: 2017
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

20 results on '"Cong, Peishan"'

1. LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment

2. Human-centric Scene Understanding for 3D Large-scale Scenarios

3. WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language

4. Weakly Supervised 3D Multi-person Pose Estimation for Large-scale Scenes based on Monocular Camera and Single LiDAR

5. Gait Recognition in Large-scale Free Environment via Single LiDAR

6. LiDAR-aid Inertial Poser: Large-scale Human Motion Capture by Sparse Inertial and LiDAR Sensors

7. STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes

8. Self-supervised Point Cloud Completion on Real Traffic Scenes via Scene-concerned Bottom-up Mechanism

9. Input-Output Balanced Framework for Long-tailed LiDAR Semantic Segmentation

10. Risk factors for pelvic lymph node metastasis in endometrial cancer

11. Weakly Supervised 3D Multi-Person Pose Estimation for Large-Scale Scenes Based on Monocular Camera and Single LiDAR

12. LiDAR-aid Inertial Poser: Large-scale Human Motion Capture by Sparse Inertial and LiDAR Sensors

13. LiCamGait: Gait Recognition in the Wild by Using LiDAR and Camera Multi-modal Visual Sensors

14. Self-Supervised Point Cloud Completion on Real Traffic Scenes Via Scene-Concerned Bottom-Up Mechanism

15. STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes

16. Disulfiram inhibits IL-1β secretion and inflammatory cells recruitment in Aspergillus fumigatus keratitis

17. Urine Culture in Hospitalized Patients during 2014-2018: An Analysis on Pathogen Distribution and Drug Sensitivity

18. Input-Output Balanced Framework for Long-Tailed Lidar Semantic Segmentation

19. Effect of Fucoidan on Gut Microbiota and its Clinical Efficacy in Helicobacter Pylori Eradication: Randomised Controlled Trial

20. The Combination of the Tumor Markers Suggests the Histological Diagnosis of Lung Cancer

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

20 results on '"Cong, Peishan"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources