Author: "Liu, Zhijian" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Liu, Zhijian"' showing total 1,559 results

Start Over Author "Liu, Zhijian"

1,559 results on '"Liu, Zhijian"'

1. LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Author: Xue, Fuzhao, Chen, Yukang, Li, Dacheng, Hu, Qinghao, Zhu, Ligeng, Li, Xiuyu, Fang, Yunhao, Tang, Haotian, Yang, Shang, Liu, Zhijian, He, Ethan, Yin, Hongxu, Molchanov, Pavlo, Kautz, Jan, Fan, Linxi, Zhu, Yuke, Lu, Yao, and Han, Song
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Long-context capability is critical for multi-modal foundation models, especially for long video understanding. We introduce LongVILA, a full-stack solution for long-context visual-language models \qinghao{by co-designing the algorithm and system. For model training, we upgrade existing VLMs to support long video understanding by incorporating two additional stages, {\em i.e.}, long context extension and long video supervised fine-tuning. However, training on long video is computationally and memory intensive. We introduce the long-context Multi-Modal Sequence Parallelism (MM-SP) system that efficiently parallelizes long video training and inference, enabling 2M context length training on 256 GPUs without any gradient checkpointing. LongVILA efficiently extends the number of video frames of VILA from 8 to 2048, improving the long video captioning score from 2.00 to 3.26 (out of 5), achieving 99.8% accuracy in 6,000-frame (more than 1 million tokens) video needle-in-a-haystack. LongVILA-7B demonstrates strong accuracy on the VideoMME benchmark, i.e., 61.8% with subtitle. Besides, MM-SP is 2.1x - 5.7x faster than ring style sequence parallelism and 1.1x - 1.4x faster than Megatron with a hybrid context and tensor parallelism. Moreover, it seamlessly integrates with Hugging Face Transformers., Comment: Code and models are available at https://github.com/NVlabs/VILA/blob/main/LongVILA.md
Published: 2024

2. Sparse Refinement for Efficient High-Resolution Semantic Segmentation

Author: Liu, Zhijian, Zhang, Zhuoyang, Khaki, Samir, Yang, Shang, Tang, Haotian, Xu, Chenfeng, Keutzer, Kurt, and Han, Song
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Semantic segmentation empowers numerous real-world applications, such as autonomous driving and augmented/mixed reality. These applications often operate on high-resolution images (e.g., 8 megapixels) to capture the fine details. However, this comes at the cost of considerable computational complexity, hindering the deployment in latency-sensitive scenarios. In this paper, we introduce SparseRefine, a novel approach that enhances dense low-resolution predictions with sparse high-resolution refinements. Based on coarse low-resolution outputs, SparseRefine first uses an entropy selector to identify a sparse set of pixels with high entropy. It then employs a sparse feature extractor to efficiently generate the refinements for those pixels of interest. Finally, it leverages a gated ensembler to apply these sparse refinements to the initial coarse predictions. SparseRefine can be seamlessly integrated into any existing semantic segmentation model, regardless of CNN- or ViT-based. SparseRefine achieves significant speedup: 1.5 to 3.7 times when applied to HRNet-W48, SegFormer-B5, Mask2Former-T/L and SegNeXt-L on Cityscapes, with negligible to no loss of accuracy. Our "dense+sparse" paradigm paves the way for efficient high-resolution visual computing., Comment: ECCV 2024. The first two authors contributed equally to this work. Project page: https://sparserefine.mit.edu
Published: 2024

3. LidarDM: Generative LiDAR Simulation in a Generated World

Author: Zyrianov, Vlas, Che, Henry, Liu, Zhijian, and Wang, Shenlong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Robotics
Abstract: We present LidarDM, a novel LiDAR generative model capable of producing realistic, layout-aware, physically plausible, and temporally coherent LiDAR videos. LidarDM stands out with two unprecedented capabilities in LiDAR generative modeling: (i) LiDAR generation guided by driving scenarios, offering significant potential for autonomous driving simulations, and (ii) 4D LiDAR point cloud generation, enabling the creation of realistic and temporally coherent sequences. At the heart of our model is a novel integrated 4D world generation framework. Specifically, we employ latent diffusion models to generate the 3D scene, combine it with dynamic actors to form the underlying 4D world, and subsequently produce realistic sensory observations within this virtual environment. Our experiments indicate that our approach outperforms competing algorithms in realism, temporal coherency, and layout consistency. We additionally show that LidarDM can be used as a generative world model simulator for training and testing perception models.
Published: 2024

4. Comprehensive evaluations of the bioaerosol filtration performance of high- and medium-efficiency filters under different influencing factors

Author: Yao, Guangpeng, Liu, Zhijian, Shi, Jie, Liu, Haiyang, Ding, Mingtao, and He, Junzhou
Published: 2024
Full Text: View/download PDF

5. StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation

Author: Kodaira, Akio, Xu, Chenfeng, Hazama, Toshiki, Yoshimoto, Takanori, Ohno, Kohei, Mitsuhori, Shogo, Sugano, Soichi, Cho, Hanying, Liu, Zhijian, and Keutzer, Kurt
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, Computer Science - Machine Learning
Abstract: We introduce StreamDiffusion, a real-time diffusion pipeline designed for interactive image generation. Existing diffusion models are adept at creating images from text or image prompts, yet they often fall short in real-time interaction. This limitation becomes particularly evident in scenarios involving continuous input, such as Metaverse, live video streaming, and broadcasting, where high throughput is imperative. To address this, we present a novel approach that transforms the original sequential denoising into the batching denoising process. Stream Batch eliminates the conventional wait-and-interact approach and enables fluid and high throughput streams. To handle the frequency disparity between data input and model throughput, we design a novel input-output queue for parallelizing the streaming process. Moreover, the existing diffusion pipeline uses classifier-free guidance(CFG), which requires additional U-Net computation. To mitigate the redundant computations, we propose a novel residual classifier-free guidance (RCFG) algorithm that reduces the number of negative conditional denoising steps to only one or even zero. Besides, we introduce a stochastic similarity filter(SSF) to optimize power consumption. Our Stream Batch achieves around 1.5x speedup compared to the sequential denoising method at different denoising levels. The proposed RCFG leads to speeds up to 2.05x higher than the conventional CFG. Combining the proposed strategies and existing mature acceleration tools makes the image-to-image generation achieve up-to 91.07fps on one RTX4090, improving the throughputs of AutoPipline developed by Diffusers over 59.56x. Furthermore, our proposed StreamDiffusion also significantly reduces the energy consumption by 2.39x on one RTX3060 and 1.99x on one RTX4090, respectively., Comment: tech report, the code is available at https://github.com/cumulo-autumn/StreamDiffusion
Published: 2023

6. Point Transformer V3: Simpler, Faster, Stronger

Author: Wu, Xiaoyang, Jiang, Li, Wang, Peng-Shuai, Liu, Zhijian, Liu, Xihui, Qiao, Yu, Ouyang, Wanli, He, Tong, and Zhao, Hengshuang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper is not motivated to seek innovation within the attention mechanism. Instead, it focuses on overcoming the existing trade-offs between accuracy and efficiency within the context of point cloud processing, leveraging the power of scale. Drawing inspiration from recent advances in 3D large-scale representation learning, we recognize that model performance is more influenced by scale than by intricate design. Therefore, we present Point Transformer V3 (PTv3), which prioritizes simplicity and efficiency over the accuracy of certain mechanisms that are minor to the overall performance after scaling, such as replacing the precise neighbor search by KNN with an efficient serialized neighbor mapping of point clouds organized with specific patterns. This principle enables significant scaling, expanding the receptive field from 16 to 1024 points while remaining efficient (a 3x increase in processing speed and a 10x improvement in memory efficiency compared with its predecessor, PTv2). PTv3 attains state-of-the-art results on over 20 downstream tasks that span both indoor and outdoor scenarios. Further enhanced with multi-dataset joint training, PTv3 pushes these results to a higher level., Comment: CVPR 2024, code available at Pointcept (https://github.com/Pointcept/PointTransformerV3)
Published: 2023

7. TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs

Author: Tang, Haotian, Yang, Shang, Liu, Zhijian, Hong, Ke, Yu, Zhongming, Li, Xiuyu, Dai, Guohao, Wang, Yu, and Han, Song
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Computer Science - Performance
Abstract: Sparse convolution plays a pivotal role in emerging workloads, including point cloud processing in AR/VR, autonomous driving, and graph understanding in recommendation systems. Since the computation pattern is sparse and irregular, specialized high-performance kernels are required. Existing GPU libraries offer two dataflow types for sparse convolution. The gather-GEMM-scatter dataflow is easy to implement but not optimal in performance, while the dataflows with overlapped computation and memory access (e.g.implicit GEMM) are highly performant but have very high engineering costs. In this paper, we introduce TorchSparse++, a new GPU library that achieves the best of both worlds. We create a highly efficient Sparse Kernel Generator that generates performant sparse convolution kernels at less than one-tenth of the engineering cost of the current state-of-the-art system. On top of this, we design the Sparse Autotuner, which extends the design space of existing sparse convolution libraries and searches for the best dataflow configurations for training and inference workloads. Consequently, TorchSparse++ achieves 2.9x, 3.3x, 2.2x and 1.7x measured end-to-end speedup on an NVIDIA A100 GPU over state-of-the-art MinkowskiEngine, SpConv 1.2, TorchSparse and SpConv v2 in inference; and is 1.2-1.3x faster than SpConv v2 in mixed precision training across seven representative autonomous driving benchmarks. It also seamlessly supports graph convolutions, achieving 2.6-7.6x faster inference speed compared with state-of-the-art graph deep learning libraries., Comment: MICRO 2023; Haotian Tang and Shang Yang contributed equally to this project
Published: 2023

8. LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Author: Chen, Yukang, Qian, Shengju, Tang, Haotian, Lai, Xin, Liu, Zhijian, Han, Song, and Jia, Jiaya
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: We present LongLoRA, an efficient fine-tuning approach that extends the context sizes of pre-trained large language models (LLMs), with limited computation cost. Typically, training LLMs with long context sizes is computationally expensive, requiring extensive training hours and GPU resources. For example, training on the context length of 8192 needs 16x computational costs in self-attention layers as that of 2048. In this paper, we speed up the context extension of LLMs in two aspects. On the one hand, although dense global attention is needed during inference, fine-tuning the model can be effectively and efficiently done by sparse local attention. The proposed shifted sparse attention effectively enables context extension, leading to non-trivial computation saving with similar performance to fine-tuning with vanilla attention. Particularly, it can be implemented with only two lines of code in training, while being optional in inference. On the other hand, we revisit the parameter-efficient fine-tuning regime for context expansion. Notably, we find that LoRA for context extension works well under the premise of trainable embedding and normalization. LongLoRA combines this improved LoRA with S^2-Attn. LongLoRA demonstrates strong empirical results on various tasks on Llama2 models from 7B/13B to 70B. LongLoRA extends Llama2 7B from 4k context to 100k, or Llama2 70B to 32k on a single 8x A100 machine. LongLoRA extends models' context while retaining their original architectures, and is compatible with most existing techniques, like Flash-Attention2. In addition, we further conduct supervised fine-tuning with LongLoRA and our long instruction-following LongAlpaca dataset., Comment: Code, models, dataset, and demo are available at https://github.com/dvlab-research/LongLoRA
Published: 2023

9. MapPrior: Bird's-Eye View Map Layout Estimation with Generative Models

Author: Zhu, Xiyue, Zyrianov, Vlas, Liu, Zhijian, and Wang, Shenlong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Robotics
Abstract: Despite tremendous advancements in bird's-eye view (BEV) perception, existing models fall short in generating realistic and coherent semantic map layouts, and they fail to account for uncertainties arising from partial sensor information (such as occlusion or limited coverage). In this work, we introduce MapPrior, a novel BEV perception framework that combines a traditional discriminative BEV perception model with a learned generative model for semantic map layouts. Our MapPrior delivers predictions with better accuracy, realism, and uncertainty awareness. We evaluate our model on the large-scale nuScenes benchmark. At the time of submission, MapPrior outperforms the strongest competing method, with significantly improved MMD and ECE scores in camera- and LiDAR-based BEV perception.
Published: 2023

10. CA-CentripetalNet: A novel anchor-free deep learning framework for hardhat wearing detection

Author: Liu, Zhijian, Cai, Nian, Ouyang, Wensheng, Zhang, Chengbin, Tian, Nili, and Wang, Han
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Automatic hardhat wearing detection can strengthen the safety management in construction sites, which is still challenging due to complicated video surveillance scenes. To deal with the poor generalization of previous deep learning based methods, a novel anchor-free deep learning framework called CA-CentripetalNet is proposed for hardhat wearing detection. Two novel schemes are proposed to improve the feature extraction and utilization ability of CA-CentripetalNet, which are vertical-horizontal corner pooling and bounding constrained center attention. The former is designed to realize the comprehensive utilization of marginal features and internal features. The latter is designed to enforce the backbone to pay attention to internal features, which is only used during the training rather than during the detection. Experimental results indicate that the CA-CentripetalNet achieves better performance with the 86.63% mAP (mean Average Precision) with less memory consumption at a reasonable speed than the existing deep learning based methods, especially in case of small-scale hardhats and non-worn-hardhats., Comment: It has been accepted for the journal of Signal, Image and Video Processing, which is a complete version. It is noted that it has been deleted for future publishing
Published: 2023

11. Fast and accurate DNASeq variant calling workflow composed of LUSH toolkit

Author: Wang, Taifu, Zhang, Youjin, Wang, Haoling, Zheng, Qiwen, Yang, Jiaobo, Zhang, Tiefeng, Sun, Geng, Liu, Weicong, Yin, Longhui, He, Xinqiu, You, Rui, Wang, Chu, Liu, Zhencheng, Liu, Zhijian, Wang, Jin’an, Jin, Xiangqian, and He, Zengquan
Published: 2024
Full Text: View/download PDF

12. Cytosolic Cadherin 4 promotes angiogenesis and metastasis in papillary thyroid cancer by suppressing the ubiquitination/degradation of β-catenin

Author: Wu, Luyao, Xiao, Jian, Yi, Dandan, Ding, Haoran, Wang, Ru, Duan, Zehua, Liu, Zhijian, Shi, Xianbiao, Shen, Meiping, and Sang, Jianfeng
Published: 2024
Full Text: View/download PDF

13. SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer

Author: Chen, Xuanyao, Liu, Zhijian, Tang, Haotian, Yi, Li, Zhao, Hang, and Han, Song
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: High-resolution images enable neural networks to learn richer visual representations. However, this improved performance comes at the cost of growing computational complexity, hindering their usage in latency-sensitive applications. As not all pixels are equal, skipping computations for less-important regions offers a simple and effective measure to reduce the computation. This, however, is hard to be translated into actual speedup for CNNs since it breaks the regularity of the dense convolution workload. In this paper, we introduce SparseViT that revisits activation sparsity for recent window-based vision transformers (ViTs). As window attentions are naturally batched over blocks, actual speedup with window activation pruning becomes possible: i.e., ~50% latency reduction with 60% sparsity. Different layers should be assigned with different pruning ratios due to their diverse sensitivities and computational costs. We introduce sparsity-aware adaptation and apply the evolutionary search to efficiently find the optimal layerwise sparsity configuration within the vast search space. SparseViT achieves speedups of 1.5x, 1.4x, and 1.3x compared to its dense counterpart in monocular 3D object detection, 2D instance segmentation, and 2D semantic segmentation, respectively, with negligible to no loss of accuracy., Comment: CVPR 2023. The first two authors contributed equally to this work. Project page: https://sparsevit.mit.edu
Published: 2023

14. FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer

Author: Liu, Zhijian, Yang, Xinyu, Tang, Haotian, Yang, Shang, and Han, Song
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Transformer, as an alternative to CNN, has been proven effective in many modalities (e.g., texts and images). For 3D point cloud transformers, existing efforts focus primarily on pushing their accuracy to the state-of-the-art level. However, their latency lags behind sparse convolution-based models (3x slower), hindering their usage in resource-constrained, latency-sensitive applications (such as autonomous driving). This inefficiency comes from point clouds' sparse and irregular nature, whereas transformers are designed for dense, regular workloads. This paper presents FlatFormer to close this latency gap by trading spatial proximity for better computational regularity. We first flatten the point cloud with window-based sorting and partition points into groups of equal sizes rather than windows of equal shapes. This effectively avoids expensive structuring and padding overheads. We then apply self-attention within groups to extract local features, alternate sorting axis to gather features from different directions, and shift windows to exchange features across groups. FlatFormer delivers state-of-the-art accuracy on Waymo Open Dataset with 4.6x speedup over (transformer-based) SST and 1.4x speedup over (sparse convolutional) CenterPoint. This is the first point cloud transformer that achieves real-time performance on edge GPUs and is faster than sparse convolutional methods while achieving on-par or even superior accuracy on large-scale benchmarks., Comment: CVPR 2023. The first two authors contributed equally to this work. Project page: https://flatformer.mit.edu
Published: 2023

15. Analysis of Repetition Capacitor Charge System Based on Permanent Magnet Excited Homopolar Inductor Alternator

Author: Liu, Longjian, Yu, Kexun, Xie, Xianfei, Liu, Zhijian, Guo, Cheng, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Yang, Qingxin, editor, Li, Zewen, editor, and Luo, An, editor
Published: 2024
Full Text: View/download PDF

16. Analysis of Electromagnetic Performance of an AUV Wireless Charge Device with New Coil Structure

Author: Liu, Longjian, Zhang, Ming, Rao, Bo, Liu, Zhijian, Guo, Cheng, Yu, Zhongming, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Yang, Qingxin, editor, Li, Zewen, editor, and Luo, An, editor
Published: 2024
Full Text: View/download PDF

17. Analysis of Electromagnetic Performance of a Pulsed HIA with Novel Compensating Winding

Author: Liu, Longjian, Yu, Kexun, Xie, Xianfei, Liu, Zhijian, Guo, Cheng, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Yang, Qingxin, editor, Li, Zewen, editor, and Luo, An, editor
Published: 2024
Full Text: View/download PDF

18. Electromagnetic Performance Analysis of Homopolar Inductor Alternator with Radial Externally Movable Permanent Magnets

Author: Liu, Longjian, Yu, Kexun, Xie, Xianfei, Liu, Zhijian, Guo, Cheng, Zhang, Ming, Rao, Bo, Sun, Xiangfei, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Yang, Qingxin, editor, Li, Zewen, editor, and Luo, An, editor
Published: 2024
Full Text: View/download PDF

19. Operation Strategy of Battery Swapping-Charging System for Electric Vehicle Based on Multi-material Flow with Space-Time Coupling Characteristics

Author: Liu, Zhijian, Dai, Jing, Yang, Lingrui, Dong, Hang, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Yang, Qingxin, editor, Li, Zewen, editor, and Luo, An, editor
Published: 2024
Full Text: View/download PDF

20. Interval Prediction of Dynamic Line Rating of OHL Based on Improved Affine Arithmetic

Author: Li, Hanru, Liu, Zhijian, xu, Tao, Lai, Liyong, Huang, Lingyu, Xu, Bin, Liu, Ren, Bo, Tang, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Li, Yong, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Oneto, Luca, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zamboni, Walter, Series Editor, Tan, Kay Chen, Series Editor, Yang, Qingxin, editor, Li, Zewen, editor, and Luo, An, editor
Published: 2024
Full Text: View/download PDF

21. Study on the flow mechanism and frequency characteristics of rales in lower respiratory tract

Author: Jin, Yongjun, Liu, Zhijian, Hu, Chenxing, Dong, Zhijian, Rong, Rui, Liu, Haiyang, Liang, Zhenyu, Liu, Jingwei, Chen, Li, Huang, Minghua, Cui, Haihang, and Shen, Yan
Published: 2024
Full Text: View/download PDF

22. Calibration Set Optimization by Dragonfly Algorithm for Near-Infrared Modeling of Wheat Flour Protein Content

Author: HU Yunchao, LIU Zhijian, WANG Ying, HUANG Haoran, WANG Honghong, WU Cai’e, XIONG Zhixin
Subjects: dragonfly algorithm, near-infrared spectroscopy, optimization of calibration set, protein content of wheat flour, Food processing and manufacture, TP368-456
Abstract: In order to optimize the calibration set for near-infrared modeling of the protein content in wheat flour, the binary dragonfly algorithm (BDA) was used to select representative samples from the primary calibration set divided by the traditional Kennard/Stone (K/S) method. Based on the representative samples, a partial least square regression (PLSR) model for estimating the protein content in wheat flour was established, and the prediction set was employed to evaluate the stability and prediction performance of the model. The results indicated that an optimal calibration set with 30 samples was selected finally by BDA, and the proposed model exhibited a coefficient of determination of prediction (Rp2) of 0.956 4 and a root mean square errors of prediction (RMSEP) of 0.278 1, which increased by 1.87% and decreased by 15.57% compared with those (0.938 8 and 0.329 4) from K/S partition of 100 primary calibration sets, respectively. The average number of calibration sets selected from 10 BDA experiments was 30.2, and the protein content of wheat flour was predicted better by the 10 models developed than that obtained based on the primary calibration set. Therefore, BDA can select a small number of representative calibration set samples based on which a PLSR model with good robustness and high prediction accuracy for the protein content of wheat flour can be established. The proposed method can provide an efficient tool for calibration set selection in near-infrared spectroscopic analysis of the quality of wheat flour.
Published: 2024
Full Text: View/download PDF

23. BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Author: Liu, Zhijian, Tang, Haotian, Amini, Alexander, Yang, Xinyu, Mao, Huizi, Rus, Daniela, and Han, Song
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Multi-sensor fusion is essential for an accurate and reliable autonomous driving system. Recent approaches are based on point-level fusion: augmenting the LiDAR point cloud with camera features. However, the camera-to-LiDAR projection throws away the semantic density of camera features, hindering the effectiveness of such methods, especially for semantic-oriented tasks (such as 3D scene segmentation). In this paper, we break this deeply-rooted convention with BEVFusion, an efficient and generic multi-task multi-sensor fusion framework. It unifies multi-modal features in the shared bird's-eye view (BEV) representation space, which nicely preserves both geometric and semantic information. To achieve this, we diagnose and lift key efficiency bottlenecks in the view transformation with optimized BEV pooling, reducing latency by more than 40x. BEVFusion is fundamentally task-agnostic and seamlessly supports different 3D perception tasks with almost no architectural changes. It establishes the new state of the art on nuScenes, achieving 1.3% higher mAP and NDS on 3D object detection and 13.6% higher mIoU on BEV map segmentation, with 1.9x lower computation cost. Code to reproduce our results is available at https://github.com/mit-han-lab/bevfusion., Comment: ICRA 2023. The first two authors contributed equally to this work. Project page: https://bevfusion.mit.edu
Published: 2022

24. PVNAS: 3D Neural Architecture Search with Point-Voxel Convolution

Author: Liu, Zhijian, Tang, Haotian, Zhao, Shengyu, Shao, Kevin, and Han, Song
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: 3D neural networks are widely used in real-world applications (e.g., AR/VR headsets, self-driving cars). They are required to be fast and accurate; however, limited hardware resources on edge devices make these requirements rather challenging. Previous work processes 3D data using either voxel-based or point-based neural networks, but both types of 3D models are not hardware-efficient due to the large memory footprint and random memory access. In this paper, we study 3D deep learning from the efficiency perspective. We first systematically analyze the bottlenecks of previous 3D methods. We then combine the best from point-based and voxel-based models together and propose a novel hardware-efficient 3D primitive, Point-Voxel Convolution (PVConv). We further enhance this primitive with the sparse convolution to make it more effective in processing large (outdoor) scenes. Based on our designed 3D primitive, we introduce 3D Neural Architecture Search (3D-NAS) to explore the best 3D network architecture given a resource constraint. We evaluate our proposed method on six representative benchmark datasets, achieving state-of-the-art performance with 1.8-23.7x measured speedup. Furthermore, our method has been deployed to the autonomous racing vehicle of MIT Driverless, achieving larger detection range, higher accuracy and lower latency., Comment: Journal extension of arXiv:1907.03739 and arXiv:2007.16100 (IEEE TPAMI, 2021). The first two authors contributed equally to this work
Published: 2022
Full Text: View/download PDF

25. Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Author: Cai, Han, Lin, Ji, Lin, Yujun, Liu, Zhijian, Tang, Haotian, Wang, Hanrui, Zhu, Ligeng, and Han, Song
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial intelligence (AI), including computer vision, natural language processing and speech recognition. However, their superior performance comes at the considerable cost of computational complexity, which greatly hinders their applications in many resource-constrained devices, such as mobile phones and Internet of Things (IoT) devices. Therefore, methods and techniques that are able to lift the efficiency bottleneck while preserving the high accuracy of DNNs are in great demand in order to enable numerous edge AI applications. This paper provides an overview of efficient deep learning methods, systems and applications. We start from introducing popular model compression methods, including pruning, factorization, quantization as well as compact model design. To reduce the large design cost of these manual solutions, we discuss the AutoML framework for each of them, such as neural architecture search (NAS) and automated pruning and quantization. We then cover efficient on-device training to enable user customization based on the local data on mobile devices. Apart from general acceleration techniques, we also showcase several task-specific accelerations for point cloud, video and natural language processing by exploiting their spatial sparsity and temporal/token redundancy. Finally, to support all these algorithmic advancements, we introduce the efficient deep learning system design from both software and hardware perspectives., Comment: Journal preprint (ACM TODAES, 2021). The first seven authors contributed equally to this work and are listed in the alphabetical order
Published: 2022
Full Text: View/download PDF

26. TorchSparse: Efficient Point Cloud Inference Engine

Author: Tang, Haotian, Liu, Zhijian, Li, Xiuyu, Lin, Yujun, and Han, Song
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Performance
Abstract: Deep learning on point clouds has received increased attention thanks to its wide applications in AR/VR and autonomous driving. These applications require low latency and high accuracy to provide real-time user experience and ensure user safety. Unlike conventional dense workloads, the sparse and irregular nature of point clouds poses severe challenges to running sparse CNNs efficiently on the general-purpose hardware. Furthermore, existing sparse acceleration techniques for 2D images do not translate to 3D point clouds. In this paper, we introduce TorchSparse, a high-performance point cloud inference engine that accelerates the sparse convolution computation on GPUs. TorchSparse directly optimizes the two bottlenecks of sparse convolution: irregular computation and data movement. It applies adaptive matrix multiplication grouping to trade computation for better regularity, achieving 1.4-1.5x speedup for matrix multiplication. It also optimizes the data movement by adopting vectorized, quantized and fused locality-aware memory access, reducing the memory movement cost by 2.7x. Evaluated on seven representative models across three benchmark datasets, TorchSparse achieves 1.6x and 1.5x measured end-to-end speedup over the state-of-the-art MinkowskiEngine and SpConv, respectively., Comment: MLSys 2022. The first three authors contributed equally to this work. Project page: https://torchsparse.mit.edu
Published: 2022

27. Research on multi-time scale optimization of integrated energy system based on multiple energy storage

Author: Qian, Jiangbo, Guo, Yunfeng, Wu, Di, Liu, Ao, Han, Zhonghe, Liu, Zhijian, Zhang, Shicong, and Yang, Xinyan
Published: 2024
Full Text: View/download PDF

28. Evaluation of different computational methods for numerical simulation of aerosol distribution in the operating room

Author: Liu, Zhijian, Chu, Jiaqi, Li, Haochuan, Huang, Zhenzhe, Liu, Haiyang, Jiang, Chuan, Sun, Tingting, Xiao, Xia, and He, Junzhou
Published: 2024
Full Text: View/download PDF

29. Rapid prediction for the transient dispersion of leaked airborne pollutant in urban environment based on graph neural networks

Author: Shao, Xuqiang, Zhang, Siqi, Liu, Xiaofan, Liu, Zhijian, and Huang, Jiancai
Published: 2024
Full Text: View/download PDF

30. Interpenetrating LiB/Li3BN2 phases enabling stable composite lithium metal anode

Author: Qing, Piao, Huang, Shaozhen, Naren, Tuoya, Li, Quan, Huang, Haifeng, Long, Kecheng, Liu, Zhijian, Mei, Lin, Sun, Fu, Wei, Weifeng, Zhang, Yu, Ma, Jianmin, Wu, Zhibin, and Chen, Libao
Published: 2024
Full Text: View/download PDF

31. Air disinfection performance of upper-room ultraviolet germicidal irradiation (UR-UVGI) system in a multi-compartment dental clinic

Author: Yao, Guangpeng, Liu, Zhijian, Liu, Haiyang, Jiang, Chuan, Li, Yabin, Liu, Jia, and He, Junzhou
Published: 2024
Full Text: View/download PDF

32. Evaluation of upper-room ultraviolet germicidal irradiation system for air disinfection and energy conservation in multi-patient ward

Author: Liu, Zhijian, Lin, Chaofan, Shi, Jie, Ding, Mingtao, Liu, Haiyang, Chu, Jiaqi, Li, Haochuan, Huang, Zhenzhe, Xiao, Xia, Wan, Tongliang, and He, Junzhou
Published: 2024
Full Text: View/download PDF

33. Transformer fault diagnosis method based on the three-stage lightweight residual neural network

Author: Liu, Hang, Niu, Ben, Liu, Zhijian, Li, Ming, and Shi, Zhiyu
Published: 2025
Full Text: View/download PDF

34. Single-nucleus transcriptomes reveal spatiotemporal symbiotic perception and early response in Medicago

Author: Liu, Zhijian, Yang, Jun, Long, Yanping, Zhang, Chi, Wang, Dapeng, Zhang, Xiaowei, Dong, Wentao, Zhao, Li, Liu, Chengwu, Zhai, Jixian, and Wang, Ertao
Published: 2023
Full Text: View/download PDF

35. Mesenchymal stem cells alleviate autoimmune thyroiditis by modulating macrophage phenotypes and through influencing the STING pathway

Author: Ding, Haoran, Qin, Jiabo, Liu, Zhijian, Shi, Xianbiao, Guan, Wenxian, and Sang, Jianfeng
Published: 2024
Full Text: View/download PDF

36. OsCSLD4 confers salt–alkali tolerance by regulating gene expressions in photosynthesis and carbohydrate biosynthesis pathways, cell wall hemicellulose accumulation and physio-biochemical adaptability in rice

Author: Liu, Zhijian, Wu, Cuili, Li, Wenjie, Hu, Li, Fu, Xingyue, Hu, Binhua, Liao, Yongxiang, Xiang, Zufen, Jiang, Haibo, Huang, Weizao, Yang, Xiaocheng, Du, Anping, Pu, Zhigang, Tu, Shengbin, Ma, Xinrong, and Li, Hui
Published: 2024
Full Text: View/download PDF

37. VISTA 2.0: An Open, Data-driven Simulator for Multimodal Sensing and Policy Learning for Autonomous Vehicles

Author: Amini, Alexander, Wang, Tsun-Hsuan, Gilitschenski, Igor, Schwarting, Wilko, Liu, Zhijian, Han, Song, Karaman, Sertac, and Rus, Daniela
Subjects: Computer Science - Robotics, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Simulation has the potential to transform the development of robust algorithms for mobile agents deployed in safety-critical scenarios. However, the poor photorealism and lack of diverse sensor modalities of existing simulation engines remain key hurdles towards realizing this potential. Here, we present VISTA, an open source, data-driven simulator that integrates multiple types of sensors for autonomous vehicles. Using high fidelity, real-world datasets, VISTA represents and simulates RGB cameras, 3D LiDAR, and event-based cameras, enabling the rapid generation of novel viewpoints in simulation and thereby enriching the data available for policy learning with corner cases that are difficult to capture in the physical world. Using VISTA, we demonstrate the ability to train and test perception-to-control policies across each of the sensor types and showcase the power of this approach via deployment on a full scale autonomous vehicle. The policies learned in VISTA exhibit sim-to-real transfer without modification and greater robustness than those trained exclusively on real-world data., Comment: First two authors contributed equally. Code and project website is available here: https://vista.csail.mit.edu
Published: 2021

38. Investigation into the effects of background particles on Serratia marcescens bioaerosol

Author: Liu, Zhijian, Li, Haochuan, Chu, Jiaqi, Huang, Zhenzhe, Xiao, Xia, Lin, Chaofan, Shi, Jie, and He, Junzhou
Published: 2024
Full Text: View/download PDF

39. Interfacial exchange of airflow and bacteria-carrying particles induced by door opening and foot traffic in an operating room

Author: Liu, Haiyang, Liu, Zhijian, He, Junzhou, Hu, Chenxing, and Rong, Rui
Published: 2024
Full Text: View/download PDF

40. Spatial distribution of bioaerosols and evaluation of four ventilation method on controlling their diffusion in a typical enhanced biosafety level 2 laboratory

Author: Liu, Zhijian, Wang, Yongxin, Jiang, Chuan, He, Junzhou, Rong, Rui, Li, Shiyue, and Liang, Zhenyu
Published: 2024
Full Text: View/download PDF

41. Integrated assessment of bioaerosol dispersion patterns and infection risk in a typical urban environment: Implications for urban biosecurity management

Author: Dong, Zhijian, Liu, Zhijian, Hu, Chenxing, Shao, Xuqiang, Yang, Haokang, Jin, Yongjun, and Rong, Rui
Published: 2024
Full Text: View/download PDF

42. Assessment of the partial partition effect on the control of bioaerosol transmission in a one bed ward

Author: Wang, Yongxin, Liu, Zhijian, Liu, Haiyang, Xiao, Xia, He, Junzhou, Li, Shiyue, and Liang, Zhenyu
Published: 2024
Full Text: View/download PDF

43. Advances in recycling LiFePO4 from spent lithium batteries: A critical review

Author: Ding, Yunji, Fu, Jiayi, Zhang, Shengen, He, Xuefeng, Zhao, Baohuai, Ren, Jing, Zhong, Jiahao, and Liu, Zhijian
Published: 2024
Full Text: View/download PDF

44. Telomere-to-telomere genome assemblies of two Chinese Baijiu-brewing sorghum landraces

Author: Bao, Jiandong, Zhang, Heng, Wang, Fulin, Li, Lin, Zhu, Xueming, Xu, Junfeng, Wang, Yao, Liu, Zhijian, Zhai, Guowei, Xu, Heng, Lin, Fucheng, and Zhu, Ying
Published: 2024
Full Text: View/download PDF

45. Modeling the infection risk and emergency evacuation from bioaerosol leakage around an urban vaccine factory

Author: Liu, Zhijian, Cao, Hongwei, Hu, Chenxing, Wu, Minnan, Zhang, Siqi, He, Junzhou, and Jiang, Chuan
Published: 2023
Full Text: View/download PDF

46. LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision

Author: Liu, Zhijian, Stent, Simon, Li, Jie, Gideon, John, and Han, Song
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Computer vision tasks such as object detection and semantic/instance segmentation rely on the painstaking annotation of large training datasets. In this paper, we propose LocTex that takes advantage of the low-cost localized textual annotations (i.e., captions and synchronized mouse-over gestures) to reduce the annotation effort. We introduce a contrastive pre-training framework between images and captions and propose to supervise the cross-modal attention map with rendered mouse traces to provide coarse localization signals. Our learned visual features capture rich semantics (from free-form captions) and accurate localization (from mouse traces), which are very effective when transferred to various downstream vision tasks. Compared with ImageNet supervised pre-training, LocTex can reduce the size of the pre-training dataset by 10x or the target dataset by 2x while achieving comparable or even improved performance on COCO instance segmentation. When provided with the same amount of annotations, LocTex achieves around 4% higher accuracy than the previous state-of-the-art "vision+language" pre-training approach on the task of PASCAL VOC image classification., Comment: ICCV 2021. Project page: https://loctex.mit.edu/
Published: 2021

47. The severity and infection of acute pancreatitis may increase the risk of bleeding in patients undergoing EUS-guided drainage and endoscopic necrosectomy: a large retrospective cohort

Author: Peng, Siyang, Yao, Qian, Fu, Yunfeng, Xu, Xuan, Chen, SiHai, Ke, Huajing, Hu, Yang, Xiong, Huifang, He, Wenhua, Zhu, Yong, Xia, Liang, Wu, Yao, Shu, Xu, Liu, Zhijian, Zhu, Yin, Chen, Youxiang, Lu, Nonghua, and Liu, Pi
Published: 2023
Full Text: View/download PDF

48. Effect of aging temperature on microstructure and softening property of the Cu-Cr-Zr-Nb alloy

Author: Miao, Yupeng, Gan, Chunlei, Jin, Wei, Wang, Ming, Chen, Yuanxing, Liu, Zhijian, and Zhang, Zhibo
Published: 2024
Full Text: View/download PDF

49. Modeling the long-range transmission of airborne bioaerosol releases in a high-density urban environment

Author: Dong, Zhijian, Liu, Zhijian, Hu, Chenxing, Jin, Yongjun, Ye, Chenglin, He, Yuchen, and Rong, Rui
Published: 2024
Full Text: View/download PDF

50. Investigation of pathogen-laden aerosol aerodynamics and ventilation rates on their long-range transmission: A study on the Fangcang shelter hospital

Author: Jiang, Chuan, Liu, Zhijian, Wang, Yongxin, Yao, Guangpeng, Li, Shiyue, Rong, Rui, Liang, Zhenyu, Liu, Haiyang, Liu, Jingwei, Wan, Tongliang, and He, Junzhou
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

1,559 results on '"Liu, Zhijian"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources