Author: "Chen, ZiJian" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Chen, ZiJian"' showing total 648 results

Start Over Author "Chen, ZiJian"

648 results on '"Chen, ZiJian"'

1. OBI-Bench: Can LMMs Aid in Study of Ancient Script on Oracle Bones?

Author: Chen, Zijian, Chen, Tingzhu, Zhang, Wenjun, and Zhai, Guangtao
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: We introduce OBI-Bench, a holistic benchmark crafted to systematically evaluate large multi-modal models (LMMs) on whole-process oracle bone inscriptions (OBI) processing tasks demanding expert-level domain knowledge and deliberate cognition. OBI-Bench includes 5,523 meticulously collected diverse-sourced images, covering five key domain problems: recognition, rejoining, classification, retrieval, and deciphering. These images span centuries of archaeological findings and years of research by front-line scholars, comprising multi-stage font appearances from excavation to synthesis, such as original oracle bone, inked rubbings, oracle bone fragments, cropped single character, and handprinted character. Unlike existing benchmarks, OBI-Bench focuses on advanced visual perception and reasoning with OBI-specific knowledge, challenging LMMs to perform tasks akin to those faced by experts. The evaluation of 6 proprietary LMMs as well as 17 open-source LMMs highlights the substantial challenges and demands posed by OBI-Bench. Even the latest versions of GPT-4o, Gemini 1.5 Pro, and Qwen-VL-Max are still far from public-level humans in some fine-grained perception tasks. However, they perform at a level comparable to untrained humans in deciphering task, indicating remarkable capabilities in offering new interpretative perspectives and generating creative guesses. We hope OBI-Bench can facilitate the community to develop domain-specific multi-modal foundation models towards ancient language research and delve deeper to discover and enhance these untapped potentials of LMMs., Comment: 31 pages, 18 figures
Published: 2024

2. An Early FIRST Reproduction and Improvements to Single-Token Decoding for Fast Listwise Reranking

Author: Chen, Zijian, Pradeep, Ronak, and Lin, Jimmy
Subjects: Computer Science - Information Retrieval, Computer Science - Computation and Language
Abstract: Recent advances have demonstrated that large language models (LLMs) excel as listwise rerankers, but their high computational demands remain a barrier to widespread adoption. Further, the traditional language modeling (LM) objective is not ideally suited for reranking tasks. FIRST is a novel approach that addresses these challenges by integrating a learning-to-rank objective and leveraging the logits of only the first generated token, thereby significantly reducing inference latency compared to traditional LLM rerankers. In this study, we extend the evaluation of FIRST to the TREC Deep Learning datasets (DL19-22), validating its robustness across diverse domains. We investigate the influence of different first-stage retrievers on FIRST rerankers, observing diminishing returns and patterns consistent with traditional LLM rerankers. Through applying the FIRST objective to a broader range of backbone models, we achieve effectiveness surpassing the original implementation. Our experiments confirm that fast reranking with single-token logits does not compromise out-of-domain reranking quality. To better quantify the computational savings in the original study, we measure and compare latency to find a 21%-42% gain across various models and benchmarks. Moreover, while LM training implicitly improves zero-shot single-token reranking, our experiments also raise questions about whether LM pre-training may hinder subsequent fine-tuning with the FIRST objective. These findings pave the way for more efficient and effective listwise reranking in future applications.
Published: 2024

3. Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs

Author: Zhang, Zicheng, Jia, Ziheng, Wu, Haoning, Li, Chunyi, Chen, Zijian, Zhou, Yingjie, Sun, Wei, Liu, Xiaohong, Min, Xiongkuo, Lin, Weisi, and Zhai, Guangtao
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: With the rising interest in research on Large Multi-modal Models (LMMs) for video understanding, many studies have emphasized general video comprehension capabilities, neglecting the systematic exploration into video quality understanding. To address this oversight, we introduce Q-Bench-Video in this paper, a new benchmark specifically designed to evaluate LMMs' proficiency in discerning video quality. a) To ensure video source diversity, Q-Bench-Video encompasses videos from natural scenes, AI-generated Content (AIGC), and Computer Graphics (CG). b) Building on the traditional multiple-choice questions format with the Yes-or-No and What-How categories, we include Open-ended questions to better evaluate complex scenarios. Additionally, we incorporate the video pair quality comparison question to enhance comprehensiveness. c) Beyond the traditional Technical, Aesthetic, and Temporal distortions, we have expanded our evaluation aspects to include the dimension of AIGC distortions, which addresses the increasing demand for video generation. Finally, we collect a total of 2,378 question-answer pairs and test them on 12 open-source & 5 proprietary LMMs. Our findings indicate that while LMMs have a foundational understanding of video quality, their performance remains incomplete and imprecise, with a notable discrepancy compared to human performance. Through Q-Bench-Video, we seek to catalyze community interest, stimulate further research, and unlock the untapped potential of LMMs to close the gap in video quality understanding.
Published: 2024

4. QID$^2$: An Image-Conditioned Diffusion Model for Q-space Up-sampling of DWI Data

Author: Chen, Zijian, Wang, Jueqi, and Venkataraman, Archana
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: We propose an image-conditioned diffusion model to estimate high angular resolution diffusion weighted imaging (DWI) from a low angular resolution acquisition. Our model, which we call QID$^2$, takes as input a set of low angular resolution DWI data and uses this information to estimate the DWI data associated with a target gradient direction. We leverage a U-Net architecture with cross-attention to preserve the positional information of the reference images, further guiding the target image generation. We train and evaluate QID$^2$ on single-shell DWI samples curated from the Human Connectome Project (HCP) dataset. Specifically, we sub-sample the HCP gradient directions to produce low angular resolution DWI data and train QID$^2$ to reconstruct the missing high angular resolution samples. We compare QID$^2$ with two state-of-the-art GAN models. Our results demonstrate that QID$^2$ not only achieves higher-quality generated images, but it consistently outperforms the GAN models in downstream tensor estimation across multiple metrics. Taken together, this study highlights the potential of diffusion models, and QID$^2$ in particular, for q-space up-sampling, thus offering a promising toolkit for clinical and research applications., Comment: Accepted at MICCAI 2024 International Workshop on Computational Diffusion MRI. Zijian Chen and Jueqi Wang contributed equally to this work
Published: 2024

5. A Lesion-aware Edge-based Graph Neural Network for Predicting Language Ability in Patients with Post-stroke Aphasia

Author: Chen, Zijian, Varkanitsa, Maria, Ishwar, Prakash, Konrad, Janusz, Betke, Margrit, Kiran, Swathi, and Venkataraman, Archana
Subjects: Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing, Quantitative Biology - Neurons and Cognition
Abstract: We propose a lesion-aware graph neural network (LEGNet) to predict language ability from resting-state fMRI (rs-fMRI) connectivity in patients with post-stroke aphasia. Our model integrates three components: an edge-based learning module that encodes functional connectivity between brain regions, a lesion encoding module, and a subgraph learning module that leverages functional similarities for prediction. We use synthetic data derived from the Human Connectome Project (HCP) for hyperparameter tuning and model pretraining. We then evaluate the performance using repeated 10-fold cross-validation on an in-house neuroimaging dataset of post-stroke aphasia. Our results demonstrate that LEGNet outperforms baseline deep learning methods in predicting language ability. LEGNet also exhibits superior generalization ability when tested on a second in-house dataset that was acquired under a slightly different neuroimaging protocol. Taken together, the results of this study highlight the potential of LEGNet in effectively learning the relationships between rs-fMRI connectivity and language ability in a patient cohort with brain lesions for improved post-stroke aphasia evaluation., Comment: Accepted at MICCAI 2024 International Workshop on Machine Learning in Clinical Neuroimaging (MLCN)
Published: 2024

6. Assessing UHD Image Quality from Aesthetics, Distortions, and Saliency

Author: Sun, Wei, Zhang, Weixia, Cao, Yuqin, Cao, Linhan, Jia, Jun, Chen, Zijian, Zhang, Zicheng, Min, Xiongkuo, and Zhai, Guangtao
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: UHD images, typically with resolutions equal to or higher than 4K, pose a significant challenge for efficient image quality assessment (IQA) algorithms, as adopting full-resolution images as inputs leads to overwhelming computational complexity and commonly used pre-processing methods like resizing or cropping may cause substantial loss of detail. To address this problem, we design a multi-branch deep neural network (DNN) to assess the quality of UHD images from three perspectives: global aesthetic characteristics, local technical distortions, and salient content perception. Specifically, aesthetic features are extracted from low-resolution images downsampled from the UHD ones, which lose high-frequency texture information but still preserve the global aesthetics characteristics. Technical distortions are measured using a fragment image composed of mini-patches cropped from UHD images based on the grid mini-patch sampling strategy. The salient content of UHD images is detected and cropped to extract quality-aware features from the salient regions. We adopt the Swin Transformer Tiny as the backbone networks to extract features from these three perspectives. The extracted features are concatenated and regressed into quality scores by a two-layer multi-layer perceptron (MLP) network. We employ the mean square error (MSE) loss to optimize prediction accuracy and the fidelity loss to optimize prediction monotonicity. Experimental results show that the proposed model achieves the best performance on the UHD-IQA dataset while maintaining the lowest computational complexity, demonstrating its effectiveness and efficiency. Moreover, the proposed model won first prize in ECCV AIM 2024 UHD-IQA Challenge. The code is available at https://github.com/sunwei925/UIQA., Comment: The proposed model won first prize in ECCV AIM 2024 Pushing the Boundaries of Blind Photo Quality Assessment Challenge
Published: 2024

7. SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression

Author: Cao, Linhan, Sun, Wei, Min, Xiongkuo, Jia, Jun, Zhang, Zicheng, Chen, Zijian, Zhu, Yucheng, Liu, Lizhou, Chen, Qiubo, Chen, Jing, and Zhai, Guangtao
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Just noticeable distortion (JND), representing the threshold of distortion in an image that is minimally perceptible to the human visual system (HVS), is crucial for image compression algorithms to achieve a trade-off between transmission bit rate and image quality. However, traditional JND prediction methods only rely on pixel-level or sub-band level features, lacking the ability to capture the impact of image content on JND. To bridge this gap, we propose a Semantic-Guided JND (SG-JND) network to leverage semantic information for JND prediction. In particular, SG-JND consists of three essential modules: the image preprocessing module extracts semantic-level patches from images, the feature extraction module extracts multi-layer features by utilizing the cross-scale attention layers, and the JND prediction module regresses the extracted features into the final JND value. Experimental results show that SG-JND achieves the state-of-the-art performance on two publicly available JND datasets, which demonstrates the effectiveness of SG-JND and highlight the significance of incorporating semantic information in JND assessment., Comment: Accepted by ICIP 2024
Published: 2024

8. Benchmarking AIGC Video Quality Assessment: A Dataset and Unified Model

Author: Zhang, Zhichao, Li, Xinyue, Sun, Wei, Jia, Jun, Min, Xiongkuo, Zhang, Zicheng, Li, Chunyi, Chen, Zijian, Wang, Puyi, Ji, Zhongpeng, Sun, Fengyu, Jui, Shangling, and Zhai, Guangtao
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In recent years, artificial intelligence (AI) driven video generation has garnered significant attention due to advancements in stable diffusion and large language model techniques. Thus, there is a great demand for accurate video quality assessment (VQA) models to measure the perceptual quality of AI-generated content (AIGC) videos as well as optimize video generation techniques. However, assessing the quality of AIGC videos is quite challenging due to the highly complex distortions they exhibit (e.g., unnatural action, irrational objects, etc.). Therefore, in this paper, we try to systemically investigate the AIGC-VQA problem from both subjective and objective quality assessment perspectives. For the subjective perspective, we construct a Large-scale Generated Vdeo Quality assessment (LGVQ) dataset, consisting of 2,808 AIGC videos generated by 6 video generation models using 468 carefully selected text prompts. Unlike previous subjective VQA experiments, we evaluate the perceptual quality of AIGC videos from three dimensions: spatial quality, temporal quality, and text-to-video alignment, which hold utmost importance for current video generation techniques. For the objective perspective, we establish a benchmark for evaluating existing quality assessment metrics on the LGVQ dataset, which reveals that current metrics perform poorly on the LGVQ dataset. Thus, we propose a Unify Generated Video Quality assessment (UGVQ) model to comprehensively and accurately evaluate the quality of AIGC videos across three aspects using a unified model, which uses visual, textual and motion features of video and corresponding prompt, and integrates key features to enhance feature expression. We hope that our benchmark can promote the development of quality evaluation metrics for AIGC videos. The LGVQ dataset and the UGVQ metric will be publicly released.
Published: 2024

9. GAIA: Rethinking Action Quality Assessment for AI-Generated Videos

Author: Chen, Zijian, Sun, Wei, Tian, Yuan, Jia, Jun, Zhang, Zicheng, Wang, Jiarui, Huang, Ru, Min, Xiongkuo, Zhai, Guangtao, and Zhang, Wenjun
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Assessing action quality is both imperative and challenging due to its significant impact on the quality of AI-generated videos, further complicated by the inherently ambiguous nature of actions within AI-generated video (AIGV). Current action quality assessment (AQA) algorithms predominantly focus on actions from real specific scenarios and are pre-trained with normative action features, thus rendering them inapplicable in AIGVs. To address these problems, we construct GAIA, a Generic AI-generated Action dataset, by conducting a large-scale subjective evaluation from a novel causal reasoning-based perspective, resulting in 971,244 ratings among 9,180 video-action pairs. Based on GAIA, we evaluate a suite of popular text-to-video (T2V) models on their ability to generate visually rational actions, revealing their pros and cons on different categories of actions. We also extend GAIA as a testbed to benchmark the AQA capacity of existing automatic evaluation methods. Results show that traditional AQA methods, action-related metrics in recent T2V benchmarks, and mainstream video quality methods perform poorly with an average SRCC of 0.454, 0.191, and 0.519, respectively, indicating a sizable gap between current models and human action perception patterns in AIGVs. Our findings underscore the significance of action quality as a unique perspective for studying AIGVs and can catalyze progress towards methods with enhanced capacities for AQA in AIGVs., Comment: Accepted by NeurIPS2024 Dataset and Benchmark Track as Spotlight. 33 pages, 15 figures
Published: 2024

10. Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking

Author: Zhang, Jiyao, Huang, Weiyao, Peng, Bo, Wu, Mingdong, Hu, Fei, Chen, Zijian, Zhao, Bo, and Dong, Hao
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: 6D Object Pose Estimation is a crucial yet challenging task in computer vision, suffering from a significant lack of large-scale datasets. This scarcity impedes comprehensive evaluation of model performance, limiting research advancements. Furthermore, the restricted number of available instances or categories curtails its applications. To address these issues, this paper introduces Omni6DPose, a substantial dataset characterized by its diversity in object categories, large scale, and variety in object materials. Omni6DPose is divided into three main components: ROPE (Real 6D Object Pose Estimation Dataset), which includes 332K images annotated with over 1.5M annotations across 581 instances in 149 categories; SOPE(Simulated 6D Object Pose Estimation Dataset), consisting of 475K images created in a mixed reality setting with depth simulation, annotated with over 5M annotations across 4162 instances in the same 149 categories; and the manually aligned real scanned objects used in both ROPE and SOPE. Omni6DPose is inherently challenging due to the substantial variations and ambiguities. To address this challenge, we introduce GenPose++, an enhanced version of the SOTA category-level pose estimation framework, incorporating two pivotal improvements: Semantic-aware feature extraction and Clustering-based aggregation. Moreover, we provide a comprehensive benchmarking analysis to evaluate the performance of previous methods on this large-scale dataset in the realms of 6D object pose estimation and pose tracking.
Published: 2024

11. A-Bench: Are LMMs Masters at Evaluating AI-generated Images?

Author: Zhang, Zicheng, Wu, Haoning, Li, Chunyi, Zhou, Yingjie, Sun, Wei, Min, Xiongkuo, Chen, Zijian, Liu, Xiaohong, Lin, Weisi, and Zhai, Guangtao
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: How to accurately and efficiently assess AI-generated images (AIGIs) remains a critical challenge for generative models. Given the high costs and extensive time commitments required for user studies, many researchers have turned towards employing large multi-modal models (LMMs) as AIGI evaluators, the precision and validity of which are still questionable. Furthermore, traditional benchmarks often utilize mostly natural-captured content rather than AIGIs to test the abilities of LMMs, leading to a noticeable gap for AIGIs. Therefore, we introduce A-Bench in this paper, a benchmark designed to diagnose whether LMMs are masters at evaluating AIGIs. Specifically, A-Bench is organized under two key principles: 1) Emphasizing both high-level semantic understanding and low-level visual quality perception to address the intricate demands of AIGIs. 2) Various generative models are utilized for AIGI creation, and various LMMs are employed for evaluation, which ensures a comprehensive validation scope. Ultimately, 2,864 AIGIs from 16 text-to-image models are sampled, each paired with question-answers annotated by human experts, and tested across 18 leading LMMs. We hope that A-Bench will significantly enhance the evaluation process and promote the generation quality for AIGIs. The benchmark is available at https://github.com/Q-Future/A-Bench.
Published: 2024

12. AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results

Author: Conde, Marcos V., Zadtootaghaj, Saman, Barman, Nabajeet, Timofte, Radu, He, Chenlong, Zheng, Qi, Zhu, Ruoxi, Tu, Zhengzhong, Wang, Haiqiang, Chen, Xiangguang, Meng, Wenhui, Pan, Xiang, Shi, Huiying, Zhu, Han, Xu, Xiaozhong, Sun, Lei, Chen, Zhenzhong, Liu, Shan, Zhang, Zicheng, Wu, Haoning, Zhou, Yingjie, Li, Chunyi, Liu, Xiaohong, Lin, Weisi, Zhai, Guangtao, Sun, Wei, Cao, Yuqin, Jiang, Yanwei, Jia, Jun, Zhang, Zhichao, Chen, Zijian, Zhang, Weixia, Min, Xiongkuo, Göring, Steve, Qi, Zihao, and Feng, Chen
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia
Abstract: This paper reviews the AIS 2024 Video Quality Assessment (VQA) Challenge, focused on User-Generated Content (UGC). The aim of this challenge is to gather deep learning-based methods capable of estimating the perceptual quality of UGC videos. The user-generated videos from the YouTube UGC Dataset include diverse content (sports, games, lyrics, anime, etc.), quality and resolutions. The proposed methods must process 30 FHD frames under 1 second. In the challenge, a total of 102 participants registered, and 15 submitted code and models. The performance of the top-5 submissions is reviewed and provided here as a survey of diverse deep models for efficient video quality assessment of user-generated content., Comment: CVPR 2024 Workshop -- AI for Streaming (AIS) Video Quality Assessment Challenge
Published: 2024

13. Synergistic Improvement of Mechanical and Corrosion Properties of Mg-9.1Y-1.8Zn Alloys by Hot Extrusion

Author: Lu, Xianzheng, Chen, Zijian, Zou, Xianjun, Zhang, Jian, Tu, Yu, Zhou, Xiaojie, Chen, Xiaomin, Lai, Chiping, Chan, Luenchow, and Zeng, Gang
Published: 2024
Full Text: View/download PDF

14. Confidence-Aware RGB-D Face Recognition via Virtual Depth Synthesis

Author: Chen, Zijian, Wang, Mei, Deng, Weihong, Shi, Hongzhi, Wen, Dongchao, Zhang, Yingjie, Cui, Xingchen, and Zhao, Jian
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: 2D face recognition encounters challenges in unconstrained environments due to varying illumination, occlusion, and pose. Recent studies focus on RGB-D face recognition to improve robustness by incorporating depth information. However, collecting sufficient paired RGB-D training data is expensive and time-consuming, hindering wide deployment. In this work, we first construct a diverse depth dataset generated by 3D Morphable Models for depth model pre-training. Then, we propose a domain-independent pre-training framework that utilizes readily available pre-trained RGB and depth models to separately perform face recognition without needing additional paired data for retraining. To seamlessly integrate the two distinct networks and harness the complementary benefits of RGB and depth information for improved accuracy, we propose an innovative Adaptive Confidence Weighting (ACW). This mechanism is designed to learn confidence estimates for each modality to achieve modality fusion at the score level. Our method is simple and lightweight, only requiring ACW training beyond the backbone models. Experiments on multiple public RGB-D face recognition benchmarks demonstrate state-of-the-art performance surpassing previous methods based on depth estimation and feature fusion, validating the efficacy of our approach., Comment: 9 pages, 5 figures
Published: 2024

15. A Lesion-Aware Edge-Based Graph Neural Network for Predicting Language Ability in Patients with Post-stroke Aphasia

Author: Chen, Zijian, Varkanitsa, Maria, Ishwar, Prakash, Konrad, Janusz, Betke, Margrit, Kiran, Swathi, Venkataraman, Archana, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Bathula, Deepti R., editor, Benet Nirmala, Anoop, editor, Dvornek, Nicha C., editor, Govindarajan, Sindhuja T., editor, Habes, Mohamad, editor, Kumar, Vinod, editor, Nebli, Ahmed, editor, Wolfers, Thomas, editor, and Xiao, Yiming, editor
Published: 2025
Full Text: View/download PDF

16. Investigation of Al/CeO2 interfacial relationships for epitaxial growth of Al on CeO2 substrates: first-principles calculation

Author: Ling, Ying, Zou, Xiuliang, Chen, Zijian, and Yan, Hong
Published: 2024
Full Text: View/download PDF

17. Exploring the Naturalness of AI-Generated Images

Author: Chen, Zijian, Sun, Wei, Wu, Haoning, Zhang, Zicheng, Jia, Jun, Ji, Zhongpeng, Sun, Fengyu, Jui, Shangling, Min, Xiongkuo, Zhai, Guangtao, and Zhang, Wenjun
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The proliferation of Artificial Intelligence-Generated Images (AGIs) has greatly expanded the Image Naturalness Assessment (INA) problem. Different from early definitions that mainly focus on tone-mapped images with limited distortions (e.g., exposure, contrast, and color reproduction), INA on AI-generated images is especially challenging as it has more diverse contents and could be affected by factors from multiple perspectives, including low-level technical distortions and high-level rationality distortions. In this paper, we take the first step to benchmark and assess the visual naturalness of AI-generated images. First, we construct the AI-Generated Image Naturalness (AGIN) database by conducting a large-scale subjective study to collect human opinions on the overall naturalness as well as perceptions from technical and rationality perspectives. AGIN verifies that naturalness is universally and disparately affected by technical and rationality distortions. Second, we propose the Joint Objective Image Naturalness evaluaTor (JOINT), to automatically predict the naturalness of AGIs that aligns human ratings. Specifically, JOINT imitates human reasoning in naturalness evaluation by jointly learning both technical and rationality features. We demonstrate that JOINT significantly outperforms baselines for providing more subjectively consistent results on naturalness assessment., Comment: 33 pages
Published: 2023

18. Joint Location Sensing and Channel Estimation for IRS-Aided mmWave ISAC Systems

Author: Chen, Zijian, Zhao, Ming-Min, Li, Min, Xu, Fan, Wu, Qingqing, and Zhao, Min-Jian
Subjects: Electrical Engineering and Systems Science - Signal Processing
Abstract: In this paper, we investigate a self-sensing intelligent reflecting surface (IRS) aided millimeter wave (mmWave) integrated sensing and communication (ISAC) system. Unlike the conventional purely passive IRS, the self-sensing IRS can effectively reduce the path loss of sensing-related links, thus rendering it advantageous in ISAC systems. Aiming to jointly sense the target/scatterer/user positions as well as estimate the sensing and communication (SAC) channels in the considered system, we propose a two-phase transmission scheme, where the coarse and refined sensing/channel estimation (CE) results are respectively obtained in the first phase (using scanning-based IRS reflection coefficients) and second phase (using optimized IRS reflection coefficients). For each phase, an angle-based sensing turbo variational Bayesian inference (AS-TVBI) algorithm, which combines the VBI, messaging passing and expectation-maximization (EM) methods, is developed to solve the considered joint location sensing and CE problem. The proposed algorithm effectively exploits the partial overlapping structured (POS) sparsity and 2-dimensional (2D) block sparsity inherent in the SAC channels to enhance the overall performance. Based on the estimation results from the first phase, we formulate a Cram\'{e}r-Rao bound (CRB) minimization problem for optimizing IRS reflection coefficients, and through proper reformulations, a low-complexity manifold-based optimization algorithm is proposed to solve this problem. Simulation results are provided to verify the superiority of the proposed transmission scheme and associated algorithms.
Published: 2023

19. Correction: SDF-1 promotes metastasis of NSCLC by enhancing chemoattraction of megakaryocytes through the PI3K/Akt signaling pathway

Author: Ai, Yiguo, Wan, Changhong, Chen, Zijian, Wang, Yansheng, Zhao, Wen, and Huang, Weizhe
Published: 2024
Full Text: View/download PDF

20. Impact of surgical approaches on stem position and hidden blood loss in total hip arthroplasty: minimally invasive vs. posterolateral

Author: Yuan, Gongwu, Xiao, Yaoguang, Li, Zhigang, Chen, Zijian, and Liu, Ximing
Published: 2024
Full Text: View/download PDF

21. Assessing the Impact of an Artificial Intelligence-Based Model for Intracranial Aneurysm Detection in CT Angiography on Patient Diagnosis and Outcomes (IDEAL Study)—a protocol for a multicenter, double-blinded randomized controlled trial

Author: Shi, Zhao, Hu, Bin, Lu, Mengjie, Chen, Zijian, Zhang, Manting, Yu, Yizhou, Zhou, Changsheng, Zhong, Jian, Wu, Bingqian, Zhang, Xueming, Wei, Yongyue, and Zhang, Long Jiang
Published: 2024
Full Text: View/download PDF

22. Well-defined in-textile photolithography towards permeable textile electronics

Author: Wang, Pengwei, Ma, Xiaohao, Lin, Zhiqiang, Chen, Fan, Chen, Zijian, Hu, Hong, Xu, Hailong, Zhang, Xinyi, Shi, Yuqing, Huang, Qiyao, Lin, Yuanjing, and Zheng, Zijian
Published: 2024
Full Text: View/download PDF

23. Sulcal Pattern Matching with the Wasserstein Distance

Author: Chen, Zijian, Das, Soumya, and Chung, Moo K.
Subjects: Quantitative Biology - Neurons and Cognition, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: We present the unified computational framework for modeling the sulcal patterns of human brain obtained from the magnetic resonance images. The Wasserstein distance is used to align the sulcal patterns nonlinearly. These patterns are topologically different across subjects making the pattern matching a challenge. We work out the mathematical details and develop the gradient descent algorithms for estimating the deformation field. We further quantify the image registration performance. This method is applied in identifying the differences between male and female sulcal patterns., Comment: In press in IEEE ISBI
Published: 2023

24. Optimizing vehicle edge computing task offloading at intersections: a fuzzy decision-making approach

Author: Zhang, Lei, Wang, Miao, Wang, Liqiang, Chen, Zijian, and Zhang, Hong
Published: 2025
Full Text: View/download PDF

25. SDF-1 promotes metastasis of NSCLC by enhancing chemoattraction of megakaryocytes through the PI3K/Akt signaling pathway

Author: Ai, Yiguo, Wan, Changhong, Chen, Zijian, Wang, Yansheng, Zhao, Wen, and Huang, Weizhe
Published: 2024
Full Text: View/download PDF

26. WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment

Author: Yao, Lin, Song, Jianfei, Xu, Ruizhuo, Yang, Yingfang, Chen, Zijian, and Deng, Yafeng
Subjects: Computer Science - Computation and Language, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Historically lower-level tasks such as automatic speech recognition (ASR) and speaker identification are the main focus in the speech field. Interest has been growing in higher-level spoken language understanding (SLU) tasks recently, like sentiment analysis (SA). However, improving performances on SLU tasks remains a big challenge. Basically, there are two main methods for SLU tasks: (1) Two-stage method, which uses a speech model to transfer speech to text, then uses a language model to get the results of downstream tasks; (2) One-stage method, which just fine-tunes a pre-trained speech model to fit in the downstream tasks. The first method loses emotional cues such as intonation, and causes recognition errors during ASR process, and the second one lacks necessary language knowledge. In this paper, we propose the Wave BERT (WaBERT), a novel end-to-end model combining the speech model and the language model for SLU tasks. WaBERT is based on the pre-trained speech and language model, hence training from scratch is not needed. We also set most parameters of WaBERT frozen during training. By introducing WaBERT, audio-specific information and language knowledge are integrated in the short-time and low-resource training process to improve results on the dev dataset of SLUE SA tasks by 1.15% of recall score and 0.82% of F1 score. Additionally, we modify the serial Continuous Integrate-and-Fire (CIF) mechanism to achieve the monotonic alignment between the speech and text modalities.
Published: 2022

27. Embedding of Functional Human Brain Networks on a Sphere

Author: Chung, Moo K. and Chen, Zijian
Subjects: Quantitative Biology - Other Quantitative Biology
Abstract: Human brain activity is often measured using the blood-oxygen-level dependent (BOLD) signals obtained through functional magnetic resonance imaging (fMRI). The strength of connectivity between brain regions is then measured as a Pearson correlation matrix. As the number of brain regions increases, the dimension of matrix increases. It becomes extremely cumbersome to even visualize and quantify such weighted complete networks. To remedy the problem, we propose to embed brain networks onto a sphere, which is a Riemannian manifold with constant positive curvature. The Matlab code for the spherical embedding is given in https://github.com/laplcebeltrami/sphericalMDS.
Published: 2022

28. Axial compression behaviour of circular concrete-filled stainless-clad bimetallic steel tubular stub columns

Author: Ban, Huiyong, Zeng, Zhuo, Chen, Zijian, Shi, Yongjiu, and Wang, Yuanqing
Published: 2024
Full Text: View/download PDF

29. Axial compression behaviour of square concrete-filled stainless-clad bimetallic steel tubular stub columns

Author: Zeng, Zhuo, Dai, Peng, Chen, Zijian, Shi, Yongjiu, and Ban, Huiyong
Published: 2024
Full Text: View/download PDF

30. Molecular representation learning based on Transformer with fixed-length padding method

Author: Wu, Yichu, Yang, Yang, Zhang, Ruimeng, Chen, Zijian, Jin, Meichen, Zou, Yi, Wang, Zhonghua, and Wu, Fanhong
Published: 2025
Full Text: View/download PDF

31. Digital manufacturing of perovskite materials and solar cells

Author: Wang, Zixuan, Chen, Zijian, Wang, Boyuan, Wu, Chuang, Zhou, Chao, Peng, Yang, Zhang, Xinyu, Ni, Zongming, Chung, Chi-yung, Chan, Ching-chuen, Yang, Jian, and Zhao, Haitao
Published: 2025
Full Text: View/download PDF

32. Fast insulation recovery characteristics of induced trigger gas gap switch

Author: DONG Bingbing, TAO Lei, LI Kang, and CHEN Zijian
Subjects: gas gap switch, induced breakdown, double pulse method, insulation recovery characteristics, gas breakdown voltage, strong electronegative gas sf6, Applications of electric power, TK4001-4102
Abstract: Gas gap switch has good application prospects in power systems, due to its quick response and simple structure. However, there is still little research on insulation recovery characteristic. Therefore, the double pulse method is used to study the influence of switch gap distance, trigger medium pressure and trigger medium type on the insulation recovery characteristics of gas switch. The experimental results show that the insulation recovery characteristics of induced trigger gas gap switch experience three stages: transition period, fast recovery period and saturation period. The duration of the saturation period is much longer than the sum of the previous two stages, and there was no 'platform phenomenon' in the rapid recovery period. With the decrease of gap distance, the insulation recovery rate of gas switch increases gradually, and the basic recovery time of gap insulation (insulation recovery coefficient RU > 90%) can be reduced by 50%. The influence of trigger medium pressure on the insulation recovery of gas switch is significant, and the influence characteristics on the insulation recovery process are different. Increasing the trigger medium pressure will slow down the insulation recovery process of gas switch. In 0.1~0.3 MPa compressed dry air, the basic recovery time of gas gap switch insulation corresponds to 11~40 ms. The strong electronegative gas SF6 has a significant effect on the insulation recovery rate of gas switches, and its insulation recovery rate is close to 4 times that in air. The research results provide theoretical guidance for the rapid insulation recovery of gas gap switch.
Published: 2024
Full Text: View/download PDF

33. Understand how machine learning impact lung cancer research from 2010 to 2021: A bibliometric analysis

Author: Chen Zijian, Liu Yangqi, Lin Zeying, and Huang Weizhe
Subjects: lung cancer, machine learning, bibliometric analysis, global trend, collaboration, burstiness, Medicine
Abstract: Advances in lung cancer research applying machine learning (ML) technology have generated many relevant literature. However, there is absence of bibliometric analysis review that aids a comprehensive understanding of this field and its progress. Present article for the first time performed a bibliometric analysis to clarify research status and focus from 2010 to 2021. In the analysis, a total of 2,312 relevant literature were searched and retrieved from the Web of Science Core Collection database. We conducted a bibliometric analysis and further visualization. During that time, exponentially growing annual publication and our model have shown a flourishing research prospect. Annual citation reached the peak in 2017. Researchers from United States and China have produced most of the relevant literature and strongest partnership between them. Medical image analysis and Nature appeared to bring more attention to the public. The computer-aided diagnosis, precision medicine, and survival prediction were the focus of research, reflecting the development trend at that period. ML did make a big difference in lung cancer research in the past decade.
Published: 2024
Full Text: View/download PDF

34. Synthesis and antifeedant activity of 3H-indole derived oxime esters and oxime ethers against cotton bollworm

Author: Yang, Yang, Wu, Yichu, Zhang, Ruimeng, Jin, Meichen, Chen, Zijian, Li, Chunyu, Xu, Shibo, Song, Lixing, Kai, Zhenpeng, Wang, Zhonghua, and Wu, Fanhong
Published: 2024
Full Text: View/download PDF

35. Sulfur release behavior and sulfur fixation mechanism during biomass microwave co-pyrolysis of Ascophyllum and rice straw

Author: Xu, Qing, Chen, Zijian, Xian, Shengxian, Wu, Yujian, and Li, Ming
Published: 2024
Full Text: View/download PDF

36. Plasmonic hybrid modes in a multifunctional ZIF-8 layer for high performance volatile organic compounds sensing

Author: Chen, Zijian, Jao, Chih-Yu, Hu, Kaiqiang, Luo, Yecheng, Ma, Churong, Jiang, Ruifen, Guo, Tuan, and Chen, Kai
Published: 2024
Full Text: View/download PDF

37. Iron supplementation and iron accumulation promote adipocyte thermogenesis through PGC1α-ATGL–mediated lipolysis

Author: Mai, Xudong, Liu, Yifan, Fan, Jigang, Xiao, Lanling, Liao, Miaomiao, Huang, Zhipeng, Chen, Zijian, Huang, Shaojun, Sun, Rui, Jiang, Xiaowan, Huang, Liujing, Sun, Jia, Xie, Liwei, and Chen, Hong
Published: 2024
Full Text: View/download PDF

38. Research on the thermo-hydro-mechanical coupling simulation and deformation spatiotemporal evolution for the entire process of oil shale in-situ mining

Author: Song, Shengyuan, Mei, Shidi, Hu, Ying, Li, Qiang, Chen, Zijian, and Zhang, Shuo
Published: 2024
Full Text: View/download PDF

39. HIST1H2BK predicts neoadjuvant-chemotherapy response and mediates 5-fluorouracil resistance of gastric cancer cells

Author: Chen, Zijian, Tang, Xiaocheng, Li, Weiyao, Li, Tuoyang, Huang, Jintuan, Jiang, Yingming, Qiu, Jun, Huang, Zhenze, Tan, Rongchang, Ji, Xiang, Lv, Li, Yang, Zuli, and Chen, Hao
Published: 2024
Full Text: View/download PDF

40. Efficient recycling of sewage water in a polyester integrated industry: A case study

Author: Xu, Dong, Wu, Shuangxia, Yan, Ailan, Chen, Zijian, Xu, Jiancai, Gu, Chaoguang, Qi, Yiting, and Wu, Shuyun
Published: 2024
Full Text: View/download PDF

41. Origins of formaldehyde in a mountainous background atmosphere of southern China

Author: Li, Qinqin, Gong, Daocheng, Chen, Zijian, Li, Jiangyong, Wu, Gengchen, Deng, Shuo, Wang, Hao, He, Lingyan, and Wang, Boguang
Published: 2024
Full Text: View/download PDF

42. Effect of rotational-die ECAP parameters on microstructure and mechanical properties of Mg97Y2Zn alloys

Author: Zhou, Xiaojie, Xiao, Songke, Li, Miao, Wang, Yanan, Lu, Xianzheng, Chen, Zijian, Guo, Zihang, Xiao, Hongchao, and Guo, Jing
Published: 2024
Full Text: View/download PDF

43. Exploring catalytic carbonization of MXene-encased fiber coatings for exceptionally flame-retarded flexible polyurethane foams

Author: Wang, Dingding, Chen, Zijian, Jiang, Zhikun, An, Yingying, Yu, Shaoyu, Zhang, Heng, Yang, Wei, Lu, Hongdian, Wei, Chunxiang, and Mao, Lei
Published: 2024
Full Text: View/download PDF

44. A robotic platform for the synthesis of colloidal nanocrystals

Author: Zhao, Haitao, Chen, Wei, Huang, Hao, Sun, Zhehao, Chen, Zijian, Wu, Lingjun, Zhang, Baicheng, Lai, Fuming, Wang, Zhuo, Adam, Mukhtar Lawan, Pang, Cheng Heng, Chu, Paul K., Lu, Yang, Wu, Tao, Jiang, Jun, Yin, Zongyou, and Yu, Xue-Feng
Published: 2023
Full Text: View/download PDF

45. Intercellular adhesion molecule 2 as a novel prospective tumor suppressor induced by ERG promotes ubiquitination-mediated radixin degradation to inhibit gastric cancer tumorigenicity and metastasis

Author: Tang, Xiaocheng, Huang, Jintuan, Jiang, Yingming, Qiu, Jun, Li, Tuoyang, Li, Weiyao, Chen, Zijian, Huang, Zhenze, Yu, Xihu, Yang, Tao, Ji, Xiang, Tan, Rongchang, lv, Li, Yang, Zuli, and Chen, Hao
Published: 2023
Full Text: View/download PDF

46. LAD1 promotes malignant progression by diminishing ubiquitin-dependent degradation of vimentin in gastric cancer

Author: Jiang, Yingming, Feng, Yanchun, Huang, Jintuan, Huang, Zhenze, Tan, Rongchang, Li, Tuoyang, Chen, Zijian, Tang, Xiaocheng, Qiu, Jun, Li, Chujun, Chen, Hao, and Yang, Zuli
Published: 2023
Full Text: View/download PDF

47. Confining donor conformation distributions for efficient thermally activated delayed fluorescence with fast spin-flipping

Author: Qiu, Weidong, Liu, Denghui, Li, Mengke, Cai, Xinyi, Chen, Zijian, He, Yanmei, Liang, Baoyan, Peng, Xiaomei, Qiao, Zhenyang, Chen, Jiting, Li, Wei, Pu, Junrong, Xie, Wentao, Wang, Zhiheng, Li, Deli, Gan, Yiyang, Jiao, Yihang, Gu, Qing, and Su, Shi-Jian
Published: 2023
Full Text: View/download PDF

48. Rapid Detection of Procymidone in Vegetables by Nanobody-Based Colloidal Gold Immunochromatography Assay

Author: HE Xiaoting, CHEN Zijian, HUANG Song, XIAO Zemiao, LIU Jia, ZHONG Min, WANG Hong, SHEN Yudong, XU Zhenlin
Subjects: nanobodies, procymidone, colloidal gold immunochromatography, rapid detection, Food processing and manufacture, TP368-456
Abstract: To explore the feasibility of applying nanobodies in colloidal gold immunochromatography assay (GICA) for small-molecule harmful substances, this study systematically investigated the effects of colloidal gold labeling parameters, colloidal gold working buffer and sample pretreatment methods on GICA using the pesticide procymidone as a model. The results showed that the size of colloidal gold particles, labeling pH and the amount of antibody used were the key factors to ensure the stability of the nanobody-gold-labeled probes. Under the optimal conditions, the limit of detection, half-maximal inhibitory concentration (IC50) and visual detection limit of GICA for procymidone were 0.44, 6.29 and 200 μg/L, respectively. The sensitivity was 5.1-fold higher than that of conventional monoclonal antibody-based GICA. The quick, easy, cheap, effective, rugged and safe (QuEChERS) procedure was used for the pretreatment of Chinese leek, cucumber and tomato samples. Based on the excellent organic solvent tolerance of nanobodies, the organic solvent blow-drying procedure was omitted. The average recoveries of the spiked samples ranged from 80.1% to 109.6%, and the results for the actual samples were consistent with those of gas chromatography-mass spectrometry (GC-MS). This study showed that nanobodies are a potential alternative to conventional antibodies for immunoassays for the rapid detection of small molecules. The proposed nanobody-based GICA can be used for the rapid screening of procymidone in vegetables with the advantages of high sensitivity, high accuracy, rapidity and simplicity.
Published: 2023
Full Text: View/download PDF

49. Facile spinning of tough and conductive eutectogel fibers via Li+-induced dense hydrogen-bond networks

Author: Fang, Lingtao, Zhang, Chi, Ge, Wenjiao, Rong, Mingming, Chen, Fan, Chen, Zijian, Wang, Xiaohui, Zheng, Zijian, and Huang, Qiyao
Published: 2023
Full Text: View/download PDF

50. Machine learning and robot-assisted synthesis of diverse gold nanorods via seedless approach

Author: Moses, Oyawale Adetunji, Adam, Mukhtar Lawan, Chen, Zijian, Ezeh, Collins Izuchukwu, Huang, Hao, Wang, Zhuo, Wang, Zixuan, Wang, Boyuan, Li, Wentao, Wang, Chensu, Yin, Zongyou, Lu, Yang, Yu, Xue-Feng, and Zhao, Haitao
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

648 results on '"Chen, ZiJian"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources