397 results on '"Li, Xiang-An"'
Search Results
52. Quality analysis in metal additive manufacturing with deep learning
- Author
-
Li, Xiang, Jia, Xiaodong, Yang, Qibo, and Lee, Jay
- Published
- 2020
- Full Text
- View/download PDF
53. Deep learning-based cross-sensor domain adaptation for fault diagnosis of electro-mechanical actuators
- Author
-
Siahpour, Shahin, Li, Xiang, and Lee, Jay
- Published
- 2020
- Full Text
- View/download PDF
54. Explaining Concept Drift of Deep Learning Models
- Author
-
Wang, Xiaolu, Wang, Zhi, Shao, Wei, Jia, Chunfu, Li, Xiang, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Vaidya, Jaideep, editor, Zhang, Xiao, editor, and Li, Jin, editor
- Published
- 2019
- Full Text
- View/download PDF
55. Machine learning applications in vadose zone hydrology: A review.
- Author
-
Li, Xiang, Nieber, John L., and Kumar, Vipin
- Subjects
ARTIFICIAL neural networks ,MACHINE learning ,DEEP learning ,HYDROLOGY ,RANDOM forest algorithms - Abstract
Machine learning (ML) has been broadly applied for vadose zone applications in recent years. This article provides a comprehensive review of such developments. ML applications for variables corresponding to different complex vadose zone processes are summarized mostly in a prediction context. By analyzing and assessing these applications, we discovered extensive usages of classic ML models with relatively limited applications of deep learning (DL) approaches in general. We also recognized a lack of benchmark datasets for soil property research as well as limited integration of physics‐based vadose zone principles into the ML approaches. To facilitate this interdisciplinary research of ML in vadose zone characterization and processes, a paradigm of knowledge‐guided machine learning is suggested along with other data‐driven and ML model‐based research suggestions to advance future research. Core Ideas: Random forest and artificial neural network are two widely applied machine learning options for predicting vadose zone studies.A benchmark dataset is missing in soil property studies.We suggest vadose zone scientists explore more deep learning options and expand knowledge‐guided machine learning implementations. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
56. Deep learning‐based target decomposition for markerless lung tumor tracking in radiotherapy.
- Author
-
Fu, Yabo, Zhang, Pengpeng, Fan, Qiyong, Cai, Weixing, Pham, Hai, Rimner, Andreas, Cuaron, John, Cervino, Laura, Moran, Jean M., Li, Tianfang, and Li, Xiang
- Subjects
DEEP learning ,LUNGS ,LUNG tumors ,GENERATIVE adversarial networks ,GROUND motion ,LINEAR accelerators - Abstract
Background: In radiotherapy, real‐time tumor tracking can verify tumor position during beam delivery, guide the radiation beam to target the tumor, and reduce the chance of a geometric miss. Markerless kV x‐ray image‐based tumor tracking is challenging due to the low tumor visibility caused by tumor‐obscuring structures. Developing a new method to enhance tumor visibility for real‐time tumor tracking is essential. Purpose: To introduce a novel method for markerless kV image‐based tracking of lung tumors via deep learning‐based target decomposition. Methods: We utilized a conditional Generative Adversarial Network (cGAN), known as Pix2Pix, to build a patient‐specific model and generate the synthetic decomposed target image (sDTI) to enhance tumor visibility on the real‐time kV projection images acquired by the onboard kV imager equipped on modern linear accelerators. We used 4DCT simulation images to generate the digitally reconstructed radiograph (DRR) and DTI image pairs for model training. We augmented the training dataset by randomly shifting the 4DCT in the superior‐inferior, anterior‐posterior, and left‐right directions during the DRR and DTI generation process. We performed real‐time 2D tumor tracking via template matching between the DTI generated from the CT simulation and the sDTI generated from the real‐time kV projection images. We validated the proposed method using nine patients' datasets with implanted beacons near the tumor. Results: The sDTI can effectively improve the image contrast around the lung tumors on the kV projection images for the nine patients. With the beacon motion as ground truth, the tracking errors were on average 0.8 ± 0.7 mm in the superior‐inferior (SI) direction and 0.9 ± 0.8 mm in the in‐plane left‐right (IPLR) direction. The percentage of successful tracking, defined as a tracking error less than 2 mm in the SI direction, is 92.2% on the 4312 tested images. The patient‐specific model took approximately 12 h to train. During testing, it took approximately 35 ms to generate one sDTI, and 13 ms to perform the tumor tracking using template matching. Conclusions: Our method offers the potential solution for nearly real‐time markerless lung tumor tracking. It achieved a high level of accuracy and an impressive tracking rate. Further development of 3D lung tumor tracking is warranted. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
57. Fire video image detection based on convolutional neural network
- Author
-
Zhang Jie, Sui Yang, Li Qiang, Li Xiang, and Dong Wei
- Subjects
deep learning ,fire identification ,Caffe framework ,convolutional neural network ,generalization ability ,Electronics ,TK7800-8360 - Abstract
With the development of computer technology, fire image processing technology combining computer vision, machine learning, deep learning and other technologies has been widely studied and applied. Aiming at the complex preprocessing process and high false positive rate of traditional image processing methods, this paper proposes a method based on deep convolutional neural network model for fire detection, which reduces complex preprocessing links and integrates the whole fire identification process into one single depth neural network for easy training and optimization. In view of the problem of fire detection caused by similar fire scenes in the identification process, this paper uses the motion characteristics of fire to innovatively propose the combination of fire frame position changes before and after the fire video to eliminate the interference of lights and other similar fire scenes. After comparing many open learning open source frameworks, this paper chooses Caffe framework for training and testing. The experimental results show that the method realizes the recognition and localization of fire images. This method is suitable for different fire scenarios and has good generalization ability and anti-interference ability.
- Published
- 2019
- Full Text
- View/download PDF
58. Teach to Hash: A Deep Supervised Hashing Framework with Data Selection
- Author
-
Li, Xiang, Ma, Chao, Yang, Jie, Qiao, Yu, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Weikum, Gerhard, Series Editor, Cheng, Long, editor, Leung, Andrew Chi Sing, editor, and Ozawa, Seiichi, editor
- Published
- 2018
- Full Text
- View/download PDF
59. Emotion Analysis for the Upcoming Response in Open-Domain Human-Computer Conversation
- Author
-
Li, Xiang, Zhang, Ming, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Weikum, Gerhard, Series Editor, U, Leong Hou, editor, and Xie, Haoran, editor
- Published
- 2018
- Full Text
- View/download PDF
60. Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation
- Author
-
Zhang, Zhenyu, Cui, Zhen, Xu, Chunyan, Jie, Zequn, Li, Xiang, Yang, Jian, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Weikum, Gerhard, Series Editor, Ferrari, Vittorio, editor, Hebert, Martial, editor, Sminchisescu, Cristian, editor, and Weiss, Yair, editor
- Published
- 2018
- Full Text
- View/download PDF
61. Modeling 4D fMRI Data via Spatio-Temporal Convolutional Neural Networks (ST-CNN)
- Author
-
Zhao, Yu, Li, Xiang, Zhang, Wei, Zhao, Shijie, Makkie, Milad, Zhang, Mo, Li, Quanzheng, Liu, Tianming, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Weikum, Gerhard, Series Editor, Frangi, Alejandro F., editor, Schnabel, Julia A., editor, Davatzikos, Christos, editor, Alberola-López, Carlos, editor, and Fichtinger, Gabor, editor
- Published
- 2018
- Full Text
- View/download PDF
62. A Novel Reference-Based and Gradient-Guided Deep Learning Model for Daily Precipitation Downscaling
- Author
-
Li Xiang, Jie Xiang, Jiping Guan, Fuhan Zhang, Yanling Zhao, and Lifeng Zhang
- Subjects
precipitation downscaling ,deep learning ,super-resolution ,Meteorology. Climatology ,QC851-999 - Abstract
The spatial resolution of precipitation predicted by general circulation models is too coarse to meet current research and operational needs. Downscaling is one way to provide finer resolution data at local scales. The single-image super-resolution method in the computer vision field has made great strides lately and has been applied in various fields. In this article, we propose a novel reference-based and gradient-guided deep learning model (RBGGM) to downscale daily precipitation considering the discontinuity of precipitation and ill-posed nature of downscaling. Global Precipitation Measurement Mission (GPM) precipitation data, variables in ERA5 re-analysis data, and topographic data are selected to perform the downscaling, and a residual dense attention block is constructed to extract features of them. By exploring the discontinuous feature of precipitation, we introduce gradient feature to reconstruct precipitation distribution. We also extract the feature of high-resolution monthly precipitation as a reference feature to resolve the ill-posed nature of downscaling. Extensive experimental results on benchmark data sets demonstrate that our proposed model performs better than other baseline methods. Furthermore, we construct a daily precipitation downscaling data set based on GPM precipitation data, ERA5 re-analysis data and topographic data.
- Published
- 2022
- Full Text
- View/download PDF
63. Self-paced Convolutional Neural Network for Computer Aided Detection in Medical Imaging Analysis
- Author
-
Li, Xiang, Zhong, Aoxiao, Lin, Ming, Guo, Ning, Sun, Mu, Sitek, Arkadiusz, Ye, Jieping, Thrall, James, Li, Quanzheng, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Wang, Qian, editor, Shi, Yinghuan, editor, Suk, Heung-Il, editor, and Suzuki, Kenji, editor
- Published
- 2017
- Full Text
- View/download PDF
64. Joint Emoji Classification and Embedding Learning
- Author
-
Li, Xiang, Yan, Rui, Zhang, Ming, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Chen, Lei, editor, Jensen, Christian S., editor, Shahabi, Cyrus, editor, Yang, Xiaochun, editor, and Lian, Xiang, editor
- Published
- 2017
- Full Text
- View/download PDF
65. Thyroid Ultrasound Image Database and Marker Mask Inpainting Method for Research and Development.
- Author
-
Li, Xiang, Fu, Chong, Xu, Sen, and Sham, Chiu-Wing
- Subjects
- *
ULTRASONIC imaging , *IMAGE databases , *THYROID gland , *COMPUTER-aided diagnosis , *INPAINTING , *THYROID diseases , *THYROID cancer - Abstract
The main objective of this study was to build a rich and high-quality thyroid ultrasound image database (TUD) for computer-aided diagnosis (CAD) systems to support accurate diagnosis and prognostic modeling of thyroid disorders. Because most of the raw thyroid ultrasound images contain artificial markers, which seriously affect the robustness of CAD systems because of their strong prior location information, we propose a marker mask inpainting (MMI) method to erase artificial markers and improve image quality. First, a set of thyroid ultrasound images were collected from the General Hospital of the Northern Theater Command. Then, two modules were designed in MMI, namely, the marker detection (MD) module and marker erasure (ME) module. The MD module detects all markers in the image and stores them in a binary mask. According to the binary mask, the ME module erases the markers and generates an unmarked image. Finally, a new TUD based on the marked images and unmarked images was built. The TUD is carefully annotated and statistically analyzed by professional physicians to ensure accuracy and consistency. Moreover, several normal thyroid gland images and some ancillary information on benign and malignant nodules are provided. Several typical segmentation models were evaluated on the TUD. The experimental results revealed that our TUD can facilitate the development of more accurate CAD systems for the analysis of thyroid nodule-related lesions in ultrasound images. The effectiveness of our MMI method was determined in quantitative experiments. The rich and high-quality resource TUD promotes the development of more effective diagnostic and treatment methods for thyroid diseases. Furthermore, MMI for erasing artificial markers and generating unmarked images is proposed to improve the quality of thyroid ultrasound images. Our TUD database is available at https://github.com/NEU-LX/TUD-Datebase. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
66. Graph network based deep learning of bandgaps.
- Author
-
Li, Xiang-Guo, Blaiszik, Ben, Schwarting, Marcus Emory, Jacobs, Ryan, Scourtas, Aristana, Schmidt, K. J., Voyles, Paul M., and Morgan, Dane
- Subjects
- *
MACHINE learning , *CRYSTAL symmetry , *DENSITY functional theory , *DEEP learning , *INFORMATION modeling , *DATA structures , *PREDICTION models - Abstract
Recent machine learning models for bandgap prediction that explicitly encode the structure information to the model feature set significantly improve the model accuracy compared to both traditional machine learning and non-graph-based deep learning methods. The ongoing rapid growth of open-access bandgap databases can benefit such model construction not only by expanding their domain of applicability but also by requiring constant updating of the model. Here, we build a new state-of-the-art multi-fidelity graph network model for bandgap prediction of crystalline compounds from a large bandgap database of experimental and density functional theory (DFT) computed bandgaps with over 806 600 entries (1500 experimental, 775 700 low-fidelity DFT, and 29 400 high-fidelity DFT). The model predicts bandgaps with a 0.23 eV mean absolute error in cross validation for high-fidelity data, and including the mixed data from all different fidelities improves the prediction of the high-fidelity data. The prediction error is smaller for high-symmetry crystals than for low symmetry crystals. Our data are published through a new cloud-based computing environment, called the "Foundry," which supports easy creation and revision of standardized data structures and will enable cloud accessible containerized models, allowing for continuous model development and data accumulation in the future. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
67. A Data-Driven Approach to Generating Stochastic Mesoscale 3D Shale Volume Elements From 2D SEM Images and Predicting the Equivalent Modulus.
- Author
-
Hong, Yang, Li, Xiang, Gao, Yue, Liu, Zhanli, Yan, Ziming, and Zhuang, Zhuo
- Subjects
DEEP learning ,SHALE ,SCANNING electron microscopy ,ARTIFICIAL neural networks ,SHALE oils ,CONVOLUTIONAL neural networks ,PATTERN recognition systems - Published
- 2023
- Full Text
- View/download PDF
68. Image Composition Method Based on a Spatial Position Analysis Network.
- Author
-
Li, Xiang, Teng, Guowei, An, Ping, and Yao, Haiyan
- Subjects
RESEARCH personnel ,GENERALIZATION - Abstract
Realistic image composition aims to composite new images by fusing a source object into a target image. It is a challenging problem due to the complex multi-task framework, including sensible object placement, appearance consistency, shadow generation, etc. Most existing researchers attempt to address one of the issues. Especially before compositing, there is no matching assignment between the source object and target image, which often leads to unreasonable results. To address the issues above, we consider image composition as an image generation problem and propose a deep adversarial learning network via spatial position analysis. We target the analysis network segment and classify the objects in target images. A spatial alignment network matches the segmented objects with the source objects, and predicts a sensible placement position, and an adversarial network generates a realistic composite image with the shadow and reflection of the source object. Furthermore, we use the classification information of target objects to filter out unreasonable image compositing. Moreover, we introduce a new test set to evaluate the network generalization for our multi-task image composition dataset. Extensive experimental results of the SHU (Shanghai University) dataset demonstrate that our deep spatial position analysis network remarkably enhances the compositing performance in realistic, shadow, and reflection generations. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
69. Deep learning‐enabled discovery and characterization of HKT genes in Spartina alterniflora.
- Author
-
Yang, Maogeng, Chen, Shoukun, Huang, Zhangping, Gao, Shang, Yu, Tingxi, Du, Tingting, Zhang, Hao, Li, Xiang, Liu, Chun‐Ming, Chen, Shihua, and Li, Huihui
- Subjects
SPARTINA alterniflora ,SALT-tolerant crops ,CULTIVARS ,GENES ,DEEP learning ,RED rice ,PHRAGMITES - Abstract
SUMMARY: Spartina alterniflora is a halophyte that can survive in high‐salinity environments, and it is phylogenetically close to important cereal crops, such as maize and rice. It is of scientific interest to understand why S. alterniflora can live under such extremely stressful conditions. The molecular mechanism underlying its high‐saline tolerance is still largely unknown. Here we investigated the possibility that high‐affinity K+ transporters (HKTs), which function in salt tolerance and maintenance of ion homeostasis in plants, are responsible for salt tolerance in S. alterniflora. To overcome the imprecision and unstable of the gene screening method caused by the conventional sequence alignment, we used a deep learning method, DeepGOPlus, to automatically extract sequence and protein characteristics from our newly assemble S. alterniflora genome to identify SaHKTs. Results showed that a total of 16 HKT genes were identified. The number of S. alterniflora HKTs (SaHKTs) is larger than that in all other investigated plant species except wheat. Phylogenetically related SaHKT members had similar gene structures, conserved protein domains and cis‐elements. Expression profiling showed that most SaHKT genes are expressed in specific tissues and are differentially expressed under salt stress. Yeast complementation expression analysis showed that type I members SaHKT1;2, SaHKT1;3 and SaHKT1;8 and type II members SaHKT2;1, SaHKT2;3 and SaHKT2;4 had low‐affinity K+ uptake ability and that type II members showed stronger K+ affinity than rice and Arabidopsis HKTs, as well as most SaHKTs showed preference for Na+ transport. We believe the deep learning‐based methods are powerful approaches to uncovering new functional genes, and the SaHKT genes identified are important resources for breeding new varieties of salt‐tolerant crops. Significance Statement: The great potential of deep learning‐based methods for uncovering the HKT genes in S. alterniflora was demonstrated, which opens the door for utilizing SaHKTs in a wide range of species to increase the stress resilience of staple crops. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
70. APF-GAN: Exploring asymmetric pre-training and fine-tuning strategy for conditional generative adversarial network.
- Author
-
Li, Yuxuan, Yang, Lingfeng, and Li, Xiang
- Subjects
GENERATIVE adversarial networks ,PATTERN recognition systems ,INFORMATION storage & retrieval systems ,OBJECT recognition (Computer vision) ,LANDSCAPE assessment ,COMPUTER vision ,DEEP learning - Published
- 2023
- Full Text
- View/download PDF
71. Deep Learning to Predict the Cell Proliferation and Prognosis of Non-Small Cell Lung Cancer Based on FDG-PET/CT Images.
- Author
-
Hu, Dehua, Li, Xiang, Lin, Chao, Wu, Yonggang, and Jiang, Hao
- Subjects
- *
NON-small-cell lung carcinoma , *DEEP learning , *CELL proliferation , *PROGNOSIS , *POSITRON emission tomography - Abstract
(1) Background: Cell proliferation (Ki-67) has important clinical value in the treatment and prognosis of non-small cell lung cancer (NSCLC). However, current detection methods for Ki-67 are invasive and can lead to incorrect results. This study aimed to explore a deep learning classification model for the prediction of Ki-67 and the prognosis of NSCLC based on FDG-PET/CT images. (2) Methods: The FDG-PET/CT scan results of 159 patients with NSCLC confirmed via pathology were analyzed retrospectively, and the prediction models for the Ki-67 expression level based on PET images, CT images and PET/CT combined images were constructed using Densenet201. Based on a Ki-67 high expression score (HES) obtained from the prediction model, the survival rate of patients with NSCLC was analyzed using Kaplan–Meier and univariate Cox regression. (3) Results: The statistical analysis showed that Ki-67 expression was significantly correlated with clinical features of NSCLC, including age, gender, differentiation state and histopathological type. After a comparison of the three models (i.e., the PET model, the CT model, and the FDG-PET/CT combined model), the combined model was found to have the greatest advantage in Ki-67 prediction in terms of AUC (0.891), accuracy (0.822), precision (0.776) and specificity (0.902). Meanwhile, our results indicated that HES was a risk factor for prognosis and could be used for the survival prediction of NSCLC patients. (4) Conclusions: The deep-learning-based FDG-PET/CT radiomics classifier provided a novel non-invasive strategy with which to evaluate the malignancy and prognosis of NSCLC. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
72. Multimodel Collaboration to Combat Malicious Domain Fluxing.
- Author
-
Nie, Yuanping, Liu, Shuangshuang, Qian, Cheng, Deng, Congyi, Li, Xiang, Wang, Zhi, and Kuang, Xiaohui
- Subjects
ARTIFICIAL neural networks ,DEEP learning ,STATISTICAL learning ,REINFORCEMENT learning ,MACHINE learning - Abstract
This paper proposes a novel domain-generation-algorithm detection framework based on statistical learning that integrates the detection capabilities of multiple heterogeneous models. The framework includes both traditional machine learning methods based on artificial features and deep learning methods, comprehensively analyzing 34 artificial features and advanced features extracted from deep neural networks. Additionally, the framework evaluates the predictions of the base models based on the fit of the samples to each type of sample set and a predefined significance level. The predictions of the base models are statistically analyzed, and the final decision is made using strategies such as voting, confidence, and credibility. Experimental results demonstrate that the DGA detection framework based on statistical learning achieves a higher detection rate compared to the underlying base models, with accuracy, precision, recall, and F1 scores reaching 0.979, 0.977, 0.981, and 0.979, respectively. The framework also exhibits a stronger adaptability to unknown domains and a certain level of robustness against concept drift attacks. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
73. A Study of Contrastive Learning Algorithms for Sentence Representation Based on Simple Data Augmentation.
- Author
-
Liu, Xiaodong, Gong, Wenyin, Li, Yuxin, Li, Yanchi, and Li, Xiang
- Subjects
MACHINE learning ,DATA augmentation ,DEEP learning ,LANGUAGE models ,ENGLISH language ,ALGORITHMS - Abstract
In the era of deep learning, representational text-matching algorithms based on BERT and its variant models have become mainstream and are limited by the sentence vectors generated by the BERT model, and the SimCSE algorithm proposed in 2021 has improved the sentence vector quality to a certain extent. In this paper, to address the problem that the SimCSE algorithm has—that the greater the difference in sentence length, the smaller the probability that the sentence pairs are similar—an EdaCSE algorithm is proposed to perturb the sentence length using a simple data enhancement method without affecting the semantics of the sentences. The perturbation is applied to the sentence length by adding meaningless English punctuation marks to the original sentence so that the model no longer tends to recognise sentences of similar length as similar sentences. Based on the BERT series of models, experiments were conducted on five different datasets, and the experiments proved that the EdaCSE method improves an average of 1.67, 0.84, and 1.08 on the five datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
74. Patient specific prior cross attention for kV decomposition in paraspinal motion tracking.
- Author
-
He, Xiuxiu, Cai, Weixing, Li, Feifei, Fan, Qiyong, Zhang, Pengpeng, Cuaron, John J., Cerviño, Laura I., Moran, Jean M., Li, Xiang, and Li, Tianfang
- Subjects
DEEP learning ,MACHINE learning ,GENERATIVE adversarial networks ,X-ray imaging ,CONE beam computed tomography ,IMAGE-guided radiation therapy - Abstract
Background: X‐ray image quality is critical for accurate intrafraction motion tracking in radiation therapy. Purpose: This study aims to develop a deep‐learning algorithm to improve kV image contrast by decomposing the image into bony and soft tissue components. In particular, we designed a priori attention mechanism in the neural network framework for optimal decomposition. We show that a patient‐specific prior cross‐attention (PCAT) mechanism can boost the performance of kV image decomposition. We demonstrate its use in paraspinal SBRT motion tracking with online kV imaging. Methods: Online 2D kV projections were acquired during paraspinal SBRT for patient motion monitoring. The patient‐specific prior images were generated by randomly shifting and rotating spine‐only DRR created from the setup CBCT, simulating potential motions. The latent features of the prior images were incorporated into the PCAT using multi‐head cross attention. The neural network aimed to learn to selectively amplify the transmission of the projection image features that correlate with features of the priori. The PCAT network structure consisted of (1) a dual‐branch generator that separates the spine and soft tissue component of the kV projection image and (2) a dual‐function discriminator (DFD) that provides the realness score of the predicted images. For supervision, we used a loss combining mean absolute error loss, discriminator loss, perceptual loss, total variation, and mean squared error loss for soft tissues. The proposed PCAT approach was benchmarked against previous work using the ResNet generative adversarial network (ResNetGAN) without prior information. Results: The trained PCAT had improved performance in effectively retaining and preserving the spine structure and texture information while suppressing the soft tissues from the kV projection images. The decomposed spine‐only x‐ray images had the submillimeter matching accuracy at all beam angles. The decomposed spine‐only x‐ray significantly reduced the maximum errors to 0.44 mm (<2 pixels) in comparison to 0.92 mm (∼4 pixels) of ResNetGAN. The PCAT decomposed spine images also had higher PSNR and SSIM (p‐value < 0.001). Conclusion: The PCAT selectively learned the important latent features by incorporating the patient‐specific prior knowledge into the deep learning algorithm, significantly improving the robustness of the kV projection image decomposition, and leading to improved motion tracking accuracy in paraspinal SBRT. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
75. Survey on Explainable AI: From Approaches, Limitations and Applications Aspects.
- Author
-
Yang, Wenli, Wei, Yuchen, Wei, Hanyu, Chen, Yanyu, Huang, Guan, Li, Xiang, Li, Renjie, Yao, Naimeng, Wang, Xinyi, Gu, Xiaotong, Amin, Muhammad Bilal, and Kang, Byeong
- Subjects
ARTIFICIAL intelligence ,DEEP learning ,MACHINE learning ,ARTIFICIAL neural networks ,DECISION making - Abstract
In recent years, artificial intelligence (AI) technology has been used in most if not all domains and has greatly benefited our lives. While AI can accurately extract critical features and valuable information from large amounts of data to help people complete tasks faster, there are growing concerns about the non-transparency of AI in the decision-making process. The emergence of explainable AI (XAI) has allowed humans to better understand and control AI systems, which is motivated to provide transparent explanations for the decisions made by AI. This article aims to present a comprehensive overview of recent research on XAI approaches from three well-defined taxonomies. We offer an in-depth analysis and summary of the status and prospects of XAI applications in several key areas where reliable explanations are urgently needed to avoid mistakes in decision-making. We conclude by discussing XAI's limitations and future research directions. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
76. Large depth-of-field ultra-compact microscope by progressive optimization and deep learning.
- Author
-
Zhang, Yuanlong, Song, Xiaofei, Xie, Jiachen, Hu, Jing, Chen, Jiawei, Li, Xiang, Zhang, Haiyu, Zhou, Qiqun, Yuan, Lekang, Kong, Chui, Shen, Yibing, Wu, Jiamin, Fang, Lu, and Dai, Qionghai
- Subjects
DEEP learning ,IMAGING systems ,OPTICAL microscopes ,DIFFRACTIVE optical elements ,CELL phones ,MICROSCOPES - Abstract
The optical microscope is customarily an instrument of substantial size and expense but limited performance. Here we report an integrated microscope that achieves optical performance beyond a commercial microscope with a 5×, NA 0.1 objective but only at 0.15 cm
3 and 0.5 g, whose size is five orders of magnitude smaller than that of a conventional microscope. To achieve this, a progressive optimization pipeline is proposed which systematically optimizes both aspherical lenses and diffractive optical elements with over 30 times memory reduction compared to the end-to-end optimization. By designing a simulation-supervision deep neural network for spatially varying deconvolution during optical design, we accomplish over 10 times improvement in the depth-of-field compared to traditional microscopes with great generalization in a wide variety of samples. To show the unique advantages, the integrated microscope is equipped in a cell phone without any accessories for the application of portable diagnostics. We believe our method provides a new framework for the design of miniaturized high-performance imaging systems by integrating aspherical optics, computational optics, and deep learning. Traditional optical microscope, while bulky, often fails to deliver optimal performance. Here, the authors have engineered an integrated microscope of 0.15 cm3 in volume and a weight of 0.5 g, which outperforms a commercial microscope and can be seamlessly integrated with a smartphone. [ABSTRACT FROM AUTHOR]- Published
- 2023
- Full Text
- View/download PDF
77. Towards optimal deep fusion of imaging and clinical data via a model‐based description of fusion quality.
- Author
-
Wang, Yuqi, Li, Xiang, Konanur, Meghana, Konkel, Brandon, Seyferth, Elisabeth, Brajer, Nathan, Liu, Jian‐Guo, Bashir, Mustafa R., and Lafata, Kyle J.
- Subjects
- *
IMAGE fusion , *CONVOLUTIONAL neural networks , *DIAGNOSTIC imaging , *PROBABILITY density function , *DATA structures , *GAUSSIAN processes , *DATA fusion (Statistics) - Abstract
Background: Due to intrinsic differences in data formatting, data structure, and underlying semantic information, the integration of imaging data with clinical data can be non‐trivial. Optimal integration requires robust data fusion, that is, the process of integrating multiple data sources to produce more useful information than captured by individual data sources. Here, we introduce the concept of fusion quality for deep learning problems involving imaging and clinical data. We first provide a general theoretical framework and numerical validation of our technique. To demonstrate real‐world applicability, we then apply our technique to optimize the fusion of CT imaging and hepatic blood markers to estimate portal venous hypertension, which is linked to prognosis in patients with cirrhosis of the liver. Purpose: To develop a measurement method of optimal data fusion quality deep learning problems utilizing both imaging data and clinical data. Methods: Our approach is based on modeling the fully connected layer (FCL) of a convolutional neural network (CNN) as a potential function, whose distribution takes the form of the classical Gibbs measure. The features of the FCL are then modeled as random variables governed by state functions, which are interpreted as the different data sources to be fused. The probability density of each source, relative to the probability density of the FCL, represents a quantitative measure of source‐bias. To minimize this source‐bias and optimize CNN performance, we implement a vector‐growing encoding scheme called positional encoding, where low‐dimensional clinical data are transcribed into a rich feature space that complements high‐dimensional imaging features. We first provide a numerical validation of our approach based on simulated Gaussian processes. We then applied our approach to patient data, where we optimized the fusion of CT images with blood markers to predict portal venous hypertension in patients with cirrhosis of the liver. This patient study was based on a modified ResNet‐152 model that incorporates both images and blood markers as input. These two data sources were processed in parallel, fused into a single FCL, and optimized based on our fusion quality framework. Results: Numerical validation of our approach confirmed that the probability density function of a fused feature space converges to a source‐specific probability density function when source data are improperly fused. Our numerical results demonstrate that this phenomenon can be quantified as a measure of fusion quality. On patient data, the fused model consisting of both imaging data and positionally encoded blood markers at the theoretically optimal fusion quality metric achieved an AUC of 0.74 and an accuracy of 0.71. This model was statistically better than the imaging‐only model (AUC = 0.60; accuracy = 0.62), the blood marker‐only model (AUC = 0.58; accuracy = 0.60), and a variety of purposely sub‐optimized fusion models (AUC = 0.61–0.70; accuracy = 0.58–0.69). Conclusions: We introduced the concept of data fusion quality for multi‐source deep learning problems involving both imaging and clinical data. We provided a theoretical framework, numerical validation, and real‐world application in abdominal radiology. Our data suggests that CT imaging and hepatic blood markers provide complementary diagnostic information when appropriately fused. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
78. Research on Intelligent Wheelchair Attitude-Based Adjustment Method Based on Action Intention Recognition.
- Author
-
Cui, Jianwei, Huang, Zizheng, Li, Xiang, Cui, Linwei, Shang, Yucheng, and Tong, Liyan
- Subjects
ELECTRIC wheelchairs ,WHEELCHAIRS ,ELECTRIC actuators ,INTENTION ,DEEP learning ,HUMAN body - Abstract
At present, research on intelligent wheelchairs mostly focuses on motion control, while research on attitude-based adjustment is relatively insufficient. The existing methods for adjusting wheelchair posture generally lack collaborative control and good human–machine collaboration. This article proposes an intelligent wheelchair posture-adjustment method based on action intention recognition by studying the relationship between the force changes on the contact surface between the human body and the wheelchair and the action intention. This method is applied to a multi-part adjustable electric wheelchair, which is equipped with multiple force sensors to collect pressure information from various parts of the passenger's body. The upper level of the system converts the pressure data into the form of a pressure distribution map, extracts the shape features using the VIT deep learning model, identifies and classifies them, and ultimately identifies the action intentions of the passengers. Based on different action intentions, the electric actuator is controlled to adjust the wheelchair posture. After testing, this method can effectively collect the body pressure data of passengers, with an accuracy of over 95% for the three common intentions of lying down, sitting up, and standing up. The wheelchair can adjust its posture based on the recognition results. By adjusting the wheelchair posture through this method, users do not need to wear additional equipment and are less affected by the external environment. The target function can be achieved with simple learning, which has good human–machine collaboration and can solve the problem of some people having difficulty adjusting the wheelchair posture independently during wheelchair use. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
79. An Intelligent Boosting and Decision-Tree-Regression-Based Score Prediction (BDTR-SP) Method in the Reform of Tertiary Education Teaching.
- Author
-
Zhu, Ling, Liu, Guangyu, Lv, Shuang, Chen, Dongjie, Chen, Zhihong, and Li, Xiang
- Subjects
EDUCATIONAL change ,POSTSECONDARY education ,INFORMATION technology ,INTELLIGENT tutoring systems ,LEARNING strategies ,ARTIFICIAL intelligence ,DEEP learning - Abstract
The reform of tertiary education teaching promotes teachers to adjust timely teaching plans based on students' learning feedback in order to improve teaching performance. Thefore, learning score prediction is a key issue in process of the reform of tertiary education teaching. With the development of information and management technologies, a lot of teaching data are generated as the scale of online and offline education expands. However, a teacher or educator does not have a comprehensive dataset in practice, which challenges his/her ability to predict the students' learning performance from the individual's viewpoint. How to overcome the drawbacks of small samples is an open issue. To this end, it is desirable that an effective artificial intelligent tool is designed to help teachers or educators predict students' scores well. We propose a boosting and decision-tree-regression-based score prediction (BDTR-SP) model, which relies on an ensemble learning structure with base learners of decision tree regression (DTR) to improve the prediction accuracy. Experiments on small samples are conducted to examine the important features that affect students' scores. The results show that the proposed model has advantages over its peer in terms of prediction correctness. Moreover, the predicted results are consistent with the actual facts implied in the original dataset. The proposed BDTR-SP method aids teachers and students to predict students' performance in the on-going courses in order to adjust the teaching and learning strategies, plans and practices in advance, enhancing the teaching and learning quality. Therefore, the integration of information technology and artificial intelligence into teaching and learning practices is able to push forward the reform of tertiary education teaching. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
80. Applied Computing and Artificial Intelligence.
- Author
-
Li, Xiang, Zhang, Shuo, and Zhang, Wei
- Subjects
- *
ARTIFICIAL intelligence , *DEEP learning , *RANDOM forest algorithms , *DIFFERENTIAL evolution , *CONVOLUTIONAL neural networks , *REMAINING useful life , *DRIVER assistance systems - Abstract
Applied computing and artificial intelligence methods have been attracting growing interest in recent years due to their effectiveness in solving technical problems. Hao et al. [[14]] present an unsupervised fault diagnosis methodology to leverage the generated MPCs of different working conditions to diagnose the actual unlabeled MPCs. The paper by Ainapure et al. [[17]] proposes a new cross-domain fault diagnosis method with enhanced robustness. The paper by Saeed et al. [[1]] proposes an approach to building an AutoML data-dependent CNN model (DeepPCANet) customized for DR screening automatically. [Extracted from the article]
- Published
- 2023
- Full Text
- View/download PDF
81. Transform paper-based cadastral data into digital systems using GIS and end-to-end deep learning techniques.
- Author
-
Mango, Joseph, Wang, Moyang, Mu, Senlin, Zhang, Di, Ngondo, Jamila, Valerian-Peter, Regina, Claramunt, Christophe, and Li, Xiang
- Subjects
DEEP learning ,GEOGRAPHIC information systems ,VECTOR data ,INTERNET stores ,ARTIFICIAL neural networks - Abstract
Digital systems storing cadastral data in vector format are considered effective due to their ability of offering interactive services to citizens and other land-related systems. The adoption of such systems is ubiquitous, but when adopted, they create two non-compatible systems with paper-based cadastral systems whose information needs to be digitised. This study proposes a new approach that is fast and accurate for transforming paper-based cadastral data into digital systems. The proposed method involves deep-learning techniques of the LCNN and ResNet-50 for detecting cadastral parcels and their numbers, respectively, from the cadastral plans. It also contains four functions defined to speed up transformations and compilations of the cadastral plan's data in digital systems. The LCNN is trained and validated with 968 samples. The ResNet-50 is trained and validated with 106,000 samples. The Structural-Average-Precision ( sAP 10 ) achieved with the LCNN was 0.9057. The Precision, Recall and F1-Score achieved with the ResNet-50 were 0.9650, 0.9648 and 0.9649, respectively. These results confirmed that the new method is accurate enough for implementation, and we tested it with a huge set of data from Tanzania. Its performance from the experimented data shows that the proposed method could effectively transform paper-based cadastral data into digital systems. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
82. MIU-Net: MIX-Attention and Inception U-Net for Histopathology Image Nuclei Segmentation.
- Author
-
Li, Jiangqi and Li, Xiang
- Subjects
CELL nuclei ,DEEP learning ,IMAGE segmentation ,HISTOPATHOLOGY ,PATHOLOGY ,HEMATOXYLIN & eosin staining ,NETWORK performance - Abstract
In the medical field, hematoxylin and eosin (H&E)-stained histopathology images of cell nuclei analysis represent an important measure for cancer diagnosis. The most valuable aspect of the nuclei analysis is the segmentation of the different nuclei morphologies of different organs and subsequent diagnosis of the type and severity of the disease based on pathology. In recent years, deep learning techniques have been widely used in digital histopathology analysis. Automated nuclear segmentation technology enables the rapid and efficient segmentation of tens of thousands of complex and variable nuclei in histopathology images. However, a challenging problem during nuclei segmentation is the blocking of cell nuclei, overlapping, and background complexity of the tissue fraction. To address this challenge, we present MIU-net, an efficient deep learning network structure for the nuclei segmentation of histopathology images. Our proposed structure includes two blocks with modified inception module and attention module. The advantage of the modified inception module is to balance the computation and network performance of the deeper layers of the network, combined with the convolutional layer using different sizes of kernels to learn effective features in a fast and efficient manner to complete kernel segmentation. The attention module allows us to extract small and fine irregular boundary features from the images, which can better segment cancer cells that appear disorganized and fragmented. We test our methodology on public kumar datasets and achieve the highest AUC score of 0.92. The experimental results show that the proposed method achieves better performance than other state-of-the-art methods. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
83. Clinical Evaluation of the Automatic Coronary Artery Disease Reporting and Data System (CAD-RADS) in Coronary Computed Tomography Angiography Using Convolutional Neural Networks.
- Author
-
Huang, Zengfa, Xiao, Jianwei, Wang, Xi, Li, Zuoqin, Guo, Ning, Hu, Yun, Li, Xiang, and Wang, Xiang
- Abstract
The coronary artery disease reporting and data system (CAD-RADS™) was recently introduced to standardise reporting. We aimed to evaluate the utility of an automatic postprocessing and reporting system based on CAD-RADS™ in suspected coronary artery disease (CAD) patients. Clinical evaluation was performed in 346 patients who underwent coronary computed tomography angiography (CCTA). We compared deep learning (DL)-based CCTA with human readers for evaluation of CAD-RADS™ with commercially-available automated segmentation and manual postprocessing in a retrospective validation cohort. Compared with invasive coronary angiography, the sensitivity, specificity, positive predictive value, negative predictive value, and accuracy of the DL model for diagnosis of CAD were 79.02%, 86.52%, 89.50%, 73.94%, and 82.08%, respectively. There was no significant difference between the DL-based and the reader-based CAD-RADS™ grading of CCTA results. Consistency testing showed that the Kappa value between the model and the readers was 0.775 (95% confidence interval [CI] : 0.728–0.823, p < 0.001), 0.802 (95% CI : 0.756–0.847, p < 0.001), and 0.796 (95% CI : 0.750–0.843, p < 0.001), respectively. This system reduces the time taken from 14.97 ± 1.80 min to 5.02 ± 0.8 min (p < 0.001). The standardised reporting of DL-based CAD-RADS™ in CCTA can accurately and rapidly evaluate suspected CAD patients, and has good consistency with grading by radiologists. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
84. A decomposition-ensemble-integration framework for carbon price forecasting.
- Author
-
Li, Xiang, Zhang, Yongqi, Chen, Lei, Li, Jia, and Chu, Xiaowen
- Subjects
- *
HILBERT-Huang transform , *CARBON pricing , *DEEP learning , *EMISSIONS trading , *TIME series analysis , *CARBON offsetting - Abstract
Accurate carbon price forecasting is crucial for effective carbon trading policies that mitigate climate change. The volatility and uncertainty of carbon prices pose significant challenges. Traditional frameworks that use a single model for all intrinsic mode functions (IMFs) fail to leverage the diverse strengths of different models. In this paper, we propose DecEnsInt, a novel framework that utilizes an ensemble of models tailored to distinct frequency domains. By using Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) to decompose historical prices into IMFs and optimizing ensemble parameters through Sequential Least Squares Programming (SLSQP) for each frequency domain, DecEnsInt enhances forecasting performance. Extensive experiments on four Emissions Trading System (ETS) datasets, evaluated using RMSE, MAPE, MAE and R 2 , demonstrate DecEnsInt's superiority, achieving relative improvements in key metrics by 10.58% to 17.72% over runner-up ensemble methods. DecEnsInt also achieves better robustness and stability across different forecasting steps, proving its effectiveness in handling the complexities of carbon price forecasting. • Different models show distinct forecasting ability in different frequency domains. • Ensemble technique with Intrinsic Mode Function boosts overall predictive accuracy. • Optimizing ensemble parameters by solving individual optimization problems. • The framework significantly enhances carbon price forecasting in four data sets. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
85. AUV-Aided Optical—Acoustic Hybrid Data Collection Based on Deep Reinforcement Learning.
- Author
-
Bu, Fanfeng, Luo, Hanjiang, Ma, Saisai, Li, Xiang, Ruby, Rukhsana, and Han, Guangjie
- Subjects
REINFORCEMENT learning ,DEEP learning ,ACQUISITION of data ,AUTONOMOUS underwater vehicles ,WIRELESS sensor networks ,DATA packeting ,SUBMERSIBLES ,UNDERWATER navigation - Abstract
Autonomous underwater vehicles (AUVs)-assisted mobile data collection in underwater wireless sensor networks (UWSNs) has received significant attention because of their mobility and flexibility. To satisfy the increasing demand of diverse application requirements for underwater data collection, such as time-sensitive data freshness, emergency event security as well as energy efficiency, in this paper, we propose a novel multi-modal AUV-assisted data collection scheme which integrates both acoustic and optical technologies and takes advantage of their complementary strengths in terms of communication distance and data rate. In this scheme, we consider the age of information (AoI) of the data packet, node transmission energy as well as energy consumption of the AUV movement, and we make a trade-off between them to retrieve data in a timely and reliable manner. To optimize these, we leverage a deep reinforcement learning (DRL) approach to find the optimal motion trajectory of AUV by selecting the suitable communication options. In addition to that, we also design an optimal angle steering algorithm for AUV navigation under different communication scenarios to reduce energy consumption further. We conduct extensive simulations to verify the effectiveness of the proposed scheme, and the results show that the proposed scheme can significantly reduce the weighted sum of AoI as well as energy consumption. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
86. Ensemble of convolutional neural networks and multilayer perceptron for the diagnosis of mild cognitive impairment and Alzheimer's disease.
- Author
-
Li, Minglei, Jiang, Yuchen, Li, Xiang, Yin, Shen, and Luo, Hao
- Subjects
DEEP learning ,ALZHEIMER'S disease ,CONVOLUTIONAL neural networks ,MILD cognitive impairment ,MAGNETIC resonance imaging ,COMPUTER-aided diagnosis - Abstract
Background: Structural magnetic resonance imaging (sMRI) can provide morphological information about the structure and function of the brain in the same scanning process. It has been widely used in the diagnosis of Alzheimer's disease (AD) and mild cognitive impairment (MCI). Purpose: To capture the anatomical changes in the brain caused by AD/MCI, deep learning‐based MRI image analysis methods have been proposed in recent years. However, it is observed that the performance of most existing methods is limited as they only construct a single type of deep network and ignore the significance of other clinical information. Methods: To make up for these defects, an ensemble framework that incorporates three types of dedicatedly‐designed convolutional neural networks (CNNs) and a multilayer perceptron (MLP) network is proposed, where three CNNs with entropy‐based multi‐instance learning pooling layers have more reliable feature selection abilities. The dedicatedly‐designed base classifiers can make use of the heterogeneous data, and empower the framework with enhanced diversity and robustness. In particular, to consider the interactions among the base classifiers, a novel multi‐head self‐attention voting scheme is designed. Moreover, considering the chance that MCI can be transformed to AD, the proposed framework is designed to diagnose AD and predict MCI conversion simultaneously, with the aid of the transfer learning technique. Results: For performance evaluation and comparison, extensive experiments are conducted on the public dataset of the Alzheimer's Disease Neuroimaging Initiative (ADNI). The results show that the proposed ensemble framework provides superior performance under most of the evaluation metrics. Especially, the proposed framework achieves state‐of‐the‐art diagnostic accuracy (98.61% for the AD diagnosis task, and 84.49% for the MCI conversion prediction task). Conclusions: These promising results demonstrate the proposed ensemble framework can accurately diagnose AD patients and predict the conversion of MCI patients, which has the potential of clinical practice for diagnosing AD and MCI. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
87. Using Whole Slide Gray Value Map to Predict HER2 Expression and FISH Status in Breast Cancer.
- Author
-
Yao, Qian, Hou, Wei, Wu, Kaiyuan, Bai, Yanhua, Long, Mengping, Diao, Xinting, Jia, Ling, Niu, Dongfeng, and Li, Xiang
- Subjects
DEEP learning ,ONCOGENES ,IMMUNOHISTOCHEMISTRY ,GENE expression ,CANCER patients ,FLUORESCENCE in situ hybridization ,INTRACLASS correlation ,RESEARCH funding ,ARTIFICIAL neural networks ,BREAST tumors - Abstract
Simple Summary: HER2 expression is important for target therapy in breast cancer patients, however, accurate evaluation of HER2 expression is challenging for pathologists owing to the ambiguities and subjectivities of manual scoring. We proposed a deep learning framework using a Whole Slide gray value map and convolutional neural network model to predict HER2 expression level on immunohistochemistry (IHC) assay and predict HER2 gene status on fluorescence in situ hybridization (FISH) assay. Our results indicated that the proposed model is feasible for predicting HER2 expression and gene amplification and achieved high consistency with the experienced pathologists' assessment. This unique HER2 scoring model did not rely on challenging manual intervention and proved to be a simple and robust tool for pathologists to improve the accuracy of HER2 interpretation and provided a clinical aid to target therapy in breast cancer patients. Accurate detection of HER2 expression through immunohistochemistry (IHC) is of great clinical significance in the treatment of breast cancer. However, manual interpretation of HER2 is challenging, due to the interobserver variability among pathologists. We sought to explore a deep learning method to predict HER2 expression level and gene status based on a Whole Slide Image (WSI) of the HER2 IHC section. When applied to 228 invasive breast carcinoma of no special type (IBC-NST) DAB-stained slides, our GrayMap+ convolutional neural network (CNN) model accurately classified HER2 IHC level with mean accuracy 0.952 ± 0.029 and predicted HER2 FISH status with mean accuracy 0.921 ± 0.029. Our result also demonstrated strong consistency in HER2 expression score between our system and experienced pathologists (intraclass correlation coefficient (ICC) = 0.903, Cohen's κ = 0.875). The discordant cases were found to be largely caused by high intra-tumor staining heterogeneity in the HER2 IHC group and low copy number in the HER2 FISH group. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
88. A short-term building energy consumption prediction and diagnosis using deep learning algorithms.
- Author
-
Li, Xiang, Yu, Junqi, Wang, Qian, Dong, Fangnan, Cheng, Renyin, and Feng, Chunyong
- Subjects
- *
ENERGY consumption of buildings , *MACHINE learning , *ENERGY consumption , *DEEP learning , *ENERGY management , *SUPERVISED learning , *CONSUMPTION (Economics) , *BUSINESS hours - Abstract
Short-term energy consumption prediction of buildings is crucial for developing model-based predictive control, fault detection, and diagnosis methods. This study takes a university library in Xi'an as the research object. First, a time-by-time energy consumption prediction model is established under the supervised learning approach, which uses a long short-term memory (LSTM) network and a Multi-Input Multi-Output (MIMO) strategy. The experimental results validate the model's validity, which is close enough to physical reality for engineering purposes. Second, the potential of the people flows factor in energy consumption prediction models is explored. The results show that people flow has great potential in predicting building energy consumption and can effectively improve the prediction model performance. Third, a diagnostic method, which can recognize abnormal energy consumption data is used to diagnose the unreasonable use of the building during each hour of operation. The method is based on differences between actual and predicted energy consumption data derived from a short-term energy consumption prediction model. Based on actual building operation data, this work is enlightening and can serve as a reference for building energy efficiency management and operation. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
89. A sub‐regional compression method for greenhouse images based on CNN image quality assessment.
- Author
-
Ma, Diankun, Wen, Haojie, Li, Xinxing, Xie, Tianhua, and Li, Xiang
- Subjects
GREENHOUSES ,DEEP learning ,INTERNET of things ,JPEG (Image coding standard) ,FRUIT ,CUCUMBERS ,GREENHOUSE gardening - Abstract
As a major Internet of Things information carrier within the greenhouse, image quality and image file size need to be balanced. To reduce image file size while maintaining image quality, a sub‐regional compression (SRC) method is proposed, and its optimal compression quality of the background regions is selected using image quality assessments such as SSIM, BRISQUE, NIQE, and PIQE. In addition, a convolutional neural network‐based NIMA model to measure image quality is also being introduced. In the end, the SRC method is compared with JPEG compression default mode and JPEG compression with quality 30 in NIMA score, compression time, and compression efficiency, proving the effectiveness of the method. Practical applications: Certain vegetables and fruits, such as cucumbers, tomatoes, etc., in most cases are processed directly into the hands of consumers, so the loss is directly attributable to their growth, especially pests and diseases. Monitoring for the above situation is a great improvement, but the increase in monitoring accuracy brings video/key frame to becoming larger. Therefore, it is better to reduce the file size as well as to ensure clarity in regions of interest, viewing the files faster and detecting the losses by humans or machines as easily as possible. This paper investigates the overall process and innovatively introduces deep learning into image quality assessment to better simulate human perception of images, and the research results of this paper can be applied to different models of surveillance cameras to achieve the same results. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
90. Deep-Learning-Based Information Fusion Methodology for Oil Film Coefficient Identification of Squeeze Film Dampers.
- Author
-
Zhang, Wei, Zhang, Jiaxuan, and Li, Xiang
- Abstract
Squeeze film damper (SFD) is one of the key components in the sophisticated rotating machines in the high-end aerospace industry. Rolling bearings in the aeroengine spindle system are often used in combination with elastic supports and SFDs to reduce rotor vibrations. Accurate coefficient identification of oil films in SFDs is critical to ensure safe and reliable operations of the complex equipment. However, this task remains a challenging issue in the past decades due to the difficulties in real-time measurement and computation during operations. The highly nonlinear relationship between the measurement and oil film coefficient cannot be effectively captured using traditional methods. This article proposes a novel deep-learning-based information fusion methodology for oil film coefficient identification of SFDs. An end-to-end deep neural network model structure is established, which explores heterogeneous SFD data simultaneously with fusion architecture and directly outputs the identification results. The high-dimensional data can be automatically processed, and high identification accuracy can be achieved. Experiments on the SFD test rig are carried out for validation. The experimental results demonstrate that the proposed method can well identify the oil film coefficients in an intelligent manner and is robust against environmental noise and different parameter settings. The proposed methodology is, thus, promising for applications in real industries. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
91. Representation Learning in Multi-view Clustering: A Literature Review.
- Author
-
Chen, Man-Sheng, Lin, Jia-Qi, Li, Xiang-Long, Liu, Bao-Yu, Wang, Chang-Dong, Huang, Dong, and Lai, Jian-Huang
- Subjects
DEEP learning ,DATA structures ,LITERATURE reviews ,SOURCE code - Abstract
Multi-view clustering (MVC) has attracted more and more attention in the recent few years by making full use of complementary and consensus information between multiple views to cluster objects into different partitions. Although there have been two existing works for MVC survey, neither of them jointly takes the recent popular deep learning-based methods into consideration. Therefore, in this paper, we conduct a comprehensive survey of MVC from the perspective of representation learning. It covers a quantity of multi-view clustering methods including the deep learning-based models, providing a novel taxonomy of the MVC algorithms. Furthermore, the representation learning-based MVC methods can be mainly divided into two categories, i.e., shallow representation learning-based MVC and deep representation learning-based MVC, where the deep learning-based models are capable of handling more complex data structure as well as showing better expression. In the shallow category, according to the means of representation learning, we further split it into two groups, i.e., multi-view graph clustering and multi-view subspace clustering. To be more comprehensive, basic research materials of MVC are provided for readers, containing introductions of the commonly used multi-view datasets with the download link and the open source code library. In the end, some open problems are pointed out for further investigation and development. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
92. Multiscale unsupervised domain adaptation for automatic pancreas segmentation in CT volumes using adversarial learning.
- Author
-
Zhu, Yan, Hu, Peijun, Li, Xiang, Tian, Yu, Bai, Xueli, Liang, Tingbo, and Li, Jingsong
- Subjects
PANCREAS ,PANCREATIC diseases ,COMPUTED tomography ,LEARNING strategies ,NETWORK performance - Abstract
Purpose: Computer‐aided automatic pancreas segmentation is essential for early diagnosis and treatment of pancreatic diseases. However, the annotation of pancreas images requires professional doctors and considerable expenditure. Due to imaging differences among various institution population, scanning devices, imaging protocols, and so on, significant degradation in the performance of model inference results is prone to occur when models trained with domain‐specific (usually institution‐specific) datasets are directly applied to new (other centers/institutions) domain data. In this paper, we propose a novel unsupervised domain adaptation method based on adversarial learning to address pancreas segmentation challenges with the lack of annotations and domain shift interference. Methods: A 3D semantic segmentation model with attention module and residual module is designed as the backbone pancreas segmentation model. In both segmentation model and domain adaptation discriminator network, a multiscale progressively weighted structure is introduced to acquire different field of views. Features of labeled data and unlabeled data are fed in pairs into the proposed multiscale discriminator to learn domain‐specific characteristics. Then the unlabeled data features with pseudodomain label are fed to the discriminator to acquire domain‐ambiguous information. With this adversarial learning strategy, the performance of the segmentation network is enhanced to segment unseen unlabeled data. Results: Experiments were conducted on two public annotated datasets as source datasets, respectively, and one private dataset as target dataset, where annotations were not used for the training process but only for evaluation. The 3D segmentation model achieves comparative performance with state‐of‐the‐art pancreas segmentation methods on source domain. After implementing our domain adaptation architecture, the average dice similarity coefficient (DSC) of the segmentation model trained on the NIH‐TCIA source dataset increases from 58.79% to 72.73% on the local hospital dataset, while the performance of the target domain segmentation model transferred from the medical segmentation decathlon (MSD) source dataset rises from 62.34% to 71.17%. Conclusions: Correlations of features across data domains are utilized to train the pancreas segmentation model on unlabeled data domain, improving the generalization of the model. Our results demonstrate that the proposed method enables the segmentation model to make meaningful segmentation for unseen data of the training set. In the future, the proposed method has the potential to apply segmentation model trained on public dataset to clinical unannotated CT images from local hospital, effectively assisting radiologists in clinical practice. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
93. Federated Multi-Agent Deep Reinforcement Learning for Resource Allocation of Vehicle-to-Vehicle Communications.
- Author
-
Li, Xiang, Lu, Lingyun, Ni, Wei, Jamalipour, Abbas, Zhang, Dalin, and Du, Haifeng
- Subjects
- *
RESOURCE allocation , *DEEP learning , *REWARD (Psychology) , *MULTIAGENT systems , *REINFORCEMENT learning , *CO-channel interference - Abstract
Dynamic topology, fast-changing channels and the time sensitivity of safety-related services present challenges to the status quo of resource allocation for cellular-underlaying vehicle-to-vehicle (V2V) communications. In this paper, we investigate a novel federated multi-agent deep reinforcement learning (FedMARL) approach for the decentralized joint optimization of channel selection and power control for V2V communication. The approach takes advantage of both deep reinforcement learning (DRL) and federated learning (FL), satisfying the reliability and delay requirements of V2V communication while maximizing the transmit rates of cellular links. Specifically, we elaborately construct individual V2V agent implement by the dueling double deep Q-network (D3QN), and design the reward function to train V2V agents collaboratively. As a result, each agent individually optimizes channel selection and power level based on its local observations, including the instantaneous channel state information (CSI) of corresponding V2V link, the instantaneous co-channel interference from the cellular link, the previous channels selections of nearby V2V pairs, and the queue backlog at the V2V transmitter. Another important aspect is that we incorporate FL to alleviate the training instability problem induced by cooperative multi-agent environment. The local DRL models of different V2V agents are federated periodically, addressing the limitations of partial observability on the entire network status for individual agent, and accelerating the training process of multi-agent learning. Validated via simulations, the proposed FedMARL scheme shows superiority to the baselines in terms of the cellular sum-rate and the V2V packet delivery rate. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
94. Regionalization in a Global Hydrologic Deep Learning Model: From Physical Descriptors to Random Vectors.
- Author
-
Li, Xiang, Khandelwal, Ankush, Jia, Xiaowei, Cutler, Kelly, Ghosh, Rahul, Renganathan, Arvind, Xu, Shaoming, Tayal, Kshitij, Nieber, John, Duffy, Christopher, Steinbach, Michael, and Kumar, Vipin
- Subjects
DEEP learning ,BUILDING performance ,WEATHERING ,HYDROLOGIC models ,STREAMFLOW ,GAGING - Abstract
Streamflow prediction is a long‐standing hydrologic problem. Development of models for streamflow prediction often requires incorporation of catchment physical descriptors to characterize the associated complex hydrological processes. Across different scales of catchments, these physical descriptors also allow models to extrapolate hydrologic information from one catchment to others, a process referred to as "regionalization". Recently, in gauged basin scenarios, deep learning models have been shown to achieve state of the art regionalization performance by building a global hydrologic model. These models predict streamflow given catchment physical descriptors and weather forcing data. However, these physical descriptors are by their nature uncertain, sometimes incomplete, or even unavailable in certain cases, which limits the applicability of this approach. In this paper, we show that by assigning a vector of random values as a surrogate for catchment physical descriptors, we can achieve robust regionalization performance under a gauged prediction scenario. Our results show that the deep learning model using our proposed random vector approach achieves a predictive performance comparable to that of the model using actual physical descriptors. The random vector approach yields robust performance under different data sparsity scenarios and deep learning model selections. Furthermore, based on the use of random vectors, high‐dimensional characterization improves regionalization performance in gauged basin scenario when physical descriptors are uncertain, or insufficient. Key Points: We propose a random vector strategy to serve as a surrogate to physical descriptors for regionalization under gauged scenariosThe deep learning model using random vectors yields comparable performance to the model using physical descriptors in gauged scenariosHigh‐dimensional characterization helps distinguish catchments and thus helps catchments learn from each other [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
95. Automatic stent recognition using perceptual attention U‐net for quantitative intrafraction motion monitoring in pancreatic cancer radiotherapy.
- Author
-
He, Xiuxiu, Cai, Weixing, Li, Feifei, Zhang, Pengpeng, Reyngold, Marsha, Cuaron, John J., Cerviño, Laura I., Li, Tianfang, and Li, Xiang
- Subjects
DEEP learning ,PANCREATIC cancer ,CANCER radiotherapy ,ARTIFICIAL neural networks ,PETRI nets ,NOMOGRAPHY (Mathematics) ,COMPUTED tomography - Abstract
Purpose: Stent has often been used as an internal surrogate to monitor intrafraction tumor motion during pancreatic cancer radiotherapy. Based on the stent contours generated from planning CT images, the current intrafraction motion review (IMR) system on Varian TrueBeam only provides a tool to verify the stent motion visually but lacks quantitative information. The purpose of this study is to develop an automatic stent recognition method for quantitative intrafraction tumor motion monitoring in pancreatic cancer treatment. Methods: A total of 535 IMR images from 14 pancreatic cancer patients were retrospectively selected in this study, with the manual contour of the stent on each image serving as the ground truth. We developed a deep learning–based approach that integrates two mechanisms that focus on the features of the segmentation target. The objective attention modeling was integrated into the U‐net framework to deal with the optimization difficulties when training a deep network with 2D IMR images and limited training data. A perceptual loss was combined with the binary cross‐entropy loss and a Dice loss for supervision. The deep neural network was trained to capture more contextual information to predict binary stent masks. A random‐split test was performed, with images of ten patients (71%, 380 images) randomly selected for training, whereas the rest of four patients (29%, 155 images) were used for testing. Sevenfold cross‐validation of the proposed PAUnet on the 14 patients was performed for further evaluation. Results: Our stent segmentation results were compared with the manually segmented contours. For the random‐split test, the trained model achieved a mean (±standard deviation) stent Dice similarity coefficient (DSC), 95% Hausdorff distance (HD95), the center‐of‐mass distance (CMD), and volume difference Voldiff$Vo{l_{diff}}$ were 0.96 (±0.01), 1.01 (±0.55) mm, 0.66 (±0.46) mm, and 3.07% (±2.37%), respectively. The sevenfold cross‐validation of the proposed PAUnet had the mean (±standard deviation) of 0.96 (±0.02), 0.72 (±0.49) mm, 0.85 (±0.96) mm, and 3.47% (±3.27%) for the DSC, HD95, CMD, and Voldiff$Vo{l_{diff}}$. Conclusion: We developed a novel deep learning–based approach to automatically segment the stent from IMR images, demonstrated its clinical feasibility, and validated its accuracy compared to manual segmentation. The proposed technique could be a useful tool for quantitative intrafraction motion monitoring in pancreatic cancer radiotherapy. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
96. Underwater Object Detection using Transfer Learning with Deep Learning
- Author
-
Zhu Kaiyan, Li Xiang, and Song Weibo
- Subjects
Computer science ,business.industry ,media_common.quotation_subject ,Deep learning ,Detector ,02 engineering and technology ,Pascal (programming language) ,010501 environmental sciences ,Object (computer science) ,01 natural sciences ,Object detection ,0202 electrical engineering, electronic engineering, information engineering ,Contrast (vision) ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,Underwater ,Transfer of learning ,business ,computer ,0105 earth and related environmental sciences ,computer.programming_language ,media_common - Abstract
In recent years, Deep learning based methods have proven their excellent performance in generic object detection. However, underwater object detection is still a challenge, especially from underwater optical images. In contrast to generic datasets, underwater images usually have color shift and low contrast, underwater detection datasets are scarce and the objects in the available underwater datasets and real applications are usually small. To address these issues, we propose a underwater object detection algorithm which choose Yolov3-tiny network as backbone and pre-train on Pascal VOC datasets. Experiments show that our proposed method improves the performance of region-based object detectors on CHINAMM2019 datasets, compared with SSD algorithm and Yolov3.
- Published
- 2020
- Full Text
- View/download PDF
97. Representation of Traffic Congestion Data for Urban Road Traffic Networks Based on Pooling Operations
- Author
-
Li Xiang, Sen Zhang, Yong Yao, and Shaobo Li
- Subjects
lcsh:T55.4-60.8 ,Computer science ,Pooling ,02 engineering and technology ,Overfitting ,computer.software_genre ,Convolutional neural network ,lcsh:QA75.5-76.95 ,Theoretical Computer Science ,0502 economics and business ,0202 electrical engineering, electronic engineering, information engineering ,lcsh:Industrial engineering. Management engineering ,traffic congestion ,road network ,representation method ,data compression ,050210 logistics & transportation ,Numerical Analysis ,Artificial neural network ,business.industry ,Deep learning ,05 social sciences ,ComputerSystemsOrganization_COMPUTER-COMMUNICATIONNETWORKS ,short-term traffic prediction ,deep learning ,Grid ,Computational Mathematics ,Computational Theory and Mathematics ,Traffic congestion ,020201 artificial intelligence & image processing ,Data mining ,Artificial intelligence ,lcsh:Electronic computers. Computer science ,business ,computer ,Data compression - Abstract
In order to improve the efficiency of transportation networks, it is critical to forecast traffic congestion. Large-scale traffic congestion data have become available and accessible, yet they need to be properly represented in order to avoid overfitting, reduce the requirements of computational resources, and be utilized effectively by various methodologies and models. Inspired by pooling operations in deep learning, we propose a representation framework for traffic congestion data in urban road traffic networks. This framework consists of grid-based partition of urban road traffic networks and a pooling operation to reduce multiple values into an aggregated one. We also propose using a pooling operation to calculate the maximum value in each grid (MAV). Raw snapshots of traffic congestion maps are transformed and represented as a series of matrices which are used as inputs to a spatiotemporal congestion prediction network (STCN) to evaluate the effectiveness of representation when predicting traffic congestion. STCN combines convolutional neural networks (CNNs) and long short-term memory neural network (LSTMs) for their spatiotemporal capability. CNNs can extract spatial features and dependencies of traffic congestion between roads, and LSTMs can learn their temporal evolution patterns and correlations. An empirical experiment on an urban road traffic network shows that when incorporated into our proposed representation framework, MAV outperforms other pooling operations in the effectiveness of the representation of traffic congestion data for traffic congestion prediction, and that the framework is cost-efficient in terms of computational resources.
- Published
- 2020
- Full Text
- View/download PDF
98. Critical Temperature Prediction of Superconductors Based on Atomic Vectors and Deep Learning
- Author
-
Jianjun Hu, Dan Yabo, Rongzhi Dong, Li Xiang, Tiantian Hu, Shaobo Li, and Zhuo Cao
- Subjects
Physics and Astronomy (miscellaneous) ,Computer science ,General Mathematics ,Materials informatics ,02 engineering and technology ,010402 general chemistry ,01 natural sciences ,Convolutional neural network ,lstm ,materials informatics ,Hybrid neural network ,Matrix (mathematics) ,Singular value decomposition ,Computer Science (miscellaneous) ,Representation (mathematics) ,cnn ,Artificial neural network ,business.industry ,Deep learning ,superconductivity ,lcsh:Mathematics ,021001 nanoscience & nanotechnology ,lcsh:QA1-939 ,0104 chemical sciences ,machine learning ,Chemistry (miscellaneous) ,Artificial intelligence ,0210 nano-technology ,business ,Algorithm - Abstract
In this paper, a hybrid neural network (HNN) that combines a convolutional neural network (CNN) and long short-term memory neural network (LSTM) is proposed to extract the high-level characteristics of materials for critical temperature (Tc) prediction of superconductors. Firstly, by obtaining 73,452 inorganic compounds from the Materials Project (MP) database and building an atomic environment matrix, we obtained a vector representation (atomic vector) of 87 atoms by singular value decomposition (SVD) of the atomic environment matrix. Then, the obtained atom vector was used to implement the coded representation of the superconductors in the order of the atoms in the chemical formula of the superconductor. The experimental results of the HNN model trained with 12,413 superconductors were compared with three benchmark neural network algorithms and multiple machine learning algorithms using two commonly used material characterization methods. The experimental results show that the HNN method proposed in this paper can effectively extract the characteristic relationships between the atoms of superconductors, and it has high accuracy in predicting the Tc.
- Published
- 2020
99. Developing a Deep Learning network "MSCP-Net" to generate stalk anatomical traits related with crop lodging and yield in maize.
- Author
-
Zhou, Haiyu, Li, Xiang, Jiang, Yufeng, Zhu, Xiaoying, Fu, Taiming, Yang, Mingchong, Cheng, Weidong, Xie, Xiaodong, Chen, Yan, and Wang, Lingqiang
- Subjects
- *
CONVOLUTIONAL neural networks , *PLANT breeding , *DEEP learning , *PLANT stems , *CROP yields , *CORN - Abstract
Plant stem is essential for the delivery of resources and has a great impact on plant lodging resistance and yield. However, how to accurately and efficiently extract structural information from crop stems is a big headache. In this study, we first established a Maize Stalk Cross-section Phenotype (MSCP) dataset containing anatomical information of 990 images from hand-cut transections of stalks. Then, to large-scale measure the stalk anatomy features, we developed a Maize Stalk Cross-section Phenotyping Network (MSCP-Net) which integrated a convolutional neural network and the methods of instance segmentation and key point detection. A total of 14 stalk anatomical parameters (traits) can be automatically produced with high mAP@.5 (0.907) for the parameter "vascular bundles segmentation" and high DICE (0.864) for the parameter "functional zones segmentation". The cross-validation with the MSCP dataset indicated the good performance of MSCP-Net in predicting anatomical traits. On this basis, the correlation analysis across 14 anatomical traits and 12 agronomic importance traits in 110 maize inbred-lines was conducted and revealed that the stalk related traits (stem cross-section, large vascular bundles, fiber contents, and aerial roots) are key indicators for lodging resistance and grain yield of maize. In addition, the maize inbred-lines were classified into two groups, and the higher value of group II compared with group I in breeding hybrid varieties was discussed. The results demonstrated that the MSCP-Net is expected to be a useful tool to rapidly obtain stem anatomical traits which are agronomic important in maize genetic improvement. [Display omitted] • A high-quality MSCP dataset containing anatomical information of 990 images from hand-cut transections of maize stalks was established. • A high-performance MSCP-Net was developed to automatically and accurately produce 14 maize stalk anatomical traits. • It was demonstrated that stem anatomical traits are agronomic important in germplasm evaluation and crop breeding. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
100. Data privacy preserving federated transfer learning in machinery fault diagnostics using prior distributions.
- Author
-
Zhang, Wei and Li, Xiang
- Subjects
FAULT diagnosis ,FEATURE extraction ,PRIVACY ,MACHINERY - Abstract
Federated learning has been receiving increasing attention in the recent years, which improves model performance with data privacy among different clients. The intelligent fault diagnostic problems can be largely benefited from this emerging technology since the private data generally cannot leave local storage in the real industries. While promising federated learning performance has been achieved in the literature, most studies assume data from different clients are independent and identically distributed. In the real industrial scenarios, due to variations in machines and operating conditions, the data distributions are generally different across different clients, that significantly deteriorates the performance of federated learning. To address this issue, a federated transfer learning method is proposed in this article for machinery fault diagnostics. Under the condition that data from different clients cannot be communicated, prior distributions are proposed to indirectly bridge the domain gap. In this way, client-invariant features can be extracted for diagnostics while the data privacy is preserved. Experiments on two rotating machinery datasets are implemented for validation, and the results suggest the proposed method offers an effective and promising approach for federated transfer learning in fault diagnostic problems. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.