Author: "Ding, Kai" / Publication Year Range: Last 10 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Ding, Kai"' showing total 2,685 results

Start Over Author "Ding, Kai" Publication Year Range Last 10 years

2,685 results on '"Ding, Kai"'

1. Semantic Refocused Tuning for Open-Vocabulary Panoptic Segmentation

Author: Chng, Yong Xien, Qiu, Xuchong, Han, Yizeng, Ding, Kai, Ding, Wan, and Huang, Gao
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Open-vocabulary panoptic segmentation is an emerging task aiming to accurately segment the image into semantically meaningful masks based on a set of texts. Despite existing efforts, it remains challenging to develop a high-performing method that generalizes effectively across new domains and requires minimal training resources. Our in-depth analysis of current methods reveals a crucial insight: mask classification is the main performance bottleneck for open-vocab. panoptic segmentation. Based on this, we propose Semantic Refocused Tuning (SMART), a novel framework that greatly enhances open-vocab. panoptic segmentation by improving mask classification through two key innovations. First, SMART adopts a multimodal Semantic-guided Mask Attention mechanism that injects task-awareness into the regional information extraction process. This enables the model to capture task-specific and contextually relevant information for more effective mask classification. Second, it incorporates Query Projection Tuning, which strategically fine-tunes the query projection layers within the Vision Language Model (VLM) used for mask classification. This adjustment allows the model to adapt the image focus of mask tokens to new distributions with minimal training resources, while preserving the VLM's pre-trained knowledge. Extensive ablation studies confirm the superiority of our approach. Notably, SMART sets new state-of-the-art results, demonstrating improvements of up to +1.3 PQ and +5.4 mIoU across representative benchmarks, while reducing training costs by nearly 10x compared to the previous best method. Our code and data will be released., Comment: 9 pages, 6 figures
Published: 2024

2. GOPT: Generalizable Online 3D Bin Packing via Transformer-based Deep Reinforcement Learning

Author: Xiong, Heng, Guo, Changrong, Peng, Jian, Ding, Kai, Chen, Wenjie, Qiu, Xuchong, Bai, Long, and Xu, Jianfeng
Subjects: Computer Science - Robotics
Abstract: Robotic object packing has broad practical applications in the logistics and automation industry, often formulated by researchers as the online 3D Bin Packing Problem (3D-BPP). However, existing DRL-based methods primarily focus on enhancing performance in limited packing environments while neglecting the ability to generalize across multiple environments characterized by different bin dimensions. To this end, we propose GOPT, a generalizable online 3D Bin Packing approach via Transformer-based deep reinforcement learning (DRL). First, we design a Placement Generator module to yield finite subspaces as placement candidates and the representation of the bin. Second, we propose a Packing Transformer, which fuses the features of the items and bin, to identify the spatial correlation between the item to be packed and available sub-spaces within the bin. Coupling these two components enables GOPT's ability to perform inference on bins of varying dimensions. We conduct extensive experiments and demonstrate that GOPT not only achieves superior performance against the baselines, but also exhibits excellent generalization capabilities. Furthermore, the deployment with a robot showcases the practical applicability of our method in the real world. The source code will be publicly available at https://github.com/Xiong5Heng/GOPT., Comment: 8 pages, 6 figures. This paper has been accepted by IEEE Robotics and Automation Letters
Published: 2024
Full Text: View/download PDF

3. Nonlinear magneto-optical response across van Hove singularity in a non-centrosymmetric magnetic Weyl semimetal

Author: Li, Jian, Ding, Kai-He, and Tang, Lijun
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: We investigate the nonlinear magneto-optical response in non-centrosymmetric magnetic Weyl semimetals featuring a quadratic tilt, focusing particularly on the influence of the van Hove singularity (VHS). In the absence of a magnetic field, the second-order nonlinear Drude conductivity components exhibit inflection or dip behavior across the VHS. In contrast, the second-order nonlinear anomalous Hall conductivity, primarily governed by the Berry curvature dipole, manifests a subtle plateau-like structure. As the tilt strength increases, the VHS energy escalates, thereby amplifying the VHS-induced characteristics within these second-order conductivity components. However, in the presence of a magnetic field, we show that the resultant magnetic moment suppresses nonlinear electron transport while enhancing nonlinear hole transport. %both suppresses and notably enhances nonlinear magnetic-optical transport in the electron and hole regions, respectively. This effect serves to mitigate the impact of the VHS, resulting specifically in an asymmetric peak or a kinked-like structure in the magnetic field-induced contribution to the second-order nonlinear conductivity near the Weyl nodes. These findings provide new insights into the intricate interplay among the VHS, Berry curvature, and magnetic moment in nonlinear magneto-optical transport through non-centrosymmetric magnetic Weyl semimetals., Comment: 10 pages, 5 figures. Accepted for publication in Phys. Rev. B
Published: 2024

4. TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models

Author: Cao, Jiahuan, Peng, Dezhi, Zhang, Peirong, Shi, Yongxin, Liu, Yang, Ding, Kai, and Jin, Lianwen
Subjects: Computer Science - Computation and Language
Abstract: Classical Chinese is a gateway to the rich heritage and wisdom of ancient China, yet its complexities pose formidable comprehension barriers for most modern people without specialized knowledge. While Large Language Models (LLMs) have shown remarkable capabilities in Natural Language Processing (NLP), they struggle with Classical Chinese Understanding (CCU), especially in data-demanding and knowledge-intensive tasks. In response to this dilemma, we propose \textbf{TongGu} (mean understanding ancient and modern), the first CCU-specific LLM, underpinned by three core contributions. First, we construct a two-stage instruction-tuning dataset ACCN-INS derived from rich classical Chinese corpora, aiming to unlock the full CCU potential of LLMs. Second, we propose Redundancy-Aware Tuning (RAT) to prevent catastrophic forgetting, enabling TongGu to acquire new capabilities while preserving its foundational knowledge. Third, we present a CCU Retrieval-Augmented Generation (CCU-RAG) technique to reduce hallucinations based on knowledge-grounding. Extensive experiments across 24 diverse CCU tasks validate TongGu's superior ability, underscoring the effectiveness of RAT and CCU-RAG. The model and dataset are available at \url{https://github.com/SCUT-DLVCLab/TongGu-LLM}.
Published: 2024

5. Scaling Laws for Fact Memorization of Large Language Models

Author: Lu, Xingyu, Li, Xiaonan, Cheng, Qinyuan, Ding, Kai, Huang, Xuanjing, and Qiu, Xipeng
Subjects: Computer Science - Computation and Language
Abstract: Fact knowledge memorization is crucial for Large Language Models (LLM) to generate factual and reliable responses. However, the behaviors of LLM fact memorization remain under-explored. In this paper, we analyze the scaling laws for LLM's fact knowledge and LLMs' behaviors of memorizing different types of facts. We find that LLMs' fact knowledge capacity has a linear and negative exponential law relationship with model size and training epochs, respectively. Estimated by the built scaling law, memorizing the whole Wikidata's facts requires training an LLM with 1000B non-embed parameters for 100 epochs, suggesting that using LLMs to memorize all public facts is almost implausible for a general pre-training setting. Meanwhile, we find that LLMs can generalize on unseen fact knowledge and its scaling law is similar to general pre-training. Additionally, we analyze the compatibility and preference of LLMs' fact memorization. For compatibility, we find LLMs struggle with memorizing redundant facts in a unified way. Only when correlated facts have the same direction and structure, the LLM can compatibly memorize them. This shows the inefficiency of LLM memorization for redundant facts. For preference, the LLM pays more attention to memorizing more frequent and difficult facts, and the subsequent facts can overwrite prior facts' memorization, which significantly hinders low-frequency facts memorization. Our findings reveal the capacity and characteristics of LLMs' fact knowledge learning, which provide directions for LLMs' fact knowledge augmentation.
Published: 2024

6. Investigation on high-aspect-ratio silicon carbide ceramic microchannel by using waterjet-assisted laser micromachining

Author: Han, Jinjin, Tong, Linpeng, He, Bin, Kong, Linglei, Li, Qilin, Wang, Denglong, Ding, Kai, and Lei, Weining
Published: 2024
Full Text: View/download PDF

7. Advancements in drugs restructured with nanomedicines for multiple myeloma treatment

Author: Liu, Zhaoyun, Shen, Hongli, Liu, Hui, Ding, Kai, Song, Jia, Zhang, Jingtian, Ding, Dan, and Fu, Rong
Published: 2024
Full Text: View/download PDF

8. Small world but large differences: cultivar-specific secondary metabolite-mediated phyllosphere fungal homeostasis in tea plant (Camellia sinensis)

Author: Ding, Kai, Lv, Wuyun, Ren, Hengze, Xiong, Fei, Zhang, Yuting, Zhang, Junhong, Tong, Zaikang, Wang, Xinchao, and Wang, Yuchun
Published: 2024
Full Text: View/download PDF

9. Datasets for Large Language Models: A Comprehensive Survey

Author: Liu, Yang, Cao, Jiahuan, Liu, Chongyu, Ding, Kai, and Jin, Lianwen
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: This paper embarks on an exploration into the Large Language Model (LLM) datasets, which play a crucial role in the remarkable advancements of LLMs. The datasets serve as the foundational infrastructure analogous to a root system that sustains and nurtures the development of LLMs. Consequently, examination of these datasets emerges as a critical topic in research. In order to address the current lack of a comprehensive overview and thorough analysis of LLM datasets, and to gain insights into their current status and future trends, this survey consolidates and categorizes the fundamental aspects of LLM datasets from five perspectives: (1) Pre-training Corpora; (2) Instruction Fine-tuning Datasets; (3) Preference Datasets; (4) Evaluation Datasets; (5) Traditional Natural Language Processing (NLP) Datasets. The survey sheds light on the prevailing challenges and points out potential avenues for future investigation. Additionally, a comprehensive review of the existing available dataset resources is also provided, including statistics from 444 datasets, covering 8 language categories and spanning 32 domains. Information from 20 dimensions is incorporated into the dataset statistics. The total data size surveyed surpasses 774.5 TB for pre-training corpora and 700M instances for other datasets. We aim to present the entire landscape of LLM text datasets, serving as a comprehensive reference for researchers in this field and contributing to future studies. Related resources are available at: https://github.com/lmmlzn/Awesome-LLMs-Datasets., Comment: 181 pages, 21 figures
Published: 2024

10. EXACT-Net:EHR-guided lung tumor auto-segmentation for non-small cell lung cancer radiotherapy

Author: Hooshangnejad, Hamed, Feng, Xue, Huang, Gaofeng, Zhang, Rui, Kelly, Katelyn, Chen, Quan, and Ding, Kai
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Physics - Medical Physics
Abstract: Lung cancer is a devastating disease with the highest mortality rate among cancer types. Over 60% of non-small cell lung cancer (NSCLC) patients, which accounts for 87% of diagnoses, require radiation therapy. Rapid treatment initiation significantly increases the patient's survival rate and reduces the mortality rate. Accurate tumor segmentation is a critical step in the diagnosis and treatment of NSCLC. Manual segmentation is time and labor-consuming and causes delays in treatment initiation. Although many lung nodule detection methods, including deep learning-based models, have been proposed, there is still a long-standing problem of high false positives (FPs) with most of these methods. Here, we developed an electronic health record (EHR) guided lung tumor auto-segmentation called EXACT-Net (EHR-enhanced eXACtitude in Tumor segmentation), where the extracted information from EHRs using a pre-trained large language model (LLM), was used to remove the FPs and keep the TP nodules only. The auto-segmentation model was trained on NSCLC patients' computed tomography (CT), and the pre-trained LLM was used with the zero-shot learning approach. Our approach resulted in a 250% boost in successful nodule detection using the data from ten NSCLC patients treated in our institution.
Published: 2024

11. Van Hove singularity-induced negative magnetoresistance in Dirac semimetals

Author: Ding, Kai-He and Zhu, Zhen-Gang
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: Negative magnetoresistance (NMR) is a marked feature of Dirac semimetals, and may be caused by multiple mechanisms, such as the chiral anomaly, the Zeeman energy, the quantum interference effect, and the orbital moment. Recently, an experiment on Dirac semimetal Cd$_3$As$_2$ thin films revealed a new NMR feature that depends strongly on the thickness of the sample [T. Schumann, \emph{et al}., Phys. Rev. B 95, 241113(R) (2017)]. Here, we introduce a new mechanism of inducing NMR via the presence of the van Hove singularity (VHS) in the density of states. Theoretical fitting of the experimental data on magnetoconductivity and magnetoresistance shows good agreement, indicating that the observed NMR in thin films of Cd$_3$As$_2$ can be attributed to the VHS. This work provides new insights into the underlying of Dirac semimetals., Comment: 6 pages, 5 figures
Published: 2023

12. UPOCR: Towards Unified Pixel-Level OCR Interface

Author: Peng, Dezhi, Yang, Zhenhua, Zhang, Jiaxin, Liu, Chongyu, Shi, Yongxin, Ding, Kai, Guo, Fengjun, and Jin, Lianwen
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In recent years, the optical character recognition (OCR) field has been proliferating with plentiful cutting-edge approaches for a wide spectrum of tasks. However, these approaches are task-specifically designed with divergent paradigms, architectures, and training strategies, which significantly increases the complexity of research and maintenance and hinders the fast deployment in applications. To this end, we propose UPOCR, a simple-yet-effective generalist model for Unified Pixel-level OCR interface. Specifically, the UPOCR unifies the paradigm of diverse OCR tasks as image-to-image transformation and the architecture as a vision Transformer (ViT)-based encoder-decoder. Learnable task prompts are introduced to push the general feature representations extracted by the encoder toward task-specific spaces, endowing the decoder with task awareness. Moreover, the model training is uniformly aimed at minimizing the discrepancy between the generated and ground-truth images regardless of the inhomogeneity among tasks. Experiments are conducted on three pixel-level OCR tasks including text removal, text segmentation, and tampered text detection. Without bells and whistles, the experimental results showcase that the proposed method can simultaneously achieve state-of-the-art performance on three tasks with a unified single model, which provides valuable strategies and insights for future research on generalist OCR models. Code will be publicly available.
Published: 2023

13. Differential Settlement of Cement Fly-Ash Gravel and Cement–Soil Compacted Piles

Author: Cheng, Xuansheng, Ding, Kai, Gong, Lijun, and Sun, Haodong
Published: 2024
Full Text: View/download PDF

14. Optimal Claim-Dependent Proportional Reinsurance Under a Self-Exciting Claim Model

Author: Wu, Fan, Shen, Yang, Zhang, Xin, and Ding, Kai
Published: 2024
Full Text: View/download PDF

15. Highly efficient capture approach for the identification of diverse inherited retinal disorders.

Author: Kao, Hsiao-Jung, Lin, Ting-Yi, Hsieh, Feng-Jen, Chien, Jia-Ying, Yeh, Erh-Chan, Lin, Wan-Jia, Chen, Yi-Hua, Ding, Kai-Hsuan, Yang, Yu, Chi, Sheng-Chu, Tsai, Ping-Hsing, Hsu, Chih-Chien, Hwang, De-Kuang, Tsai, Hsien-Yang, Peng, Mei-Ling, Lee, Shi-Huang, Chau, Siu-Fung, Chen, Chen, Cheang, Wai-Man, Chen, Shih-Jen, Chiou, Shih-Hwa, Lu, Mei-Yeh, Huang, Shun-Ping, and Kwok, Pui-Yan
Abstract: Our study presents a 319-gene panel targeting inherited retinal dystrophy (IRD) genes. Through a multi-center retrospective cohort study, we validated the assays effectiveness and clinical utility and characterized the mutation spectrum of Taiwanese IRD patients. Between January 2018 and May 2022, 493 patients in 425 unrelated families, all initially suspected of having IRD without prior genetic diagnoses, underwent detailed ophthalmic and physical examinations (with extra-ocular features recorded) and genetic testing with our customized panel. Disease-causing variants were identified by segregation analysis and clinical interpretation, with validation via Sanger sequencing. We achieved a read depth of >200× for 94.2% of the targeted 1.2 Mb region. 68.5% (291/425) of the probands received molecular diagnoses, with 53.9% (229/425) resolved cases. Retinitis pigmentosa (RP) is the most prevalent initial clinical impression (64.2%), and 90.8% of the cohort have the five most prevalent phenotypes (RP, cone-rod syndrome, Ushers syndrome, Lebers congenital amaurosis, Bietti crystalline dystrophy). The most commonly mutated genes of probands that received molecular diagnosis are USH2A (13.7% of the cohort), EYS (11.3%), CYP4V2 (4.8%), ABCA4 (4.5%), RPGR (3.4%), and RP1 (3.1%), collectively accounted for 40.8% of diagnoses. We identify 87 unique unreported variants previously not associated with IRD and refine clinical diagnoses for 21 patients (7.22% of positive cases). We developed a customized gene panel and tested it on the largest Taiwanese cohort, showing that it provides excellent coverage for diverse IRD phenotypes.
Published: 2024

16. The application of matching pursuit spectral blueing in post-stack seismic frequency enhancement

Author: Jin Xuebin, Li Bingxi, Zhang Zhenguo, Lei Maosheng, An Lishuang, and Ding Kai
Subjects: Spectral blueing, Matching pursuit, Poststack seismic, Optimize the spectral blueing operator, Medicine, Science
Abstract: Abstract The calculation of the spectral blueing operator in the traditional spectral blueing method has singularity, which leads to poor performance in post-stack seimic frequency expansion. To this end, a frequency spreading technique based on matching pursuit (MP) and spectral blueing is proposed. Through time–frequency analysis processing, it is shown that the seismic signal extracted by matching tracking method has good stability and higher resolution. The specific process of the method in this paper firstly uses the matching tracking method to accurately divide the post-stack seismic data into multiple frequency-division seismic bodies; then, in the process of calculating the spectral blueing ization operators for each frequency band, the weighting idea is used to calculate the weights of the optimized spectral blueing ization operators for each frequency band based on the differences in energy in different frequency bands; finally, the optimized spectral blueing operator is convolved with seismic reflection coefficients to obtain high-resolution seismic data. The actual test results of poststack seismic data have proven that the frequency raising method proposed in this paper is superior to the traditional spectral blueing ization algorithm, greatly improving the high-frequency component information of poststack seismic data. After frequency extension, there are more seismic events and higher resolution. Finally, the practicability and rationality of the seismic data after frequency extraction are verified by a series of operations such as attribute extraction, well seismic calibration and inversion.
Published: 2024
Full Text: View/download PDF

17. DocAligner: Annotating Real-world Photographic Document Images by Simply Taking Pictures

Author: Zhang, Jiaxin, Chen, Bangdong, Cheng, Hiuyi, Guo, Fengjun, Ding, Kai, and Jin, Lianwen
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Recently, there has been a growing interest in research concerning document image analysis and recognition in photographic scenarios. However, the lack of labeled datasets for this emerging challenge poses a significant obstacle, as manual annotation can be time-consuming and impractical. To tackle this issue, we present DocAligner, a novel method that streamlines the manual annotation process to a simple step of taking pictures. DocAligner achieves this by establishing dense correspondence between photographic document images and their clean counterparts. It enables the automatic transfer of existing annotations in clean document images to photographic ones and helps to automatically acquire labels that are unavailable through manual labeling. Considering the distinctive characteristics of document images, DocAligner incorporates several innovative features. First, we propose a non-rigid pre-alignment technique based on the document's edges, which effectively eliminates interference caused by significant global shifts and repetitive patterns present in document images. Second, to handle large shifts and ensure high accuracy, we introduce a hierarchical aligning approach that combines global and local correlation layers. Furthermore, considering the importance of fine-grained elements in document images, we present a details recurrent refinement module to enhance the output in a high-resolution space. To train DocAligner, we construct a synthetic dataset and introduce a self-supervised learning approach to enhance its robustness for real-world data. Through extensive experiments, we demonstrate the effectiveness of DocAligner and the acquired dataset. Datasets and codes will be publicly available.
Published: 2023

18. A Mathematical Prediction Model of the Grinding Force in Ultrasonic-Assisted Grinding of ZrO2 Ceramics with Experimental Validation

Author: Liu, Sheng, Ding, Kai, Su, Honghua, Zhuang, Bailiang, Li, Qilin, Lei, Weining, Zhou, Zongchen, and Han, Xiao
Published: 2024
Full Text: View/download PDF

19. Adaptive data augmentation for mandarin automatic speech recognition

Author: Ding, Kai, Li, Ruixuan, Xu, Yuelin, Du, Xingyue, and Deng, Bin
Published: 2024
Full Text: View/download PDF

20. Residual Stress Induced by Phase Transformation and its Role in the Delayed Cracking Performance of 22MnB5 Hot Roll Bending Pipes

Author: Ding, Kai, Zhu, Ping, Hu, Tianhan, Dong, Wufeng, Sun, Yu, Zhou, Jiayi, Zhao, Bingge, Shi, Lei, and Gao, Yulai
Published: 2024
Full Text: View/download PDF

21. M$^{6}$Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis

Author: Cheng, Hiuyi, Zhang, Peirong, Wu, Sihang, Zhang, Jiaxin, Zhu, Qiyuan, Xie, Zecheng, Li, Jing, Ding, Kai, and Jin, Lianwen
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Document layout analysis is a crucial prerequisite for document understanding, including document retrieval and conversion. Most public datasets currently contain only PDF documents and lack realistic documents. Models trained on these datasets may not generalize well to real-world scenarios. Therefore, this paper introduces a large and diverse document layout analysis dataset called $M^{6}Doc$. The $M^6$ designation represents six properties: (1) Multi-Format (including scanned, photographed, and PDF documents); (2) Multi-Type (such as scientific articles, textbooks, books, test papers, magazines, newspapers, and notes); (3) Multi-Layout (rectangular, Manhattan, non-Manhattan, and multi-column Manhattan); (4) Multi-Language (Chinese and English); (5) Multi-Annotation Category (74 types of annotation labels with 237,116 annotation instances in 9,080 manually annotated pages); and (6) Modern documents. Additionally, we propose a transformer-based document layout analysis method called TransDLANet, which leverages an adaptive element matching mechanism that enables query embedding to better match ground truth to improve recall, and constructs a segmentation branch for more precise document image instance segmentation. We conduct a comprehensive evaluation of $M^{6}Doc$ with various layout analysis methods and demonstrate its effectiveness. TransDLANet achieves state-of-the-art performance on $M^{6}Doc$ with 64.5% mAP. The $M^{6}Doc$ dataset will be available at https://github.com/HCIILAB/M6Doc., Comment: Accepted by CVPR 2023
Published: 2023

22. deepPERFECT: Novel Deep Learning CT Synthesis Method for Expeditious Pancreatic Cancer Radiotherapy

Author: Hooshangnejad, Hamed, Chen, Quan, Feng, Xue, Zhang, Rui, and Ding, Kai
Subjects: Physics - Medical Physics
Abstract: Pancreatic cancer with more than 60,000 new cases each year has less than 10 percent 5-year overall survival. Radiation therapy (RT) is an effective treatment for Locally advanced pancreatic cancer (LAPC). The current clinical RT workflow is lengthy and involves separate image acquisition for diagnostic CT (dCT) and planning CT (pCT). Studies have shown a reduction in mortality rate from expeditious radiotherapy treatment. dCT and pCT are acquired separately because of the differences in the image acquisition setup and patient body. We are presenting deepPERFECT: deep learning-based model to adapt the shape of the patient body on dCT to the treatment delivery setup. Our method expedites the treatment course by allowing the design of the initial RT planning before the pCT acquisition. Thus, the physicians can evaluate the potential RT prognosis ahead of time, verify the plan on the treatment day-one CT and apply any online adaptation if needed. We used the data from 25 pancreatic cancer patients. The model was trained on 15 cases and tested on the remaining ten cases. We evaluated the performance of four different deep-learning architectures for this task. The synthesized CT (sCT) and regions of interest (ROIs) were compared with ground truth (pCT) using Dice similarity coefficient (DSC) and Hausdorff distance (HD). We found that the three-dimensional Generative Adversarial Network (GAN) model trained on large patches has the best performance. The average DSC and HD for body contours were 0.93, and 4.6 mm. We found no statistically significant difference between the synthesized CT plans and the ground truth. We showed that employing deepPERFECT shortens the current lengthy clinical workflow by at least one week and improves the effectiveness of treatment and the quality of life of pancreatic cancer patients., Comment: paper is under review by medical physics
Published: 2023
Full Text: View/download PDF

23. WBGT Index Forecast Using Time Series Models in Smart Cities

Author: Ding, Kai, Huang, Yidu, Tao, Ming, Xie, Renping, Li, Xueqiang, Zhong, Xuefeng, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Tari, Zahir, editor, Li, Keqiu, editor, and Wu, Hongyi, editor
Published: 2024
Full Text: View/download PDF

24. Effect of Hydrogen Concentration and Residual Stress on the Delayed Cracking Performance of the 22MnB5 Hot Roll Bending Pipe

Author: Zhu, Ping, Hu, Tianhan, Zhou, Jiayi, Sun, Yu, Dong, Wufeng, Ding, Kai, Gao, Yulai, and The Minerals, Metals & Materials Society
Published: 2024
Full Text: View/download PDF

25. Preparation of Cu Powders with Electrical Explosion of Wires and Their Size-Dependent Mechanical Properties

Author: Wang, Chenhui, Zhang, Luojia, Wu, Bingjia, Ding, Kai, Gao, Yulai, Zhao, Bingge, and The Minerals, Metals & Materials Society
Published: 2024
Full Text: View/download PDF

26. Effect of the Welding Current on the Liquid Metal Embrittlement in the Resistance Spot Welded Galvanized DP1180 Advanced High Strength Steel

Author: Zhou, Jiayi, Sun, Yu, Wu, Bingjia, Hu, Tianhan, Lei, Ming, Ding, Kai, Gao, Yulai, and The Minerals, Metals & Materials Society
Published: 2024
Full Text: View/download PDF

27. Nucleation of One Single Sn Droplet on Al Thin Film Explored by Nanocalorimetry

Author: Wu, Bingjia, Wang, Chenhui, Zhou, Jiayi, Ding, Kai, Zhao, Bingge, Gao, Yulai, Peng, Zhiwei, editor, Zhang, Mingming, editor, Li, Jian, editor, Li, Bowen, editor, Monteiro, Sergio Neves, editor, Soman, Rajiv, editor, Hwang, Jiann-Yang, editor, Kalay, Yunus Eren, editor, Escobedo-Diaz, Juan P., editor, Carpenter, John S., editor, Brown, Andrew D., editor, and Ikhmayies, Shadia, editor
Published: 2024
Full Text: View/download PDF

28. Gallium-doped Zinc Oxide: Nonlinear Reflection and Transmission Measurements and Modeling in the ENZ Region

Author: Ball, Adam, Secondo, Ray, Diroll, Benjamin T., Fomra, Dhruv, Ding, Kai, Avrutin, Vitaly, Ozgur, Umit, and Kinsey, Nathaniel
Subjects: Physics - Optics
Abstract: Strong nonlinear materials have been sought after for decades for applications in telecommunications, sensing, and quantum optics. Gallium-doped zinc oxide is a II-VI transparent conducting oxide that shows promising nonlinearities similar to indium tin oxide and aluminum-doped zinc oxide for the telecommunications band. Here we explore its nonlinearities in the epsilon near zero (ENZ) region and show n2,eff values on the order of 4.5x10-3 cm2GW-1 for IR pumping on 200-300 nm thin films. Measuring nonlinear changes in transmission and reflection with a white light source probe in the near-IR while exciting in the near-IR provides data in both time and wavelength. Three films varying in thickness, optical loss, and ENZ crossover wavelength are numerically modeled and compared to experimental data showing agreement for both dispersion and temporal relaxation. In addition, we discuss optimal excitation and probing wavelengths occur around ENZ for thick films but are red-shifted for thin films where our model provides an additional degree of freedom to explore. Obtaining accurate nonlinear measurements is a difficult and time-consuming task where our method in this paper provides experimental and modeled data to the community for an ENZ material of interest., Comment: 18 pages, 10 figures
Published: 2022

29. Effects of single and multiple imputation strategies on addressing over-fitting issues caused by imbalanced data from various scenarios

Author: Yang, Jiaxi, Wang, Yihan, Yang, Ye, Ding, Kai, Na, Chongning, and Yang, Yao
Published: 2024
Full Text: View/download PDF

30. Dry eye in the upper blepharoplasty patient: a study comparing orbicularis-sparing versus orbicularis-excising techniques

Author: Mian, Osamah T., Lippe, Christina M., Khan, Asher, Bugg, Victoria A., Bryant, Juliana C., Riaz, Kamran M., Dvorak, Justin D., Ding, Kai, and Moreau, Annie
Published: 2023
Full Text: View/download PDF

31. Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild

Author: Zhang, Jiaxin, Luo, Canjie, Jin, Lianwen, Guo, Fengjun, and Ding, Kai
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Camera-captured document images usually suffer from perspective and geometric deformations. It is of great value to rectify them when considering poor visual aesthetics and the deteriorated performance of OCR systems. Recent learning-based methods intensively focus on the accurately cropped document image. However, this might not be sufficient for overcoming practical challenges, including document images either with large marginal regions or without margins. Due to this impracticality, users struggle to crop documents precisely when they encounter large marginal regions. Simultaneously, dewarping images without margins is still an insurmountable problem. To the best of our knowledge, there is still no complete and effective pipeline for rectifying document images in the wild. To address this issue, we propose a novel approach called Marior (Margin Removal and \Iterative Content Rectification). Marior follows a progressive strategy to iteratively improve the dewarping quality and readability in a coarse-to-fine manner. Specifically, we divide the pipeline into two modules: margin removal module (MRM) and iterative content rectification module (ICRM). First, we predict the segmentation mask of the input image to remove the margin, thereby obtaining a preliminary result. Then we refine the image further by producing dense displacement flows to achieve content-aware rectification. We determine the number of refinement iterations adaptively. Experiments demonstrate the state-of-the-art performance of our method on public benchmarks. The resources are available at https://github.com/ZZZHANG-jx/Marior for further comparison., Comment: This paper has been accepted by ACM Multimedia 2022
Published: 2022

32. Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context

Author: Liu, Chongyu, Jin, Lianwen, Liu, Yuliang, Luo, Canjie, Chen, Bangdong, Guo, Fengjun, and Ding, Kai
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Text removal has attracted increasingly attention due to its various applications on privacy protection, document restoration, and text editing. It has shown significant progress with deep neural network. However, most of the existing methods often generate inconsistent results for complex background. To address this issue, we propose a Contextual-guided Text Removal Network, termed as CTRNet. CTRNet explores both low-level structure and high-level discriminative context feature as prior knowledge to guide the process of background restoration. We further propose a Local-global Content Modeling (LGCM) block with CNNs and Transformer-Encoder to capture local features and establish the long-term relationship among pixels globally. Finally, we incorporate LGCM with context guidance for feature modeling and decoding. Experiments on benchmark datasets, SCUT-EnsText and SCUT-Syn show that CTRNet significantly outperforms the existing state-of-the-art methods. Furthermore, a qualitative experiment on examination papers also demonstrates the generalization ability of our method. The codes and supplement materials are available at https://github.com/lcy0604/CTRNet.
Published: 2022

33. SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition

Author: Huang, Mingxin, Liu, Yuliang, Peng, Zhenghao, Liu, Chongyu, Lin, Dahua, Zhu, Shenggao, Yuan, Nicholas, Ding, Kai, and Jin, Lianwen
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: End-to-end scene text spotting has attracted great attention in recent years due to the success of excavating the intrinsic synergy of the scene text detection and recognition. However, recent state-of-the-art methods usually incorporate detection and recognition simply by sharing the backbone, which does not directly take advantage of the feature interaction between the two tasks. In this paper, we propose a new end-to-end scene text spotting framework termed SwinTextSpotter. Using a transformer encoder with dynamic head as the detector, we unify the two tasks with a novel Recognition Conversion mechanism to explicitly guide text localization through recognition loss. The straightforward design results in a concise framework that requires neither additional rectification module nor character-level annotation for the arbitrarily-shaped text. Qualitative and quantitative experiments on multi-oriented datasets RoIC13 and ICDAR 2015, arbitrarily-shaped datasets Total-Text and CTW1500, and multi-lingual datasets ReCTS (Chinese) and VinText (Vietnamese) demonstrate SwinTextSpotter significantly outperforms existing methods. Code is available at https://github.com/mxin262/SwinTextSpotter., Comment: Accepted to be appeared in CVPR 2022
Published: 2022

34. Asymptomatic multicentric Castleman disease: a potential early stage of idiopathic MCD

Author: Zhang, Lu, Liu, Qin-hua, Zhou, Hui, Zhang, Hui-lai, Dong, Yu-jun, Wang, Xiao-bo, Wang, Lu-qun, Su, Li-ping, Yan, Xiao-jing, Li, Yan, Zhang, Ming-zhi, Ding, Kai-yang, Wang, Hui-han, Peng, Hong-ling, Zhong, Li-ye, Yang, Lin, Bi, Lin-tao, Gao, Da, Gao, Guang-xun, Huang, Liang, Sun, Chun-yan, Song, Jia, Qian, Wen-bin, Huang, Wen-rong, Li, Zhen-ling, Liu, Yao, and Li, Jian
Published: 2024
Full Text: View/download PDF

35. Investigation of large-aspect ratio microgrooves on silicon nitride ceramic by WJALM

Author: Su, Hai, Han, Jinjin, He, Bin, Ahmad, Wasim, Khan, Aqib Mashood, Ma, Rui, Ding, Kai, Kong, Linglei, Li, Qilin, and Lei, Weining
Published: 2024
Full Text: View/download PDF

36. S. glabra exerts anti-lung cancer effects by inducing ferroptosis and anticancer immunity

Author: Liu, Songyu, Zhang, Lu, Ding, Kai, Zeng, Bin, Li, Bo, Zhou, Jinyi, Li, Jv, Wang, Junliang, Zhang, Huijun, Sun, Ruifen, and Su, Xiaosan
Published: 2024
Full Text: View/download PDF

37. Shock-induced nanoscale pore collapse and hotspot in cyclotetramethylene tetranitramine (HMX)

Author: Ding, Kai, Wang, XinJie, and Huang, FengLei
Published: 2024
Full Text: View/download PDF

38. The safety and efficacy of NOACs versus LMWH for thromboprophylaxis after THA or TKA: A systemic review and meta-analysis

Author: Ding, Kai, Yan, Wei, Zhang, Yifan, Li, Jiaxing, Li, Congxin, and Liang, Chunhui
Published: 2024
Full Text: View/download PDF

39. A multi-stage approach for desired part grasping under complex backgrounds in human-robot collaborative assembly

Author: Hui, Jizhuang, Zhang, Yaqian, Ding, Kai, Guo, Lei, Chen, Chun-Hsien, and Wang, Lihui
Published: 2024
Full Text: View/download PDF

40. Dynamic response of deep-buried circular loess tunnel under P-wave action

Author: Cheng, Xuansheng, Sun, Haodong, Zhang, Shanglong, Ding, Kai, and Xia, Peiyan
Published: 2024
Full Text: View/download PDF

41. Long-term cover crops boost multi-nutrient cycling and subsurface soil carbon sequestration by alleviating microbial carbon limitation in a subtropical forest

Author: Ding, Kai, Chen, Liyao, Zhang, Yuting, Ge, Siyu, Zhang, Yiman, Lu, Meng, Shen, Zhenming, Tong, Zaikang, and Zhang, Junhong
Published: 2024
Full Text: View/download PDF

42. ACT001 inhibits primary central nervous system lymphoma tumor growth by enhancing the anti-tumor effect of T cells

Author: Liu, Zhaoyun, Wang, Guanrou, Liu, Hui, Ding, Kai, Song, Jia, and Fu, Rong
Published: 2024
Full Text: View/download PDF

43. Immune persistence after different polio sequential immunization schedules in Chinese infants

Author: Ting Zhao, Jing Li, Teng Huang, Zhi-Fang Ying, Yan-Chun Che, Zhi-Mei Zhao, Yu-Ting Fu, Jun-Hui Tao, Qing-Hai Yang, Ding-Kai Wei, Guo-Liang Li, Li Yi, Yu-Ping Zhao, Hong-Bo Chen, Jian-Feng Wang, Rui-Ju Jiang, Lei Yu, Wei Cai, Wei Yang, Ming-Xue Xie, Qiong-Zhou Yin, Jing Pu, Li Shi, Chao Hong, Yan Deng, Lu-Kui Cai, Jian Zhou, Yu Wen, Hong-Sen Li, Wei Huang, Zhao-Jun Mo, Chang-Gui Li, Qi-Han Li, and Jing-Si Yang
Subjects: Immunologic diseases. Allergy, RC581-607, Neoplasms. Tumors. Oncology. Including cancer and carcinogens, RC254-282
Abstract: Abstract Trivalent oral poliovirus vaccine (tOPV) has been withdrawn and instead an inactivated poliovirus vaccine (IPV) and bivalent type 1 and type 3 OPV (bOPV) sequential immunization schedule has been implemented since 2016, but no immune persistence data are available for this polio vaccination strategy. This study aimed to assess immune persistence following different polio sequential immunization schedules. Venous blood was collected at 24, 36, and 48 months of age from participants who had completed sequential schedules of combined IPV and OPV in phase III clinical trials. The serum neutralizing antibody titers against poliovirus were determined, and the poliovirus-specific antibody-positive rates were evaluated. A total of 1104 participants were enrolled in this study. The positive rates of poliovirus type 1- and type 3-specific antibodies among the sequential immunization groups showed no significant difference at 24, 36, or 48 months of age. The positive rates of poliovirus type 2-specific antibody in the IPV-IPV-tOPV group at all time points were nearly 100%, which was significantly higher than the corresponding rates in other immunization groups (IPV-bOPV-bOPV and IPV-IPV-bOPV). Immunization schedules involving one or two doses of IPV followed by bOPV failed to maintain a high positive rate for poliovirus type 2-specific antibody.
Published: 2024
Full Text: View/download PDF

44. LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding

Author: Wang, Jiapeng, Jin, Lianwen, and Ding, Kai
Subjects: Computer Science - Computation and Language
Abstract: Structured document understanding has attracted considerable attention and made significant progress recently, owing to its crucial role in intelligent document processing. However, most existing related models can only deal with the document data of specific language(s) (typically English) included in the pre-training collection, which is extremely limited. To address this issue, we propose a simple yet effective Language-independent Layout Transformer (LiLT) for structured document understanding. LiLT can be pre-trained on the structured documents of a single language and then directly fine-tuned on other languages with the corresponding off-the-shelf monolingual/multilingual pre-trained textual models. Experimental results on eight languages have shown that LiLT can achieve competitive or even superior performance on diverse widely-used downstream benchmarks, which enables language-independent benefit from the pre-training of document layout structure. Code and model are publicly available at https://github.com/jpWang/LiLT., Comment: ACL 2022 Main conference
Published: 2022

45. Machine vision-based recognition of elastic abrasive tool wear and its influence on machining performance

Author: Guo, Lei, Duan, Zhengcong, Guo, Wanjin, Ding, Kai, Lee, Chul-Hee, and Chan, Felix T. S.
Published: 2023
Full Text: View/download PDF

46. Understory vegetation restoration improves soil physicochemical properties, enzymatic activity, and changes diazotrophic communities in Cunninghamia lanceolata plantations but depends on site history

Author: Ding, Kai, Zhang, Yuting, Yang, Anna, Zhang, Yiman, Lu, Meng, Ge, Siyu, Qiu, Yongbin, Zhang, Junhong, and Tong, Zaikang
Published: 2023
Full Text: View/download PDF

47. Novel 1,2,4-triazoles derived from Ibuprofen: synthesis and in vitro evaluation of their mPGES-1 inhibitory and antiproliferative activity

Author: Bülbül, Bahadır, Ding, Kai, Zhan, Chang-Guo, Çiftçi, Gamze, Yelekçi, Kemal, Gürboğa, Merve, Özakpınar, Özlem Bingöl, Aydemir, Esra, Baybağ, Deniz, Şahin, Fikrettin, Kulabaş, Necla, Helvacıoğlu, Sinem, Charehsaz, Mohammad, Tatar, Esra, Özbey, Süheyla, and Küçükgüzel, İlkay
Published: 2023
Full Text: View/download PDF

48. ASTT: acoustic spatial-temporal transformer for short utterance speaker recognition

Author: Wu, Xing, Li, Ruixuan, Deng, Bin, Zhao, Ming, Du, Xingyue, Wang, Jianjia, and Ding, Kai
Published: 2023
Full Text: View/download PDF

49. A hollow magnetic soft robot consisting of rod-shaped nanofiber actuators

Author: Zheng, Xiaotong, Zheng, Zhiwen, Zhang, Zhao, Zhong, Run, Dong, Chunxiu, Wang, Qingyi, Kong, Degang, Ding, Kai, Luo, Yi, Li, Xiaohong, and Weng, Jie
Published: 2024
Full Text: View/download PDF

50. Interfacial microstructure characteristics and failure mechanism of the laser welding-brazing steel-Al joints with various welding parameters

Author: Li, Bolong, Hu, Tianhan, Zhou, Jiayi, Pan, Hua, Ding, Kai, Wu, Tianhai, and Gao, Yulai
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

2,685 results on '"Ding, Kai"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources