Author: "Li, Yongmin" / Publication Type: Reports - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Li, Yongmin"' showing total 26 results

Start Over Author "Li, Yongmin" Publication Type Reports

26 results on '"Li, Yongmin"'

1. Showing LLM-Generated Code Selectively Based on Confidence of LLMs

Author: Li, Jia, Zhu, Yuqi, Li, Yongmin, Li, Ge, and Jin, Zhi
Subjects: Computer Science - Software Engineering, Computer Science - Computation and Language
Abstract: Large Language Models (LLMs) have shown impressive abilities in code generation, but they may generate erroneous programs. Reading a program takes ten times longer than writing it. Showing these erroneous programs to developers will waste developers' energies and introduce security risks to software. To address the above limitations, we propose HonestCoder, a novel LLM-based code generation approach. HonestCoder selectively shows the generated programs to developers based on LLMs' confidence. The confidence provides valuable insights into the correctness of generated programs. To achieve this goal, we propose a novel approach to estimate LLMs' confidence in code generation. It estimates confidence by measuring the multi-modal similarity between LLMs-generated programs. We collect and release a multilingual benchmark named TruthCodeBench, which consists of 2,265 samples and covers two popular programming languages (i.e., Python and Java). We apply HonestCoder to four popular LLMs (e.g., DeepSeek-Coder and Code Llama) and evaluate it on TruthCodeBench. Based on the experiments, we obtain the following insights. (1) HonestCoder can effectively estimate LLMs' confidence and accurately determine the correctness of generated programs. For example, HonestCoder outperforms the state-of-the-art baseline by 27.79% in AUROC and 63.74% in AUCPR. (2) HonestCoder can decrease the number of erroneous programs shown to developers. Compared to eight baselines, it can show more correct programs and fewer erroneous programs to developers. (3) Compared to showing code indiscriminately, HonestCoder only adds slight time overhead (approximately 0.4 seconds per requirement). (4) We discuss future directions to facilitate the application of LLMs in software development. We hope this work can motivate broad discussions about measuring the reliability of LLMs' outputs in performing code-related tasks.
Published: 2024

2. Optomechanical sensor network with fiber Bragg gratings

Author: Yang, Shiwei, Zhang, Qiang, Yang, Linrun, Liu, Hanghua, Wang, Quansen, Zhang, Pengfei, Shen, Heng, and Li, Yongmin
Subjects: Physics - Optics, Physics - Applied Physics
Abstract: Cavity optomechanics offers a versatile platform for both fundamental physics and ultrasensitive sensing. Importantly, resonant enhancement in both optical and mechanical responses enables the highly sensitive optical detection of small forces, displacements, vibrations, and magnetic fields, enabling it a promising candidate of the next generation of ultrasensitive sensor networks. However, this is impeded by the fiber optic-incompatibility and intrinsic nature of existing optomechanical sensors. Here, we report the first demonstration of an optomechanical sensor network in terms of magnetic field detection, wherein multiple fiber-optic optomechanical sensors are connected into a standard single mode fiber. Building upon a commercially available fiber Bragg gratings, we realize a robust low-loss, low-noise, and polarization-insensitive coupling with light sources in a way compatible with fiber optics. This thus enables our optomechanical senor to fulfill the requirements for ultrasensitive sensor networks. Furthermore, in this sensor network we demonstrate the sensitivity of 8.73 pm/Gs for DC magnetic fields and 537 fT/Hz1/2 for AC magnetic fields in a magnetically unshielded environment with the ambient temperature and pressure, better than the reported values in previous optomechanical magnetometers. Our work sheds light on exploiting cavity optomechanics in the practical applications and ultrasensitive senor networks.
Published: 2024

3. Segmenting Medical Images: From UNet to Res-UNet and nnUNet

Author: Huang, Lina, Miron, Alina, Hone, Kate, and Li, Yongmin
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: This study provides a comparative analysis of deep learning models including UNet, Res-UNet, Attention Res-UNet, and nnUNet, and evaluates their performance in brain tumour, polyp, and multi-class heart segmentation tasks. The analysis focuses on precision, accuracy, recall, Dice Similarity Coefficient (DSC), and Intersection over Union (IoU) to assess their clinical applicability. In brain tumour segmentation, Res-UNet and nnUNet significantly outperformed UNet, with Res-UNet leading in DSC and IoU scores, indicating superior accuracy in tumour delineation. Meanwhile, nnUNet excelled in recall and accuracy, which are crucial for reliable tumour detection in clinical diagnosis and planning. In polyp detection, nnUNet was the most effective, achieving the highest metrics across all categories and proving itself as a reliable diagnostic tool in endoscopy. In the complex task of heart segmentation, Res-UNet and Attention Res-UNet were outstanding in delineating the left ventricle, with Res-UNet also leading in right ventricle segmentation. nnUNet was unmatched in myocardium segmentation, achieving top scores in precision, recall, DSC, and IoU. The conclusion notes that although Res-UNet occasionally outperforms nnUNet in specific metrics, the differences are quite small. Moreover, nnUNet consistently shows superior overall performance across the experiments. Particularly noted for its high recall and accuracy, which are crucial in clinical settings to minimize misdiagnosis and ensure timely treatment, nnUNet's robust performance in crucial metrics across all tested categories establishes it as the most effective model for these varied and complex segmentation tasks., Comment: 7 pages, 3 figures
Published: 2024

4. DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories

Author: Li, Jia, Li, Ge, Zhao, Yunfei, Li, Yongmin, Liu, Huanyu, Zhu, Hao, Wang, Lecheng, Liu, Kaibo, Fang, Zheng, Wang, Lanshen, Ding, Jiazheng, Zhang, Xuanming, Zhu, Yuqi, Dong, Yihong, Jin, Zhi, Li, Binhua, Huang, Fei, and Li, Yongbin
Subjects: Computer Science - Computation and Language, Computer Science - Software Engineering
Abstract: How to evaluate the coding abilities of Large Language Models (LLMs) remains an open question. We find that existing benchmarks are poorly aligned with real-world code repositories and are insufficient to evaluate the coding abilities of LLMs. To address the knowledge gap, we propose a new benchmark named DevEval, which has three advances. (1) DevEval aligns with real-world repositories in multiple dimensions, e.g., code distributions and dependency distributions. (2) DevEval is annotated by 13 developers and contains comprehensive annotations (e.g., requirements, original repositories, reference code, and reference dependencies). (3) DevEval comprises 1,874 testing samples from 117 repositories, covering 10 popular domains (e.g., Internet, Database). Based on DevEval, we propose repository-level code generation and evaluate 8 popular LLMs on DevEval (e.g., gpt-4, gpt-3.5, StarCoder 2, DeepSeek Coder, CodeLLaMa). Our experiments reveal these LLMs' coding abilities in real-world code repositories. For example, in our experiments, the highest Pass@1 of gpt-4-turbo is only 53.04%. We also analyze LLMs' failed cases and summarize their shortcomings. We hope DevEval can facilitate the development of LLMs in real code repositories. DevEval, prompts, and LLMs' predictions have been released., Comment: Accepted by the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024). arXiv admin note: substantial text overlap with arXiv:2404.00599, arXiv:2401.06401
Published: 2024

5. Integrated quantum communication network and vibration sensing in optical fibers

Author: Liu, Shuaishuai, Tian, Yan, Zhang, Yu, Lu, Zhenguo, Wang, Xuyang, and Li, Yongmin
Subjects: Quantum Physics, Physics - Optics
Abstract: Communication and sensing technology play a significant role in various aspects of modern society. A seamless combination of the communication and the sensing systems is desired and have attracted great interests in recent years. Here, we propose and demonstrate a network architecture that integrating the downstream quantum access network (DQAN) and vibration sensing in optical fibers. By encoding the key information of eight users simultaneously on the sidemode quantum states of a single laser source and successively separating them by a filter network, we achieve a secure and efficient DQAN with an average key rate of 1.88*10^4 bits per second over an 80 km single-mode fiber. Meanwhile, the vibration location with spatial resolution of 120 m, 24 m, and 8 m at vibration frequencies of 100 Hz, 1 kHz, and 10 kHz, respectively, is implemented with the existing infrastructure of the DQAN system. Our integrated architecture provides a viable and cost-effective solution for building a secure quantum communication sensor network, and open the way for expanding the functionality of quantum communication networks.
Published: 2024

6. Multi-Center Fetal Brain Tissue Annotation (FeTA) Challenge 2022 Results

Author: Payette, Kelly, Steger, Céline, Licandro, Roxane, de Dumast, Priscille, Li, Hongwei Bran, Barkovich, Matthew, Li, Liu, Dannecker, Maik, Chen, Chen, Ouyang, Cheng, McConnell, Niccolò, Miron, Alina, Li, Yongmin, Uus, Alena, Grigorescu, Irina, Gilliland, Paula Ramirez, Siddiquee, Md Mahfuzur Rahman, Xu, Daguang, Myronenko, Andriy, Wang, Haoyu, Huang, Ziyan, Ye, Jin, Alenyà, Mireia, Comte, Valentin, Camara, Oscar, Masson, Jean-Baptiste, Nilsson, Astrid, Godard, Charlotte, Mazher, Moona, Qayyum, Abdul, Gao, Yibo, Zhou, Hangqi, Gao, Shangqi, Fu, Jia, Dong, Guiming, Wang, Guotai, Rieu, ZunHyan, Yang, HyeonSik, Lee, Minwoo, Płotka, Szymon, Grzeszczyk, Michal K., Sitek, Arkadiusz, Daza, Luisa Vargas, Usma, Santiago, Arbelaez, Pablo, Lu, Wenying, Zhang, Wenhao, Liang, Jing, Valabregue, Romain, Joshi, Anand A., Nayak, Krishna N., Leahy, Richard M., Wilhelmi, Luca, Dändliker, Aline, Ji, Hui, Gennari, Antonio G., Jakovčić, Anton, Klaić, Melita, Adžić, Ana, Marković, Pavel, Grabarić, Gracia, Kasprian, Gregor, Dovjak, Gregor, Rados, Milan, Vasung, Lana, Cuadra, Meritxell Bach, and Jakab, Andras
Subjects: Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Segmentation is a critical step in analyzing the developing human fetal brain. There have been vast improvements in automatic segmentation methods in the past several years, and the Fetal Brain Tissue Annotation (FeTA) Challenge 2021 helped to establish an excellent standard of fetal brain segmentation. However, FeTA 2021 was a single center study, and the generalizability of algorithms across different imaging centers remains unsolved, limiting real-world clinical applicability. The multi-center FeTA Challenge 2022 focuses on advancing the generalizability of fetal brain segmentation algorithms for magnetic resonance imaging (MRI). In FeTA 2022, the training dataset contained images and corresponding manually annotated multi-class labels from two imaging centers, and the testing data contained images from these two imaging centers as well as two additional unseen centers. The data from different centers varied in many aspects, including scanners used, imaging parameters, and fetal brain super-resolution algorithms applied. 16 teams participated in the challenge, and 17 algorithms were evaluated. Here, a detailed overview and analysis of the challenge results are provided, focusing on the generalizability of the submissions. Both in- and out of domain, the white matter and ventricles were segmented with the highest accuracy, while the most challenging structure remains the cerebral cortex due to anatomical complexity. The FeTA Challenge 2022 was able to successfully evaluate and advance generalizability of multi-class fetal brain tissue segmentation algorithms for MRI and it continues to benchmark new algorithms. The resulting new methods contribute to improving the analysis of brain development in utero., Comment: Results from FeTA Challenge 2022, held at MICCAI; Manuscript submitted. Supplementary Info (including submission methods descriptions) available here: https://zenodo.org/records/10628648
Published: 2024

7. Compact quantum random number generator based on a laser diode and silicon photonics integrated hybrid chip

Author: Wang, Xuyang, Zheng, Tao, Jia, Yanxiang, Zhao, Qianru, Zhang, Yu, Shi, Yuqi, Wang, Ning, Lu, Zhenguo, Zou, Jun, and Li, Yongmin
Subjects: Quantum Physics
Abstract: In this study, a compact and low-power-consumption quantum random number generator (QRNG) based on a laser diode and silicon photonics integrated hybrid chip is proposed and verified experimentally. The hybrid chip's size is 8.8*2.6*1 mm3, and the power of entropy source is 80 mW. A common mode rejection ratio greater than 40 dB was achieved using an optimized 1*2 multimode interferometer structure. A method for optimizing the quantum-to-classical noise ratio is presented. A quantum-to-classical noise ratio of approximately 9 dB was achieved when the photoelectron current is 1 microampere using a balance homodyne detector with a high dark current GeSi photodiode. The proposed QRNG has the potential for use in scenarios of moderate MHz random number generation speed, with low power, small volume, and low cost prioritized., Comment: 15 pages, 10 figures
Published: 2024

8. DevEval: Evaluating Code Generation in Practical Software Projects

Author: Li, Jia, Li, Ge, Zhao, Yunfei, Li, Yongmin, Jin, Zhi, Zhu, Hao, Liu, Huanyu, Liu, Kaibo, Wang, Lecheng, Fang, Zheng, Wang, Lanshen, Ding, Jiazheng, Zhang, Xuanming, Dong, Yihong, Zhu, Yuqi, Gu, Bin, and Yang, Mengfei
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: How to evaluate Large Language Models (LLMs) in code generation is an open question. Many benchmarks have been proposed but are inconsistent with practical software projects, e.g., unreal program distributions, insufficient dependencies, and small-scale project contexts. Thus, the capabilities of LLMs in practical projects are still unclear. In this paper, we propose a new benchmark named DevEval, aligned with Developers' experiences in practical projects. DevEval is collected through a rigorous pipeline, containing 2,690 samples from 119 practical projects and covering 10 domains. Compared to previous benchmarks, DevEval aligns to practical projects in multiple dimensions, e.g., real program distributions, sufficient dependencies, and enough-scale project contexts. We assess five popular LLMs on DevEval (e.g., gpt-4, gpt-3.5-turbo, CodeLLaMa, and StarCoder) and reveal their actual abilities in code generation. For instance, the highest Pass@1 of gpt-3.5-turbo only is 42 in our experiments. We also discuss the challenges and future directions of code generation in practical projects. We open-source DevEval and hope it can facilitate the development of code generation in practical projects., Comment: We are re-checking this benchmark and repeating related experiments. New versions of DevEval will be released later
Published: 2024

9. SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

Author: Luo, Xiangde, Fu, Jia, Zhong, Yunxin, Liu, Shuolin, Han, Bing, Astaraki, Mehdi, Bendazzoli, Simone, Toma-Dasu, Iuliana, Ye, Yiwen, Chen, Ziyang, Xia, Yong, Su, Yanzhou, Ye, Jin, He, Junjun, Xing, Zhaohu, Wang, Hongqiu, Zhu, Lei, Yang, Kaixiang, Fang, Xin, Wang, Zhiwei, Lee, Chan Woong, Park, Sang Joon, Chun, Jaehee, Ulrich, Constantin, Maier-Hein, Klaus H., Ndipenoch, Nchongmaje, Miron, Alina, Li, Yongmin, Zhang, Yimeng, Chen, Yu, Bai, Lu, Huang, Jinlong, An, Chengyang, Wang, Lisheng, Huang, Kaiwen, Gu, Yunqi, Zhou, Tao, Zhou, Mu, Zhang, Shichuan, Liao, Wenjun, Wang, Guotai, and Zhang, Shaoting
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Radiation therapy is a primary and effective NasoPharyngeal Carcinoma (NPC) treatment strategy. The precise delineation of Gross Tumor Volumes (GTVs) and Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting patient prognosis. Previously, the delineation of GTVs and OARs was performed by experienced radiation oncologists. Recently, deep learning has achieved promising results in many medical image segmentation tasks. However, for NPC OARs and GTVs segmentation, few public datasets are available for model development and evaluation. To alleviate this problem, the SegRap2023 challenge was organized in conjunction with MICCAI2023 and presented a large-scale benchmark for OAR and GTV segmentation with 400 Computed Tomography (CT) scans from 200 NPC patients, each with a pair of pre-aligned non-contrast and contrast-enhanced CT scans. The challenge's goal was to segment 45 OARs and 2 GTVs from the paired CT scans. In this paper, we detail the challenge and analyze the solutions of all participants. The average Dice similarity coefficient scores for all submissions ranged from 76.68\% to 86.70\%, and 70.42\% to 73.44\% for OARs and GTVs, respectively. We conclude that the segmentation of large-size OARs is well-addressed, and more efforts are needed for GTVs and small-size or thin-structure OARs. The benchmark will remain publicly available here: https://segrap2023.grand-challenge.org, Comment: A challenge report of SegRap2023 (organized in conjunction with MICCAI2023)
Published: 2023

10. FashionFlow: Leveraging Diffusion Models for Dynamic Fashion Video Synthesis from Static Imagery

Author: Islam, Tasin, Miron, Alina, Liu, XiaoHui, and Li, Yongmin
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Our study introduces a new image-to-video generator called FashionFlow to generate fashion videos. By utilising a diffusion model, we are able to create short videos from still fashion images. Our approach involves developing and connecting relevant components with the diffusion model, which results in the creation of high-fidelity videos that are aligned with the conditional image. The components include the use of pseudo-3D convolutional layers to generate videos efficiently. VAE and CLIP encoders capture vital characteristics from still images to condition the diffusion model at a global level. Our research demonstrates a successful synthesis of fashion videos featuring models posing from various angles, showcasing the fit and appearance of the garment. Our findings hold great promise for improving and enhancing the shopping experience for the online fashion industry.
Published: 2023

11. Performance Analysis of UNet and Variants for Medical Image Segmentation

Author: Ehab, Walid and Li, Yongmin
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Medical imaging plays a crucial role in modern healthcare by providing non-invasive visualisation of internal structures and abnormalities, enabling early disease detection, accurate diagnosis, and treatment planning. This study aims to explore the application of deep learning models, particularly focusing on the UNet architecture and its variants, in medical image segmentation. We seek to evaluate the performance of these models across various challenging medical image segmentation tasks, addressing issues such as image normalization, resizing, architecture choices, loss function design, and hyperparameter tuning. The findings reveal that the standard UNet, when extended with a deep network layer, is a proficient medical image segmentation model, while the Res-UNet and Attention Res-UNet architectures demonstrate smoother convergence and superior performance, particularly when handling fine image details. The study also addresses the challenge of high class imbalance through careful preprocessing and loss function definitions. We anticipate that the results of this study will provide useful insights for researchers seeking to apply these models to new medical imaging problems and offer guidance and best practices for their implementation.
Published: 2023

12. EditSum: A Retrieve-and-Edit Framework for Source Code Summarization

Author: Li, Jia, Li, Yongmin, Li, Ge, Hu, Xing, Xia, Xin, and Jin, Zhi
Subjects: Computer Science - Software Engineering, Computer Science - Computation and Language
Abstract: Existing studies show that code summaries help developers understand and maintain source code. Unfortunately, these summaries are often missing or outdated in software projects. Code summarization aims to generate natural language descriptions automatically for source code. Code summaries are highly structured and have repetitive patterns. Besides the patternized words, a code summary also contains important keywords, which are the key to reflecting the functionality of the code. However, the state-of-the-art approaches perform poorly on predicting the keywords, which leads to the generated summaries suffering a loss in informativeness. To alleviate this problem, this paper proposes a novel retrieve-and-edit approach named EditSum for code summarization. Specifically, EditSum first retrieves a similar code snippet from a pre-defined corpus and treats its summary as a prototype summary to learn the pattern. Then, EditSum edits the prototype automatically to combine the pattern in the prototype with the semantic information of input code. Our motivation is that the retrieved prototype provides a good start-point for post-generation because the summaries of similar code snippets often have the same pattern. The post-editing process further reuses the patternized words in the prototype and generates keywords based on the semantic information of input code. We conduct experiments on a large-scale Java corpus and experimental results demonstrate that EditSum outperforms the state-of-the-art approaches by a substantial margin. The human evaluation also proves the summaries generated by EditSum are more informative and useful. We also verify that EditSum performs well on predicting the patternized words and keywords., Comment: Accepted by the 36th IEEE/ACM International Conference on Automated Software Engineering (ASE 2021)
Published: 2023
Full Text: View/download PDF

13. Skin Lesion Diagnosis Using Convolutional Neural Networks

Author: Nunez, Daniel Alonso Villanueva and Li, Yongmin
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Cancerous skin lesions are one of the most common malignancies detected in humans, and if not detected at an early stage, they can lead to death. Therefore, it is crucial to have access to accurate results early on to optimize the chances of survival. Unfortunately, accurate results are typically obtained by highly trained dermatologists, who may not be accessible to many people, particularly in low-income and middle-income countries. Artificial Intelligence (AI) appears to be a potential solution to this problem, as it has proven to provide equal or even better diagnoses than healthcare professionals. This project aims to address the issue by collecting state-of-the-art techniques for image classification from various fields and implementing them. Some of these techniques include mixup, presizing, and test-time augmentation, among others. Three architectures were used for the implementation: DenseNet121, VGG16 with batch normalization, and ResNet50. The models were designed with two main purposes. First, to classify images into seven categories, including melanocytic nevus, melanoma, benign keratosis-like lesions, basal cell carcinoma, actinic keratoses and intraepithelial carcinoma, vascular lesions, and dermatofibroma. Second, to classify images into benign or malignant. The models were trained using a dataset of 8012 images, and their performance was evaluated using 2003 images. It's worth noting that this model is trained end-to-end, directly from the image to the labels, without the need for handcrafted feature extraction.
Published: 2023

14. Structured Chain-of-Thought Prompting for Code Generation

Author: Li, Jia, Li, Ge, Li, Yongmin, and Jin, Zhi
Subjects: Computer Science - Software Engineering, Computer Science - Computation and Language
Abstract: Large Language Models (LLMs) (e.g., ChatGPT) have shown impressive performance in code generation. LLMs take prompts as inputs, and Chain-of-Thought (CoT) prompting is the state-of-the-art prompting technique. CoT prompting asks LLMs first to generate CoTs (i.e., intermediate natural language reasoning steps) and then output the code. However, CoT prompting is designed for natural language generation and has low accuracy in code generation. In this paper, we propose Structured CoTs (SCoTs) and present a novel prompting technique for code generation, named SCoT prompting. Our motivation is source code contains rich structural information and any code can be composed of three program structures (i.e., sequence, branch, and loop structures). Intuitively, structured intermediate reasoning steps make for structured source code. Thus, we ask LLMs to use program structures to build CoTs, obtaining SCoTs. Then, LLMs generate the final code based on SCoTs. Compared to CoT prompting, SCoT prompting explicitly constrains LLMs to think about how to solve requirements from the view of source code and further the performance of LLMs in code generation. We apply SCoT prompting to two LLMs (i.e., ChatGPT and Codex) and evaluate it on three benchmarks (i.e., HumanEval, MBPP, and MBCPP). (1) SCoT prompting outperforms the state-of-the-art baseline - CoT prompting by up to 13.79% in Pass@1. (2) Human evaluation shows human developers prefer programs from SCoT prompting. (3) SCoT prompting is robust to examples and achieves substantial improvements., Comment: arXiv admin note: text overlap with arXiv:2303.17780
Published: 2023

15. Silicon photonics-integrated time-domain balanced homodyne detector in continuous-variable quantum key distribution

Author: Jia, Yanxiang, Wang, Xuyang, Hu, Xiao, Hua, Xin, Zhang, Yu, Guo, Xubo, Zhang, Shengxiang, Xiao, Xi, Yu, Shaohua, Zou, Jun, and Li, Yongmin
Subjects: Quantum Physics
Abstract: We designed and experimentally demonstrated a silicon photonics-integrated time-domain balanced homodyne detector (TBHD), whose optical part has dimensions of 1.5 mm * 0.4 mm. To automatically and accurately balance the detector, new variable optical attenuators were used, and a common mode rejection ratio of 86.9 dB could be achieved. In the quantum tomography experiment, the density matrix and Wigner function of a coherent state were reconstructed with 99.97 % fidelity. The feasibility of this TBHD in a continuous-variable quantum key distribution (CVQKD) system was also demonstrated. This facilitates the integration of the optical circuits of the CVQKD system based on the GG02 protocol on the silicon photonics chip using TBHD.
Published: 2023
Full Text: View/download PDF

16. AceCoder: Utilizing Existing Code to Enhance Code Generation

Author: Li, Jia, Zhao, Yunfei, Li, Yongmin, Li, Ge, and Jin, Zhi
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence
Abstract: Large Language Models (LLMs) have shown great success in code generation. LLMs take as the input a prompt and output the code. A key question is how to make prompts (i.e., Prompting Techniques). Existing prompting techniques are designed for natural language generation and have low accuracy in code generation. In this paper, we propose a new prompting technique named AceCoder. Our motivation is that code generation meets two unique challenges (i.e., requirement understanding and code implementation). AceCoder contains two novel mechanisms (i.e., guided code generation and example retrieval) to solve these challenges. (1) Guided code generation asks LLMs first to analyze requirements and output an intermediate preliminary (e.g., test cases). The preliminary is used to clarify requirements and tell LLMs "what to write". (2) Example retrieval selects similar programs as examples in prompts, which provide lots of relevant content (e.g., algorithms, APIs) and teach LLMs "how to write". We apply AceCoder to three LLMs (e.g., Codex) and evaluate it on three public benchmarks using the Pass@k. Results show that AceCoder can significantly improve the performance of LLMs on code generation. (1) In terms of Pass@1, AceCoder outperforms the state-of-the-art baseline by up to 56.4% in MBPP, 70.7% in MBJP, and 88.4% in MBJSP. (2) AceCoder is effective in LLMs with different sizes (i.e., 6B to 13B) and different languages (i.e., Python, Java, and JavaScript). (3) Human evaluation shows human developers prefer programs from AceCoder.
Published: 2023

17. Segmentation of Retinal Blood Vessels Using Deep Learning

Author: Anene, Ifeyinwa Linda and Li, Yongmin
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: The morphology of retinal blood vessels can indicate various diseases in the human body, and researchers have been working on automatic scanning and segmentation of retinal images to aid diagnosis. This project compares the performance of four neural network architectures in segmenting retinal images, using a combined dataset from different databases, namely the UNet, DR-VNet, UNet-ResNet and UNet-VGG.
Published: 2023

18. Retinal Image Segmentation with Small Datasets

Author: Ndipenoch, Nchongmaje, Miron, Alina, Wang, Zidong, and Li, Yongmin
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Many eye diseases like Diabetic Macular Edema (DME), Age-related Macular Degeneration (AMD), and Glaucoma manifest in the retina, can cause irreversible blindness or severely impair the central version. The Optical Coherence Tomography (OCT), a 3D scan of the retina with high qualitative information about the retinal morphology, can be used to diagnose and monitor changes in the retinal anatomy. Many Deep Learning (DL) methods have shared the success of developing an automated tool to monitor pathological changes in the retina. However, the success of these methods depend mainly on large datasets. To address the challenge from very small and limited datasets, we proposed a DL architecture termed CoNet (Coherent Network) for joint segmentation of layers and fluids in retinal OCT images on very small datasets (less than a hundred training samples). The proposed model was evaluated on the publicly available Duke DME dataset consisting of 110 B-Scans from 10 patients suffering from DME. Experimental results show that the proposed model outperformed both the human experts' annotation and the current state-of-the-art architectures by a clear margin with a mean Dice Score of 88% when trained on 55 images without any data augmentation., Comment: Submitted to Bioimaging 2023
Published: 2023

19. nnUNet RASPP for Retinal OCT Fluid Detection, Segmentation and Generalisation over Variations of Data Sources

Author: Ndipenoch, Nchongmaje, Miron, Alina, Wang, Zidong, and Li, Yongmin
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Retinal Optical Coherence Tomography (OCT), a noninvasive cross-sectional scan of the eye with qualitative 3D visualization of the retinal anatomy is use to study the retinal structure and the presence of pathogens. The advent of the retinal OCT has transformed ophthalmology and it is currently paramount for the diagnosis, monitoring and treatment of many eye pathogens including Macular Edema which impairs vision severely or Glaucoma that can cause irreversible blindness. However the quality of retinal OCT images varies among device manufacturers. Deep Learning methods have had their success in the medical image segmentation community but it is still not clear if the level of success can be generalised across OCT images collected from different device vendors. In this work we propose two variants of the nnUNet [8]. The standard nnUNet and an enhanced vision call nnUnet_RASPP (nnU-Net with residual and Atrous Spatial Pyramid Pooling) both of which are robust and generalise with consistent high performance across images from multiple device vendors. The algorithm was validated on the MICCAI 2017 RETOUCH challenge dataset [1] acquired from 3 device vendors across 3 medical centers from patients suffering from 2 retinal disease types. Experimental results show that our algorithms outperform the current state-of-the-arts algorithms by a clear margin for segmentation obtaining a mean Dice Score (DS) of 82.3% for the 3 retinal fluids scoring 84.0%, 80.0%, 83.0% for Intraretinal Fluid (IRF), Subretinal Fluid (SRF), and Pigment Epithelium Detachments (PED) respectively on the testing dataset. Also we obtained a perfect Area Under the Curve (AUC) score of 100% for the detection of the presence of fluid for all 3 fluid classes on the testing dataset., Comment: 25 pages, 14 figures and 5 tables
Published: 2023

20. Experimental Demonstration of Sequential Multiparty Quantum Secret Sharing and Quantum Conference Key Agreement

Author: Liu, Shuaishuai, Lu, Zhengguo, Wang, Pu, Tian, Yan, Lu, Qing, Wang, Xuyang, and Li, Yongmin
Subjects: Physics - Optics, Quantum Physics
Abstract: Quantum secret sharing (QSS) and quantum conference key agreement (QCKA) provide efficient encryption approaches for realizing multi-party secure communication, which are essential components of future quantum networks. We present three practical, scalable, verifiable (k, n) threshold QSS protocols that are secure against eavesdroppers and dishonest players. The proposed QSS protocols eliminate the need for each player preparing the laser source and laser phase locking of the overall players. The dealer can implement the parameter evaluation and get the secret information of each player without the cooperation from other players. We consider the practical security of the proposed QSS systems with Trojan-horse attack, untrusted source intensity fluctuating and untrusted noisy sources. Our QSS systems are versatile, they can support the QCKA protocol by only modifying the classic post-processing and requiring no changes to the underlying hardware architecture. We experimentally implement the QSS and QCKA protocol with five parties over 25 km (55 km) single mode fibers, and achieve a key rate of 0.0061 (7.14*10^-4) bits per pulse. Our work paves the way for the practical applications of future QSS and QCKA.
Published: 2023
Full Text: View/download PDF

21. SkCoder: A Sketch-based Approach for Automatic Code Generation

Author: Li, Jia, Li, Yongmin, Li, Ge, Jin, Zhi, Hao, Yiyang, and Hu, Xing
Subjects: Computer Science - Software Engineering
Abstract: Recently, deep learning techniques have shown great success in automatic code generation. Inspired by the code reuse, some researchers propose copy-based approaches that can copy the content from similar code snippets to obtain better performance. Practically, human developers recognize the content in the similar code that is relevant to their needs, which can be viewed as a code sketch. The sketch is further edited to the desired code. However, existing copy-based approaches ignore the code sketches and tend to repeat the similar code without necessary modifications, which leads to generating wrong results. In this paper, we propose a sketch-based code generation approach named SkCoder to mimic developers' code reuse behavior. Given a natural language requirement, SkCoder retrieves a similar code snippet, extracts relevant parts as a code sketch, and edits the sketch into the desired code. Our motivations are that the extracted sketch provides a well-formed pattern for telling models "how to write". The post-editing further adds requirement-specific details to the sketch and outputs the complete code. We conduct experiments on two public datasets and a new dataset collected by this work. We compare our approach to 20 baselines using 5 widely used metrics. Experimental results show that (1) SkCoder can generate more correct programs, and outperforms the state-of-the-art - CodeT5-base by 30.30%, 35.39%, and 29.62% on three datasets. (2) Our approach is effective to multiple code generation models and improves them by up to 120.1% in Pass@1. (3) We investigate three plausible code sketches and discuss the importance of sketches. (4) We manually evaluate the generated code and prove the superiority of our SkCoder in three aspects., Comment: Accepted by the 45th IEEE/ACM International Conference on Software Engineering (ICSE 2023)
Published: 2023

22. Discrete-modulation continuous-variable quantum key distribution with high key rate

Author: Wang, Pu, Liu, Jianqiang, Lu, Zhenguo, Wang, Xuyang, and Li, Yongmin
Subjects: Quantum Physics
Abstract: Discrete-modulation continuous-variable quantum key distribution has the potential for large-scale deployment in the secure quantum communication networks due to low implementation complexity and compatibility with the current telecom systems. The security proof for four coherent states phase-shift keying (4-PSK) protocol has recently been established by applying numerical methods. However, the achievable key rate is relatively low compared with the optimal Gaussian modulation scheme. To enhance the key rate of discrete-modulation protocol, we first show that 8-PSK increases the key rate by about 60\% in comparison to 4-PSK, whereas the key rate has no significant improvement from 8-PSK to 12-PSK. We then expand the 12-PSK to two-ring constellation structure with four states in the inner ring and eight states in the outer ring, which significantly improves the key rate to be 2.4 times of that of 4-PSK. The key rate of the two-ring constellation structure can reach 70\% of the key rate achieved by Gaussian modulation in long distance transmissions, making this protocol an attractive alternative for high-rate and low-cost application in secure quantum communication networks., Comment: Welcome comments
Published: 2021
Full Text: View/download PDF

23. Silicon photonics integrated dynamic polarization controller for continuous-variable quantum key distribution

Author: Wang, Xuyang, Jia, Yanxiang, Guo, Xubo, Liu, Jianqiang, Wang, Shaofeng, Liu, Wenyuan, Sun, Fangyuan, Zou, Jun, and Li, Yongmin
Subjects: Quantum Physics
Abstract: We designed and demonstrated experimentally a silicon photonics integrated dynamic polarization controller which is a crucial component of a continuous-variable quantum key distribution system. By using a variable step simulated annealing approach, we achieve a dynamic polarization extinction ratio greater than 25 dB. The dynamic polarization controller can be utilized in silicon photonics integrated continuous-variable quantum key distribution system to minimize the size and decrease the cost further.
Published: 2021
Full Text: View/download PDF

24. Retrieve and Refine: Exemplar-based Neural Comment Generation

Author: Wei, Bolin, Li, Yongmin, Li, Ge, Xia, Xin, and Jin, Zhi
Subjects: Computer Science - Software Engineering, Computer Science - Artificial Intelligence
Abstract: Code comment generation which aims to automatically generate natural language descriptions for source code, is a crucial task in the field of automatic software development. Traditional comment generation methods use manually-crafted templates or information retrieval (IR) techniques to generate summaries for source code. In recent years, neural network-based methods which leveraged acclaimed encoder-decoder deep learning framework to learn comment generation patterns from a large-scale parallel code corpus, have achieved impressive results. However, these emerging methods only take code-related information as input. Software reuse is common in the process of software development, meaning that comments of similar code snippets are helpful for comment generation. Inspired by the IR-based and template-based approaches, in this paper, we propose a neural comment generation approach where we use the existing comments of similar code snippets as exemplars to guide comment generation. Specifically, given a piece of code, we first use an IR technique to retrieve a similar code snippet and treat its comment as an exemplar. Then we design a novel seq2seq neural network that takes the given code, its AST, its similar code, and its exemplar as input, and leverages the information from the exemplar to assist in the target comment generation based on the semantic similarity between the source code and the similar code. We evaluate our approach on a large-scale Java corpus, which contains about 2M samples, and experimental results demonstrate that our model outperforms the state-of-the-art methods by a substantial margin., Comment: to be published in the 35th IEEE/ACM International Conference on Automated Software Engineering (ASE 2020) (ASE'20)
Published: 2020

25. Strong quantum entanglement via a controllable four wave mixing mechanism in an optomechanical system

Author: You, Xiang and Li, Yongmin
Subjects: Quantum Physics
Abstract: We propose an approach to generate strong quantum entanglement by the controllable four wave mixing mechanism in a single cavity, weak coupling optomechanical system. The optomechanical system is driven by a strong two tone pump field and a weak signal field, simultaneously. The two tone pump field consists of a lower and an upper sideband, which couple with the optical cavity and mechanical resonator, and generate the beam splitter and two mode squeezing interactions under the rotating wave approximation. This interaction mechanism modifies the effective susceptibility of the optomechanical cavity and optomechanically induces a four wave mixing process. Strong quantum entanglement can be generated between the signal and four wave mixing fields with an entangled degree over 16 dB in realistic optomechanical systems. The generation scheme of the quantum entanglement is quite robust against thermal mechanical noise, and entanglement above 3 dB can persist at room temperature in the weak coupling regime., Comment: 18 pages, 7 figures
Published: 2019
Full Text: View/download PDF

26. Advantages of the coherent state compared with squeeze state in unidimensional continuous variable quantum key distribution

Author: Wang, Xuyang, Cao, Yanxia, Wang, Pu, and Li, Yongmin
Subjects: Quantum Physics
Abstract: In this work, a comparison study between unidimensional (UD) coherent-state and UD squeeze-state protocols is performed in the continuous variable quantum key distribution domain. First, the UD squeeze-state protocol is proposed and the equivalence between the prepare-and-measure and entanglement-based schemes of UD squeeze-state protocol is proved. Then, the security of the UD squeeze-state protocol under collective attack in realistic conditions is analyzed. Lastly, the performances of the two UD protocols are analyzed. Based on the uniform expressions established in our study, the squeeze-state and coherent-state protocols can be analyzed simultaneously. Our results show that the UD squeeze-state protocols are quite different from the two-dimensional protocols in that the UD squeeze-state protocols have a poorer performance compared with UD coherent-state protocols, which is opposite in the case of two-dimensional protocols., Comment: 12 pages, 8 figures
Published: 2018
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

26 results on '"Li, Yongmin"'

1. Showing LLM-Generated Code Selectively Based on Confidence of LLMs

2. Optomechanical sensor network with fiber Bragg gratings

3. Segmenting Medical Images: From UNet to Res-UNet and nnUNet

4. DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories

5. Integrated quantum communication network and vibration sensing in optical fibers

6. Multi-Center Fetal Brain Tissue Annotation (FeTA) Challenge 2022 Results

7. Compact quantum random number generator based on a laser diode and silicon photonics integrated hybrid chip

8. DevEval: Evaluating Code Generation in Practical Software Projects

9. SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

10. FashionFlow: Leveraging Diffusion Models for Dynamic Fashion Video Synthesis from Static Imagery

11. Performance Analysis of UNet and Variants for Medical Image Segmentation

12. EditSum: A Retrieve-and-Edit Framework for Source Code Summarization

13. Skin Lesion Diagnosis Using Convolutional Neural Networks

14. Structured Chain-of-Thought Prompting for Code Generation

15. Silicon photonics-integrated time-domain balanced homodyne detector in continuous-variable quantum key distribution

16. AceCoder: Utilizing Existing Code to Enhance Code Generation

17. Segmentation of Retinal Blood Vessels Using Deep Learning

18. Retinal Image Segmentation with Small Datasets

19. nnUNet RASPP for Retinal OCT Fluid Detection, Segmentation and Generalisation over Variations of Data Sources

20. Experimental Demonstration of Sequential Multiparty Quantum Secret Sharing and Quantum Conference Key Agreement

21. SkCoder: A Sketch-based Approach for Automatic Code Generation

22. Discrete-modulation continuous-variable quantum key distribution with high key rate

23. Silicon photonics integrated dynamic polarization controller for continuous-variable quantum key distribution

24. Retrieve and Refine: Exemplar-based Neural Comment Generation

25. Strong quantum entanglement via a controllable four wave mixing mechanism in an optomechanical system

26. Advantages of the coherent state compared with squeeze state in unidimensional continuous variable quantum key distribution

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Publication Type

Database

26 results on '"Li, Yongmin"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources