Author: "Cui Fan" / Publication Type: Reports - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Cui Fan"' showing total 13 results

Start Over Author "Cui Fan" Publication Type Reports

13 results on '"Cui Fan"'

1. OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection

Author: Cui, Fan, Yin, Chenyang, Zhou, Kexing, Xiao, Youwei, Sun, Guangyu, Xu, Qiang, Guo, Qipeng, Song, Demin, Lin, Dahua, Zhang, Xingcheng, Yun, and Liang
Subjects: Computer Science - Hardware Architecture, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Recent studies have demonstrated the significant potential of Large Language Models (LLMs) in generating Register Transfer Level (RTL) code, with notable advancements showcased by commercial models such as GPT-4 and Claude3-Opus. However, these proprietary LLMs often raise concerns regarding privacy and security. While open-source LLMs offer solutions to these concerns, they typically underperform commercial models in RTL code generation tasks, primarily due to the scarcity of high-quality open-source RTL datasets. To address this challenge, we introduce OriGen , a fully open-source framework that incorporates self-reflection capabilities and a novel dataset augmentation methodology for generating high-quality, large-scale RTL code. Our approach employs a code-tocode augmentation technique to enhance the quality of open-source RTL code datasets. Furthermore, OriGen can rectify syntactic errors through a self-reflection process that leverages compiler feedback. Experimental results demonstrate that OriGen significantly outperforms other open-source alternatives in RTL code generation. It surpasses the previous best-performing open-source LLM by 12.8% and even exceeds GPT-4 Turbo in the pass@1 metric on the VerilogEval-Human benchmark. Moreover, OriGen exhibits superior capabilities in self-reflection and error correction, outperforming GPT-4 by 19.9% on a benchmark designed to evaluate self-reflection capabilities.
Published: 2024

2. Classification of High-Ordered Topological Nodes towards Moir\'e Flat Bands in Twisted Bilayers

Author: Cui, Fan, Le, Congcong, Zhang, Qiang, Wu, Xianxin, Hu, Jiangping, and Chiu, Ching-Kai
Subjects: Condensed Matter - Strongly Correlated Electrons, Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: At magic twisted angles, Dirac cones in twisted bilayer graphene (TBG) can evolve into flat bands, serving as a critical playground for the study of strongly correlated physics. When chiral symmetry is introduced, rigorous mathematical proof confirms that the flat bands are locked at zero energy in the entire Moir\'e Brillouin zone (BZ). Yet, TBG is not the sole platform that exhibits this absolute band flatness. Central to this flatness phenomenon are topological nodes and their specific locations in the BZ. In this study, considering twisted bilayer systems that preserve chiral symmetry, we classify various ordered topological nodes in base layers and all possible node locations across different BZs. Specifically, we constrain the node locations to rotational centers, such as {\Gamma} and M points, to ensure the interlayer coupling retains equal strength in all directions. Using this classification as a foundation, we systematically identify the conditions under which Moir\'e flat bands emerge. Additionally, through the extension of holomorphic functions, we provide proof that flat bands are locked at zero energy, shedding light on the origin of the band flatness. Remarkably, beyond Dirac cones, numerous twisted bilayer nodal platforms can host flat bands with a degeneracy number of more than two, such as four-fold, six-fold, and eight-fold. This multiplicity of degeneracy in flat bands might unveil more complex and enriched correlation physics., Comment: 13 pages, 10 figures, 2 tables
Published: 2023

3. Exploring Representation Learning for Small-Footprint Keyword Spotting

Author: Cui, Fan, Guo, Liyong, Wang, Quandong, Gao, Peng, and Wang, Yujun
Subjects: Computer Science - Sound, Computer Science - Computation and Language, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: In this paper, we investigate representation learning for low-resource keyword spotting (KWS). The main challenges of KWS are limited labeled data and limited available device resources. To address those challenges, we explore representation learning for KWS by self-supervised contrastive learning and self-training with pretrained model. First, local-global contrastive siamese networks (LGCSiam) are designed to learn similar utterance-level representations for similar audio samplers by proposed local-global contrastive loss without requiring ground-truth. Second, a self-supervised pretrained Wav2Vec 2.0 model is applied as a constraint module (WVC) to force the KWS model to learn frame-level acoustic representations. By the LGCSiam and WVC modules, the proposed small-footprint KWS model can be pretrained with unlabeled data. Experiments on speech commands dataset show that the self-training WVC module and the self-supervised LGCSiam module significantly improve accuracy, especially in the case of training on a small labeled dataset.
Published: 2023
Full Text: View/download PDF

4. Relate auditory speech to EEG by shallow-deep attention-based network

Author: Cui, Fan, Guo, Liyong, He, Lang, Liu, Jiyao, Pei, ErCheng, Wang, Yujun, and Jiang, Dongmei
Subjects: Computer Science - Sound, Computer Science - Computation and Language, Electrical Engineering and Systems Science - Audio and Speech Processing, Quantitative Biology - Neurons and Cognition
Abstract: Electroencephalography (EEG) plays a vital role in detecting how brain responses to different stimulus. In this paper, we propose a novel Shallow-Deep Attention-based Network (SDANet) to classify the correct auditory stimulus evoking the EEG signal. It adopts the Attention-based Correlation Module (ACM) to discover the connection between auditory speech and EEG from global aspect, and the Shallow-Deep Similarity Classification Module (SDSCM) to decide the classification result via the embeddings learned from the shallow and deep layers. Moreover, various training strategies and data augmentation are used to boost the model robustness. Experiments are conducted on the dataset provided by Auditory EEG challenge (ICASSP Signal Processing Grand Challenge 2023). Results show that the proposed model has a significant gain over the baseline on the match-mismatch track.
Published: 2023

5. Improving Weakly Supervised Sound Event Detection with Causal Intervention

Author: Xin, Yifei, Yang, Dongchao, Cui, Fan, Wang, Yujun, and Zou, Yuexian
Subjects: Computer Science - Sound, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Existing weakly supervised sound event detection (WSSED) work has not explored both types of co-occurrences simultaneously, i.e., some sound events often co-occur, and their occurrences are usually accompanied by specific background sounds, so they would be inevitably entangled, causing misclassification and biased localization results with only clip-level supervision. To tackle this issue, we first establish a structural causal model (SCM) to reveal that the context is the main cause of co-occurrence confounders that mislead the model to learn spurious correlations between frames and clip-level labels. Based on the causal analysis, we propose a causal intervention (CI) method for WSSED to remove the negative impact of co-occurrence confounders by iteratively accumulating every possible context of each class and then re-projecting the contexts to the frame-level features for making the event boundary clearer. Experiments show that our method effectively improves the performance on multiple datasets and can generalize to various baseline models., Comment: Accepted by ICASSP2023
Published: 2023

6. Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

Author: Guo, Liyong, Yang, Xiaoyu, Wang, Quandong, Kong, Yuxiang, Yao, Zengwei, Cui, Fan, Kuang, Fangjun, Kang, Wei, Lin, Long, Luo, Mingshuang, Zelasko, Piotr, and Povey, Daniel
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Computation and Language, Computer Science - Sound
Abstract: Knowledge distillation(KD) is a common approach to improve model performance in automatic speech recognition (ASR), where a student model is trained to imitate the output behaviour of a teacher model. However, traditional KD methods suffer from teacher label storage issue, especially when the training corpora are large. Although on-the-fly teacher label generation tackles this issue, the training speed is significantly slower as the teacher model has to be evaluated every batch. In this paper, we reformulate the generation of teacher label as a codec problem. We propose a novel Multi-codebook Vector Quantization (MVQ) approach that compresses teacher embeddings to codebook indexes (CI). Based on this, a KD training framework (MVQ-KD) is proposed where a student model predicts the CI generated from the embeddings of a self-supervised pre-trained teacher model. Experiments on the LibriSpeech clean-100 hour show that MVQ-KD framework achieves comparable performance as traditional KD methods (l1, l2), while requiring 256 times less storage. When the full LibriSpeech dataset is used, MVQ-KD framework results in 13.8% and 8.2% relative word error rate reductions (WERRs) for non -streaming transducer on test-clean and test-other and 4.0% and 4.9% for streaming transducer. The implementation of this work is already released as a part of the open-source project icefall., Comment: Submitted to ICASSP 2022
Published: 2022

7. Detect what you want: Target Sound Detection

Author: Yang, Dongchao, Wang, Helin, Zou, Yuexian, Cui, Fan, and Wang, Yujun
Subjects: Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Human beings can perceive a target sound type from a multi-source mixture signal by the selective auditory attention, however, such functionality was hardly ever explored in machine hearing. This paper addresses the target sound detection (TSD) task, which aims to detect the target sound signal from a mixture audio when a target sound's reference audio is given. We present a novel target sound detection network (TSDNet) which consists of two main parts: A conditional network which aims at generating a sound-discriminative conditional embedding vector representing the target sound, and a detection network which takes both the mixture audio and the conditional embedding vector as inputs and produces the detection result of the target sound. These two networks can be jointly optimized with a multi-task learning approach to further improve the performance. In addition, we study both strong-supervised and weakly-supervised strategies to train TSDNet and propose a data augmentation method by mixing two samples. To facilitate this research, we build a target sound detection dataset (\textit{i.e.} URBAN-TSD) based on URBAN-SED and UrbanSound8K datasets, and experimental results indicate our method could get the segment-based F scores of 76.3$\%$ and 56.8$\%$ on the strongly-labelled and weakly-labelled data respectively., Comment: Submitted to DCASE workshop2022
Published: 2021

8. Comprehensive view of microscopic interactions between DNA-coated colloids

Author: Cui, Fan, Marbach, Sophie, Zheng, Jeana Aojie, Holmes-Cerfon, Miranda, and Pine, David J.
Subjects: Condensed Matter - Soft Condensed Matter
Abstract: The self-assembly of DNA-coated colloids into highly-ordered structures offers great promise for advanced optical materials. However, control of disorder, defects, melting, and crystal growth is hindered by the lack of a microscopic understanding of DNA-mediated colloidal interactions. Here we use total internal reflection microscopy to measure in situ the interaction potential between DNA-coated colloids with nanometer resolution and the macroscopic melting behavior. The range and strength of the interaction are measured and linked to key material design parameters, including DNA sequence, polymer length, grafting density, and complementary fraction. We present a first-principles model that quantitatively reproduces our experimental data without fitting parameters over a wide range of DNA ligand designs. Our theory identifies a subtle competition between DNA binding and steric repulsion and accurately predicts adhesion and melting at a molecular level. Combining experimental and theoretical results, our work provides a quantitative and predictive approach for guiding material design with DNA-nanotechnology and can be further extended to a diversity of colloidal and biological systems.
Published: 2021
Full Text: View/download PDF

9. Effect of Photon Counting Shot Noise on Total Internal Reflection Microscopy

Author: Cui, Fan and Pine, David J.
Subjects: Condensed Matter - Soft Condensed Matter, Physics - Data Analysis, Statistics and Probability, Physics - Instrumentation and Detectors
Abstract: Total internal reflection microscopy (TIRM) measures changes in the distance between a colloidal particle and a transparent substrate by measuring the intensity of light scattered by the particle when it is illuminated by the evanescent field that is created from light totally internally reflected at the substrate interface. From these measurements, the height-dependent effective potential $\varphi(z)$ between the colloidal particle and the substrate can be measured. The spatial resolution with which TIRM can resolve the height $z$ and effective potential $\varphi(z)$ is limited by the intrinsic shot noise of the photon counting process used to measure the scattered light intensity. We develop a model to determine the spatial resolution with which TIRM can measure $\varphi(z)$ and verify its validity with simulations and experiments. We further establish the critical role of photon-counting statistics and the intensity integration time $\tau$ in TIRM measurements, which is a trade-off between narrowing the width of the photon counting distribution and capturing the instantaneous position of the probe particle., Comment: 10 pages, 7 figures
Published: 2021

10. Generalized Fermion Doubling Theorems: Classification of 2D Nodal Systems in Terms of Wallpaper Groups

Author: Le, Congcong, Yang, Zhesen, Cui, Fan, Schnyder, A. P., and Chiu, Ching-Kai
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics, Condensed Matter - Superconductivity
Abstract: The Nielsen-Ninomiya Theorem has set up a ground rule for the minimal number of the topological points in a Brillouin zone. Notably, in the 2D Brillouin zone, chiral symmetry and space-time inversion symmetry can properly define topological invariants as charges characterizing the stability of the nodal points so that the non-zero charges protect these points. Due to the charge neutralization, the Nielsen-Ninomiya Theorem requires at least two stable topological points in the entire Brillouin zone. However, additional crystalline symmetries might duplicate the points. In this regard, for the wallpaper groups with crystalline symmetries, the minimal number of the nodal points in the Brillouin zone might be more than two. In this work, we determine the minimal numbers of the nodal points for the wallpaper groups in chiral-symmetric and space-time-inversion-symmetric systems separately and provide examples for new topological materials, such as topological nodal time-reversal-symmetric superconductors and Dirac semimetals. This generalized Nielsen-Ninomiya Theorem serves as a guide to search for 2D topological nodal materials and new platforms for twistronics. Furthermore, we show the Nielsen-Ninomiya Theorem can be extended to 2D non-Hermitian systems hosting topologically protected exceptional points and Fermi points for the 17 wallpaper groups and use the violation of the theorem on the surface to classify 3D Hermitian and non-Hermitian topological bulks., Comment: Significant extensions in the second version. 39 pages, 21 figures, 7 tables, any comments are welcome
Published: 2021
Full Text: View/download PDF

11. Robust Protection of III-V Nanowires in Water Splitting by a Thin Compact TiO$_2$ Layer

Author: Cui, Fan, Zhang, Yunyan, Fonseka, H. Aruni, Promdet, Premrudee, Channa, Ali Imran, Wang, Mingqing, Xia, Xueming, Sathasivam, Sanjayan, Liu, Hezhuang, Parkin, Ivan P., Yang, Hui, Li, Ting, Choy, Kwang-Leong, Wu, Jiang, Blackman, Chris, Sanchez, Ana M., and Liu, Huiyun
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics, Physics - Chemical Physics
Abstract: Narrow-bandgap III-V semiconductor nanowires (NWs) with a suitable band structure and strong light-trapping ability are ideal for high-efficiency low-cost solar water-splitting systems. However, due to their nanoscale dimension, they suffer more severe corrosion by the electrolyte solution than the thin-film counterparts. Thus, short-term durability is the major obstacle for using these NWs for practical water splitting applications. Here, we demonstrated for the first time that a thin layer (~7 nm thick) of compact TiO$_2$ deposited by atomic layer deposition can provide robust protection to III-V NWs. The protected GaAs NWs maintain 91.4% of its photoluminescence intensity after 14 months of storage in ambient atmosphere, which suggests the TiO$_2$ layer is pinhole-free. Working as a photocathode for water splitting, they exhibited a 45% larger photocurrent density compared with un-protected counterparts and a high Faraday efficiency of 91%, and can also maintain a record-long highly-stable performance among narrow-bandgap III-V NW photoelectrodes; after 67 hours photoelectrochemical stability test reaction in strong acid electrolyte solution (pH = 1), they show no apparent indication of corrosion, which is in stark contrast to the un-protected NWs that are fully failed after 35-hours. These findings provide an effective way to enhance both stability and performance of III-V NW based photoelectrodes, which are highly important for practical applications in solar-energy-based water splitting systems., Comment: 27 pages, 5 figures
Published: 2020

12. Reemergence of superconductivity in pressurized quasi-one-dimensional superconductor K2Mo3As3

Author: Huang, Cheng, Guo, Jing, Zhao, Kang, Cui, Fan, Qin, Shengshan, Mu, Qingge, Zhou, Yazhou, Cai, Shu, Yang, Chongli, Long, Sijin, Yang, Ke, Li, Aiguo, Wu, Qi, Ren, Zhian, Hu, Jiangping, and Sun, Liling
Subjects: Condensed Matter - Superconductivity
Abstract: Here we report a pressure-induced reemergence of superconductivity in recently discovered superconductor K2Mo3As3, which is the first experimental case observed in quasi-one-dimensional superconductors. We find that, after full suppression of the ambient-pressure superconducting (SC-I) state at 8.7 GPa, an intermediary non-superconducting state sets in and prevails to the pressure up to 18.2 GPa, however, above this pressure a new superconducting (SC-II) state appears unexpectedly. High pressure x-ray diffraction measurements demonstrate that the pressure-induced dramatic change of the lattice parameter c contributes mainly to the emergence of the SC-II state. Combined with the theioretical calculations on band strcture, our results suggest that the reemergemce of superconductivity is associated with the change of the complicated interplay among different orbital electrons, driven by the pressure-induced unisotropic change of the lattice., Comment: 17 pages and 4 figures
Published: 2020
Full Text: View/download PDF

13. Cross-model convolutional neural network for multiple modality data representation

Author: Wu, Yanbin, Wang, Li, Cui, Fan, Zhai, Hongbin, Dong, Baoming, and Wang, Jim Jing-Yan
Subjects: Computer Science - Learning
Abstract: A novel data representation method of convolutional neural net- work (CNN) is proposed in this paper to represent data of different modalities. We learn a CNN model for the data of each modality to map the data of differ- ent modalities to a common space, and regularize the new representations in the common space by a cross-model relevance matrix. We further impose that the class label of data points can also be predicted from the CNN representa- tions in the common space. The learning problem is modeled as a minimiza- tion problem, which is solved by an augmented Lagrange method (ALM) with updating rules of Alternating direction method of multipliers (ADMM). The experiments over benchmark of sequence data of multiple modalities show its advantage.
Published: 2016

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

13 results on '"Cui Fan"'

1. OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection

2. Classification of High-Ordered Topological Nodes towards Moir\'e Flat Bands in Twisted Bilayers

3. Exploring Representation Learning for Small-Footprint Keyword Spotting

4. Relate auditory speech to EEG by shallow-deep attention-based network

5. Improving Weakly Supervised Sound Event Detection with Causal Intervention

6. Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

7. Detect what you want: Target Sound Detection

8. Comprehensive view of microscopic interactions between DNA-coated colloids

9. Effect of Photon Counting Shot Noise on Total Internal Reflection Microscopy

10. Generalized Fermion Doubling Theorems: Classification of 2D Nodal Systems in Terms of Wallpaper Groups

11. Robust Protection of III-V Nanowires in Water Splitting by a Thin Compact TiO$_2$ Layer

12. Reemergence of superconductivity in pressurized quasi-one-dimensional superconductor K2Mo3As3

13. Cross-model convolutional neural network for multiple modality data representation

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Publication Type

Database

13 results on '"Cui Fan"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources