Author: "Wang, Zhefeng" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wang, Zhefeng"' showing total 302 results

Start Over Author "Wang, Zhefeng"

302 results on '"Wang, Zhefeng"'

1. Beware of Calibration Data for Pruning Large Language Models

Author: Ji, Yixin, Xiang, Yang, Li, Juntao, Xia, Qingrong, Li, Ping, Duan, Xinyu, Wang, Zhefeng, and Zhang, Min
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: As large language models (LLMs) are widely applied across various fields, model compression has become increasingly crucial for reducing costs and improving inference efficiency. Post-training pruning is a promising method that does not require resource-intensive iterative training and only needs a small amount of calibration data to assess the importance of parameters. Previous research has primarily focused on designing advanced pruning methods, while different calibration data's impact on pruning performance still lacks systematical exploration. We fill this blank and surprisingly observe that the effects of calibration data even value more than designing advanced pruning strategies, especially for high sparsity. Our preliminary exploration also discloses that using calibration data similar to the training data can yield better performance. As pre-training data is usually inaccessible for advanced LLMs, we further provide a self-generating calibration data synthesis strategy to construct feasible calibration data. We conduct experiments on the recent strong open-source LLMs (e.g., DCLM, and LLaMA-3), and the results show that the proposed method outperforms commonly used calibration data and can effectively enhance strong pruning methods (e.g., Wanda, OWL)., Comment: under review
Published: 2024

2. A Convex-optimization-based Layer-wise Post-training Pruner for Large Language Models

Author: Zhao, Pengxiang, Hu, Hanyu, Li, Ping, Zheng, Yi, Wang, Zhefeng, and Yuan, Xiaoming
Subjects: Computer Science - Machine Learning, Mathematics - Optimization and Control
Abstract: Pruning is a critical strategy for compressing trained large language models (LLMs), aiming at substantial memory conservation and computational acceleration without compromising performance. However, existing pruning methods often necessitate inefficient retraining for billion-scale LLMs or rely on heuristic methods such as the optimal brain surgeon framework, which degrade performance. In this paper, we introduce FISTAPruner, the first post-training pruner based on convex optimization models and algorithms. Specifically, we propose a convex optimization model incorporating $\ell_1$ norm to induce sparsity and utilize the FISTA solver for optimization. FISTAPruner incorporates an intra-layer cumulative error correction mechanism and supports parallel pruning. We comprehensively evaluate FISTAPruner on models such as OPT, LLaMA, LLaMA-2, and LLaMA-3 with 125M to 70B parameters under unstructured and 2:4 semi-structured sparsity, demonstrating superior performance over existing state-of-the-art methods across various language benchmarks.
Published: 2024

3. MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis

Author: Yang, Qian, Zuo, Jialong, Su, Zhe, Jiang, Ziyue, Li, Mingze, Zhao, Zhou, Chen, Feiyang, Wang, Zhefeng, and Huai, Baoxing
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Sound
Abstract: We introduce an open source high-quality Mandarin TTS dataset MSceneSpeech (Multiple Scene Speech Dataset), which is intended to provide resources for expressive speech synthesis. MSceneSpeech comprises numerous audio recordings and texts performed and recorded according to daily life scenarios. Each scenario includes multiple speakers and a diverse range of prosodic styles, making it suitable for speech synthesis that entails multi-speaker style and prosody modeling. We have established a robust baseline, through the prompting mechanism, that can effectively synthesize speech characterized by both user-specific timbre and scene-specific prosody with arbitrary text input. The open source MSceneSpeech Dataset and audio samples of our baseline are available at https://speechai-demo.github.io/MSceneSpeech/., Comment: Accepted by INTERSPEECH 2024
Published: 2024

4. OPT-Tree: Speculative Decoding with Adaptive Draft Tree Structure

Author: Wang, Jikai, Su, Yi, Li, Juntao, Xia, Qingrong, Ye, Zi, Duan, Xinyu, Wang, Zhefeng, and Zhang, Min
Subjects: Computer Science - Computation and Language
Abstract: Autoregressive language models demonstrate excellent performance in various scenarios. However, the inference efficiency is limited by its one-step-one-word generation mode, which has become a pressing problem recently as the models become increasingly larger. Speculative decoding employs a "draft and then verify" mechanism to allow multiple tokens to be generated in one step, realizing lossless acceleration. Existing methods mainly adopt fixed heuristic draft structures, which fail to adapt to different situations to maximize the acceptance length during verification. To alleviate this dilemma, we proposed OPT-Tree, an algorithm to construct adaptive and scalable draft trees. It searches the optimal tree structure that maximizes the mathematical expectation of the acceptance length in each decoding step. Experimental results reveal that OPT-Tree outperforms the existing draft structures and achieves a speed-up ratio of up to 3.2 compared with autoregressive decoding. If the draft model is powerful enough and the node budget is sufficient, it can generate more than ten tokens in a single step. Our code is available at https://github.com/Jikai0Wang/OPT-Tree.
Published: 2024

5. Optimizing Large Model Training through Overlapped Activation Recomputation

Author: Chen, Ping, Zhang, Wenjie, He, Shuibing, Gu, Yingjie, Peng, Zhuwei, Huang, Kexin, Zhan, Xuan, Chen, Weijian, Zheng, Yi, Wang, Zhefeng, Yin, Yanlong, and Chen, Gang
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Machine Learning
Abstract: Large model training has been using recomputation to alleviate the memory pressure and pipelining to exploit the parallelism of data, tensor, and devices. The existing recomputation approaches may incur up to 40% overhead when training real-world models, e.g., the GPT model with 22B parameters. This is because they are executed on demand in the critical training path. In this paper, we design a new recomputation framework, Lynx, to reduce the overhead by overlapping the recomputation with communication occurring in training pipelines. It consists of an optimal scheduling algorithm (OPT) and a heuristic-based scheduling algorithm (HEU). OPT achieves a global optimum but suffers from a long search time. HEU was designed based on our observation that there are identical structures in large DNN models so that we can apply the same scheduling policy to all identical structures. HEU achieves a local optimum but reduces the search time by 99% compared to OPT. Our comprehensive evaluation using GPT models with 1.3B-20B parameters shows that both OPT and HEU outperform the state-of-the-art recomputation approaches (e.g., Megatron-LM and Checkmake) by 1.02-1.53x. HEU achieves a similar performance as OPT with a search time of 0.16s on average., Comment: 13 pages
Published: 2024

6. Adapprox: Adaptive Approximation in Adam Optimization via Randomized Low-Rank Matrices

Author: Zhao, Pengxiang, Li, Ping, Gu, Yingjie, Zheng, Yi, Kölker, Stephan Ludger, Wang, Zhefeng, and Yuan, Xiaoming
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language, Mathematics - Optimization and Control
Abstract: As deep learning models exponentially increase in size, optimizers such as Adam encounter significant memory consumption challenges due to the storage of first and second moment data. Current memory-efficient methods like Adafactor and CAME often compromise accuracy with their matrix factorization techniques. Addressing this, we introduce Adapprox, a novel approach that employs randomized low-rank matrix approximation for a more effective and accurate approximation of Adam's second moment. Adapprox features an adaptive rank selection mechanism, finely balancing accuracy and memory efficiency, and includes an optional cosine similarity guidance strategy to enhance stability and expedite convergence. In GPT-2 training and downstream tasks, Adapprox surpasses AdamW by achieving 34.5% to 49.9% and 33.8% to 49.9% memory savings for the 117M and 345M models, respectively, with the first moment enabled, and further increases these savings without the first moment. Besides, it enhances convergence speed and improves downstream task performance relative to its counterparts.
Published: 2024

7. Shai: A large language model for asset management

Author: Guo, Zhongyang, Jiang, Guanran, Zhang, Zhongdan, Li, Peng, Wang, Zhefeng, and Wang, Yinchun
Subjects: Quantitative Finance - Portfolio Management, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: This paper introduces "Shai" a 10B level large language model specifically designed for the asset management industry, built upon an open-source foundational model. With continuous pre-training and fine-tuning using a targeted corpus, Shai demonstrates enhanced performance in tasks relevant to its domain, outperforming baseline models. Our research includes the development of an innovative evaluation framework, which integrates professional qualification exams, tailored tasks, open-ended question answering, and safety assessments, to comprehensively assess Shai's capabilities. Furthermore, we discuss the challenges and implications of utilizing large language models like GPT-4 for performance assessment in asset management, suggesting a combination of automated evaluation and human judgment. Shai's development, showcasing the potential and versatility of 10B-level large language models in the financial sector with significant performance and modest computational requirements, hopes to provide practical insights and methodologies to assist industry peers in their similar endeavors.
Published: 2023

8. Reliability and Validity of the Chinese Mandarin Version of the Dysphagia Handicap Index

Author: Zeng, Hongji, Zeng, Jing, Zhao, Weijia, Luo, Siyu, Luo, Pengchao, Wang, Zhefeng, Tian, Qingfeng, and Zeng, Xi
Published: 2024
Full Text: View/download PDF

9. How Well Do Large Language Models Understand Syntax? An Evaluation by Asking Natural Language Questions

Author: Zhou, Houquan, Hou, Yang, Li, Zhenghua, Wang, Xuebin, Wang, Zhefeng, Duan, Xinyu, and Zhang, Min
Subjects: Computer Science - Computation and Language
Abstract: While recent advancements in large language models (LLMs) bring us closer to achieving artificial general intelligence, the question persists: Do LLMs truly understand language, or do they merely mimic comprehension through pattern recognition? This study seeks to explore this question through the lens of syntax, a crucial component of sentence comprehension. Adopting a natural language question-answering (Q&A) scheme, we craft questions targeting nine syntactic knowledge points that are most closely related to sentence comprehension. Experiments conducted on 24 LLMs suggest that most have a limited grasp of syntactic knowledge, exhibiting notable discrepancies across different syntactic knowledge points. In particular, questions involving prepositional phrase attachment pose the greatest challenge, whereas those concerning adjectival modifier and indirect object are relatively easier for LLMs to handle. Furthermore, a case study on the training dynamics of the LLMs reveals that the majority of syntactic knowledge is learned during the initial stages of training, hinting that simply increasing the number of training tokens may not be the `silver bullet' for improving the comprehension ability of LLMs., Comment: 20 pages, 6 figures
Published: 2023

10. Mirror: A Universal Framework for Various Information Extraction Tasks

Author: Zhu, Tong, Ren, Junfei, Yu, Zijian, Wu, Mengsong, Zhang, Guoliang, Qu, Xiaoye, Chen, Wenliang, Wang, Zhefeng, Huai, Baoxing, and Zhang, Min
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Sharing knowledge between information extraction tasks has always been a challenge due to the diverse data formats and task variations. Meanwhile, this divergence leads to information waste and increases difficulties in building complex applications in real scenarios. Recent studies often formulate IE tasks as a triplet extraction problem. However, such a paradigm does not support multi-span and n-ary extraction, leading to weak versatility. To this end, we reorganize IE problems into unified multi-slot tuples and propose a universal framework for various IE tasks, namely Mirror. Specifically, we recast existing IE tasks as a multi-span cyclic graph extraction problem and devise a non-autoregressive graph decoding algorithm to extract all spans in a single step. It is worth noting that this graph structure is incredibly versatile, and it supports not only complex IE tasks, but also machine reading comprehension and classification tasks. We manually construct a corpus containing 57 datasets for model pretraining, and conduct experiments on 30 datasets across 8 downstream tasks. The experimental results demonstrate that our model has decent compatibility and outperforms or reaches competitive performance with SOTA systems under few-shot and zero-shot settings. The code, model weights, and pretraining corpus are available at https://github.com/Spico197/Mirror ., Comment: Accepted to EMNLP23 main conference
Published: 2023

11. High-order Joint Constituency and Dependency Parsing

Author: Gu, Yanggan, Hou, Yang, Wang, Zhefeng, Duan, Xinyu, and Li, Zhenghua
Subjects: Computer Science - Computation and Language
Abstract: This work revisits the topic of jointly parsing constituency and dependency trees, i.e., to produce compatible constituency and dependency trees simultaneously for input sentences, which is attractive considering that the two types of trees are complementary in representing syntax. The original work of Zhou and Zhao (2019) performs joint parsing only at the inference phase. They train two separate parsers under the multi-task learning framework (i.e., one shared encoder and two independent decoders). They design an ad-hoc dynamic programming-based decoding algorithm of $O(n^5)$ time complexity for finding optimal compatible tree pairs. Compared to their work, we make progress in three aspects: (1) adopting a much more efficient decoding algorithm of $O(n^4)$ time complexity, (2) exploring joint modeling at the training phase, instead of only at the inference phase, (3) proposing high-order scoring components to promote constituent-dependency interaction. We conduct experiments and analysis on seven languages, covering both rich-resource and low-resource scenarios. Results and analysis show that joint modeling leads to a modest overall performance boost over separate modeling, but substantially improves the complete matching ratio of whole trees, thanks to the explicit modeling of tree compatibility., Comment: LREC-COLING 2024
Published: 2023

12. Recognizing Unseen Objects via Multimodal Intensive Knowledge Graph Propagation

Author: Wu, Likang, Li, Zhi, Zhao, Hongke, Wang, Zhefeng, Liu, Qi, Huai, Baoxing, Yuan, Nicholas Jing, and Chen, Enhong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Zero-Shot Learning (ZSL), which aims at automatically recognizing unseen objects, is a promising learning paradigm to understand new real-world knowledge for machines continuously. Recently, the Knowledge Graph (KG) has been proven as an effective scheme for handling the zero-shot task with large-scale and non-attribute data. Prior studies always embed relationships of seen and unseen objects into visual information from existing knowledge graphs to promote the cognitive ability of the unseen data. Actually, real-world knowledge is naturally formed by multimodal facts. Compared with ordinary structural knowledge from a graph perspective, multimodal KG can provide cognitive systems with fine-grained knowledge. For example, the text description and visual content can depict more critical details of a fact than only depending on knowledge triplets. Unfortunately, this multimodal fine-grained knowledge is largely unexploited due to the bottleneck of feature alignment between different modalities. To that end, we propose a multimodal intensive ZSL framework that matches regions of images with corresponding semantic embeddings via a designed dense attention module and self-calibration loss. It makes the semantic transfer process of our ZSL framework learns more differentiated knowledge between entities. Our model also gets rid of the performance limitation of only using rough global features. We conduct extensive experiments and evaluate our model on large-scale real-world data. The experimental results clearly demonstrate the effectiveness of the proposed model in standard zero-shot classification tasks., Comment: arXiv admin note: text overlap with arXiv:1805.11724 by other authors
Published: 2023
Full Text: View/download PDF

13. AraMUS: Pushing the Limits of Data and Model Scale for Arabic Natural Language Processing

Author: Alghamdi, Asaad, Duan, Xinyu, Jiang, Wei, Wang, Zhenhai, Wu, Yimeng, Xia, Qingrong, Wang, Zhefeng, Zheng, Yi, Rezagholizadeh, Mehdi, Huai, Baoxing, Cheng, Peilun, and Ghaddar, Abbas
Subjects: Computer Science - Computation and Language
Abstract: Developing monolingual large Pre-trained Language Models (PLMs) is shown to be very successful in handling different tasks in Natural Language Processing (NLP). In this work, we present AraMUS, the largest Arabic PLM with 11B parameters trained on 529GB of high-quality Arabic textual data. AraMUS achieves state-of-the-art performances on a diverse set of Arabic classification and generative tasks. Moreover, AraMUS shows impressive few-shot learning abilities compared with the best existing Arabic PLMs.
Published: 2023

14. Reference Matters: Benchmarking Factual Error Correction for Dialogue Summarization with Fine-grained Evaluation Framework

Author: Gao, Mingqi, Wan, Xiaojun, Su, Jia, Wang, Zhefeng, and Huai, Baoxing
Subjects: Computer Science - Computation and Language
Abstract: Factuality is important to dialogue summarization. Factual error correction (FEC) of model-generated summaries is one way to improve factuality. Current FEC evaluation that relies on factuality metrics is not reliable and detailed enough. To address this problem, we are the first to manually annotate a FEC dataset for dialogue summarization containing 4000 items and propose FERRANTI, a fine-grained evaluation framework based on reference correction that automatically evaluates the performance of FEC models on different error categories. Using this evaluation framework, we conduct sufficient experiments with FEC approaches under a variety of settings and find the best training modes and significant differences in the performance of the existing approaches on different factual error categories., Comment: Accepted to ACL 2023 Main Conference
Published: 2023

15. Plasma-derived exosomal long noncoding RNAs of pancreatic cancer patients as novel blood-based biomarkers of disease

Author: He, Xiaomeng, Chen, Litian, Di, Yang, Li, Wenyang, Zhang, Xin, Bai, Zhihui, Wang, Zhefeng, Liu, Shanshan, Corpe, Christopher, and Wang, Jin
Published: 2024
Full Text: View/download PDF

16. Direct transmission of severe fever with thrombocytopenia syndrome virus from farm-raised fur animals to workers in Weihai, China

Author: Li, Jizhao, Wang, Chunping, Li, Xiang, Zhang, Guoying, Sun, Shunzeng, Wang, Zhefeng, Zhao, Jian, Xiu, Linqing, Jiang, Nianchen, Zhang, Huajiang, Yang, Zhenghui, and Zhang, Jinbo
Published: 2024
Full Text: View/download PDF

17. CopyNE: Better Contextual ASR by Copying Named Entities

Author: Zhou, Shilin, Li, Zhenghua, Hong, Yu, Zhang, Min, Wang, Zhefeng, and Huai, Baoxing
Subjects: Computer Science - Computation and Language
Abstract: End-to-end automatic speech recognition (ASR) systems have made significant progress in general scenarios. However, it remains challenging to transcribe contextual named entities (NEs) in the contextual ASR scenario. Previous approaches have attempted to address this by utilizing the NE dictionary. These approaches treat entities as individual tokens and generate them token-by-token, which may result in incomplete transcriptions of entities. In this paper, we treat entities as indivisible wholes and introduce the idea of copying into ASR. We design a systematic mechanism called CopyNE, which can copy entities from the NE dictionary. By copying all tokens of an entity at once, we can reduce errors during entity transcription, ensuring the completeness of the entity. Experiments demonstrate that CopyNE consistently improves the accuracy of transcribing entities compared to previous approaches. Even when based on the strong Whisper, CopyNE still achieves notable improvements., Comment: ACL 2024
Published: 2023

18. CED: Catalog Extraction from Documents

Author: Zhu, Tong, Zhang, Guoliang, Li, Zechang, Yu, Zijian, Ren, Junfei, Wu, Mengsong, Wang, Zhefeng, Huai, Baoxing, Chao, Pingfu, and Chen, Wenliang
Subjects: Computer Science - Computation and Language
Abstract: Sentence-by-sentence information extraction from long documents is an exhausting and error-prone task. As the indicator of document skeleton, catalogs naturally chunk documents into segments and provide informative cascade semantics, which can help to reduce the search space. Despite their usefulness, catalogs are hard to be extracted without the assist from external knowledge. For documents that adhere to a specific template, regular expressions are practical to extract catalogs. However, handcrafted heuristics are not applicable when processing documents from different sources with diverse formats. To address this problem, we build a large manually annotated corpus, which is the first dataset for the Catalog Extraction from Documents (CED) task. Based on this corpus, we propose a transition-based framework for parsing documents into catalog trees. The experimental results demonstrate that our proposed method outperforms baseline systems and shows a good ability to transfer. We believe the CED task could fill the gap between raw text segments and information extraction tasks on extremely long documents. Data and code are available at \url{https://github.com/Spico197/CatalogExtraction}
Published: 2023

19. A Survey on Arabic Named Entity Recognition: Past, Recent Advances, and Future Trends

Author: Qu, Xiaoye, Gu, Yingjie, Xia, Qingrong, Li, Zechang, Wang, Zhefeng, and Huai, Baoxing
Subjects: Computer Science - Computation and Language
Abstract: As more and more Arabic texts emerged on the Internet, extracting important information from these Arabic texts is especially useful. As a fundamental technology, Named entity recognition (NER) serves as the core component in information extraction technology, while also playing a critical role in many other Natural Language Processing (NLP) systems, such as question answering and knowledge graph building. In this paper, we provide a comprehensive review of the development of Arabic NER, especially the recent advances in deep learning and pre-trained language model. Specifically, we first introduce the background of Arabic NER, including the characteristics of Arabic and existing resources for Arabic NER. Then, we systematically review the development of Arabic NER methods. Traditional Arabic NER systems focus on feature engineering and designing domain-specific rules. In recent years, deep learning methods achieve significant progress by representing texts via continuous vector representations. With the growth of pre-trained language model, Arabic NER yields better performance. Finally, we conclude the method gap between Arabic NER and NER methods from other languages, which helps outline future directions for Arabic NER., Comment: Accepted by IEEE TKDE
Published: 2023

20. Distantly-Supervised Named Entity Recognition with Adaptive Teacher Learning and Fine-grained Student Ensemble

Author: Qu, Xiaoye, Zeng, Jun, Liu, Daizong, Wang, Zhefeng, Huai, Baoxing, and Zhou, Pan
Subjects: Computer Science - Computation and Language
Abstract: Distantly-Supervised Named Entity Recognition (DS-NER) effectively alleviates the data scarcity problem in NER by automatically generating training samples. Unfortunately, the distant supervision may induce noisy labels, thus undermining the robustness of the learned models and restricting the practical application. To relieve this problem, recent works adopt self-training teacher-student frameworks to gradually refine the training labels and improve the generalization ability of NER models. However, we argue that the performance of the current self-training frameworks for DS-NER is severely underestimated by their plain designs, including both inadequate student learning and coarse-grained teacher updating. Therefore, in this paper, we make the first attempt to alleviate these issues by proposing: (1) adaptive teacher learning comprised of joint training of two teacher-student networks and considering both consistent and inconsistent predictions between two teachers, thus promoting comprehensive student learning. (2) fine-grained student ensemble that updates each fragment of the teacher model with a temporal moving average of the corresponding fragment of the student, which enhances consistent predictions on each model fragment against noise. To verify the effectiveness of our proposed method, we conduct experiments on four DS-NER datasets. The experimental results demonstrate that our method significantly surpasses previous SOTA methods., Comment: Accepted at AAAI 2023
Published: 2022

21. Mining Word Boundaries in Speech as Naturally Annotated Word Segmentation Data

Author: Zhang, Lei, Li, Zhenghua, Zhou, Shilin, Gong, Chen, Wang, Zhefeng, Huai, Baoxing, and Zhang, Min
Subjects: Computer Science - Computation and Language
Abstract: Inspired by early research on exploring naturally annotated data for Chinese word segmentation (CWS), and also by recent research on integration of speech and text processing, this work for the first time proposes to mine word boundaries from parallel speech/text data. First we collect parallel speech/text data from two Internet sources that are related with CWS data used in our experiments. Then, we obtain character-level alignments and design simple heuristic rules for determining word boundaries according to pause duration between adjacent characters. Finally, we present an effective complete-then-train strategy that can better utilize extra naturally annotated data for model training. Experiments demonstrate our approach can significantly boost CWS performance in both cross-domain and low-resource scenarios., Comment: latest version
Published: 2022

22. Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Understanding

Author: Ghaddar, Abbas, Wu, Yimeng, Bagga, Sunyam, Rashid, Ahmad, Bibi, Khalil, Rezagholizadeh, Mehdi, Xing, Chao, Wang, Yasheng, Xinyu, Duan, Wang, Zhefeng, Huai, Baoxing, Jiang, Xin, Liu, Qun, and Langlais, Philippe
Subjects: Computer Science - Computation and Language
Abstract: There is a growing body of work in recent years to develop pre-trained language models (PLMs) for the Arabic language. This work concerns addressing two major problems in existing Arabic PLMs which constraint progress of the Arabic NLU and NLG fields.First, existing Arabic PLMs are not well-explored and their pre-trainig can be improved significantly using a more methodical approach. Second, there is a lack of systematic and reproducible evaluation of these models in the literature. In this work, we revisit both the pre-training and evaluation of Arabic PLMs. In terms of pre-training, we explore improving Arabic LMs from three perspectives: quality of the pre-training data, size of the model, and incorporating character-level information. As a result, we release three new Arabic BERT-style models ( JABER, Char-JABER, and SABER), and two T5-style models (AT5S and AT5B). In terms of evaluation, we conduct a comprehensive empirical study to systematically evaluate the performance of existing state-of-the-art models on ALUE that is a leaderboard-powered benchmark for Arabic NLU tasks, and on a subset of the ARGEN benchmark for Arabic NLG tasks. We show that our models significantly outperform existing Arabic PLMs and achieve a new state-of-the-art performance on discriminative and generative Arabic NLU and NLG tasks. Our models and source code to reproduce of results will be made available shortly.
Published: 2022

23. Delving Deep into Regularity: A Simple but Effective Method for Chinese Named Entity Recognition

Author: Gu, Yingjie, Qu, Xiaoye, Wang, Zhefeng, Zheng, Yi, Huai, Baoxing, and Yuan, Nicholas Jing
Subjects: Computer Science - Computation and Language
Abstract: Recent years have witnessed the improving performance of Chinese Named Entity Recognition (NER) from proposing new frameworks or incorporating word lexicons. However, the inner composition of entity mentions in character-level Chinese NER has been rarely studied. Actually, most mentions of regular types have strong name regularity. For example, entities end with indicator words such as "company" or "bank" usually belong to organization. In this paper, we propose a simple but effective method for investigating the regularity of entity spans in Chinese NER, dubbed as Regularity-Inspired reCOgnition Network (RICON). Specifically, the proposed model consists of two branches: a regularity-aware module and a regularityagnostic module. The regularity-aware module captures the internal regularity of each span for better entity type prediction, while the regularity-agnostic module is employed to locate the boundary of entities and relieve the excessive attention to span regularity. An orthogonality space is further constructed to encourage two modules to extract different aspects of regularity features. To verify the effectiveness of our method, we conduct extensive experiments on three benchmark datasets and a practical medical dataset. The experimental results show that our RICON significantly outperforms previous state-of-the-art methods, including various lexicon-based methods., Comment: Accepted at NAACL 2022 Findings
Published: 2022

24. Efficient Document-level Event Extraction via Pseudo-Trigger-aware Pruned Complete Graph

Author: Zhu, Tong, Qu, Xiaoye, Chen, Wenliang, Wang, Zhefeng, Huai, Baoxing, Yuan, Nicholas Jing, and Zhang, Min
Subjects: Computer Science - Computation and Language
Abstract: Most previous studies of document-level event extraction mainly focus on building argument chains in an autoregressive way, which achieves a certain success but is inefficient in both training and inference. In contrast to the previous studies, we propose a fast and lightweight model named as PTPCG. In our model, we design a novel strategy for event argument combination together with a non-autoregressive decoding algorithm via pruned complete graphs, which are constructed under the guidance of the automatically selected pseudo triggers. Compared to the previous systems, our system achieves competitive results with 19.8\% of parameters and much lower resource consumption, taking only 3.8\% GPU hours for training and up to 8.5 times faster for inference. Besides, our model shows superior compatibility for the datasets with (or without) triggers and the pseudo triggers can be the supplements for annotated triggers to make further improvements. Codes are available at https://github.com/Spico197/DocEE ., Comment: Accepted to IJCAI'2022
Published: 2021

25. JABER and SABER: Junior and Senior Arabic BERt

Author: Ghaddar, Abbas, Wu, Yimeng, Rashid, Ahmad, Bibi, Khalil, Rezagholizadeh, Mehdi, Xing, Chao, Wang, Yasheng, Xinyu, Duan, Wang, Zhefeng, Huai, Baoxing, Jiang, Xin, Liu, Qun, and Langlais, Philippe
Subjects: Computer Science - Computation and Language
Abstract: Language-specific pre-trained models have proven to be more accurate than multilingual ones in a monolingual evaluation setting, Arabic is no exception. However, we found that previously released Arabic BERT models were significantly under-trained. In this technical report, we present JABER and SABER, Junior and Senior Arabic BERt respectively, our pre-trained language model prototypes dedicated for Arabic. We conduct an empirical study to systematically evaluate the performance of models across a diverse set of existing Arabic NLU tasks. Experimental results show that JABER and SABER achieve state-of-the-art performances on ALUE, a new benchmark for Arabic Language Understanding Evaluation, as well as on a well-established NER benchmark., Comment: Technical Report; v2: add SABER and CAMeLBERT evaluation; v3: fix minor typos and grammatical errors
Published: 2021

26. SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation

Author: Huang, Rongjie, Cui, Chenye, Chen, Feiyang, Ren, Yi, Liu, Jinglin, Zhao, Zhou, Huai, Baoxing, and Wang, Zhefeng
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Multimedia, Computer Science - Sound
Abstract: Deep generative models have achieved significant progress in speech synthesis to date, while high-fidelity singing voice synthesis is still an open problem for its long continuous pronunciation, rich high-frequency parts, and strong expressiveness. Existing neural vocoders designed for text-to-speech cannot directly be applied to singing voice synthesis because they result in glitches and poor high-frequency reconstruction. In this work, we propose SingGAN, a generative adversarial network designed for high-fidelity singing voice synthesis. Specifically, 1) to alleviate the glitch problem in the generated samples, we propose source excitation with the adaptive feature learning filters to expand the receptive field patterns and stabilize long continuous signal generation; and 2) SingGAN introduces global and local discriminators at different scales to enrich low-frequency details and promote high-frequency reconstruction; and 3) To improve the training efficiency, SingGAN includes auxiliary spectrogram losses and sub-band feature matching penalty loss. To the best of our knowledge, SingGAN is the first work designed toward high-fidelity singing voice vocoding. Our evaluation of SingGAN demonstrates the state-of-the-art results with higher-quality (MOS 4.05) samples. Also, SingGAN enables a sample speed of 50x faster than real-time on a single NVIDIA 2080Ti GPU. We further show that SingGAN generalizes well to the mel-spectrogram inversion of unseen singers, and the end-to-end singing voice synthesis system SingGAN-SVS enjoys a two-stage pipeline to transform the music scores into expressive singing voices. Audio samples are available at \url{https://SingGAN.github.io/}, Comment: Accepted by ACM Multimedia 2022
Published: 2021

27. Optimization of impeller contour and diversion blade of a moist air turbo expander with spontaneous condensation

Author: Yang, Xiaoling, Chen, Liang, Wang, Zhefeng, Song, Shujian, Zhang, Ze, Chen, Shuangtao, and Hou, Yu
Published: 2024
Full Text: View/download PDF

28. An In-depth Study on Internal Structure of Chinese Words

Author: Gong, Chen, Huang, Saihao, Zhou, Houquan, Li, Zhenghua, Zhang, Min, Wang, Zhefeng, Huai, Baoxing, and Yuan, Nicholas Jing
Subjects: Computer Science - Computation and Language
Abstract: Unlike English letters, Chinese characters have rich and specific meanings. Usually, the meaning of a word can be derived from its constituent characters in some way. Several previous works on syntactic parsing propose to annotate shallow word-internal structures for better utilizing character-level information. This work proposes to model the deep internal structures of Chinese words as dependency trees with 11 labels for distinguishing syntactic relationships. First, based on newly compiled annotation guidelines, we manually annotate a word-internal structure treebank (WIST) consisting of over 30K multi-char words from Chinese Penn Treebank. To guarantee quality, each word is independently annotated by two annotators and inconsistencies are handled by a third senior annotator. Second, we present detailed and interesting analysis on WIST to reveal insights on Chinese word formation. Third, we propose word-internal structure parsing as a new task, and conduct benchmark experiments using a competitive dependency parser. Finally, we present two simple ways to encode word-internal structures, leading to promising gains on the sentence-level syntactic parsing task., Comment: Accepted by ACL-IJCNLP 2021 (long paper)
Published: 2021

29. Increased NAD+ levels protect female mouse reproductive system against zearalenone-impaired glycolysis, lipid metabolism, antioxidant capacity and inflammation

Author: Dai, Chao, Hou, Mengqian, Yang, Xudong, Wang, Zhefeng, Sun, Changpo, Wu, Xin, and Wang, Shujin
Published: 2024
Full Text: View/download PDF

30. Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking

Author: Gu, Yingjie, Qu, Xiaoye, Wang, Zhefeng, Huai, Baoxing, Yuan, Nicholas Jing, and Gui, Xiaolin
Subjects: Computer Science - Computation and Language
Abstract: Entity linking (EL) for the rapidly growing short text (e.g. search queries and news titles) is critical to industrial applications. Most existing approaches relying on adequate context for long text EL are not effective for the concise and sparse short text. In this paper, we propose a novel framework called Multi-turn Multiple-choice Machine reading comprehension (M3}) to solve the short text EL from a new perspective: a query is generated for each ambiguous mention exploiting its surrounding context, and an option selection module is employed to identify the golden entity from candidates using the query. In this way, M3 framework sufficiently interacts limited context with candidate entities during the encoding process, as well as implicitly considers the dissimilarities inside the candidate bunch in the selection stage. In addition, we design a two-stage verifier incorporated into M3 to address the commonly existed unlinkable problem in short text. To further consider the topical coherence and interdependence among referred entities, M3 leverages a multi-turn fashion to deal with mentions in a sequence manner by retrospecting historical cues. Evaluation shows that our M3 framework achieves the state-of-the-art performance on five Chinese and English datasets for the real-world short text EL., Comment: Accepted at AAAI 2021
Published: 2021

31. Identification of the transcriptome signatures and immune-inflammatory responses in postmenopausal osteoporosis

Author: Gao, Pan, Pan, Xiaoguang, Wang, Shang, Guo, Sijia, Dong, Zhanying, Wang, Zhefeng, Liang, Xue, Chen, Yan, Fang, Fang, Yang, Ling, Huang, Jinrong, Zhang, Chenxi, Li, Conghui, Luo, Yonglun, Peng, Songlin, and Xu, Fengping
Published: 2024
Full Text: View/download PDF

32. Finding Route Hotspots in Large Labeled Networks

Author: Lei, Mingtao, Zhang, Xi, Chu, Lingyang, Wang, Zhefeng, Yu, Philip S., and Fang, Binxing
Subjects: Computer Science - Data Structures and Algorithms
Abstract: In many advanced network analysis applications, like social networks, e-commerce, and network security, hotspots are generally considered as a group of vertices that are tightly connected owing to the similar characteristics, such as common habits and location proximity. In this paper, we investigate the formation of hotspots from an alternative perspective that considers the routes along the network paths as the auxiliary information, and attempt to find the route hotspots in large labeled networks. A route hotspot is a cohesive subgraph that is covered by a set of routes, and these routes correspond to the same sequential pattern consisting of vertices' labels. To the best of our knowledge, the problem of Finding Route Hotspots in Large Labeled Networks has not been tackled in the literature. However, it is challenging as counting the number of hotspots in a network is #P-hard. Inspired by the observation that the sizes of hotspots decrease with the increasing lengths of patterns, we prove several anti-monotonicity properties of hotspots, and then develop a scalable algorithm called FastRH that can use these properties to effectively prune the patterns that cannot form any hotspots. In addition, to avoid the duplicate computation overhead, we judiciously design an effective index structure called RH-Index for storing the hotspot and pattern information collectively, which also enables incremental updating and efficient query processing. Our experimental results on real-world datasets clearly demonstrate the effectiveness and scalability of our proposed methods.
Published: 2019

33. Occurrence forms and environmental characteristics of phosphorus in water column and sediment of urban waterbodies replenished by reclaimed water

Author: Zhu, Zheng, Wang, Zhefeng, Yu, Yanan, Tan, Liqiong, Suo, Siyu, Zhu, Ting, Xu, Chongxuan, Liu, Chuang, Qin, Yiming, and Liang, Wenyan
Published: 2023
Full Text: View/download PDF

34. Distribution, source, risk and phytoremediation of polycyclic aromatic hydrocarbons (PAHs) in typical urban landscape waters recharged by reclaimed water

Author: Zhu, Zheng, Li, Lanxin, Yu, Yanan, Tan, Liqiong, Wang, Zhefeng, Suo, Siyu, Liu, Chuang, Qin, Yiming, Peng, Xianchun, Lu, Haoran, and Liang, Wenyan
Published: 2023
Full Text: View/download PDF

35. Study on the coupled characteristics of high-speed centrifugal compressor and turboexpander of a reverse Brayton air refrigerator

Author: Yang, Xiaoling, Chen, Liang, Wang, Zhefeng, Chen, Shuangtao, and Hou, Yu
Published: 2023
Full Text: View/download PDF

36. CED: Catalog Extraction from Documents

Author: Zhu, Tong, primary, Zhang, Guoliang, additional, Li, Zechang, additional, Yu, Zijian, additional, Ren, Junfei, additional, Wu, Mengsong, additional, Wang, Zhefeng, additional, Huai, Baoxing, additional, Chao, Pingfu, additional, and Chen, Wenliang, additional
Published: 2023
Full Text: View/download PDF

37. Chemokine-like receptor 1 deficiency impedes macrophage phenotypic transformation and cardiac repair after myocardial infarction

Author: Wang, Caiping, Zhang, Min, Yan, Jianlong, Wang, Rongning, Wang, Zhefeng, Sun, Xin, and Dong, Shaohong
Published: 2023
Full Text: View/download PDF

38. Creation of a ready-to-use brexpiprazole suspension and the inflammation-mediated pharmacokinetics by intramuscular administration

Author: Wang, Junji, Liu, Junfeng, Ding, Jingwen, Li, Qin, Zhao, Yuan, Gao, Dongxu, Su, Keyi, Yang, Yani, Wang, Zhefeng, and He, Jun
Published: 2023
Full Text: View/download PDF

39. Adenosine kinase promotes post-infarction cardiac repair by epigenetically maintaining reparative macrophage phenotype

Author: Zhang, Min, Wang, Caiping, Wang, Rongning, Xu, Jiean, Wang, Zhefeng, Yan, Jianlong, Cai, Yongfeng, Li, Liangping, Huo, Yuqing, and Dong, Shaohong
Published: 2023
Full Text: View/download PDF

40. Mining Top-k Sequential Patterns in Database Graphs:A New Challenging Problem and a Sampling-based Approach

Author: Lei, Mingtao, Chu, Lingyang, and Wang, Zhefeng
Subjects: Computer Science - Databases
Abstract: In many real world networks, a vertex is usually associated with a transaction database that comprehensively describes the behaviour of the vertex. A typical example is the social network, where the behaviour of every user is depicted by a transaction database that stores his daily posted contents. A transaction database is a set of transactions, where a transaction is a set of items. Every path of the network is a sequence of vertices that induces multiple sequences of transactions. The sequences of transactions induced by all of the paths in the network forms an extremely large sequence database. Finding frequent sequential patterns from such sequence database discovers interesting subsequences that frequently appear in many paths of the network. However, it is a challenging task, since the sequence database induced by a database graph is too large to be explicitly induced and stored. In this paper, we propose the novel notion of database graph, which naturally models a wide spectrum of real world networks by associating each vertex with a transaction database. Our goal is to find the top-k frequent sequential patterns in the sequence database induced from a database graph. We prove that this problem is #P-hard. To tackle this problem, we propose an efficient two-step sampling algorithm that approximates the top-k frequent sequential patterns with provable quality guarantee. Extensive experimental results on synthetic and real-world data sets demonstrate the effectiveness and efficiency of our method.
Published: 2018

41. Tracking Top-K Influential Vertices in Dynamic Networks

Author: Yang, Yu, Wang, Zhefeng, Jin, Tianyuan, Pei, Jian, and Chen, Enhong
Subjects: Computer Science - Social and Information Networks
Abstract: Influence propagation in networks has enjoyed fruitful applications and has been extensively studied in literature. However, only very limited preliminary studies tackled the challenges in handling highly dynamic changes in real networks. In this paper, we tackle the problem of tracking top-$k$ influential vertices in dynamic networks, where the dynamic changes are modeled as a stream of edge weight updates. Under the popularly adopted linear threshold (LT) model and the independent cascade (IC) model, we address two essential versions of the problem: tracking the top-$k$ influential individuals and finding the best $k$-seed set to maximize the influence spread (Influence Maximization). We adopt the polling-based method and maintain a sample of random RR sets so that we can approximate the influence of vertices with provable quality guarantees. It is known that updating RR sets over dynamic changes of a network can be easily done by a reservoir sampling method, so the key challenge is to efficiently decide how many RR sets are needed to achieve good quality guarantees. We use two simple signals, which both can be accessed in $O(1)$ time, to decide a proper number of RR sets. We prove the effectiveness of our methods. For both tasks the error incurred in our method is only a multiplicative factor to the ground truth. For influence maximization, we also propose an efficient query algorithm for finding the $k$ seeds, which is one order of magnitude faster than the state-of-the-art query algorithm in practice. In addition to the thorough theoretical results, our experimental results on large real networks clearly demonstrate the effectiveness and efficiency of our algorithms.
Published: 2018

42. Mining Density Contrast Subgraphs

Author: Yang, Yu, Chu, Lingyang, Zhang, Yanyan, Wang, Zhefeng, Pei, Jian, and Chen, Enhong
Subjects: Computer Science - Social and Information Networks
Abstract: Dense subgraph discovery is a key primitive in many graph mining applications, such as detecting communities in social networks and mining gene correlation from biological data. Most studies on dense subgraph mining only deal with one graph. However, in many applications, we have more than one graph describing relations among a same group of entities. In this paper, given two graphs sharing the same set of vertices, we investigate the problem of detecting subgraphs that contrast the most with respect to density. We call such subgraphs Density Contrast Subgraphs, or DCS in short. Two widely used graph density measures, average degree and graph affinity, are considered. For both density measures, mining DCS is equivalent to mining the densest subgraph from a "difference" graph, which may have both positive and negative edge weights. Due to the existence of negative edge weights, existing dense subgraph detection algorithms cannot identify the subgraph we need. We prove the computational hardness of mining DCS under the two graph density measures and develop efficient algorithms to find DCS. We also conduct extensive experiments on several real-world datasets to evaluate our algorithms. The experimental results show that our algorithms are both effective and efficient., Comment: Full version of an ICDE'18 paper
Published: 2018

43. Robust Estimation of Similarity Transformation for Visual Object Tracking

Author: Li, Yang, Zhu, Jianke, Hoi, Steven C. H., Song, Wenjie, Wang, Zhefeng, and Liu, Hantang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Most of existing correlation filter-based tracking approaches only estimate simple axis-aligned bounding boxes, and very few of them is capable of recovering the underlying similarity transformation. To tackle this challenging problem, in this paper, we propose a new correlation filter-based tracker with a novel robust estimation of similarity transformation on the large displacements. In order to efficiently search in such a large 4-DoF space in real-time, we formulate the problem into two 2-DoF sub-problems and apply an efficient Block Coordinates Descent solver to optimize the estimation result. Specifically, we employ an efficient phase correlation scheme to deal with both scale and rotation changes simultaneously in log-polar coordinates. Moreover, a variant of correlation filter is used to predict the translational motion individually. Our experimental results demonstrate that the proposed tracker achieves very promising prediction performance compared with the state-of-the-art visual object tracking methods while still retaining the advantages of high efficiency and simplicity in conventional correlation filter-based tracking methods., Comment: Accepted by AAAI 2019
Published: 2017

44. Beneficial effect of the short-chain fatty acid propionate on vascular calcification through intestinal microbiota remodelling

Author: Yan, Jianlong, Pan, Yanbin, Shao, Wenming, Wang, Caiping, Wang, Rongning, He, Yaqiong, Zhang, Min, Wang, Yongshun, Li, Tangzhiming, Wang, Zhefeng, Liu, Wenxing, Wang, Zhenmin, Sun, Xin, and Dong, Shaohong
Published: 2022
Full Text: View/download PDF

45. Finding Theme Communities from Database Networks

Author: Chu, Lingyang, Wang, Zhefeng, Pei, Jian, Zhang, Yanyan, Yang, Yu, and Chen, Enhong
Subjects: Computer Science - Databases, Computer Science - Information Retrieval
Abstract: Given a database network where each vertex is associated with a transaction database, we are interested in finding theme communities. Here, a theme community is a cohesive subgraph such that a common pattern is frequent in all transaction databases associated with the vertices in the subgraph. Finding all theme communities from a database network enjoys many novel applications. However, it is challenging since even counting the number of all theme communities in a database network is #P-hard. Inspired by the observation that a theme community shrinks when the length of the pattern increases, we investigate several properties of theme communities and develop TCFI, a scalable algorithm that uses these properties to effectively prune the patterns that cannot form any theme community. We also design TC-Tree, a scalable algorithm that decomposes and indexes theme communities efficiently. Retrieving a ranked list of theme communities from a TC-Tree of hundreds of millions of theme communities takes less than 1 second. Extensive experiments and a case study demonstrate the effectiveness and scalability of TCFI and TC-Tree in discovering and querying meaningful theme communities from large database networks.
Published: 2017

46. Multimodal Dialogue Systems via Capturing Context-aware Dependencies and Ordinal Information of Semantic Elements

Author: He, Weidong, primary, Li, Zhi, additional, Wang, Hao, additional, Xu, Tong, additional, Wang, Zhefeng, additional, Huai, Baoxing, additional, Yuan, Nicholas Jing, additional, and Chen, Enhong, additional
Published: 2024
Full Text: View/download PDF

47. MMEA: Entity Alignment for Multi-modal Knowledge Graph

Author: Chen, Liyi, Li, Zhi, Wang, Yijun, Xu, Tong, Wang, Zhefeng, Chen, Enhong, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Li, Gang, editor, Shen, Heng Tao, editor, Yuan, Ye, editor, Wang, Xiaoyang, editor, Liu, Huawen, editor, and Zhao, Xiang, editor
Published: 2020
Full Text: View/download PDF

48. Sustainable remediation of lube oil-contaminated soil by low temperature indirect thermal desorption: Removal behaviors of contaminants, physicochemical properties change and microbial community recolonization in soils

Author: Sang, Yimin, Yu, Wang, He, Liao, Wang, Zhefeng, Ma, Fujun, Jiao, Wentao, and Gu, Qingbao
Published: 2021
Full Text: View/download PDF

49. Activity Maximization by Effective Information Diffusion in Social Networks

Author: Wang, Zhefeng, Yang, Yu, Pei, Jian, and Chen, Enhong
Subjects: Computer Science - Social and Information Networks, Computer Science - Databases
Abstract: In a social network, even about the same information the excitements between different pairs of users are different. If you want to spread a piece of new information and maximize the expected total amount of excitements, which seed users should you choose? This problem indeed is substantially different from the renowned influence maximization problem and cannot be tackled using the existing approaches. In this paper, motivated by the demand in a few interesting applications, we model the novel problem of activity maximization. We tackle the problem systematically. We first analyze the complexity and the approximability of the problem. We develop an upper bound and a lower bound that are submodular so that the Sandwich framework can be applied. We then devise a polling-based randomized algorithm that guarantees a data dependent approximation factor. Our experiments on three real data sets clearly verify the effectiveness and scalability of our method, as well as the advantage of our method against the other heuristic methods.
Published: 2016

50. Tracking Influential Nodes in Dynamic Networks

Author: Yang, Yu, Wang, Zhefeng, Pei, Jian, and Chen, Enhong
Subjects: Computer Science - Social and Information Networks, Physics - Physics and Society
Abstract: In this paper, we tackle a challenging problem inherent in a series of applications: tracking the influential nodes in dynamic networks. Specifically, we model a dynamic network as a stream of edge weight updates. This general model embraces many practical scenarios as special cases, such as edge and node insertions, deletions as well as evolving weighted graphs. Under the popularly adopted linear threshold model and independent cascade model, we consider two essential versions of the problem: finding the nodes whose influences passing a user specified threshold and finding the top-$k$ most influential nodes. Our key idea is to use the polling-based methods and maintain a sample of random RR sets so that we can approximate the influence of nodes with provable quality guarantees. We develop an efficient algorithm that incrementally updates the sample random RR sets against network changes. We also design methods to determine the proper sample sizes for the two versions of the problem so that we can provide strong quality guarantees and, at the same time, be efficient in both space and time. In addition to the thorough theoretical results, our experimental results on $5$ real network data sets clearly demonstrate the effectiveness and efficiency of our algorithms.
Published: 2016
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

302 results on '"Wang, Zhefeng"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources