Author: "Wu, Zhaofeng" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wu, Zhaofeng"' showing total 467 results

Start Over Author "Wu, Zhaofeng"

467 results on '"Wu, Zhaofeng"'

1. The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities

Author: Wu, Zhaofeng, Yu, Xinyan Velocity, Yogatama, Dani, Lu, Jiasen, and Kim, Yoon
Subjects: Computer Science - Computation and Language
Abstract: Modern language models can process inputs across diverse languages and modalities. We hypothesize that models acquire this capability through learning a shared representation space across heterogeneous data types (e.g., different languages and modalities), which places semantically similar inputs near one another, even if they are from different modalities/languages. We term this the semantic hub hypothesis, following the hub-and-spoke model from neuroscience (Patterson et al., 2007) which posits that semantic knowledge in the human brain is organized through a transmodal semantic "hub" which integrates information from various modality-specific "spokes" regions. We first show that model representations for semantically equivalent inputs in different languages are similar in the intermediate layers, and that this space can be interpreted using the model's dominant pretraining language via the logit lens. This tendency extends to other data types, including arithmetic expressions, code, and visual/audio inputs. Interventions in the shared representation space in one data type also predictably affect model outputs in other data types, suggesting that this shared representations space is not simply a vestigial byproduct of large-scale training on broad data, but something that is actively utilized by the model during input processing.
Published: 2024

2. Sparkle: Mastering Basic Spatial Capabilities in Vision Language Models Elicits Generalization to Composite Spatial Reasoning

Author: Tang, Yihong, Qu, Ao, Wang, Zhaokai, Zhuang, Dingyi, Wu, Zhaofeng, Ma, Wei, Wang, Shenhao, Zheng, Yunhan, Zhao, Zhan, and Zhao, Jinhua
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Vision language models (VLMs) have demonstrated impressive performance across a wide range of downstream tasks. However, their proficiency in spatial reasoning remains limited, despite its crucial role in tasks involving navigation and interaction with physical environments. Specifically, most of these tasks rely on the core spatial reasoning capabilities in two-dimensional (2D) environments, and our evaluation reveals that state-of-the-art VLMs frequently generate implausible and incorrect responses to composite spatial reasoning problems, including simple pathfinding tasks that humans can solve effortlessly at a glance. To address this, we explore an effective approach to enhance 2D spatial reasoning within VLMs by training the model solely on basic spatial capabilities. We begin by disentangling the key components of 2D spatial reasoning: direction comprehension, distance estimation, and localization. Our central hypothesis is that mastering these basic spatial capabilities can significantly enhance a model's performance on composite spatial tasks requiring advanced spatial understanding and combinatorial problem-solving, with generalized improvements in visual-spatial tasks. To investigate this hypothesis, we introduce Sparkle, a framework that fine-tunes VLMs on these three basic spatial capabilities by synthetic data generation and targeted supervision to form an instruction dataset for each capability. Our experiments demonstrate that VLMs fine-tuned with Sparkle achieve significant performance gains, not only in the basic tasks themselves but also in generalizing to composite and out-of-distribution spatial reasoning tasks. These findings underscore the effectiveness of mastering basic spatial capabilities in enhancing composite spatial problem-solving, offering insights into systematic strategies for improving VLMs' spatial reasoning capabilities.
Published: 2024

3. Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment

Author: Wu, Zhaofeng, Balashankar, Ananth, Kim, Yoon, Eisenstein, Jacob, and Beirami, Ahmad
Subjects: Computer Science - Computation and Language
Abstract: Aligning language models (LMs) based on human-annotated preference data is a crucial step in obtaining practical and performant LM-based systems. However, multilingual human preference data are difficult to obtain at scale, making it challenging to extend this framework to diverse languages. In this work, we evaluate a simple approach for zero-shot cross-lingual alignment, where a reward model is trained on preference data in one source language and directly applied to other target languages. On summarization and open-ended dialog generation, we show that this method is consistently successful under comprehensive evaluation settings, including human evaluation: cross-lingually aligned models are preferred by humans over unaligned models on up to >70% of evaluation instances. We moreover find that a different-language reward model sometimes yields better aligned models than a same-language reward model. We also identify best practices when there is no language-specific data for even supervised finetuning, another component in alignment., Comment: EMNLP 2024
Published: 2024

4. A Taxonomy of Ambiguity Types for NLP

Author: Li, Margaret Y., Liu, Alisa, Wu, Zhaofeng, and Smith, Noah A.
Subjects: Computer Science - Computation and Language
Abstract: Ambiguity is an critical component of language that allows for more effective communication between speakers, but is often ignored in NLP. Recent work suggests that NLP systems may struggle to grasp certain elements of human language understanding because they may not handle ambiguities at the level that humans naturally do in communication. Additionally, different types of ambiguity may serve different purposes and require different approaches for resolution, and we aim to investigate how language models' abilities vary across types. We propose a taxonomy of ambiguity types as seen in English to facilitate NLP analysis. Our taxonomy can help make meaningful splits in language ambiguity data, allowing for more fine-grained assessments of both datasets and model performance., Comment: To appear at the UnImplicit workshop at EACL 2024
Published: 2024

5. Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment

Author: Merrill, William, Wu, Zhaofeng, Naka, Norihito, Kim, Yoon, and Linzen, Tal
Subjects: Computer Science - Computation and Language
Abstract: Do LMs infer the semantics of text from co-occurrence patterns in their training data? Merrill et al. (2022) argue that, in theory, sentence co-occurrence probabilities predicted by an optimal LM should reflect the entailment relationship of the constituent sentences, but it is unclear whether probabilities predicted by neural LMs encode entailment in this way because of strong assumptions made by Merrill et al. (namely, that humans always avoid redundancy). In this work, we investigate whether their theory can be used to decode entailment relations from neural LMs. We find that a test similar to theirs can decode entailment relations between natural sentences, well above random chance, though not perfectly, across many datasets and LMs. This suggests LMs implicitly model aspects of semantics to predict semantic effects on sentence co-occurrence patterns. However, we find the test that predicts entailment in practice works in the opposite direction to the theoretical test. We thus revisit the assumptions underlying the original test, finding its derivation did not adequately account for redundancy in human-written text. We argue that better accounting for redundancy related to explanations might derive the observed flipped test and, more generally, improve computational models of speakers in linguistics.
Published: 2024

6. ITINERA: Integrating Spatial Optimization with Large Language Models for Open-domain Urban Itinerary Planning

Author: Tang, Yihong, Wang, Zhaokai, Qu, Ao, Yan, Yihao, Wu, Zhaofeng, Zhuang, Dingyi, Kai, Jushi, Hou, Kebing, Guo, Xiaotong, Zhao, Jinhua, Zhao, Zhan, and Ma, Wei
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Citywalk, a recently popular form of urban travel, requires genuine personalization and understanding of fine-grained requests compared to traditional itinerary planning. In this paper, we introduce the novel task of Open-domain Urban Itinerary Planning (OUIP), which generates personalized urban itineraries from user requests in natural language. We then present ITINERA, an OUIP system that integrates spatial optimization with large language models to provide customized urban itineraries based on user needs. This involves decomposing user requests, selecting candidate points of interest (POIs), ordering the POIs based on cluster-aware spatial optimization, and generating the itinerary. Experiments on real-world datasets and the performance of the deployed system demonstrate our system's capacity to deliver personalized and spatially coherent itineraries compared to current solutions. Source codes of ITINERA are available at https://github.com/YihongT/ITINERA.
Published: 2024

7. Preparation and performance optimization of HfAlO hybrid gate dielectric thin films driven by solution

Author: Zhang, Renjia, Tuokedaerhan, Kamale, Wu, Zhaofeng, Du, Hongguo, Ibraimov, Margulan, and Sagidolda, Yerulan
Published: 2024
Full Text: View/download PDF

8. Universal Deoxidation of Semiconductor Substrates Assisted by Machine-Learning and Real-Time-Feedback-Control

Author: Shen, Chao, Zhan, Wenkang, Tang, Jian, Wu, Zhaofeng, Xu, Bo, Zhao, Chao, and Wang, Zhanguo
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing, Electrical Engineering and Systems Science - Systems and Control
Abstract: Thin film deposition is an essential step in the semiconductor process. During preparation or loading, the substrate is exposed to the air unavoidably, which has motivated studies of the process control to remove the surface oxide before thin film deposition. Optimizing the deoxidation process in molecular beam epitaxy (MBE) for a random substrate is a multidimensional challenge and sometimes controversial. Due to variations in semiconductor materials and growth processes, the determination of substrate deoxidation temperature is highly dependent on the grower's expertise; the same substrate may yield inconsistent results when evaluated by different growers. Here, we employ a machine learning (ML) hybrid convolution and vision transformer (CNN-ViT) model. This model utilizes reflection high-energy electron diffraction (RHEED) video as input to determine the deoxidation status of the substrate as output, enabling automated substrate deoxidation under a controlled architecture. This also extends to the successful application of deoxidation processes on other substrates. Furthermore, we showcase the potential of models trained on data from a single MBE equipment to achieve high-accuracy deployment on other equipment. In contrast to traditional methods, our approach holds exceptional practical value. It standardizes deoxidation temperatures across various equipment and substrate materials, advancing the standardization research process in semiconductor preparation, a significant milestone in thin film growth technology. The concepts and methods demonstrated in this work are anticipated to revolutionize semiconductor manufacturing in optoelectronics and microelectronics industries by applying them to diverse material growth processes., Comment: 5 figures
Published: 2023

9. Machine-learning-assisted and real-time-feedback-controlled growth of InAs/GaAs quantum dots

Author: Shen, Chao, Zhan, Wenkang, Xin, Kaiyao, Li, Manyang, Sun, Zhenyu, Cong, Hui, Xu, Chi, Tang, Jian, Wu, Zhaofeng, Xu, Bo, Wei, Zhongming, Xue, Chunlai, Zhao, Chao, and Wang, Zhanguo
Published: 2024
Full Text: View/download PDF

10. Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks

Author: Wu, Zhaofeng, Qiu, Linlu, Ross, Alexis, Akyürek, Ekin, Chen, Boyuan, Wang, Bailin, Kim, Najoung, Andreas, Jacob, and Kim, Yoon
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: The impressive performance of recent language models across a wide range of tasks suggests that they possess a degree of abstract reasoning skills. Are these skills general and transferable, or specialized to specific tasks seen during pretraining? To disentangle these effects, we propose an evaluation framework based on "counterfactual" task variants that deviate from the default assumptions underlying standard tasks. Across a suite of 11 tasks, we observe nontrivial performance on the counterfactual variants, but nevertheless find that performance substantially and consistently degrades compared to the default conditions. This suggests that while current LMs may possess abstract task-solving skills to an extent, they often also rely on narrow, non-transferable procedures for task-solving. These results motivate a more careful interpretation of language model performance that teases apart these aspects of behavior., Comment: NAACL 2024
Published: 2023

11. Machine-Learning-Assisted and Real-Time-Feedback-Controlled Growth of InAs/GaAs Quantum Dots

Author: Shen, Chao, Zhan, Wenkang, Xin, Kaiyao, Li, Manyang, Sun, Zhenyu, Cong, Hui, Xu, Chi, Tang, Jian, Wu, Zhaofeng, Xu, Bo, Wei, Zhongming, Xue, Chunlai, Zhao, Chao, and Wang, Zhanguo
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: Self-assembled InAs/GaAs quantum dots (QDs) have properties highly valuable for developing various optoelectronic devices such as QD lasers and single photon sources. The applications strongly rely on the density and quality of these dots, which has motivated studies of the growth process control to realize high-quality epi-wafers and devices. Establishing the process parameters in molecular beam epitaxy (MBE) for a specific density of QDs is a multidimensional optimization challenge, usually addressed through time-consuming and iterative trial-and-error. Here, we report a real-time feedback control method to realize the growth of QDs with arbitrary density, which is fully automated and intelligent. We developed a machine learning (ML) model named 3D ResNet 50 trained using reflection high-energy electron diffraction (RHEED) videos as input instead of static images and providing real-time feedback on surface morphologies for process control. As a result, we demonstrated that ML from previous growth could predict the post-growth density of QDs, by successfully tuning the QD densities in near-real time from 1.5E10 cm-2 down to 3.8E8 cm-2 or up to 1.4E11 cm-2. Compared to traditional methods, our approach, with in situ tuning capabilities and excellent reliability, can dramatically expedite the material optimization process and improve the reproducibility of MBE, constituting significant progress for thin film growth techniques. The concepts and methodologies proved feasible in this work are promising to be applied to a variety of material growth processes, which will revolutionize semiconductor manufacturing for optoelectronic and microelectronic industries., Comment: 5 figures
Published: 2023

12. We're Afraid Language Models Aren't Modeling Ambiguity

Author: Liu, Alisa, Wu, Zhaofeng, Michael, Julian, Suhr, Alane, West, Peter, Koller, Alexander, Swayamdipta, Swabha, Smith, Noah A., and Choi, Yejin
Subjects: Computer Science - Computation and Language
Abstract: Ambiguity is an intrinsic feature of natural language. Managing ambiguity is a key part of human language understanding, allowing us to anticipate misunderstanding as communicators and revise our interpretations as listeners. As language models (LMs) are increasingly employed as dialogue interfaces and writing aids, handling ambiguous language is critical to their success. We characterize ambiguity in a sentence by its effect on entailment relations with another sentence, and collect AmbiEnt, a linguist-annotated benchmark of 1,645 examples with diverse kinds of ambiguity. We design a suite of tests based on AmbiEnt, presenting the first evaluation of pretrained LMs to recognize ambiguity and disentangle possible meanings. We find that the task remains extremely challenging, including for GPT-4, whose generated disambiguations are considered correct only 32% of the time in human evaluation, compared to 90% for disambiguations in our dataset. Finally, to illustrate the value of ambiguity-sensitive tools, we show that a multilabel NLI model can flag political claims in the wild that are misleading due to ambiguity. We encourage the field to rediscover the importance of ambiguity for NLP., Comment: EMNLP 2023 camera-ready
Published: 2023

13. Metal Mesh-Based Infrared Transparent Electromagnetic Shielding Windows with Balanced Shielding Properties over a Wide Frequency Range

Author: Liang, Yuanlong, Wen, Kui, Wu, Zhaofeng, Pan, Jisheng, Liu, Wencong, Yao, Lixiang, Liu, Peiguo, and Huang, Xianjun
Subjects: Physics - Optics, Physics - Applied Physics
Abstract: With the increasing complexity of the electromagnetic environment, electromagnetic interference has already been an important problem for optoelectronic systems to be reckoned with. The metal mesh film is a kind of widely used electromagnetic shielding material with both visible and infrared transparency. However, the shielding performance of previously reported mesh materials is frequency dependent. Here, we report an infrared-transparent electromagnetic shielding windows based on metal mesh with irregular patterns. The mesh coatings are prepared on sapphire substrate using ultraviolet photolithography technology, and provide efficient electromagnetic shielding effectiveness of more than 20 dB in the wide frequency range of 1.7~18 GHz while maintaining high infrared optical transparency. In addition, there is no distinct variation in shielding effectiveness between low and high frequency range, exhibiting a balanced shielding characteristic throughout a broad frequency band. This work could be significant in protecting infrared optoelectronic devices from electromagnetic interference., Comment: 17 pages, 5 figures
Published: 2022

14. Continued Pretraining for Better Zero- and Few-Shot Promptability

Author: Wu, Zhaofeng, Logan IV, Robert L., Walsh, Pete, Bhagia, Akshita, Groeneveld, Dirk, Singh, Sameer, and Beltagy, Iz
Subjects: Computer Science - Computation and Language
Abstract: Recently introduced language model prompting methods can achieve high accuracy in zero- and few-shot settings while requiring few to no learned task-specific parameters. Nevertheless, these methods still often trail behind full model finetuning. In this work, we investigate if a dedicated continued pretraining stage could improve "promptability", i.e., zero-shot performance with natural language prompts or few-shot performance with prompt tuning. We reveal settings where existing continued pretraining methods lack promptability. We also identify current methodological gaps, which we fill with thorough large-scale experiments. We demonstrate that a simple recipe, continued pretraining that incorporates a trainable prompt during multi-task learning, leads to improved promptability in both zero- and few-shot settings compared to existing methods, up to 31% relative. On the other hand, we find that continued pretraining using MAML-style meta-learning, a method that directly optimizes few-shot promptability, yields subpar performance. We validate our findings with two prompt tuning methods, and, based on our results, we provide concrete recommendations to optimize promptability for different use cases., Comment: EMNLP 2022
Published: 2022

15. Modeling Context With Linear Attention for Scalable Document-Level Translation

Author: Wu, Zhaofeng, Peng, Hao, Pappas, Nikolaos, and Smith, Noah A.
Subjects: Computer Science - Computation and Language
Abstract: Document-level machine translation leverages inter-sentence dependencies to produce more coherent and consistent translations. However, these models, predominantly based on transformers, are difficult to scale to long documents as their attention layers have quadratic complexity in the sequence length. Recent efforts on efficient attention improve scalability, but their effect on document translation remains unexplored. In this work, we investigate the efficacy of a recent linear attention model by Peng et al. (2021) on document translation and augment it with a sentential gate to promote a recency inductive bias. We evaluate the model on IWSLT 2015 and OpenSubtitles 2018 against the transformer, demonstrating substantially increased decoding speed on long sequences with similar or better BLEU scores. We show that sentential gating further improves translation quality on IWSLT., Comment: Findings of EMNLP 2022
Published: 2022

16. Transparency Helps Reveal When Language Models Learn Meaning

Author: Wu, Zhaofeng, Merrill, William, Peng, Hao, Beltagy, Iz, and Smith, Noah A.
Subjects: Computer Science - Computation and Language
Abstract: Many current NLP systems are built from language models trained to optimize unsupervised objectives on large amounts of raw text. Under what conditions might such a procedure acquire meaning? Our systematic experiments with synthetic data reveal that, with languages where all expressions have context-independent denotations (i.e., languages with strong transparency), both autoregressive and masked language models successfully learn to emulate semantic relations between expressions. However, when denotations are changed to be context-dependent with the language otherwise unmodified, this ability degrades. Turning to natural language, our experiments with a specific phenomenon -- referential opacity -- add to the growing body of evidence that current language models do not represent natural language semantics well. We show this failure relates to the context-dependent nature of natural language form-meaning mappings., Comment: Accepted for publication in Transactions of the Association for Computational Linguistics (TACL), 2023. Author's final version (pre-MIT Press publication)
Published: 2022

17. Green Synthesis of Carbon Quantum dots Derived from Lycium barbarum for Effective Fluorescence Detection of Cr (VI) Sensing

Author: Xie, Jierong, Wu, Zhaofeng, Sun, Jun, Lv, Changwu, and Sun, Qihua
Published: 2024
Full Text: View/download PDF

18. Synthesis of covalent organic framework materials and their application in the field of sensing

Author: Zhang, Weiyu, Liu, Shiwei, Sun, Qihua, Tian, Ning, and Wu, Zhaofeng
Published: 2024
Full Text: View/download PDF

19. Self-doped Na-carbon materials derived from a lyocell fiber for a high-performance trimethylamine gas sensor at room temperature

Author: Ran, Bo, Hu, Ping, Sun, Jun, Fang, Jialu, Sun, Qihua, Wang, Juqi, Zhu, Yuqing, Tian, Ning, Wu, Zhaofeng, and Duan, Haiming
Published: 2024
Full Text: View/download PDF

20. Novel sulfur-doped ZnSn(OH)6 nanocubes with induced oxygen vacancies for enhanced humidity sensing

Author: Jia, Hongguang, Zhang, Min, Wu, Zhaofeng, Duan, Haiming, and Han, Yi
Published: 2024
Full Text: View/download PDF

21. Learning with Latent Structures in Natural Language Processing: A Survey

Author: Wu, Zhaofeng
Subjects: Computer Science - Computation and Language
Abstract: While end-to-end learning with fully differentiable models has enabled tremendous success in natural language process (NLP) and machine learning, there have been significant recent interests in learning with latent discrete structures to incorporate better inductive biases for improved end-task performance and better interpretability. This paradigm, however, is not straightforwardly amenable to the mainstream gradient-based optimization methods. This work surveys three main families of methods to learn such models: surrogate gradients, continuous relaxation, and marginal likelihood maximization via sampling. We conclude with a review of applications of these methods and an inspection of the learned latent structure that they induce.
Published: 2022

22. Investigation of the optimal indocyanine green dose in real-time fluorescent cholangiography during laparoscopic cholecystectomy with an ultra-high-definition 4K fluorescent system: a randomized controlled trial

Author: Liu, Hui, Kuang, Jiao, Xu, Yujie, Li, Tianyang, Li, Peilin, Huang, Zisheng, Zhang, Shuai, Weng, Jiefeng, Lai, Yueyuan, Wu, Zhaofeng, Lin, Fan, Gu, Weili, and Huang, Yu
Published: 2023
Full Text: View/download PDF

23. ABC: Attention with Bounded-memory Control

Author: Peng, Hao, Kasai, Jungo, Pappas, Nikolaos, Yogatama, Dani, Wu, Zhaofeng, Kong, Lingpeng, Schwartz, Roy, and Smith, Noah A.
Subjects: Computer Science - Computation and Language
Abstract: Transformer architectures have achieved state-of-the-art results on a variety of sequence modeling tasks. However, their attention mechanism comes with a quadratic complexity in sequence lengths, making the computational overhead prohibitive, especially for long sequences. Attention context can be seen as a random-access memory with each token taking a slot. Under this perspective, the memory size grows linearly with the sequence length, and so does the overhead of reading from it. One way to improve the efficiency is to bound the memory size. We show that disparate approaches can be subsumed into one abstraction, attention with bounded-memory control (ABC), and they vary in their organization of the memory. ABC reveals new, unexplored possibilities. First, it connects several efficient attention variants that would otherwise seem apart. Second, this abstraction gives new insights--an established approach (Wang et al., 2020b) previously thought to be not applicable in causal attention, actually is. Last, we present a new instance of ABC, which draws inspiration from existing ABC approaches, but replaces their heuristic memory-organizing functions with a learned, contextualized one. Our experiments on language modeling, machine translation, and masked language model finetuning show that our approach outperforms previous efficient attention models; compared to the strong transformer baselines, it significantly improves the inference time and space efficiency with no or negligible accuracy loss.
Published: 2021

24. One-step fabrication of P-Co5.47N/Co9S8@NPC heterojunction derived from saccharomycetes cerevisiae as environment-friendly bifunctional high-efficiency electrocatalysts

Author: Jing, Xiaoyun, Lv, Changwu, Zhang, Hongmei, Chen, Chu, and Wu, Zhaofeng
Published: 2024
Full Text: View/download PDF

25. Construction of the VOBiOBr/VSZnIn2S4 heterojunction for photocatalytic hydrogen production and dye removal under simulated sunlight

Author: Kuang, Xuanyu, Jin, Xuekun, Chen, Fengjuan, Pan, Anqiang, Duan, Haiming, Wu, Zhaofeng, and Cao, Biaobing
Published: 2024
Full Text: View/download PDF

26. Bioinspired Bacterial Cellulose Carbon Nanofibers/AgO Composite for Sensitive and Selective Detection of H2O2 Vapor at Room Temperature

Author: Zhai, Xinwang, Wu, Zhaofeng, Sun, Qihua, Sun, Jun, Chen, Fengjuan, Zhang, Min, and Duan, Haiming
Published: 2023
Full Text: View/download PDF

27. Synergistic effect of oxygen vacancies and doped sulfur over BiOBr for efficient visible photocatalytic removal of dyes

Author: Yu, Yang, Kuang, Xuanyu, Jin, Xuekun, Chen, Fengjuan, Min, Junyong, Duan, Haiming, Li, Jin, Wu, Zhaofeng, and Cao, Biaobing
Published: 2024
Full Text: View/download PDF

28. Highly sensitive detection toward aniline vapor promoted by heterojunction between covalent organic frameworks and biocarbon

Author: Zhang, Weiyu, Wang, Weijin, Liu, Shiwei, Tian, Ning, Sun, Qihua, and Wu, Zhaofeng
Published: 2024
Full Text: View/download PDF

29. Impalement-resistant and robust superhydrophobic umbrella fabric enabled by a similar “pole erecting” strategy

Author: Tian, Ning, Li, Bucheng, Wei, Jinfei, Wang, Weijin, Hu, Ping, Liu, Shiwei, Zhu, Yuqing, Ran, Bo, Wu, Zhaofeng, and Zhang, Junping
Published: 2024
Full Text: View/download PDF

30. LiASi2O5 (A = K, Rb): Effects of cations on crystal structure and optical properties

Author: Sun, Jun, Ran, Bo, Hu, Ping, Wu, Zhaofeng, Lee, Ming-Hsien, and Duan, Haiming
Published: 2024
Full Text: View/download PDF

31. Infusing Finetuning with Semantic Dependencies

Author: Wu, Zhaofeng, Peng, Hao, and Smith, Noah A.
Subjects: Computer Science - Computation and Language
Abstract: For natural language processing systems, two kinds of evidence support the use of text representations from neural language models "pretrained" on large unannotated corpora: performance on application-inspired benchmarks (Peters et al., 2018, inter alia), and the emergence of syntactic abstractions in those representations (Tenney et al., 2019, inter alia). On the other hand, the lack of grounded supervision calls into question how well these representations can ever capture meaning (Bender and Koller, 2020). We apply novel probes to recent language models -- specifically focusing on predicate-argument structure as operationalized by semantic dependencies (Ivanova et al., 2012) -- and find that, unlike syntax, semantics is not brought to the surface by today's pretrained models. We then use convolutional graph encoders to explicitly incorporate semantic parses into task-specific finetuning, yielding benefits to natural language understanding (NLU) tasks in the GLUE benchmark. This approach demonstrates the potential for general-purpose (rather than task-specific) linguistic supervision, above and beyond conventional pretraining and finetuning. Several diagnostics help to localize the benefits of our approach., Comment: TACL 2021
Published: 2020
Full Text: View/download PDF

32. Understanding Mention Detector-Linker Interaction in Neural Coreference Resolution

Author: Wu, Zhaofeng and Gardner, Matt
Subjects: Computer Science - Computation and Language
Abstract: Despite significant recent progress in coreference resolution, the quality of current state-of-the-art systems still considerably trails behind human-level performance. Using the CoNLL-2012 and PreCo datasets, we dissect the best instantiation of the mainstream end-to-end coreference resolution model that underlies most current best-performing coreference systems, and empirically analyze the behavior of its two components: mention detector and mention linker. While the detector traditionally focuses heavily on recall as a design decision, we demonstrate the importance of precision, calling for their balance. However, we point out the difficulty in building a precise detector due to its inability to make important anaphoricity decisions. We also highlight the enormous room for improving the linker and show that the rest of its errors mainly involve pronoun resolution. We propose promising next steps and hope our findings will help future research in coreference resolution., Comment: CRAC @ EMNLP 2021
Published: 2020

33. Dynamic Sparsity Neural Networks for Automatic Speech Recognition

Author: Wu, Zhaofeng, Zhao, Ding, Liang, Qiao, Yu, Jiahui, Gulati, Anmol, and Pang, Ruoming
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Computation and Language, Computer Science - Machine Learning, Computer Science - Sound
Abstract: In automatic speech recognition (ASR), model pruning is a widely adopted technique that reduces model size and latency to deploy neural network models on edge devices with resource constraints. However, multiple models with different sparsity levels usually need to be separately trained and deployed to heterogeneous target hardware with different resource specifications and for applications that have various latency requirements. In this paper, we present Dynamic Sparsity Neural Networks (DSNN) that, once trained, can instantly switch to any predefined sparsity configuration at run-time. We demonstrate the effectiveness and flexibility of DSNN using experiments on internal production datasets with Google Voice Search data, and show that the performance of a DSNN model is on par with that of individually trained single sparsity networks. Our trained DSNN model, therefore, can greatly ease the training process and simplify deployment in diverse scenarios with resource constraints., Comment: ICASSP 2021. (c) 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Published: 2020

34. Modulation of gas sensing properties of chitin-based carbon fibers and Fe3O4/carbon fibers by controlling carbonization temperature

Author: Zhang, Yuzhi, Wu, Zhaofeng, Sun, Jun, Sun, Qihua, Chen, Fengjuan, Liu, Lixiang, Zhang, Min, and Duan, Haiming
Published: 2023
Full Text: View/download PDF

35. Real-time fluorescent cholangiography with indocyanine green in laparoscopic cholecystectomy: a randomized controlled trial to establish the optimal indocyanine green dose within 30 min preoperatively

Author: Huang, Yu, Chen, Qiangxing, Kuang, Jiao, Zhang, Shuai, Weng, Jiefeng, Lai, Yueyuan, Liu, Hui, Wu, Zhaofeng, Huang, Di, Lin, Fan, Zhu, Guanghui, Cao, Tiansheng, and Gu, Weili
Published: 2023
Full Text: View/download PDF

36. Dog nose-inspired high-performance ammonia sensor based on biochar/SnO2 composite

Author: Qin, Zhangjie, Wu, Zhaofeng, Sun, Qihua, Sun, Jun, Zhang, Min, Chen, Fengjuan, Zhang, Dongzhi, Lv, Changwu, and Duan, Haiming
Published: 2023
Full Text: View/download PDF

37. Biomimetic gas sensor derived from disposable bamboo chopsticks for highly sensitive and selective detection of NH3

Author: Qin, Zhangjie, Wu, Zhaofeng, Sun, Qihua, Sun, Jun, Zhang, Min, Shaymurat, Talgar, Lv, Changwu, and Duan, Haiming
Published: 2023
Full Text: View/download PDF

38. Synergistic effect of Mn3+ and oxygen vacancy on the bifunctional oxygen electrocatalytic performance of MnOX/CNTs composites

Author: He, Minghua, Jin, Xuekun, Chen, Fengjuan, Chen, Jianjun, Min, Junyong, Duan, Haiming, Kuang, Xuanyu, Li, Jin, Wu, Zhaofeng, and Li, Junhua
Published: 2023
Full Text: View/download PDF

39. Flexible all-biomass gas sensor based on doped carbon quantum dots/nonwoven cotton with discriminative function

Author: Wu, Zhaofeng, Zhang, Min, Cao, Shuai, Wang, Long, Qin, Zhangjie, Zhong, Furu, and Duan, Haiming
Published: 2022
Full Text: View/download PDF

40. A Au nanoparticle and polydopamine co-modified biosensor: A strategy for in situ and label-free surface plasmon resonance immunoassays

Author: Du, Bin, Mu, Xihui, Xu, Jianjie, Liu, Shuai, Liu, Zhiwei, Tong, Zhaoyang, Wu, Zhaofeng, and Qi, Zhi-Mei
Published: 2022
Full Text: View/download PDF

41. Preparation of CeVO4 with VO2 as precursor performing high selectivity and sensitivity to ammonia

Author: Liu, Yanan, Zhang, Min, Yang, Linyu, Wu, Zhaofeng, and Li, Zhenjiang
Published: 2022
Full Text: View/download PDF

42. Highly selective and sensitive phenol sensor based on cattail carbon fibre operating at room temperature

Author: Yang, Xingwen, Wu, Zhaofeng, Qin, Zhangjie, Chen, Fengjuan, Lv, Changwu, Zhang, Min, Zhong, Furu, Shaymurat, Talgar, and Duan, Haiming
Published: 2022
Full Text: View/download PDF

43. Controlled Sensor Derived from COF Materials for the Effective Detection of N‑Methylpyrrolidone.

Author: Liu, Shiwei, Zhang, Weiyu, Zhang, Guojie, Sun, Jun, Tian, Ning, Sun, Qihua, and Wu, Zhaofeng
Published: 2024
Full Text: View/download PDF

44. One-pot synthesis of flower-like SnS2/SnO2 heterojunction with enhanced visible light photocatalytic performance

Author: Zhao, Wenbo, He, Minghua, Chen, Fengjuan, Jin, Xuekun, Duan, Haiming, Long, Mengqiu, Wu, Zhaofeng, Cao, Biaobing, and Yu, Yang
Published: 2022
Full Text: View/download PDF

45. High-Performance Trimethylamine Sensor Based on an Imine Covalent Organic Framework

Author: Zhang, Weiyu, primary, Sun, Qihua, additional, Zhu, Yuqing, additional, Sun, Jun, additional, Wu, Zhaofeng, additional, and Tian, Ning, additional
Published: 2024
Full Text: View/download PDF

46. Synthesis of two‐dimensional nonlayered α‐Nb2O5 nanosheets by the growth promoter of sulfur and alkali halides

Author: Zhang, Bo, primary, Niu, Chengyang, additional, Chu, Wenlong, additional, Guo, Xuehao, additional, Zhou, Xilong, additional, Li, Cheng, additional, Fan, Xiulian, additional, Zou, Luwei, additional, Wu, Zhaofeng, additional, Lu, Yunzhang, additional, OuYang, Fangping, additional, Zhou, Yu, additional, and Zhang, Hongyan, additional
Published: 2024
Full Text: View/download PDF

47. Mechanical, thermal and gas sensing properties of flexible multi-walled carbon nanotubes/waterborne polyurethane composite film

Author: Wu, Zhaofeng, Cao, Shuai, Sun, Qihua, Zhong, Furu, Zhang, Min, and Duan, Haiming
Published: 2021
Full Text: View/download PDF

48. Extrahepatic biliary tract visualization using near-infrared fluorescence imaging with indocyanine green: optimization of dose and dosing time

Author: Chen, Qiangxing, Zhou, Rou, Weng, Jiefeng, Lai, Yueyuan, Liu, Hui, Kuang, Jiao, Zhang, Shuai, Wu, Zhaofeng, Wang, Wen, and Gu, Weili
Published: 2021
Full Text: View/download PDF

49. Construction of flexible and wearable 3D TiO2 NTs@Ti mesh for physiological detection based on sweat

Author: Yu, Jianwei, Zhang, Peng, Chen, Tengyu, Lv, Qiuchen, Gao, Li, Liu, Bingxin, Duan, Junyuan, Wu, Zhaofeng, and Li, Jian
Published: 2021
Full Text: View/download PDF

50. Gas sensing properties of cotton-based carbon fibers and ZnO/carbon fibers regulated by changing carbonization temperatures

Author: Cao, Shuai, Wu, Zhaofeng, Sun, Qihua, Zhang, Weiyu, Beysen, Sadeh, Wang, Shuying, Shaymurat, Talgar, Zhang, Min, and Duan, Haiming
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

467 results on '"Wu, Zhaofeng"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources