Author: "Zhang, Yanzhe" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Zhang, Yanzhe"' showing total 237 results

Start Over Author "Zhang, Yanzhe"

237 results on '"Zhang, Yanzhe"'

1. Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping

Author: Li, Ryan, Zhang, Yanzhe, and Yang, Diyi
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Sketches are a natural and accessible medium for UI designers to conceptualize early-stage ideas. However, existing research on UI/UX automation often requires high-fidelity inputs like Figma designs or detailed screenshots, limiting accessibility and impeding efficient design iteration. To bridge this gap, we introduce Sketch2Code, a benchmark that evaluates state-of-the-art Vision Language Models (VLMs) on automating the conversion of rudimentary sketches into webpage prototypes. Beyond end-to-end benchmarking, Sketch2Code supports interactive agent evaluation that mimics real-world design workflows, where a VLM-based agent iteratively refines its generations by communicating with a simulated user, either passively receiving feedback instructions or proactively asking clarification questions. We comprehensively analyze ten commercial and open-source models, showing that Sketch2Code is challenging for existing VLMs; even the most capable models struggle to accurately interpret sketches and formulate effective questions that lead to steady improvement. Nevertheless, a user study with UI/UX experts reveals a significant preference for proactive question-asking over passive feedback reception, highlighting the need to develop more effective paradigms for multi-turn conversational agents., Comment: preprint, 9 pages
Published: 2024

2. Distilling an End-to-End Voice Assistant Without Instruction Training Data

Author: Held, William, Li, Ella, Ryan, Michael, Shi, Weiyan, Zhang, Yanzhe, and Yang, Diyi
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Voice assistants, such as Siri and Google Assistant, typically model audio and text separately, resulting in lost speech information and increased complexity. Recent efforts to address this with end-to-end Speech Large Language Models (LLMs) trained with supervised finetuning (SFT) have led to models ``forgetting" capabilities from text-only LLMs. Our work proposes an alternative paradigm for training Speech LLMs without instruction data, using the response of a text-only LLM to transcripts as self-supervision. Importantly, this process can be performed without annotated responses. We show that our Distilled Voice Assistant (DiVA) generalizes to Spoken Question Answering, Classification, and Translation. Furthermore, we show that DiVA better meets user preferences, achieving a 72\% win rate compared with state-of-the-art models like Qwen 2 Audio, despite using $>$100x less training compute.
Published: 2024

3. TRINS: Towards Multimodal Language Models that Can Read

Author: Zhang, Ruiyi, Zhang, Yanzhe, Chen, Jian, Zhou, Yufan, Gu, Jiuxiang, Chen, Changyou, and Sun, Tong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Large multimodal language models have shown remarkable proficiency in understanding and editing images. However, a majority of these visually-tuned models struggle to comprehend the textual content embedded in images, primarily due to the limitation of training data. In this work, we introduce TRINS: a Text-Rich image INStruction dataset, with the objective of enhancing the reading ability of the multimodal large language model. TRINS is built upon LAION using hybrid data annotation strategies that include machine-assisted and human-assisted annotation processes. It contains 39,153 text-rich images, captions, and 102,437 questions. Specifically, we show that the number of words per annotation in TRINS is significantly longer than that of related datasets, providing new challenges. Furthermore, we introduce a simple and effective architecture, called a Language-vision Reading Assistant (LaRA), which is good at understanding textual content within images. LaRA outperforms existing state-of-the-art multimodal large language models on the TRINS dataset, as well as other classical benchmarks. Lastly, we conducted a comprehensive evaluation with TRINS on various text-rich image understanding and generation tasks, demonstrating its effectiveness., Comment: CVPR 2024
Published: 2024

4. Best Practices and Lessons Learned on Synthetic Data

Author: Liu, Ruibo, Wei, Jerry, Liu, Fangyu, Si, Chenglei, Zhang, Yanzhe, Rao, Jinmeng, Zheng, Steven, Peng, Daiyi, Yang, Diyi, Zhou, Denny, and Dai, Andrew M.
Subjects: Computer Science - Computation and Language
Abstract: The success of AI models relies on the availability of large, diverse, and high-quality datasets, which can be challenging to obtain due to data scarcity, privacy concerns, and high costs. Synthetic data has emerged as a promising solution by generating artificial data that mimics real-world patterns. This paper provides an overview of synthetic data research, discussing its applications, challenges, and future directions. We present empirical evidence from prior art to demonstrate its effectiveness and highlight the importance of ensuring its factuality, fidelity, and unbiasedness. We emphasize the need for responsible use of synthetic data to build more powerful, inclusive, and trustworthy language models., Comment: In COLM 2024
Published: 2024

5. Design2Code: How Far Are We From Automating Front-End Engineering?

Author: Si, Chenglei, Zhang, Yanzhe, Yang, Zhengyuan, Liu, Ruibo, and Yang, Diyi
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computers and Society
Abstract: Generative AI has made rapid advancements in recent years, achieving unprecedented capabilities in multimodal understanding and code generation. This can enable a new paradigm of front-end development, in which multimodal LLMs might directly convert visual designs into code implementations. In this work, we formalize this as a Design2Code task and conduct comprehensive benchmarking. Specifically, we manually curate a benchmark of 484 diverse real-world webpages as test cases and develop a set of automatic evaluation metrics to assess how well current multimodal LLMs can generate the code implementations that directly render into the given reference webpages, given the screenshots as input. We also complement automatic metrics with comprehensive human evaluations. We develop a suite of multimodal prompting methods and show their effectiveness on GPT-4V and Gemini Pro Vision. We further finetune an open-source Design2Code-18B model that successfully matches the performance of Gemini Pro Vision. Both human evaluation and automatic metrics show that GPT-4V performs the best on this task compared to other models. Moreover, annotators think GPT-4V generated webpages can replace the original reference webpages in 49% of cases in terms of visual appearance and content; and perhaps surprisingly, in 64% of cases GPT-4V generated webpages are considered better than the original reference webpages. Our fine-grained break-down metrics indicate that open-source models mostly lag in recalling visual elements from the input webpages and in generating correct layout designs, while aspects like text content and coloring can be drastically improved with proper finetuning., Comment: Technical Report; The first two authors contributed equally
Published: 2024

6. Probing the CP Structure of the Top Quark Yukawa at the Future Muon Collider

Author: Cassidy, Morgan E., Dong, Zhongtian, Kong, Kyoungchul, Lewis, Ian M., Zhang, Yanzhe, and Zheng, Ya-Juan
Subjects: High Energy Physics - Phenomenology
Abstract: We study the top-Higgs coupling with a CP violating phase $\xi$ at a future multi-TeV muon collider. We focus on processes that are directly sensitive to the top quark Yukawa coupling: $t\bar{t}h$, $tbh\mu\nu$, and $t\bar{t}h\nu\bar{\nu}$ with $h\rightarrow b\bar{b}$ and semileptonic top decays. At different energies, different processes dominate the cross section, providing complementary information. At and above an energy of $\mathcal{O}(10)$ TeV, vector boson fusion processes dominate. As we show, in the Standard Model there is destructive interference in the vector boson fusion processes $t\bar{t}h\nu\bar{\nu}$ and $tbh\mu\nu$ between the top quark Yukawa and Higgs-gauge boson couplings. A CP-violating phase changes this interference, and the cross section measurement is very sensitive to the size of the CP-violating angle. Although we find that the cross sections are measured to $\mathcal{O}(50\%)$ statistical uncertainty at $1\sigma$, a 10 and 30 TeV muon collider can bound the CP-violating angle $|\xi|\lesssim9.0^\circ$ and $|\xi|\lesssim5.4^\circ$, respectively. However, cross section measurements are insensitive to the sign of the CP-violating angle. To determine that the coupling is truly CP violating, observables sensitive to CP-violation must be measured. We find in the $t\bar{t}h$ process the azimuthal angle between the $t+\bar{t}$ plane and the initial state muon+Higgs plane shows good discrimination for $\xi=\pm0.1\pi$. For the $tbh\mu\nu$ and $t\bar{t}h\nu\bar{\nu}$ processes, the operator proportional to $\left(\vec{p}_\mu\times\vec{p}_h\right)\cdot \vec{p}_t$ is sensitive to the sign of CP phase $\xi$. From these observables, we construct asymmetry parameters that show good distinction between different values and signs of the CP violating angle., Comment: v2: Matches published version, 33 pages, 11 figures, typos fixed, references added, discussion expanded, results unchanged; v1: 32 pages, 11 figures
Published: 2023
Full Text: View/download PDF

7. Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization

Author: Liu, Zijun, Zhang, Yanzhe, Li, Peng, Liu, Yang, and Yang, Diyi
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Multiagent Systems
Abstract: Large language model (LLM) agents have been shown effective on a wide range of tasks, and by ensembling multiple LLM agents, their performances could be further improved. Existing approaches employ a fixed set of agents to interact with each other in a static architecture, which limits their generalizability to various tasks and requires strong human prior in designing these agents. In this work, we propose to construct a strategic team of agents communicating in a dynamic interaction architecture based on the task query. Specifically, we build a framework named Dynamic LLM-Agent Network ($\textbf{DyLAN}$) for LLM-agent collaboration on complicated tasks like reasoning and code generation. DyLAN enables agents to interact for multiple rounds in a dynamic architecture with inference-time agent selection and an early-stopping mechanism to improve performance and efficiency. We further design an automatic agent team optimization algorithm based on an unsupervised metric termed $\textit{Agent Importance Score}$, enabling the selection of best agents based on the contribution each agent makes. Empirically, we demonstrate that DyLAN performs well in both reasoning and code generation tasks with reasonable computational cost. DyLAN achieves 13.0% and 13.3% improvement on MATH and HumanEval, respectively, compared to a single execution on GPT-35-turbo. On specific subjects of MMLU, agent team optimization in DyLAN increases accuracy by up to 25.0%., Comment: Preprint, under review. 21 pages
Published: 2023

8. LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding

Author: Zhang, Yanzhe, Zhang, Ruiyi, Gu, Jiuxiang, Zhou, Yufan, Lipka, Nedim, Yang, Diyi, and Sun, Tong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Instruction tuning unlocks the superior capability of Large Language Models (LLM) to interact with humans. Furthermore, recent instruction-following datasets include images as visual inputs, collecting responses for image-based instructions. However, visual instruction-tuned models cannot comprehend textual details within images well. This work enhances the current visual instruction tuning pipeline with text-rich images (e.g., movie posters, book covers, etc.). Specifically, we first use publicly available OCR tools to collect results on 422K text-rich images from the LAION dataset. Moreover, we prompt text-only GPT-4 with recognized texts and image captions to generate 16K conversations, each containing question-answer pairs for text-rich images. By combining our collected data with previous multi-modal instruction-following data, our model, LLaVAR, substantially improves the LLaVA model's capability on text-based VQA datasets (up to 20% accuracy improvement) while achieving an accuracy of 91.42% on ScienceQA. The GPT-4-based instruction-following evaluation also demonstrates the improvement of our model on both natural images and text-rich images. Through qualitative analysis, LLaVAR shows promising interaction (e.g., reasoning, writing, and elaboration) skills with humans based on the latest real-world online content that combines text and images. We make our code/data/models publicly available at https://llavar.github.io/., Comment: Preprint
Published: 2023

9. Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints

Author: Lu, Albert, Zhang, Hongxin, Zhang, Yanzhe, Wang, Xuezhi, and Yang, Diyi
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The limits of open-ended generative models are unclear, yet increasingly important. What causes them to succeed and what causes them to fail? In this paper, we take a prompt-centric approach to analyzing and bounding the abilities of open-ended generative models. We present a generic methodology of analysis with two challenging prompt constraint types: structural and stylistic. These constraint types are categorized into a set of well-defined constraints that are analyzable by a single prompt. We then systematically create a diverse set of simple, natural, and useful prompts to robustly analyze each individual constraint. Using the GPT-3 text-davinci-002 model as a case study, we generate outputs from our collection of prompts and analyze the model's generative failures. We also show the generalizability of our proposed method on other large models like BLOOM and OPT. Our results and our in-context mitigation strategies reveal open challenges for future research. We have publicly released our code at https://github.com/SALT-NLP/Bound-Cap-LLM., Comment: 27 pages, 13 figures, 11 tables, to be published in EACL 2023 Findings
Published: 2023

10. Auditing Gender Presentation Differences in Text-to-Image Models

Author: Zhang, Yanzhe, Jiang, Lu, Turk, Greg, and Yang, Diyi
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Computers and Society
Abstract: Text-to-image models, which can generate high-quality images based on textual input, have recently enabled various content-creation tools. Despite significantly affecting a wide range of downstream applications, the distributions of these generated images are still not fully understood, especially when it comes to the potential stereotypical attributes of different genders. In this work, we propose a paradigm (Gender Presentation Differences) that utilizes fine-grained self-presentation attributes to study how gender is presented differently in text-to-image models. By probing gender indicators in the input text (e.g., "a woman" or "a man"), we quantify the frequency differences of presentation-centric attributes (e.g., "a shirt" and "a dress") through human annotation and introduce a novel metric: GEP. Furthermore, we propose an automatic method to estimate such differences. The automatic GEP metric based on our approach yields a higher correlation with human annotations than that based on existing CLIP scores, consistently across three state-of-the-art text-to-image models. Finally, we demonstrate the generalization ability of our metrics in the context of gender stereotypes related to occupations., Comment: Preprint, 23 pages, 14 figures. Project page at https://salt-nlp.github.io/GEP/
Published: 2023

11. Robustness of Demonstration-based Learning Under Limited Data Scenario

Author: Zhang, Hongxin, Zhang, Yanzhe, Zhang, Ruiyi, and Yang, Diyi
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Demonstration-based learning has shown great potential in stimulating pretrained language models' ability under limited data scenario. Simply augmenting the input with some demonstrations can significantly improve performance on few-shot NER. However, why such demonstrations are beneficial for the learning process remains unclear since there is no explicit alignment between the demonstrations and the predictions. In this paper, we design pathological demonstrations by gradually removing intuitively useful information from the standard ones to take a deep dive of the robustness of demonstration-based sequence labeling and show that (1) demonstrations composed of random tokens still make the model a better few-shot learner; (2) the length of random demonstrations and the relevance of random tokens are the main factors affecting the performance; (3) demonstrations increase the confidence of model predictions on captured superficial patterns. We have publicly released our code at https://github.com/SALT-NLP/RobustDemo., Comment: 14 pages, EMNLP 2022 Main Conference
Published: 2022

12. GJ 1252b: A Hot Terrestrial Super-Earth With No Atmosphere

Author: Crossfield, Ian J. M., Malik, Matej, Hill, Michelle L., Kane, Stephen R., Foley, Bradford, Polanski, Alex S., Coria, David, Brande, Jonathan, Zhang, Yanzhe, Wienke, Katherine, Kreidberg, Laura, Cowan, Nicolas B., Dragomir, Diana, Gorjian, Varoujan, Mikal-Evans, Thomas, Benneke, Bjoern, Christiansen, Jessie L., Deming, Drake, and Morales, Farisa Y.
Subjects: Astrophysics - Earth and Planetary Astrophysics
Abstract: The increasing numbers of rocky, terrestrial exoplanets known to orbit nearby stars (especially M dwarfs) has drawn increased attention to the possibility of studying these planets' surface properties, and atmospheric compositions & escape histories. Here we report the detection of the secondary eclipse of the terrestrial exoplanet GJ1252b using the Spitzer Space Telescope's IRAC2 4.5 micron channel. We measure an eclipse depth of 149(+25/-32) ppm, corresponding to a day-side brightness temperature of 1410(+91/-125) K and consistent with the prediction for no atmosphere. Comparing our measurement to atmospheric models indicates that GJ1252b has a surface pressure of <10 bar, substantially less than Venus. Assuming energy-limited escape, even a 100 bar atmosphere would be lost in <1 Myr, far shorter than estimated age of 3.9+/-0.4 Gyr. The expected mass loss could be overcome by mantle outgassing, but only if the mantle's carbon content were >7% by mass - over two orders of magnitude greater than that found in Earth. We therefore conclude that GJ1252b has no significant atmosphere. Model spectra with granitoid or feldspathic surface composition, but with no atmosphere, are disfavored at >2 sigma. The eclipse occurs just +1.4(+2.8/-1.0) min after orbital phase 0.5, indicating e cos omega=+0.0025(+0.0049/-0.0018), consistent with a circular orbit. Tidal heating is therefore likely to be negligible to GJ1252b's global energy budget. Finally, we also analyze additional, unpublished TESS transit photometry of GJ1252b which improves the precision of the transit ephemeris by a factor of ten, provides a more precise planetary radius of 1.180+/-0.078 R_E, and rules out any transit timing variations with amplitudes <1 min., Comment: ApJL in press. 16 pages, 12 figures, 10 eclipses, 1 bandpass. Models will be available at journal website
Published: 2022
Full Text: View/download PDF

13. Probing the CP structure of the top quark Yukawa at the future muon collider

Author: Cassidy, Morgan E., Dong, Zhongtian, Kong, Kyoungchul, Lewis, Ian M., Zhang, Yanzhe, and Zheng, Ya-Juan
Published: 2024
Full Text: View/download PDF

14. Leveraging Expert Guided Adversarial Augmentation For Improving Generalization in Named Entity Recognition

Author: Reich, Aaron, Chen, Jiaao, Agrawal, Aastha, Zhang, Yanzhe, and Yang, Diyi
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Named Entity Recognition (NER) systems often demonstrate great performance on in-distribution data, but perform poorly on examples drawn from a shifted distribution. One way to evaluate the generalization ability of NER models is to use adversarial examples, on which the specific variations associated with named entities are rarely considered. To this end, we propose leveraging expert-guided heuristics to change the entity tokens and their surrounding contexts thereby altering their entity types as adversarial attacks. Using expert-guided heuristics, we augmented the CoNLL 2003 test set and manually annotated it to construct a high-quality challenging set. We found that state-of-the-art NER systems trained on CoNLL 2003 training data drop performance dramatically on our challenging set. By training on adversarial augmented training examples and using mixup for regularization, we were able to significantly improve the performance on the challenging set as well as improve out-of-domain generalization which we evaluated by using OntoNotes data. We have publicly released our dataset and code at https://github.com/GT-SALT/Guided-Adversarial-Augmentation., Comment: ACL 2022 (Findings)
Published: 2022

15. Continual Sequence Generation with Adaptive Compositional Modules

Author: Zhang, Yanzhe, Wang, Xuezhi, and Yang, Diyi
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Continual learning is essential for real-world deployment when there is a need to quickly adapt the model to new tasks without forgetting knowledge of old tasks. Existing work on continual sequence generation either always reuses existing parameters to learn new tasks, which is vulnerable to catastrophic forgetting on dissimilar tasks, or blindly adds new parameters for every new task, which could prevent knowledge sharing between similar tasks. To get the best of both worlds, in this work, we propose continual sequence generation with adaptive compositional modules to adaptively add modules in transformer architectures and compose both old and new modules for new tasks. We also incorporate pseudo experience replay to facilitate knowledge transfer in those shared modules. Experiment results on various sequences of generation tasks show that our framework can adaptively add modules or reuse modules based on task similarity, outperforming state-of-the-art baselines in terms of both performance and parameter efficiency. We make our code public at https://github.com/GT-SALT/Adaptive-Compositional-Modules., Comment: 15 pages, ACL 2022
Published: 2022

16. Directly Probing the CP-structure of the Higgs-Top Yukawa at HL-LHC and Future Colliders

Author: Barman, Rahool Kumar, Cassidy, Morgan E., Dong, Zhongtian, Gonçalves, Dorival, Kim, Jeong Han, Kling, Felix, Kong, Kyoungchul, Lewis, Ian M., Wu, Yongcheng, Zhang, Yanzhe, and Zheng, Ya-Juan
Subjects: High Energy Physics - Phenomenology, High Energy Physics - Experiment
Abstract: Constraining the Higgs boson properties is a cornerstone of the LHC program and future colliders. In this Snowmass contribution, we study the potential to directly probe the Higgs-top CP-structure via the $t\bar{t}h$ production at the HL-LHC, 100 TeV FCC and muon colliders. We find the limits on the CP phase ($\alpha$) at 95% CL are $|\alpha| \lesssim 36^\circ$ with dileptonic $t\bar t (h\to b\bar b) $ and $|\alpha| \lesssim 25^\circ$ with combined $t\bar t (h\to \gamma\gamma) $ at the HL-LHC. The 100 TeV FCC brings a significant improvement in sensitivity with $|\alpha| \lesssim 3^\circ$ for the dileptonic $t\bar t (h\to b\bar b) $, due to the remarkable gain in the signal cross-section and the increased luminosity. At future muon colliders, we find that the bounds with semileptonic $t\bar t (h\to b\bar b) \nu\bar\nu$ are $|\alpha| \lesssim 9^\circ$ for 10 TeV and $|\alpha| \lesssim 3^\circ$ for 30 TeV, respectively., Comment: 12 pages, 8 figures, contribution to Snowmass 2021
Published: 2022

17. Long/short-range ordered porous carbon with interconnected structure for high performance supercapacitor

Author: Zhang, Yanzhe, Ma, Rui, Zhang, Binyuan, Jia, Dianzeng, Wang, Luxiang, and Guo, Nannan
Published: 2024
Full Text: View/download PDF

18. Trackside acoustic detection of axle-box bearing fault based on cyclic beamforming

Author: Hu, Dingyu, Zhang, Yanzhe, Chen, Hangyu, Shi, Wei, and Liao, Aihua
Published: 2024
Full Text: View/download PDF

19. Effect of gas atmospheres and SiO2 content on preparation and properties of SiO2–Si3N4 composite ceramics via nitridation of diamond-wire saw silicon waste powder

Author: Ma, Xiaobing, Wang, Rong, Wang, Yutao, Li, Juncheng, Du, Qingshan, Liu, Yanjun, Sun, Ming, Zhang, Yanzhe, Chen, Xiaohua, and Yang, Xiuming
Published: 2024
Full Text: View/download PDF

20. Processing export and firms’ social security contributions in China: The role of supply chain pressure

Author: Zhang, Yanzhe and Xu, Helian
Published: 2024
Full Text: View/download PDF

21. Are Chinese Citizens Satisfied with Lockdown Performance during the COVID-19 Outbreak Period? A Survey from Wuhan, Shulan, and Nanjing

Author: Zhang, Yanzhe, Zou, Bowen, Zhang, Huai, and Zhang, Jian
Published: 2023
Full Text: View/download PDF

22. Regulating porous structure of coal-derived carbon materials by a dual-activation for high performance supercapacitors

Author: Wang, Danting, Yan, Lihua, Zhang, Yanzhe, Ma, Rui, Zhang, Binyuan, Guo, Nannan, Wang, Luxiang, Ai, Lili, Jia, Dianzeng, and Xu, Mengjiao
Published: 2024
Full Text: View/download PDF

23. Crystallization-induced formation of two-dimensional carbon nanosheets derived from sodium lignosulfonate for fast lithium storage

Author: Ma, Rui, Zhou, Doudou, Zhang, Qing, Zhang, Binyuan, Zhang, Yanzhe, Chen, Feifei, Guo, Nannan, and Wang, Luxiang
Published: 2024
Full Text: View/download PDF

24. Continual Learning for Text Classification with Information Disentanglement Based Regularization

Author: Huang, Yufan, Zhang, Yanzhe, Chen, Jiaao, Wang, Xuezhi, and Yang, Diyi
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Continual learning has become increasingly important as it enables NLP models to constantly learn and gain knowledge over time. Previous continual learning methods are mainly designed to preserve knowledge from previous tasks, without much emphasis on how to well generalize models to new tasks. In this work, we propose an information disentanglement based regularization method for continual learning on text classification. Our proposed method first disentangles text hidden spaces into representations that are generic to all tasks and representations specific to each individual task, and further regularizes these representations differently to better constrain the knowledge required to generalize. We also introduce two simple auxiliary tasks: next sentence prediction and task-id prediction, for learning better generic and specific representation spaces. Experiments conducted on large-scale benchmarks demonstrate the effectiveness of our method in continual text classification tasks with various sequences and lengths over state-of-the-art baselines. We have publicly released our code at https://github.com/GT-SALT/IDBR., Comment: NAACL 2021
Published: 2021

25. Nitrogen-doped coal-based microporous carbon material co-activated by HCOOK and urea for high performance supercapacitors

Author: Liu, Anjie, Yan, Lihua, Zhang, Yanzhe, Ma, Rui, Guo, Nannan, Wang, Luxiang, Zhang, Binyuan, Jia, Dianzeng, and Sheng, Rui
Published: 2024
Full Text: View/download PDF

26. Controlled lattice deformation for high-mobility two-dimensional MoTe2 growth

Author: Li, Ruishan, Hong, Mengyu, Shangguan, Wei, Zhang, Yanzhe, Liu, Yihe, Jiang, He, Yu, Huihui, Gao, Li, Zhang, Xiankun, Zhang, Zheng, and Zhang, Yue
Published: 2025
Full Text: View/download PDF

27. Challenges to Sustainable Development in China’s Banking Industry: A Structural Equipment Modeling Approach for Fighting Phishing in China

Author: Zhang, Yanzhe, Shao, Yutong, and Zhang, Jian
Published: 2023
Full Text: View/download PDF

28. Effect of column size on punching behavior of flat slabs with square columns: Numerical investigation

Author: Zheng, Bowen, Zheng, Wenzhong, Wang, Lu, and Zhang, Yanzhe
Published: 2023
Full Text: View/download PDF

29. Theory of Policy Learning, China

Author: Zhang, Yanzhe and Farazmand, Ali, editor
Published: 2022
Full Text: View/download PDF

30. Nonlinear finite element analysis of non-symmetrical punching shear of rectangular flat slabs supported on square columns

Author: Zheng, Bowen, Zheng, Wenzhong, Cao, Bang, and Zhang, Yanzhe
Published: 2023
Full Text: View/download PDF

31. Two-dimensional transition metal dichalcogenides for post-silicon electronics

Author: Zhang Xiankun, Zhao Hang, Wei Xiaofu, Zhang Yanzhe, Zhang Zheng, and Zhang Yue
Subjects: logic circuits, two-dimensional transition metal dichalcogenides, computing capability, post-silicon electronics, transistor number, Science, Engineering (General). Civil engineering (General), TA1-2040
Abstract: Rapid advancements in information technology push the explosive growth in data volume, requiring greater computing-capability logic circuits. However, conventional computing-capability improving technology, which mainly relies on increasing transistor number, encounters a significant challenge due to the weak field-effect characteristics of bulk silicon-based semiconductors. Still, the ultra-thin layered bodies of two-dimensional transition metal dichalcogenides (2D-TMDCs) materials enable excellent field-effect characteristics and multiple gate control ports, facilitating the integration of the functions of multiple transistors into one. Generally, the computing-capability improvement of the transistor cell in logic circuits will greatly alleviate the challenge in transistor numbers. In other words, one can only use a small number, or even just one, 2D-TMDCs-based transistors to conduct the sophisticated logic operations that have to be realized by using many traditional transistors. In this review, from material selection, device structure optimization, and circuit architecture design, we discuss the developments, challenges, and prospects for 2D-TMDCs-based logic circuits.
Published: 2023
Full Text: View/download PDF

32. Controlled lattice deformation for high-mobility two-dimensional MoTe2 growth

Author: Li, Ruishan, primary, Hong, Mengyu, additional, Shangguan, Wei, additional, Zhang, Yanzhe, additional, Liu, Yihe, additional, Jiang, He, additional, Yu, Huihui, additional, Gao, Li, additional, Zhang, Xiankun, additional, Zhang, Zheng, additional, and Zhang, Yue, additional
Published: 2024
Full Text: View/download PDF

33. Call for Proposals of Articles for a Special Issue of Public Organization Review (POR) on Public Sector Accountability and Corruption Problems SI #4

Author: Farazmand, Ali, Zhang, Yanzhe, and Atkinson, Christopher L.
Published: 2021
Full Text: View/download PDF

34. Best Practices and Lessons Learned on Synthetic Data for Language Models

Author: Liu, Ruibo, Wei, Jerry, Liu, Fangyu, Si, Chenglei, Zhang, Yanzhe, Rao, Jinmeng, Zheng, Steven, Peng, Daiyi, Yang, Diyi, Zhou, Denny, Dai, Andrew M., Liu, Ruibo, Wei, Jerry, Liu, Fangyu, Si, Chenglei, Zhang, Yanzhe, Rao, Jinmeng, Zheng, Steven, Peng, Daiyi, Yang, Diyi, Zhou, Denny, and Dai, Andrew M.
Abstract: The success of AI models relies on the availability of large, diverse, and high-quality datasets, which can be challenging to obtain due to data scarcity, privacy concerns, and high costs. Synthetic data has emerged as a promising solution by generating artificial data that mimics real-world patterns. This paper provides an overview of synthetic data research, discussing its applications, challenges, and future directions. We present empirical evidence from prior art to demonstrate its effectiveness and highlight the importance of ensuring its factuality, fidelity, and unbiasedness. We emphasize the need for responsible use of synthetic data to build more powerful, inclusive, and trustworthy language models.
Published: 2024

35. A mechanism of stratospheric O3 intrusion into the atmospheric environment: a case study of the North China Plain.

Author: Luo, Yuehan, Zhao, Tianliang, Meng, Kai, Hu, Jun, Yang, Qingjian, Bai, Yongqing, Yang, Kai, Fu, Weikang, Tan, Chenghao, Zhang, Yifan, Zhang, Yanzhe, and Li, Zhikuan
Subjects: ATMOSPHERIC boundary layer, AIR pollution, STRATOSPHERIC circulation, TROPOSPHERE, STRATOSPHERE
Abstract: Stratosphere-to-troposphere transport results in the stratospheric intrusion (SI) of O 3 into the free troposphere through the folding of the tropopause. However, the mechanism of SI that influences the atmospheric environment through the cross-layer transport of O 3 from the stratosphere and free troposphere to the atmospheric boundary layer has not been elucidated thoroughly. In this study, an SI event over the North China Plain (NCP; 33–40° N, 114–121° E) during 19–20 May 2019 was chosen to investigate the mechanism of the cross-layer transport of stratospheric O 3 and its impact on the near-surface O 3 based on multi-source reanalysis, observation data, and air quality modeling. The results revealed a mechanism of stratospheric O 3 intrusion into the atmospheric environment induced by an extratropical cyclone system. The SI with downward transport of stratospheric O 3 to the near-surface layer was driven by the extratropical cyclone system, with vertical coupling of the upper westerly trough, the middle of the northeast cold vortex (NECV), and the lower extratropical cyclone, in the troposphere. The deep trough in the westerly jet aroused the tropopause folding, and the lower-stratospheric O 3 penetrated the folded tropopause into the upper and middle troposphere; the westerly trough was cut off to form a typical cold vortex in the upper and middle troposphere. The compensating downdrafts of the NECV further pushed the downward transport of stratospheric O 3 in the free troposphere; the NECV activated an extratropical cyclone in the lower troposphere; and the vertical cyclonic circulation governed the stratospheric O 3 from the free troposphere across the boundary layer top, invading the near-surface atmosphere. In this SI event, the average contribution of stratospheric O 3 to near-surface O 3 was accounted for at 26.77 %. The proposed meteorological mechanism of the vertical transport of stratospheric O 3 into the near-surface atmosphere, driven by an extratropical cyclone system, could improve the understanding of the influence of stratospheric O 3 on the atmospheric environment, with implications for the coordinated control of atmospheric pollution. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

36. Uncertainty of the Export VAT Rebate Policy: Measurement and Its Effects.

Author: Zhang, Yanzhe, Xu, Helian, and Shao, Xiaokai
Subjects: REBATES, PRICE increases, VALUE (Economics)
Abstract: Based on the export VAT rebate rate data from 2004 to 2015, this paper measures the uncertainty of Chinese export VAT rebate policy by using unexpected fluctuation of export VAT rebate rate, and finds that the index varies across products, years and industries. Then, the paper studies the influence of the uncertainty of export VAT rebate policy on China's export growth from both theoretical and empirical perspectives. The research results show that this uncertainty reduces the expected profit of export enterprises and their optimal output, and significantly reduces the growth rate of the ordinary export that applies the export VAT rebate method, but it has no impact on the processing export. By decomposing total value of exports, it is found that the growth in the number of ordinary export relationship and average sales decrease with the increase of uncertainty, while the average price growth rate increases slightly, and these effects are only reflected in the newly entered export relations. The findings suggest that when the uncertainty of export VAT rebate policy is high, enterprises are more cautious about entering the ordinary export market and more likely to adopt a low-quantity-high-price strategy. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

37. A significant mechanism of stratospheric O3 intrusion to atmospheric environment: a case study of North China Plain

Author: Luo, Yuehan, primary, Zhao, Tianliang, additional, Meng, Kai, additional, Hu, Jun, additional, Yang, Qingjian, additional, Bai, Yongqing, additional, Yang, Kai, additional, Fu, Weikang, additional, Tan, Chenghao, additional, Zhang, Yifan, additional, Zhang, Yanzhe, additional, and Li, Zhikuan, additional
Published: 2024
Full Text: View/download PDF

38. Supplementary material to "A significant mechanism of stratospheric O3 intrusion to atmospheric environment: a case study of North China Plain"

Author: Luo, Yuehan, primary, Zhao, Tianliang, additional, Meng, Kai, additional, Hu, Jun, additional, Yang, Qingjian, additional, Bai, Yongqing, additional, Yang, Kai, additional, Fu, Weikang, additional, Tan, Chenghao, additional, Zhang, Yifan, additional, Zhang, Yanzhe, additional, and Li, Zhikuan, additional
Published: 2024
Full Text: View/download PDF

39. Low-temperature enhancement in the extraction of phosphorus from metallurgical-grade silicon by simultaneously re-constructing phosphorus-rich phases of Ca3P2 and CaAl2Si2-P

Author: Liang, Jinshan, primary, Zhang, Yanzhe, additional, Huang, Xinping, additional, Li, Juncheng, additional, Zhao, Qing, additional, Li, Yabo, additional, and Li, Jingwei, additional
Published: 2024
Full Text: View/download PDF

40. Theory of Policy Learning, China

Author: Zhang, Yanzhe and Farazmand, Ali, editor
Published: 2018
Full Text: View/download PDF

41. Adaptive unscented Kalman filter methods for identifying time‐variant parameters via state covariance re‐updating.

Author: Zhang, Yanzhe, Ding, Yong, Bu, Jianqing, and Guo, Lina
Subjects: KALMAN filtering, SHAKING table tests, PARAMETER identification, COVARIANCE matrices, NONLINEAR systems
Abstract: The conventional parameter identification process generally assumes that parameters remain constant. However, under extreme loading conditions, structures may exhibit nonlinear behavior, and parameters could demonstrate time‐variant characteristics. The unscented Kalman filter (UKF), as an efficient online recursive estimator, is widely used for identifying parameters of nonlinear systems. Nevertheless, it exhibits limitations when attempting to identify time‐variant parameters. To address this issue, this paper proposes a covariance matching technique that produces an array of adaptive UKF algorithms. Firstly, the sensitivity parameter η is defined to identify the instant when the parameter change occurs, and its threshold is calculated based on the sensitivity parameter time history curve. Secondly, an adaptive forgetting factor is introduced to simultaneously update the innovation, cross, and state covariance matrices when the kth‐step sensitive parameter surpasses the threshold. Finally, a secondary correction forgetting factor (SCFF) is employed to further re‐update the state covariance values at the identified damage locations. This creative step enhances the adaptive capability and optimizes the identification accuracy of the proposed algorithms. Both the numerical simulations and shaking table test demonstrate that the proposed adaptive algorithms can efficiently identify the time‐variant stiffness‐type parameters, and accurately capture their time‐variant characteristics. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

42. Low-temperature enhancement in the extraction of phosphorus from metallurgical-grade silicon by simultaneously re-constructing phosphorus-rich phases of Ca3P2 and CaAl2Si2-P.

Author: Liang, Jinshan, Zhang, Yanzhe, Huang, Xinping, Li, Juncheng, Zhao, Qing, Li, Yabo, and Li, Jingwei
Abstract: A novel method of combined Si-Al solvent refining with CaCl2-CaF2-CaO molten salt treatment for phosphorus removal from metallurgical-grade silicon was investigated. With the mass ratio of salt (45 wt.%CaCl2–45 wt.%CaF2–10 wt.%CaO) to the alloy (phosphorus-doped Al-Si) being 1:1, phosphorus-rich phases of Ca3P2 and CaAl2Si2-P were successfully reconstructed at 1250 °C for 1 h. After acid leaching, phosphorus content decreased from 3 wt.% to 0.24 wt.%, corresponding to a phosphorus removal rate of 91.95%. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

43. A Novel Adaptive Square Root UKF with Forgetting Factor for the Time-Variant Parameter Identification

Author: Zhang, Yanzhe, primary, Ding, Yong, additional, Bu, Jianqing, additional, and Guo, Lina, additional
Published: 2023
Full Text: View/download PDF

44. Synergistic antibacterial activity of multi components in lysozyme/chitosan/silver/hydroxyapatite hybrid coating

Author: Yu, Wu-Zhong, Zhang, Yanzhe, Liu, Xiangmei, Xiang, Yiming, Li, Zhaoyang, and Wu, Shuilin
Published: 2018
Full Text: View/download PDF

45. Experimental Study on the Damage Mechanism of Reinforced Concrete Beams Based on Acoustic Emission Technique

Author: Bu, Jianqing, primary, Guo, Zhibo, additional, Zhang, Jiren, additional, and Zhang, Yanzhe, additional
Published: 2023
Full Text: View/download PDF

46. A significant mechanism of stratospheric O3 intrusion to atmospheric environment: a case study of North China Plain.

Author: Luo, Yuehan, Zhao, Tianliang, Meng, Kai, Hu, Jun, Yang, Qingjian, Bai, Yongqing, Yang, Kai, Fu, Weikang, Tan, Chenghao, Zhang, Yifan, Zhang, Yanzhe, and Li, Zhikuan
Subjects: ATMOSPHERIC boundary layer, CYCLONES, QUASI-biennial oscillation (Meteorology), AIR pollution, STRATOSPHERIC circulation, TROPOSPHERE
Abstract: Stratosphere-to-troposphere transport results in the stratospheric intrusion (SI) of O3 into the free troposphere through the tropopause folding. However, the mechanism of SI influencing the atmospheric environment with the cross-layer transport of O3 from the stratosphere, free troposphere to the atmospheric boundary layer has not been elucidated thoroughly. In this study, a SI event over the North China Plain (NCP) was taken to investigate the mechanism of the cross-layer transport of stratospheric O3 with the impact on the near-surface O3 based on the multi-source reanalysis and observation data and air quality modeling. The results revealed a significant mechanism of stratospheric O3 intrusion to the atmospheric environment induced by an extratropical cyclone system. The SI with downward transport of stratospheric O3 to near-surface layer was driven by the extratropical cyclone system with vertical coupling of 'upper westerly trough-middle the Northeast Cold Vortex (NECV)-lower extratropical cyclone' in the troposphere. The deep trough in the westerly jet aroused the tropopause folding, and the lower stratospheric O3 penetrated the folded tropopause into the upper and middle troposphere; the westerly trough was cut off to form a typical cold vortex in the upper and middle troposphere. The compensating downdrafts of the NECV pushed the further downward transport of stratospheric O3 in the free troposphere; The NECV activated an extratropical cyclone in the lower troposphere, and the vertical cyclonic circulation governed the stratospheric O3 from the free troposphere across the boundary layer top invading the near-surface atmosphere. In this SI event, the averaged contribution of stratospheric O3 to near-surface O3 was accounted for 26.77 %. The proposed meteorological mechanism of vertical transport of stratospheric O3 into the near-surface atmosphere driven by an extratropical cyclone system could improve the understanding of the influence of stratospheric O3 on atmospheric environment with implications for the coordinated control of atmospheric pollution. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

47. Van der Waals‐Interface‐Dominated All‐2D Electronics.

Author: Zhang, Xiankun, Zhang, Yanzhe, Yu, Huihui, Zhao, Hang, Cao, Zhihong, Zhang, Zheng, and Zhang, Yue
Published: 2023
Full Text: View/download PDF

48. An Unsupervised Fundus Image Enhancement Method with Multi-Scale Transformer and Unreferenced Loss

Author: Zhang, Yanzhe Hu, Yu Li, Hua Zou, and Xuedong
Subjects: unsupervised learning, color fundus images enhancement, attention
Abstract: Color fundus images are now widely used in computer-aided analysis systems for ophthalmic diseases. However, fundus imaging can be affected by human, environmental, and equipment factors, which may result in low-quality images. Such quality fundus images will interfere with computer-aided diagnosis. Existing methods for enhancing low-quality fundus images focus more on the overall visualization of the image rather than capturing pathological and structural features at the finer scales of the fundus image sufficiently. In this paper, we design an unsupervised method that integrates a multi-scale feature fusion transformer and an unreferenced loss function. Due to the loss of microscale features caused by unpaired training, we construct the Global Feature Extraction Module (GFEM), a combination of convolution blocks and residual Swin Transformer modules, to achieve the extraction of feature information at different levels while reducing computational costs. To improve the blurring of image details caused by deep unsupervised networks, we define unreferenced loss functions that improve the model’s ability to suppress edge sharpness degradation. In addition, uneven light distribution can also affect image quality, so we use an a priori luminance-based attention mechanism to improve low-quality image illumination unevenness. On the public dataset, we achieve an improvement of 0.88 dB in PSNR and 0.024 in SSIM compared to the state-of-the-art methods. Experiment results show that our method outperforms other deep learning methods in terms of vascular continuity and preservation of fine pathological features. Such a framework may have potential medical applications.
Published: 2023
Full Text: View/download PDF

49. Effects of Aerosol Number Concentration and Updraft Velocity on Relative Dispersion during the Collision–Coalescence Growth Stage of Warm Clouds

Author: Yang, Suying, primary, Zhang, Yanzhe, additional, Yu, Xinyang, additional, Lu, Chunsong, additional, and Li, Yiyu, additional
Published: 2023
Full Text: View/download PDF

50. Nitrogen-Doped Hierarchical Porous Carbon Derived from Coal for High-Performance Supercapacitor

Author: Cai, Leiming, primary, Zhang, Yanzhe, additional, Ma, Rui, additional, Feng, Xia, additional, Yan, Lihua, additional, Jia, Dianzeng, additional, Xu, Mengjiao, additional, Ai, Lili, additional, Guo, Nannan, additional, and Wang, Luxiang, additional
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

237 results on '"Zhang, Yanzhe"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources