Author: "Zha, Yuheng" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Zha, Yuheng"' showing total 8 results

Start Over Author "Zha, Yuheng"

8 results on '"Zha, Yuheng"'

1. Pandora: Towards General World Model with Natural Language Actions and Video States

Author: Xiang, Jiannan, Liu, Guangyi, Gu, Yi, Gao, Qiyue, Ning, Yuting, Zha, Yuheng, Feng, Zeyu, Tao, Tianhua, Hao, Shibo, Shi, Yemin, Liu, Zhengzhong, Xing, Eric P., and Hu, Zhiting
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: World models simulate future states of the world in response to different actions. They facilitate interactive content creation and provides a foundation for grounded, long-horizon reasoning. Current foundation models do not fully meet the capabilities of general world models: large language models (LLMs) are constrained by their reliance on language modality and their limited understanding of the physical world, while video models lack interactive action control over the world simulations. This paper makes a step towards building a general world model by introducing Pandora, a hybrid autoregressive-diffusion model that simulates world states by generating videos and allows real-time control with free-text actions. Pandora achieves domain generality, video consistency, and controllability through large-scale pretraining and instruction tuning. Crucially, Pandora bypasses the cost of training-from-scratch by integrating a pretrained LLM (7B) and a pretrained video model, requiring only additional lightweight finetuning. We illustrate extensive outputs by Pandora across diverse domains (indoor/outdoor, natural/urban, human/robot, 2D/3D, etc.). The results indicate great potential of building stronger general world models with larger-scale training., Comment: Website: https://world-model.maitrix.org/
Published: 2024

2. Text Alignment Is An Efficient Unified Model for Massive NLP Tasks

Author: Zha, Yuheng, Yang, Yichi, Li, Ruichen, and Hu, Zhiting
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs), typically designed as a function of next-word prediction, have excelled across extensive NLP tasks. Despite the generality, next-word prediction is often not an efficient formulation for many of the tasks, demanding an extreme scale of model parameters (10s or 100s of billions) and sometimes yielding suboptimal performance. In practice, it is often desirable to build more efficient models -- despite being less versatile, they still apply to a substantial subset of problems, delivering on par or even superior performance with much smaller model sizes. In this paper, we propose text alignment as an efficient unified model for a wide range of crucial tasks involving text entailment, similarity, question answering (and answerability), factual consistency, and so forth. Given a pair of texts, the model measures the degree of alignment between their information. We instantiate an alignment model (Align) through lightweight finetuning of RoBERTa (355M parameters) using 5.9M examples from 28 datasets. Despite its compact size, extensive experiments show the model's efficiency and strong performance: (1) On over 20 datasets of aforementioned diverse tasks, the model matches or surpasses FLAN-T5 models that have around 2x or 10x more parameters; the single unified model also outperforms task-specific models finetuned on individual datasets; (2) When applied to evaluate factual consistency of language generation on 23 datasets, our model improves over various baselines, including the much larger GPT-3.5 (ChatGPT) and sometimes even GPT-4; (3) The lightweight model can also serve as an add-on component for LLMs such as GPT-3.5 in question answering tasks, improving the average exact match (EM) score by 17.94 and F1 score by 15.05 through identifying unanswerable questions., Comment: NeurIPS 2023 Camera Ready, Code available at https://github.com/yuh-zha/Align
Published: 2023

3. AlignScore: Evaluating Factual Consistency with a Unified Alignment Function

Author: Zha, Yuheng, Yang, Yichi, Li, Ruichen, and Hu, Zhiting
Subjects: Computer Science - Computation and Language
Abstract: Many text generation applications require the generated text to be factually consistent with input information. Automatic evaluation of factual consistency is challenging. Previous work has developed various metrics that often depend on specific functions, such as natural language inference (NLI) or question answering (QA), trained on limited data. Those metrics thus can hardly assess diverse factual inconsistencies (e.g., contradictions, hallucinations) that occur in varying inputs/outputs (e.g., sentences, documents) from different tasks. In this paper, we propose AlignScore, a new holistic metric that applies to a variety of factual inconsistency scenarios as above. AlignScore is based on a general function of information alignment between two arbitrary text pieces. Crucially, we develop a unified training framework of the alignment function by integrating a large diversity of data sources, resulting in 4.7M training examples from 7 well-established tasks (NLI, QA, paraphrasing, fact verification, information retrieval, semantic similarity, and summarization). We conduct extensive experiments on large-scale benchmarks including 22 evaluation datasets, where 19 of the datasets were never seen in the alignment training. AlignScore achieves substantial improvement over a wide range of previous metrics. Moreover, AlignScore (355M parameters) matches or even outperforms metrics based on ChatGPT and GPT-4 that are orders of magnitude larger., Comment: 19 pages, 5 figures, ACL2023
Published: 2023

4. AOMD: An Analogy-aware Approach to Offensive Meme Detection on Social Media

Author: Shang, Lanyu, Zhang, Yang, Zha, Yuheng, Chen, Yingxi, Youn, Christina, and Wang, Dong
Subjects: Computer Science - Machine Learning
Abstract: This paper focuses on an important problem of detecting offensive analogy meme on online social media where the visual content and the texts/captions of the meme together make an analogy to convey the offensive information. Existing offensive meme detection solutions often ignore the implicit relation between the visual and textual contents of the meme and are insufficient to identify the offensive analogy memes. Two important challenges exist in accurately detecting the offensive analogy memes: i) it is not trivial to capture the analogy that is often implicitly conveyed by a meme; ii) it is also challenging to effectively align the complex analogy across different data modalities in a meme. To address the above challenges, we develop a deep learning based Analogy-aware Offensive Meme Detection (AOMD) framework to learn the implicit analogy from the multi-modal contents of the meme and effectively detect offensive analogy memes. We evaluate AOMD on two real-world datasets from online social media. Evaluation results show that AOMD achieves significant performance gains compared to state-of-the-art baselines by detecting offensive analogy memes more accurately.
Published: 2021

5. AlignScore: Evaluating Factual Consistency with A Unified Alignment Function

Author: Zha, Yuheng, primary, Yang, Yichi, additional, Li, Ruichen, additional, and Hu, Zhiting, additional
Published: 2023
Full Text: View/download PDF

6. Multi‐branch angle aware spatial temporal graph convolutional neural network for model‐based gait recognition

Author: Zheng, Liyang, primary, Zha, Yuheng, additional, Kong, Da, additional, Yang, Hanqing, additional, and Zhang, Yu, additional
Published: 2022
Full Text: View/download PDF

7. KnowMeme: A Knowledge-enriched Graph Neural Network Solution to Offensive Meme Detection

Author: Shang, Lanyu, primary, Youn, Christina, additional, Zha, Yuheng, additional, Zhang, Yang, additional, and Wang, Dong, additional
Published: 2021
Full Text: View/download PDF

8. AOMD: An analogy-aware approach to offensive meme detection on social media

Author: Shang, Lanyu, primary, Zhang, Yang, additional, Zha, Yuheng, additional, Chen, Yingxi, additional, Youn, Christina, additional, and Wang, Dong, additional
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

8 results on '"Zha, Yuheng"'

1. Pandora: Towards General World Model with Natural Language Actions and Video States

2. Text Alignment Is An Efficient Unified Model for Massive NLP Tasks

3. AlignScore: Evaluating Factual Consistency with a Unified Alignment Function

4. AOMD: An Analogy-aware Approach to Offensive Meme Detection on Social Media

5. AlignScore: Evaluating Factual Consistency with A Unified Alignment Function

6. Multi‐branch angle aware spatial temporal graph convolutional neural network for model‐based gait recognition

7. KnowMeme: A Knowledge-enriched Graph Neural Network Solution to Offensive Meme Detection

8. AOMD: An analogy-aware approach to offensive meme detection on social media

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

8 results on '"Zha, Yuheng"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources