Author: "Chang, Hoyeon" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Chang, Hoyeon"' showing total 4 results

Start Over Author "Chang, Hoyeon"

4 results on '"Chang, Hoyeon"'

1. How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?

Author: Lee, Seongyun, Kim, Geewook, Kim, Jiyeon, Lee, Hyunji, Chang, Hoyeon, Park, Sue Hyun, and Seo, Minjoon
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: Vision-Language adaptation (VL adaptation) transforms Large Language Models (LLMs) into Large Vision-Language Models (LVLMs) for multimodal tasks, but this process often compromises the inherent safety capabilities embedded in the original LLMs. Despite potential harmfulness due to weakened safety measures, in-depth analysis on the effects of VL adaptation on safety remains under-explored. This study examines how VL adaptation influences safety and evaluates the impact of safety fine-tuning methods. Our analysis reveals that safety degradation occurs during VL adaptation, even when the training data is safe. While safety tuning techniques like supervised fine-tuning with safety datasets or reinforcement learning from human feedback mitigate some risks, they still lead to safety degradation and a reduction in helpfulness due to over-rejection issues. Further analysis of internal model weights suggests that VL adaptation may impact certain safety-related layers, potentially lowering overall safety levels. Additionally, our findings demonstrate that the objectives of VL adaptation and safety tuning are divergent, which often results in their simultaneous application being suboptimal. To address this, we suggest the weight merging approach as an optimal solution effectively reducing safety degradation while maintaining helpfulness. These insights help guide the development of more reliable and secure LVLMs for real-world applications.
Published: 2024

2. How Do Large Language Models Acquire Factual Knowledge During Pretraining?

Author: Chang, Hoyeon, Park, Jinho, Ye, Seonghyeon, Yang, Sohee, Seo, Youngkyung, Chang, Du-Seong, and Seo, Minjoon
Subjects: Computer Science - Computation and Language, I.2.7
Abstract: Despite the recent observation that large language models (LLMs) can store substantial factual knowledge, there is a limited understanding of the mechanisms of how they acquire factual knowledge through pretraining. This work addresses this gap by studying how LLMs acquire factual knowledge during pretraining. The findings reveal several important insights into the dynamics of factual knowledge acquisition during pretraining. First, counterintuitively, we observe that pretraining on more data shows no significant improvement in the model's capability to acquire and maintain factual knowledge. Next, there is a power-law relationship between training steps and forgetting of memorization and generalization of factual knowledge, and LLMs trained with duplicated training data exhibit faster forgetting. Third, training LLMs with larger batch sizes can enhance the models' robustness to forgetting. Overall, our observations suggest that factual knowledge acquisition in LLM pretraining occurs by progressively increasing the probability of factual knowledge presented in the pretraining data at each step. However, this increase is diluted by subsequent forgetting. Based on this interpretation, we demonstrate that we can provide plausible explanations for recently observed behaviors of LLMs, such as the poor performance of LLMs on long-tail knowledge and the benefits of deduplicating the pretraining corpus.
Published: 2024

3. Nonparametric Decoding for Generative Retrieval

Author: Lee, Hyunji, Kim, Jaeyoung, Chang, Hoyeon, Oh, Hanseok, Yang, Sohee, Karpukhin, Vlad, Lu, Yi, and Seo, Minjoon
Subjects: Computer Science - Information Retrieval, Computer Science - Artificial Intelligence
Abstract: The generative retrieval model depends solely on the information encoded in its model parameters without external memory, its information capacity is limited and fixed. To overcome the limitation, we propose Nonparametric Decoding (Np Decoding) which can be applied to existing generative retrieval models. Np Decoding uses nonparametric contextualized vocab embeddings (external memory) rather than vanilla vocab embeddings as decoder vocab embeddings. By leveraging the contextualized vocab embeddings, the generative retrieval model is able to utilize both the parametric and nonparametric space. Evaluation over 9 datasets (8 single-hop and 1 multi-hop) in the document retrieval task shows that applying Np Decoding to generative retrieval models significantly improves the performance. We also show that Np Decoding is data- and parameter-efficient, and shows high performance in the zero-shot setting., Comment: published at Findings of ACL 2023
Published: 2022

4. Nonparametric Decoding for Generative Retrieval

Author: Lee, Hyunji, primary, Kim, JaeYoung, additional, Chang, Hoyeon, additional, Oh, Hanseok, additional, Yang, Sohee, additional, Karpukhin, Vladimir, additional, Lu, Yi, additional, and Seo, Minjoon, additional
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

4 results on '"Chang, Hoyeon"'

1. How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?

2. How Do Large Language Models Acquire Factual Knowledge During Pretraining?

3. Nonparametric Decoding for Generative Retrieval

4. Nonparametric Decoding for Generative Retrieval

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

Publisher

4 results on '"Chang, Hoyeon"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources