Start Over

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

Authors :: Jiang, Yuxin
Wang, Yufei
Zeng, Xingshan
Zhong, Wanjun
Li, Liangyou
Mi, Fei
Shang, Lifeng
Jiang, Xin
Liu, Qun
Wang, Wei
Jiang, Yuxin
Wang, Yufei
Zeng, Xingshan
Zhong, Wanjun
Li, Liangyou
Mi, Fei
Shang, Lifeng
Jiang, Xin
Liu, Qun
Wang, Wei
Publication Year :: 2023
Abstract: The ability to follow instructions is crucial for Large Language Models (LLMs) to handle various real-world applications. Existing benchmarks primarily focus on evaluating pure response quality, rather than assessing whether the response follows constraints stated in the instruction. To fill this research gap, in this paper, we propose FollowBench, a Multi-level Fine-grained Constraints Following Benchmark for LLMs. FollowBench comprehensively includes five different types (i.e., Content, Situation, Style, Format, and Example) of fine-grained constraints. To enable a precise constraint following estimation on diverse difficulties, we introduce a Multi-level mechanism that incrementally adds a single constraint to the initial instruction at each increased level. To assess whether LLMs' outputs have satisfied every individual constraint, we propose to prompt strong LLMs with constraint-evolution paths to handle challenging open-ended instructions. By evaluating 13 closed-source and open-source popular LLMs on FollowBench, we highlight the weaknesses of LLMs in instruction following and point towards potential avenues for future work. The data and code are publicly available at https://github.com/YJiangcm/FollowBench.<br />Comment: 22 pages, 11 figures, 16 tables. ACL 2024 main camera-ready version

Details

Database :: OAIster
Publication Type :: Electronic Resource
Accession number :: edsoai.on1438494919
Document Type :: Electronic Resource

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

Abstract

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

Abstract

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources