Back to Search Start Over

FANTAstic SEquences and Where to Find Them: Faithful and Efficient API Call Generation through State-tracked Constrained Decoding and Reranking

Authors :
Wang, Zhuoer
Ribeiro, Leonardo F. R.
Papangelis, Alexandros
Mukherjee, Rohan
Wang, Tzu-Yen
Zhao, Xinyan
Biswas, Arijit
Caverlee, James
Metallinou, Angeliki
Publication Year :
2024

Abstract

API call generation is the cornerstone of large language models' tool-using ability that provides access to the larger world. However, existing supervised and in-context learning approaches suffer from high training costs, poor data efficiency, and generated API calls that can be unfaithful to the API documentation and the user's request. To address these limitations, we propose an output-side optimization approach called FANTASE. Two of the unique contributions of FANTASE are its State-Tracked Constrained Decoding (SCD) and Reranking components. SCD dynamically incorporates appropriate API constraints in the form of Token Search Trie for efficient and guaranteed generation faithfulness with respect to the API documentation. The Reranking component efficiently brings in the supervised signal by leveraging a lightweight model as the discriminator to rerank the beam-searched candidate generations of the large language model. We demonstrate the superior performance of FANTASE in API call generation accuracy, inference efficiency, and context efficiency with DSTC8 and API Bank datasets.

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2407.13945
Document Type :
Working Paper