Back to Search Start Over

Thread Detection and Response Generation using Transformers with Prompt Optimisation

Authors :
T, Kevin Joshua
Agarwal, Arnav
Sanjay, Shriya
Sarda, Yash
Alex, John Sahaya Rani
Gupta, Saurav
Kumar, Sushant
Kamath, Vishwanath
Publication Year :
2024

Abstract

Conversational systems are crucial for human-computer interaction, managing complex dialogues by identifying threads and prioritising responses. This is especially vital in multi-party conversations, where precise identification of threads and strategic response prioritisation ensure efficient dialogue management. To address these challenges an end-to-end model that identifies threads and prioritises their response generation based on the importance was developed, involving a systematic decomposition of the problem into discrete components - thread detection, prioritisation, and performance optimisation which was meticulously analysed and optimised. These refined components seamlessly integrate into a unified framework, in conversational systems. Llama2 7b is used due to its high level of generalisation but the system can be updated with any open source Large Language Model(LLM). The computational capabilities of the Llama2 model was augmented by using fine tuning methods and strategic prompting techniques to optimise the model's performance, reducing computational time and increasing the accuracy of the model. The model achieves up to 10x speed improvement, while generating more coherent results compared to existing models.<br />Comment: 6 pages, 4 figures, submitted to 2024 IEEE International Conference on Signal Processing and Communications (SPCOM)

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2403.05931
Document Type :
Working Paper