Back to Search Start Over

Natural Language Outlines for Code: Literate Programming in the LLM Era

Authors :
Shi, Kensen
Altınbüken, Deniz
Anand, Saswat
Christodorescu, Mihai
Grünwedel, Katja
Koenings, Alexa
Naidu, Sai
Pathak, Anurag
Rasi, Marc
Ribeiro, Fredde
Ruffin, Brandon
Sanyam, Siddhant
Tabachnyk, Maxim
Toth, Sara
Tu, Roy
Welp, Tobias
Yin, Pengcheng
Zaheer, Manzil
Chandra, Satish
Sutton, Charles
Publication Year :
2024

Abstract

We propose using natural language outlines as a novel modality and interaction surface for providing AI assistance to developers throughout the software development process. An NL outline for a code function comprises multiple statements written in concise prose, which partition the code and summarize its main ideas in the style of literate programming. Crucially, we find that modern LLMs can generate accurate and high-quality NL outlines in practice. Moreover, NL outlines enable a bidirectional sync between code and NL, allowing changes in one to be automatically reflected in the other. We discuss many use cases for NL outlines: they can accelerate understanding and navigation of code and diffs, simplify code maintenance, augment code search, steer code generation, and more. We then propose and compare multiple LLM prompting techniques for generating outlines and ask professional developers to judge outline quality. Finally, we present two case studies applying NL outlines toward code review and the difficult task of malware detection.

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2408.04820
Document Type :
Working Paper