Cite
RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs
MLA
Wu, Jiaxing, et al. RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs. 2024. EBSCOhost, widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2409.04421&authtype=sso&custid=ns315887.
APA
Wu, J., Ning, L., Liu, L., Lee, H., Wu, N., Wang, C., Prakash, S., O’Banion, S., Green, B., & Xie, J. (2024). RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs.
Chicago
Wu, Jiaxing, Lin Ning, Luyang Liu, Harrison Lee, Neo Wu, Chao Wang, Sushant Prakash, Shawn O’Banion, Bradley Green, and Jun Xie. 2024. “RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs.” http://widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2409.04421&authtype=sso&custid=ns315887.