Cite
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback
MLA
Lin, Yen-Ting, et al. Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback. 2025. EBSCOhost, widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2501.10799&authtype=sso&custid=ns315887.
APA
Lin, Y.-T., Jin, D., Xu, T., Wu, T., Sukhbaatar, S., Zhu, C., He, Y., Chen, Y.-N., Weston, J., Tian, Y., Rahnama, A., Wang, S., Ma, H., & Fang, H. (2025). Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback.
Chicago
Lin, Yen-Ting, Di Jin, Tengyu Xu, Tianhao Wu, Sainbayar Sukhbaatar, Chen Zhu, Yun He, et al. 2025. “Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback.” http://widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2501.10799&authtype=sso&custid=ns315887.