Cite
Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification
MLA
Gajcin, Jasmina, et al. Iterative Reward Shaping Using Human Feedback for Correcting Reward Misspecification. 2023. EBSCOhost, widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2308.15969&authtype=sso&custid=ns315887.
APA
Gajcin, J., McCarthy, J., Nair, R., Marinescu, R., Daly, E., & Dusparic, I. (2023). Iterative Reward Shaping using Human Feedback for Correcting Reward Misspecification.
Chicago
Gajcin, Jasmina, James McCarthy, Rahul Nair, Radu Marinescu, Elizabeth Daly, and Ivana Dusparic. 2023. “Iterative Reward Shaping Using Human Feedback for Correcting Reward Misspecification.” http://widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2308.15969&authtype=sso&custid=ns315887.