Cite
CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning
MLA
Yang, Long, et al. CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning. 2022. EBSCOhost, widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2202.07565&authtype=sso&custid=ns315887.
APA
Yang, L., Ji, J., Dai, J., Zhang, Y., Li, P., & Pan, G. (2022). CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning.
Chicago
Yang, Long, Jiaming Ji, Juntao Dai, Yu Zhang, Pengfei Li, and Gang Pan. 2022. “CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning.” http://widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2202.07565&authtype=sso&custid=ns315887.