Cite
Soft policy optimization using dual-track advantage estimator
MLA
Huang, Yubo, et al. Soft Policy Optimization Using Dual-Track Advantage Estimator. 2020. EBSCOhost, widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2009.06858&authtype=sso&custid=ns315887.
APA
Huang, Y., Wang, X., Zou, L., Zhuang, Z., & Zhang, W. (2020). Soft policy optimization using dual-track advantage estimator.
Chicago
Huang, Yubo, Xuechun Wang, Luobao Zou, Zhiwei Zhuang, and Weidong Zhang. 2020. “Soft Policy Optimization Using Dual-Track Advantage Estimator.” http://widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2009.06858&authtype=sso&custid=ns315887.