Cite
Emoji Attack: A Method for Misleading Judge LLMs in Safety Risk Detection
MLA
Wei, Zhipeng, et al. Emoji Attack: A Method for Misleading Judge LLMs in Safety Risk Detection. 2024. EBSCOhost, widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2411.01077&authtype=sso&custid=ns315887.
APA
Wei, Z., Liu, Y., & Erichson, N. B. (2024). Emoji Attack: A Method for Misleading Judge LLMs in Safety Risk Detection.
Chicago
Wei, Zhipeng, Yuqi Liu, and N. Benjamin Erichson. 2024. “Emoji Attack: A Method for Misleading Judge LLMs in Safety Risk Detection.” http://widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2411.01077&authtype=sso&custid=ns315887.