Cite
LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression
MLA
Chen, Jieneng, et al. LLaVolta: Efficient Multi-Modal Models via Stage-Wise Visual Context Compression. 2024. EBSCOhost, widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2406.20092&authtype=sso&custid=ns315887.
APA
Chen, J., Ye, L., He, J., Wang, Z.-Y., Khashabi, D., & Yuille, A. (2024). LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression.
Chicago
Chen, Jieneng, Luoxin Ye, Ju He, Zhao-Yang Wang, Daniel Khashabi, and Alan Yuille. 2024. “LLaVolta: Efficient Multi-Modal Models via Stage-Wise Visual Context Compression.” http://widgets.ebscohost.com/prod/customlink/proxify/proxify.php?count=1&encode=0&proxy=&find_1=&replace_1=&target=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2406.20092&authtype=sso&custid=ns315887.