Search

Your search keyword '"Zhou, Pan"' showing total 174 results

Search Constraints

Start Over You searched for: Author "Zhou, Pan" Remove constraint Author: "Zhou, Pan" Publication Type Reports Remove constraint Publication Type: Reports
174 results on '"Zhou, Pan"'

Search Results

1. MoExtend: Tuning New Experts for Modality and Task Extension

2. Can Large Language Models Automatically Jailbreak GPT-4V?

3. A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends

4. LoCo: Low-Bit Communication Adaptor for Large-scale Model Training

5. Self-Cognition in Large Language Models: An Exploratory Study

6. Virtual Context: Enhancing Jailbreak Attacks with Special Token Injection

7. A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning

8. GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents

9. Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?

10. MVGamba: Unify 3D Content Generation as State Space Sequence Modeling

11. AutoJailbreak: Exploring Jailbreak Attacks and Defenses through a Dependency Lens

12. 4-bit Shampoo for Memory-Efficient Network Training

13. LOVA3: Learning to Visual Question Answering, Asking and Assessment

14. Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World

15. CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code

16. Does Your Neural Code Completion Model Use My Code? A Membership Inference Approach

17. Diffusion Time-step Curriculum for One Image to 3D Generation

18. Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction

19. Optimization-based Prompt Injection Attack to LLM-as-a-Judge

20. Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language Models

21. Friendly Sharpness-Aware Minimization

22. What Makes Good Collaborative Views? Contrastive Mutual Information Maximization for Multi-Agent Perception

23. Few-shot Learner Parameterization by Diffusion Time-steps

24. MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark

25. Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior

26. The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023

27. ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

28. MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition

29. The Security and Privacy of Mobile Edge Computing: An Artificial Intelligence Perspective

30. U2-KWS: Unified Two-pass Open-vocabulary Keyword Spotting with Keyword Bias

31. Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies

32. Towards Inductive Robustness: Distilling and Fostering Wave-induced Resonance in Transductive GCNs Against Graph Adversarial Attacks

33. Genixer: Empowering Multimodal Large Language Models as a Powerful Data Generator

34. Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation

35. Exploring the Robustness of Decentralized Training for Large Language Models

36. MetaCloak: Preventing Unauthorized Subject-driven Text-to-image Diffusion-based Synthesis via Meta-learning

37. Jailbreaking GPT-4V via Self-Adversarial Attacks with System Prompts

38. Instant3D: Instant Text-to-3D Generation

39. F$^2$AT: Feature-Focusing Adversarial Training via Disentanglement of Natural and Perturbed Patterns

40. ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection

41. GraphCloak: Safeguarding Task-specific Knowledge within Graph-structured Data from Unauthorized Exploitation

42. MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use

43. 3DHacker: Spectrum-based Decision Boundary Generation for Hard-label 3D Point Cloud Attack

44. Fast Diffusion Model

45. Graph Agent Network: Empowering Nodes with Inference Capabilities for Adversarial Resilience

46. Transform-Equivariant Consistency Learning for Temporal Sentence Grounding

47. InceptionNeXt: When Inception Meets ConvNeXt

48. MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer

49. You Can Ground Earlier than See: An Effective and Efficient Pipeline for Temporal Sentence Grounding in Compressed Videos

50. Unlearnable Graph: Protecting Graphs from Unauthorized Exploitation

Catalog

Books, media, physical & digital resources