Search

Your search keyword '"Hu, Di"' showing total 2,192 results

Search Constraints

Start Over You searched for: Author "Hu, Di" Remove constraint Author: "Hu, Di"
2,192 results on '"Hu, Di"'

Search Results

1. Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection

2. KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance

3. Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation

4. Boosting Audio Visual Question Answering via Key Semantic-Aware Cues

5. Towards Effective and Efficient Continual Pre-training of Large Language Models

6. Unveiling and Mitigating Bias in Audio Visual Segmentation

7. Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation

8. Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes

9. Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

10. Diagnosing and Re-learning for Balanced Multimodal Learning

11. YuLan: An Open-source Large Language Model

12. Stability impacts from the current and pressure profile modifications within finite sized island

13. Learning Manipulation by Predicting Interaction

14. MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance

15. Multimodal Fusion on Low-quality Data: A Comprehensive Survey

16. Highly dispersed Ru nanoparticles anchored on NiAl layered double oxides catalyst for selective hydrodeoxygenation of vanillin

17. Facile synthesis of fine-grained CoFe$_2$O$_4$ anchored on porous carbon for simultaneous removal of tetracycline and arsenite

18. SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model

19. Quantifying and Enhancing Multi-modal Robustness with Modality Preference

20. Two-dimensional 5d multiferroic W3Cl8: breathing Kagome lattice and tunable magneto-optical Kerr effect

23. Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs

24. Prompting Segmentation with Sound Is Generalizable Audio-Visual Source Localizer

25. Enhancing multimodal cooperation via sample-level modality valuation

27. Two-orbital spin-fermion model study of ferromagnetism in honeycomb lattice

28. Progressive Spatio-temporal Perception for Audio-Visual Question Answering

29. Towards Long Form Audio-visual Video Understanding

30. Supervised Knowledge May Hurt Novel Class Discovery Performance

31. Multi-Scale Attention for Audio Question Answering

32. Robust Cross-Modal Knowledge Distillation for Unconstrained Videos

33. Modulation of skyrmionic magnetic textures in two-dimensional vdW materials and their heterostructures

34. MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning

35. Drift surface solver for runaway electron current dominant equilibria during the Current Quench

36. Where to Turn: Road Fork Detection in Sparse 3D Point Cloud

37. Performance Evaluation of Multi-type Energy Storage Power Station Based on AHP and FCE

38. Optimal Allocation Method of Hybrid Energy Storage Capacity to Stabilize Wind Power Fluctuation

43. Balanced Audiovisual Dataset for Imbalance Analysis

44. Revisiting Pre-training in Audio-Visual Learning

45. TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real World

46. A Closer Look at Novel Class Discovery from the Labeled Set

47. Learning in Audio-visual Context: A Review, Analysis, and New Perspective

48. Dual Domain-Adversarial Learning for Audio-Visual Saliency Prediction

49. Hot-tail electrons' impact on assimilation and injection penetration of D2 Shattered Pellet Injections

50. Balanced Multimodal Learning via On-the-fly Gradient Modulation

Catalog

Books, media, physical & digital resources