Search

Your search keyword '"Chen, Jingdong"' showing total 2,088 results

Search Constraints

Start Over You searched for: Author "Chen, Jingdong" Remove constraint Author: "Chen, Jingdong"
2,088 results on '"Chen, Jingdong"'

Search Results

1. Try-On-Adapter: A Simple and Flexible Try-On Paradigm

2. HomoMatcher: Dense Feature Matching Results with Semi-Dense Efficiency by Homography Estimation

3. LumiSculpt: A Consistency Lighting Control Network for Video Generation

4. Animate-X: Universal Character Image Animation with Enhanced Motion Representation

5. StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models

6. POA: Pre-training Once for Models of All Sizes

7. Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight

8. ViTime: A Visual Intelligence-Based Foundation Model for Time Series Forecasting

9. SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding

10. Low algorithmic delay implementation of convolutional beamformer for online joint source separation and dereverberation

11. Enhancing DETRs Variants through Improved Content Query and Similar Query Aggregation

12. Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

13. M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining

14. Independent low-rank matrix analysis based on the Sinkhorn divergence source model for blind source separation

15. SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery

16. A computationally efficient semi-blind source separation based approach for nonlinear echo cancellation based on an element-wise iterative source steering

17. Large Multimodal Model Compression via Efficient Pruning and Distillation at AntGroup

18. POA: Pre-training Once for Models of All Sizes

20. LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints

21. The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction

22. Mapping EEG Signals to Visual Stimuli: A Deep Learning Approach to Match vs. Mismatch Classification

23. An Anchor-Point Based Image-Model for Room Impulse Response Simulation with Directional Source Radiation and Sensor Directivity Patterns

24. Distortionless Beamforming

25. Adaptive Noise Cancellation

26. Binaural Beamforming

27. Low-Rank Beamforming

30. Large Array Beamforming

31. Introduction

34. The Multimodal Information based Speech Processing (MISP) 2022 Challenge: Audio-Visual Diarization and Recognition

35. Dynamic control of the directional scattering of single Mie particle by laser induced metal insulator transitions

38. Robust Manifold Nonnegative Tucker Factorization for Tensor Data Representation

40. Microphone Arrays

41. SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization

42. Hierarchical Memory Learning for Fine-Grained Scene Graph Generation

43. Training Protocol Matters: Towards Accurate Scene Text Recognition via Training Protocol Searching

44. CBNet: A Composite Backbone Network Architecture for Object Detection

45. MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction

48. CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes

49. AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario

Catalog

Books, media, physical & digital resources