Search

Your search keyword '"Lu, Tong"' showing total 3,987 results

Search Constraints

Start Over You searched for: Author "Lu, Tong" Remove constraint Author: "Lu, Tong"
3,987 results on '"Lu, Tong"'

Search Results

1. CorrAdaptor: Adaptive Local Context Learning for Correspondence Pruning

2. EAR: Edge-Aware Reconstruction of 3-D vertebrae structures from bi-planar X-ray images

3. MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity

4. EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation

5. OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

6. VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks

7. How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

8. Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding

9. Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

10. PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal

11. MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

12. Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications

13. CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers

16. InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

17. Is Ego Status All You Need for Open-Loop End-to-End Autonomous Driving?

18. Evaluating the effects of high-throughput structural neuroimaging predictors on whole-brain functional connectome outcomes via network-based vector-on-matrix regression

19. Multiple Imputation Method for High-Dimensional Neuroimaging Data

26. Deep Video Restoration for Under-Display Camera

27. Memory-and-Anticipation Transformer for Online Action Understanding

28. FB-BEV: BEV Representation from Forward-Backward View Transformations

29. The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World

30. AVSegFormer: Audio-Visual Segmentation with Transformer

32. GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions

33. VideoLLM: Modeling Video Sequence with Large Language Models

34. Graph Propagation Transformer for Graph Representation Learning

35. VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks

36. Network method for voxel-pair-level brain connectivity analysis under spatial-contiguity constraints

37. MRSN: Multi-Relation Support Network for Video Action Detection

38. Probabilistic Inference Based Incremental Graph Index for Similarity Search on Social Networks

49. DDP: Diffusion Model for Dense Visual Prediction

50. Champion Solution for the WSDM2023 Toloka VQA Challenge

Catalog

Books, media, physical & digital resources