Search

Showing total 120,657 results

Search Constraints

Start Over You searched for: Topic computer science - computer vision and pattern recognition Remove constraint Topic: computer science - computer vision and pattern recognition Database arXiv Remove constraint Database: arXiv
120,657 results

Search Results

1. Exploring the Distinctiveness and Fidelity of the Descriptions Generated by Large Vision-Language Models

2. Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields

3. Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations

4. HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts

5. Learning text-to-video retrieval from image captioning

6. Inhomogeneous illuminated image enhancement under extremely low visibility condition

7. Sparse Reconstruction of Optical Doppler Tomography Based on State Space Model

8. Low Cost Machine Vision for Insect Classification

9. TextGaze: Gaze-Controllable Face Generation with Natural Language

10. PromptCIR: Blind Compressed Image Restoration with Prompt Learning

11. Cost-Sensitive Uncertainty-Based Failure Recognition for Object Detection

12. One-Shot Image Restoration

13. Multi-view Image Prompted Multi-view Diffusion for Improved 3D Generation

14. Spatial-frequency Dual-Domain Feature Fusion Network for Low-Light Remote Sensing Image Enhancement

15. Frequency-Guided Multi-Level Human Action Anomaly Detection with Normalizing Flows

16. Estimating the Robustness Radius for Randomized Smoothing with 100$\times$ Sample Efficiency

17. MV-VTON: Multi-View Virtual Try-On with Diffusion Models

18. UniRGB-IR: A Unified Framework for Visible-Infrared Downstream Tasks via Adapter Tuning

19. Simultaneous Tri-Modal Medical Image Fusion and Super-Resolution using Conditional Diffusion Model

20. On the Road to Clarity: Exploring Explainable AI for World Models in a Driver Assistance System

21. Masked Two-channel Decoupling Framework for Incomplete Multi-view Weak Multi-label Learning

22. A Novel Spike Transformer Network for Depth Estimation from Event Cameras via Cross-modality Knowledge Distillation

23. Dense Road Surface Grip Map Prediction from Multimodal Image Data

24. Image Copy-Move Forgery Detection via Deep PatchMatch and Pairwise Ranking Learning

25. Part-Guided 3D RL for Sim2Real Articulated Object Manipulation

26. 3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting

27. Adversarial Reweighting with $\alpha$-Power Maximization for Domain Adaptation

28. SDFD: Building a Versatile Synthetic Face Image Dataset with Diverse Attributes

29. Weakly Supervised Training for Hologram Verification in Identity Documents

30. Trinity Detector:text-assisted and attention mechanisms based spectral fusion for diffusion generation image detection

31. Camera Motion Estimation from RGB-D-Inertial Scene Flow

32. Comparison of self-supervised in-domain and supervised out-domain transfer learning for bird species recognition

33. Parameter Efficient Fine-tuning of Self-supervised ViTs without Catastrophic Forgetting

34. Binarizing Documents by Leveraging both Space and Frequency

35. Optimizing Universal Lesion Segmentation: State Space Model-Guided Hierarchical Networks with Feature Importance Adjustment

36. ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion

37. SAGHOG: Self-Supervised Autoencoder for Generating HOG Features for Writer Retrieval

38. SLAM for Indoor Mapping of Wide Area Construction Environments

39. Scrutinizing Data from Sky: An Examination of Its Veracity in Area Based Traffic Contexts

40. Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer

41. Self-supervised visual learning in the low-data regime: a comparative evaluation

42. Few-shot Calligraphy Style Learning

43. MCSDNet: Mesoscale Convective System Detection Network via Multi-scale Spatiotemporal Information

44. Low-Rank Knowledge Decomposition for Medical Foundation Models

45. MovieChat+: Question-aware Sparse Memory for Long Video Question Answering

46. Exploring Beyond Logits: Hierarchical Dynamic Labeling Based on Embeddings for Semi-Supervised Classification

47. S-IQA Image Quality Assessment With Compressive Sampling

48. Phase-aggregated Dual-branch Network for Efficient Fingerprint Dense Registration

49. CSCO: Connectivity Search of Convolutional Operators

50. MorphText: Deep Morphology Regularized Arbitrary-shape Scene Text Detection