Search

Showing total 160,610 results

Search Constraints

Start Over You searched for: Topic computer science - computer vision and pattern recognition Remove constraint Topic: computer science - computer vision and pattern recognition Publication Year Range Last 10 years Remove constraint Publication Year Range: Last 10 years
160,610 results

Search Results

1. Position paper: Do not explain (vision models) without context

2. NLLG Quarterly arXiv Report 09/23: What are the most influential current AI Papers?

3. Training CLIP models on Data from Scientific Papers

4. Tender Notice Extraction from E-papers Using Neural Network

5. Discussion Paper: The Threat of Real Time Deepfakes

6. Automatic Detection and Rectification of Paper Receipts on Smartphones

7. Hidden Knowledge: Mathematical Methods for the Extraction of the Fingerprint of Medieval Paper from Digital Images

8. Representation Learning for Tablet and Paper Domain Adaptation in Favor of Online Handwriting Recognition

9. Image Denoising: The Deep Learning Revolution and Beyond -- A Survey Paper --

10. From Malware Samples to Fractal Images: A New Paradigm for Classification. (Version 2.0, Previous version paper name: Have you ever seen malware?)

11. Auto Lead Extraction and Digitization of ECG Paper Records using cGAN

12. On the use of learning-based forecasting methods for ameliorating fashion business processes: A position paper

13. Attention is All They Need: Exploring the Media Archaeology of the Computer Vision Research Paper

14. FPSRS: A Fusion Approach for Paper Submission Recommendation System

15. SimCPSR: Simple Contrastive Learning for Paper Submission Recommendation System

16. Community-Driven Comprehensive Scientific Paper Summarization: Insight from cvpaper.challenge

17. Best Practices and Scoring System on Reviewing A.I. based Medical Imaging Papers: Part 1 Classification

18. A new baseline for retinal vessel segmentation: Numerical identification and correction of methodological inconsistencies affecting 100+ papers

19. Starkit: RoboCup Humanoid KidSize 2021 Worldwide Champion Team Paper

20. White Paper Assistance: A Step Forward Beyond the Shortcut Learning

21. A White Paper on Neural Network Quantization

22. A remark on a paper of Krotov and Hopfield [arXiv:2008.06996]

23. Quantum-Classical Hybrid Machine Learning for Image Classification (ICCAD Special Session Paper)

24. The MICCAI Hackathon on reproducibility, diversity, and selection of papers at the MICCAI conference

25. White Paper: Challenges and Considerations for the Creation of a Large Labelled Repository of Online Videos with Questionable Content

26. Automatic Test Suite Generation for Key-Points Detection DNNs using Many-Objective Search (Experience Paper)

27. Recognizing Families In the Wild: White Paper for the 4th Edition Data Challenge

28. Medical Imaging with Deep Learning: MIDL 2020 -- Short Paper Track

29. Modelling curvature of a bent paper leaf

30. Image Matching across Wide Baselines: From Paper to Practice

31. Additional Baseline Metrics for the paper 'Extended YouTube Faces: a Dataset for Heterogeneous Open-Set Face Identification'

32. Deep Paper Gestalt

33. Automatic Paper Summary Generation from Visual and Textual Information

34. Towards Multimodal Sarcasm Detection (An _Obviously_ Perfect Paper)

35. Sketch2code: Generating a website from a paper mockup

36. Yet Another ADNI Machine Learning Paper? Paving The Way Towards Fully-reproducible Research on Classification of Alzheimer's Disease

37. cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey

39. Visual Recognition of Paper Analytical Device Images for Detection of Falsified Pharmaceuticals

40. Learning to Generate Posters of Scientific Papers by Probabilistic Graphical Models

41. Fusion of Heterogeneous Data in Convolutional Networks for Urban Semantic Labeling (Invited Paper)

42. Position paper: Towards an observer-oriented theory of shape comparison

43. A Review Paper: Noise Models in Digital Image Processing

44. Spiking-Fer: Spiking Neural Network for Facial Expression Recognition With Event Cameras: Facial Expression Recognition (FER) is an active research domain that has shown great progress recently, notably thanks to the use of large deep learning models. However, such approaches are particularly energy intensive, which makes their deployment difficult for edge devices. To address this issue, Spiking Neural Networks (SNNs) coupled with event cameras are a promising alternative, capable of processing sparse and asynchronous events with lower energy consumption. In this paper, we establish the first use of event cameras for FER, named 'Event-based FER', and propose the first related benchmarks by converting popular video FER datasets to event streams. To deal with this new task, we propose 'Spiking-FER', a deep convolutional SNN model, and compare it against a similar Artificial Neural Network (ANN). Experiments show that the proposed approach achieves comparable performance to the ANN architecture, while consuming less energy by orders of magnitude (up to 65.39x). In addition, an experimental study of various event-based data augmentation techniques is performed to provide insights into the efficient transformations specific to event-based FER

45. FPSRS: A Fusion Approach for Paper Submission Recommendation System

46. Image Matching Across Wide Baselines: From Paper to Practice

47. 3D Convolution Neural Network based Person Identification using Gait cycles

48. Automatic test suite generation for key-points detection DNNs using many-objective search (experience paper)

49. Quantum-Classical Hybrid Machine Learning for Image Classification (ICCAD Special Session Paper)

50. A new baseline for retinal vessel segmentation: Numerical identification and correction of methodological inconsistencies affecting 100+ papers

51. Conference paper

52. Conference paper

53. Conference paper

54. Towards Multimodal Sarcasm Detection (An _Obviously_ Perfect Paper)

55. Automatic Paper Summary Generation from Visual and Textual Information

56. Yet Another ADNI Machine Learning Paper? Paving The Way Towards Fully-reproducible Research on Classification of Alzheimer's Disease

57. Learning to Generate Posters of Scientific Papers by Probabilistic Graphical Models

58. VDGD: Mitigating LVLM Hallucinations in Cognitive Prompts by Bridging the Visual Perception Gap

59. Erase to Enhance: Data-Efficient Machine Unlearning in MRI Reconstruction

60. V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM

61. Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization

62. Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras

63. Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation

64. Text Prompting for Multi-Concept Video Customization by Autoregressive Generation

65. A Survey of Artificial Intelligence in Gait-Based Neurodegenerative Disease Diagnosis

66. PoseGravity: Pose Estimation from Points and Lines with Axis Prior

67. Enhancing Active Learning for Sentinel 2 Imagery through Contrastive Learning and Uncertainty Estimation

68. Rethinking Overlooked Aspects in Vision-Language Models

69. Improving the Explain-Any-Concept by Introducing Nonlinearity to the Trainable Surrogate Model

70. Perturbing the Gradient for Alleviating Meta Overfitting

71. Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries

72. Searching Realistic-Looking Adversarial Objects For Autonomous Driving Systems

73. Reproducibility Study of CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification

74. Hierarchical Selective Classification

75. Generative Artificial Intelligence: A Systematic Review and Applications

76. Air Signing and Privacy-Preserving Signature Verification for Digital Documents

77. Two-Phase Dynamics of Interactions Explains the Starting Point of a DNN Learning Over-Fitted Features

78. When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models

79. Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection

80. Rethinking Barely-Supervised Segmentation from an Unsupervised Domain Adaptation Perspective

81. Unveiling Hallucination in Text, Image, Video, and Audio Foundation Models: A Comprehensive Survey

82. ReconBoost: Boosting Can Achieve Modality Reconcilement

83. Shape-aware synthesis of pathological lung CT scans using CycleGAN for enhanced semi-supervised lung segmentation

84. StraightPCF: Straight Point Cloud Filtering

85. Exploring the Low-Pass Filtering Behavior in Image Super-Resolution

86. Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging

87. Dehazing Remote Sensing and UAV Imagery: A Review of Deep Learning, Prior-based, and Hybrid Approaches

88. Who's in and who's out? A case study of multimodal CLIP-filtering in DataComp

89. Understanding and Evaluating Human Preferences for AI Generated Images with Instruction Tuning

90. Replication Study and Benchmarking of Real-Time Object Detection Models

91. Rectified Gaussian kernel multi-view k-means clustering

92. A Survey on Backbones for Deep Video Action Recognition

93. Towards Robust Physical-world Backdoor Attacks on Lane Detection

94. DisBeaNet: A Deep Neural Network to augment Unmanned Surface Vessels for maritime situational awareness

95. A Comprehensive Survey of Masked Faces: Recognition, Detection, and Unmasking

96. General Place Recognition Survey: Towards Real-World Autonomy

97. Continuous max-flow augmentation of self-supervised few-shot learning on SPECT left ventricles

98. Interpretability Needs a New Paradigm

99. Relevant Irrelevance: Generating Alterfactual Explanations for Image Classifiers

100. Picking watermarks from noise (PWFN): an improved robust watermarking model against intensive distortions