35 results on '"Baraldi, Lorenzo"'
Search Results
2. FOSSIL: Free Open-Vocabulary Semantic Segmentation through Synthetic References Retrieval
3. With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning
4. Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation
5. Embodied Agents for Efficient Exploration and Smart Scene Description
6. CaMEL: Mean Teacher Learning for Image Captioning
7. Spot the Difference: A Novel Task for Embodied Agents in Changing Environments
8. The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition
9. Dual-Branch Collaborative Transformer for Virtual Try-On
10. The Unreasonable Effectiveness of CLIP Features for Image Captioning: An Experimental Analysis
11. Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis
12. Estimating (and fixing) the Effect of Face Obfuscation in Video Recognition
13. Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions
14. Explore and Explain: Self-supervised Navigation and Recounting
15. A Novel Attention-based Aggregation Function to Combine Vision and Language
16. RMS-Net: Regression and Masking for Soccer Event Spotting
17. Meshed-Memory Transformer for Image Captioning
18. SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability
19. Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-To-Image Translation
20. Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions
21. A Hierarchical Quasi-Recurrent approach to Video Captioning
22. Aligning Text and Document Illustrations: Towards Visually Explainable Digital Humanities
23. Connected Components Labeling on DRAGs
24. Spaghetti Labeling: Directed Acyclic Graphs for Block-Based Connected Components Labeling.
25. LAMV: Learning to Align and Match Videos with Kernelized Temporal Layers
26. SAM: Pushing the Limits of Saliency Prediction Models
27. Visual saliency for image captioning in new multimedia services
28. Hierarchical Boundary-Aware Neural Encoder for Video Captioning
29. A deep multi-level network for saliency prediction
30. Historical document digitization through layout analysis and deep content classification
31. YACCLAB - Yet Another Connected Components Labeling Benchmark
32. Scene segmentation using temporal clustering for accessing and re-using broadcast video
33. Recognizing and Presenting the Storytelling Video Structure With Deep Multimodal Networks.
34. Gesture Recognition in Ego-centric Videos Using Dense Trajectories and Hand Segmentation
35. Gesture recognition in ego-centric videos using dense trajectories and hand segmentation
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.