Search

Your search keyword '"Baraldi, Lorenzo"' showing total 18 results

Search Constraints

Start Over You searched for: Author "Baraldi, Lorenzo" Remove constraint Author: "Baraldi, Lorenzo" Topic computer science - multimedia Remove constraint Topic: computer science - multimedia
18 results on '"Baraldi, Lorenzo"'

Search Results

1. Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training

2. Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization

3. Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities

4. BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues

5. Towards Retrieval-Augmented Architectures for Image Captioning

6. Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs

7. The Revolution of Multimodal Large Language Models: A Survey

8. Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models

9. With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning

10. Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training

11. Parents and Children: Distinguishing Multimodal DeepFakes from Natural Images

12. Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation

13. ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval

14. Retrieval-Augmented Transformer for Image Captioning

15. CaMEL: Mean Teacher Learning for Image Captioning

16. Universal Captioner: Inducing Content-Style Separation in Vision-and-Language Model Training

17. Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features

18. A Deep Siamese Network for Scene Detection in Broadcast Videos

Catalog

Books, media, physical & digital resources