Search

Your search keyword '"Baraldi, P."' showing total 2,007 results

Search Constraints

Start Over You searched for: Author "Baraldi, P." Remove constraint Author: "Baraldi, P."
2,007 results on '"Baraldi, P."'

Search Results

1. Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries

2. Causal Graphical Models for Vision-Language Compositional Understanding

3. Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis

4. Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation

5. Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering

6. Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments

7. Domain decomposition for integer optimal control with total variation regularization

8. Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training

9. Design, fabrication, and testing of diamond axicons for X-ray microscopy applications

10. Optimizing Resource Consumption in Diffusion Models through Hallucination Early Detection

11. PSZ2 G282.28+49.94, a recently discovered analogue of the famous Bullet Cluster

12. Fluent and Accurate Image Captioning with a Self-Trained Reward Model

13. Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization

14. UNMuTe: Unifying Navigation and Multimodal Dialogue-like Text Generation

15. Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities

16. BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues

17. Towards Retrieval-Augmented Architectures for Image Captioning

22. Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs

23. AIGeN: An Adversarial Approach for Instruction Generation in VLN

24. Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation

25. Mapping High-level Semantic Regions in Indoor Environments without Object Recognition

26. The Revolution of Multimodal Large Language Models: A Survey

29. Investigating the influence of varying cobalt doping on the cross-sectional widths and surface composition of MnOx nanowires in the context of battery–supercapacitor systems

31. Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models

33. The genomic evolutionary dynamics and global circulation patterns of respiratory syncytial virus.

39. Comparison of “IN-REC-SUR-E” and LISA in preterm neonates with respiratory distress syndrome: a randomized controlled trial (IN-REC-LISA trial)

41. Efficacy and feasibility of a novel semi-facial respirator with chitosan nanoparticles on the incidence of SARS-CoV-2 infection in healthcare professionals: randomized controlled trial

42. With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning

46. Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation

47. Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training

48. Interfacial two-dimensional oxide enhances photocatalytic activity of graphene/titania via electronic structure modification

Catalog

Books, media, physical & digital resources