112 results on '"Jordi Pont-Tuset"'
Search Results
2. Rich Human Feedback for Text-to-Image Generation.
3. Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation.
4. Connecting Vision and Language with Video Localized Narratives.
5. Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting.
6. Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning.
7. Imagen 3.
8. Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models.
9. Evaluating Numerical Reasoning in Text-to-Image Models.
10. Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings.
11. PiGLET: Pixel-Level Grounding of Language Expressions With Transformers.
12. Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset.
13. Adversarially Robust Panoptic Segmentation (ARPaS) Benchmark.
14. EgoCOL: Egocentric Camera pose estimation for Open-world 3D object Localization @Ego4D challenge 2023.
15. Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation.
16. Rich Human Feedback for Text-to-Image Generation.
17. Telling the What while Pointing to the Where: Multimodal Queries for Image Retrieval.
18. Panoptic Narrative Grounding.
19. Connecting Vision and Language with Localized Narratives.
20. Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022.
21. Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting.
22. Two-Level Temporal Relation Model for Online Video Instance Segmentation.
23. The Liver Tumor Segmentation Benchmark (LiTS).
24. The Open Images Dataset V4.
25. Blazingly Fast Video Object Segmentation With Pixel-Wise Metric Learning.
26. Deep Extreme Cut: From Extreme Points to Object Segmentation.
27. Iterative Deep Retinal Topology Extraction.
28. PanGEA: The Panoramic Graph Environment Annotation Toolkit.
29. Telling the What while Pointing the Where: Fine-grained Mouse Trace and Language Supervision for Improved Image Retrieval.
30. Video Object Segmentation without Temporal Information.
31. One-Shot Video Object Segmentation.
32. Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks.
33. A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation.
34. Scale-Aware Alignment of Hierarchical Image Segmentation.
35. Deep Retinal Image Understanding.
36. Convolutional Oriented Boundaries.
37. Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation.
38. Boosting Object Proposals: From Pascal to COCO.
39. Semi-automatic video object segmentation by advanced manipulation of segmentation hierarchies.
40. Video content and structure description based on keyframes, clusters and storyboards.
41. The 2019 DAVIS Challenge on VOS: Unsupervised Multi-Object Segmentation.
42. Connecting Vision and Language with Localized Narratives.
43. Natural Vocabulary Emerges from Free-Form Annotations.
44. Supervised Evaluation of Image Segmentation and Object Proposal Techniques.
45. Multiscale Combinatorial Grouping.
46. Measures and Meta-Measures for the Supervised Evaluation of Image Segmentation.
47. Upper-bound assessment of the spatial accuracy of hierarchical region-based image representations.
48. Supervised Assessment of Segmentation Hierarchies.
49. Adversarially Robust Panoptic Segmentation (ARPaS) Benchmark
50. The 2018 DAVIS Challenge on Video Object Segmentation.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.