Search

Your search keyword '"Lee, Yong-Jae"' showing total 1,857 results

Search Constraints

Start Over You searched for: Author "Lee, Yong-Jae" Remove constraint Author: "Lee, Yong-Jae"
1,857 results on '"Lee, Yong-Jae"'

Search Results

1. VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation

2. LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

3. MATE: Meet At The Embedding -- Connecting Images with Long Texts

4. Yo'LLaVA: Your Personalized Language and Vision Assistant

5. Matryoshka Multimodal Models

6. LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

7. LLM Inference Unveiled: Survey and Roofline Model Insights

8. Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving

9. CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples

10. Interfacing Foundation Models' Embeddings

11. Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images

12. ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

13. Testing learning-enabled cyber-physical systems with Large-Language Models: A Formal Approach

14. Improved Baselines with Visual Instruction Tuning

17. A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance

18. Investigating the Catastrophic Forgetting in Multimodal Large Language Models

19. Visual Instruction Inversion: Image Editing via Visual Prompting

20. Benchmarking and Analyzing Generative Data for Visual Recognition

21. Generate Anything Anywhere in Any Scene

22. Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding

23. Visual Instruction Tuning

24. Segment Everything Everywhere All at Once

26. InPL: Pseudo-labeling the Inliers First for Imbalanced Semi-supervised Learning

27. Towards Universal Fake Image Detectors that Generalize Across Generative Models

28. Learning Customized Visual Models with Retrieval-Augmented Knowledge

29. GLIGEN: Open-Set Grounded Text-to-Image Generation

30. Generalized Decoding for Pixel, Image, and Language

31. Expeditious Saliency-guided Mix-up through Random Gradient Thresholding

32. Contrastive Learning for Diverse Disentangled Foreground Generation

36. EnergyMatch: Energy-based Pseudo-Labeling for Semi-Supervised Learning

37. What Knowledge Gets Distilled in Knowledge Distillation?

38. ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models

39. The Two Dimensions of Worst-case Training and the Integrated Effect for Out-of-domain Generalization

40. End-to-End Instance Edge Detection

41. GIRAFFE HD: A High-Resolution 3D-aware Generative Model

42. Masked Discrimination for Self-Supervised Learning on Point Clouds

45. Toward Learning Human-aligned Cross-domain Robust Models by Countering Misaligned Features

46. Collaging Class-specific GANs for Semantic Image Synthesis

48. Equine Pain Behavior Classification via Self-Supervised Disentangled Pose Representation

Catalog

Books, media, physical & digital resources