1,209 results on '"G, Hauptmann"'
Search Results
2. SZTU-CMU at MER2024: Improving Emotion-LLaMA with Conv-Attention for Multimodal Emotion Recognition.
3. The Seven Faces of Stress: Understanding Facial Activity Patterns During Cognitive Stress.
4. Hyperbolic vs Euclidean Embeddings in Few-Shot Learning: Two Sides of the Same Coin.
5. Language Model Beats Diffusion - Tokenizer is key to visual generation.
6. Breaking The Limits of Text-conditioned 3D Motion Synthesis with Elaborative Descriptions.
7. ChartReader: A Unified Framework for Chart Derendering and Comprehension without Heuristic Rules.
8. DocumentNet: Bridging the Data Gap in Document Pre-training.
9. MAGVIT: Masked Generative Video Transformer.
10. STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition.
11. Towards Open-Domain Twitter User Profile Inference.
12. Zero-Shot and Few-Shot Stance Detection on Varied Topics via Conditional Generation.
13. MetaDesigner: Advancing Artistic Typography through AI-Driven, User-Centric, and Multilingual WordArt Synthesis.
14. Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions.
15. MM-TTS: A Unified Framework for Multimodal, Prompt-Induced Emotional Text-to-Speech Synthesis.
16. Rethinking Spatial Invariance of Convolutional Networks for Object Counting.
17. Rethinking Zero-shot Action Recognition: Learning from Latent Atomic Actions.
18. TRM: Temporal Relocation Module for Video Recognition.
19. Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals.
20. SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs.
21. Deep Discrete Cross-Modal Hashing with Multiple Supervision.
22. Training Vision-Language Transformers from Captions.
23. Document Entity Retrieval with Massive and Noisy Pre-training.
24. Language Model Beats Diffusion - Tokenizer is Key to Visual Generation.
25. Importance of Parasagittal Sensor Information in Tongue Motion Capture Through a Diphonic Analysis.
26. Statistical Distance Metric Learning for Image Set Retrieval.
27. Pose Guided Person Image Generation With Hidden P-Norm Regression.
28. MSNet: A Multilevel Instance Segmentation Network for Natural Disaster Damage Assessment in Aerial Videos.
29. Event-Related Bias Removal for Real-time Disaster Events.
30. Forward and Backward Multimodal NMT for Improved Monolingual and Multilingual Cross-Modal Retrieval.
31. ZSTAD: Zero-Shot Temporal Activity Detection.
32. Zero-VIRUS*: Zero-shot Vehicle Route Understanding System for Intelligent Transportation.
33. The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction.
34. ELECTRICITY: An Efficient Multi-camera Vehicle Tracking System for Intelligent City.
35. Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting.
36. Stacked Pooling for Boosting Scale Invariance of Crowd Counting.
37. Robust Long-Term Object Tracking via Improved Discriminative Model Prediction.
38. SimAug: Learning Robust Representations from Simulation for Trajectory Prediction.
39. The Eighth Visual Object Tracking VOT2020 Challenge Results.
40. Adaptive Feature Aggregation for Video Object Detection.
41. Argus: Efficient Activity Detection System for Extended Video Analysis.
42. MAGVIT: Masked Generative Video Transformer.
43. Support-set bottlenecks for video-text representation learning.
44. Cross-Modal Transfer Hashing Based on Coherent Projection.
45. Learning Spatial Awareness to Improve Crowd Counting.
46. Multi-Head Attention with Diversity for Learning Grounded Multilingual Multimodal Representations.
47. Shooter Localization Using Videos in the Wild.
48. Multi-shot Person Re-identification through Set Distance with Visual Distributional Representation.
49. Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attention Networks.
50. Peeking Into the Future: Predicting Future Person Activities and Locations in Videos.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.