641 results on '"Chong Wah Ngo"'
Search Results
2. Navigating Weight Prediction with Diet Diary.
3. OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation.
4. Improving Interpretable Embeddings for Ad-hoc Video Search with Generative Captions and Multi-word Concept Bank.
5. Leveraging LLMs and Generative Models for Interactive Known-Item Video Search.
6. PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video Recognition.
7. The ACM Web Conference 2024 Report.
8. Learning Temporal Dynamics in Videos With Image Transformer.
9. (Un)likelihood Training for Interpretable Embedding.
10. Adaptive Split-Fusion Transformer.
11. ObjectFusion: Multi-modal 3D Object Detection with Object-Centric Fusion.
12. CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding.
13. Reinforcement Learning Enhanced PicHunter for Interactive Search.
14. WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines.
15. Towards Multimodal Emotional Support Conversation Systems.
16. LLM-based query paraphrasing for video search.
17. RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models.
18. Interpretable Embedding for Ad-hoc Video Search.
19. Cross-lingual Adaptation for Recipe Retrieval with Mixup.
20. MLP-3D: A MLP-like 3D Architecture with Grouped Time Mixing.
21. Group Contextualization for Video Recognition.
22. Interactive Video Corpus Moment Retrieval using Reinforcement Learning.
23. Long-term Leap Attention, Short-term Periodic Shift for Video Classification.
24. Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning.
25. Dynamic Temporal Filtering in Video Models.
26. Reinforcement Learning-Based Interactive Video Search.
27. FoodMask: Real-time food instance counting, segmentation and recognition.
28. Learning From Web Recipe-Image Pairs for Food Recognition: Problem, Baselines and Performance.
29. Approximate k-NN Graph Construction: A Generic Online Approach.
30. Mixed Dish Recognition With Contextual Relation and Domain Alignment.
31. On the Merge of k-NN Graph.
32. Deeply Activated Salient Region for Instance Search.
33. Interactive Video Corpus Moment Retrieval using Reinforcement Learning.
34. Cross-domain Food Image-to-Recipe Retrieval by Weighted Adversarial Learning.
35. GroundNLQ @ Ego4D Natural Language Queries Challenge 2023.
36. Incremental Learning on Food Instance Segmentation.
37. FoodLMM: A Versatile Food Assistant using Large Multi-modal Model.
38. Condensing a Sequence to One Informative Frame for Video Recognition.
39. Boosting Video Representation Learning With Multi-Faceted Integration.
40. Token Shift Transformer for Video Classification.
41. CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval.
42. k-sums Clustering: A Stochastic Optimization Approach.
43. Optimization Planning for 3D ConvNets.
44. Terrace-based Food Counting and Segmentation.
45. SQL-Like Interpretable Interactive Video Search.
46. Transferring and Regularizing Prediction for Semantic Segmentation.
47. CookGAN: Causality Based Text-to-Image Synthesis.
48. Exploring Category-Agnostic Clusters for Open-Set Domain Adaptation.
49. Hyperbolic Visual Embedding Learning for Zero-Shot Recognition.
50. Person-level Action Recognition in Complex Events via TSD-TSM Networks.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.