Author: "Park, Jungin" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Park, Jungin"' showing total 31 results

Start Over Author "Park, Jungin"

31 results on '"Park, Jungin"'

1. Bridging Vision and Language Spaces with Assignment Prediction

Author: Park, Jungin, Lee, Jiyoung, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: This paper introduces VLAP, a novel approach that bridges pretrained vision models and large language models (LLMs) to make frozen LLMs understand the visual world. VLAP transforms the embedding space of pretrained vision models into the LLMs' word embedding space using a single linear layer for efficient and general-purpose visual and language understanding. Specifically, we harness well-established word embeddings to bridge two modality embedding spaces. The visual and text representations are simultaneously assigned to a set of word embeddings within pretrained LLMs by formulating the assigning procedure as an optimal transport problem. We predict the assignment of one modality from the representation of another modality data, enforcing consistent assignments for paired multimodal data. This allows vision and language representations to contain the same information, grounding the frozen LLMs' word embedding space in visual data. Moreover, a robust semantic taxonomy of LLMs can be preserved with visual data since the LLMs interpret and reason linguistic information from correlations between word embeddings. Experimental results show that VLAP achieves substantial improvements over the previous linear transformation-based approaches across a range of vision-language tasks, including image captioning, visual question answering, and cross-modal retrieval. We also demonstrate the learned visual representations hold a semantic taxonomy of LLMs, making visual semantic arithmetic possible., Comment: ICLR 2024 Camera-ready
Published: 2024

2. Knowing Where to Focus: Event-aware Transformer for Video Grounding

Author: Jang, Jinhyun, Park, Jungin, Kim, Jin, Kwon, Hyeongjun, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Recent DETR-based video grounding models have made the model directly predict moment timestamps without any hand-crafted components, such as a pre-defined proposal or non-maximum suppression, by learning moment queries. However, their input-agnostic moment queries inevitably overlook an intrinsic temporal structure of a video, providing limited positional information. In this paper, we formulate an event-aware dynamic moment query to enable the model to take the input-specific content and positional information of the video into account. To this end, we present two levels of reasoning: 1) Event reasoning that captures distinctive event units constituting a given video using a slot attention mechanism; and 2) moment reasoning that fuses the moment queries with a given sentence through a gated fusion transformer layer and learns interactions between the moment queries and video-sentence representations to predict moment timestamps. Extensive experiments demonstrate the effectiveness and efficiency of the event-aware dynamic moment queries, outperforming state-of-the-art approaches on several video grounding benchmarks., Comment: ICCV 2023. Code is available at https://github.com/jinhyunj/EaTR
Published: 2023

3. PartMix: Regularization Strategy to Learn Part Discovery for Visible-Infrared Person Re-identification

Author: Kim, Minsu, Kim, Seungryong, Park, JungIn, Park, Seongheon, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Modern data augmentation using a mixture-based technique can regularize the models from overfitting to the training data in various computer vision applications, but a proper data augmentation technique tailored for the part-based Visible-Infrared person Re-IDentification (VI-ReID) models remains unexplored. In this paper, we present a novel data augmentation technique, dubbed PartMix, that synthesizes the augmented samples by mixing the part descriptors across the modalities to improve the performance of part-based VI-ReID models. Especially, we synthesize the positive and negative samples within the same and across different identities and regularize the backbone model through contrastive learning. In addition, we also present an entropy-based mining strategy to weaken the adverse impact of unreliable positive and negative samples. When incorporated into existing part-based VI-ReID model, PartMix consistently boosts the performance. We conduct experiments to demonstrate the effectiveness of our PartMix over the existing VI-ReID methods and provide ablation studies., Comment: CVPR 2023
Published: 2023

4. Dual-path Adaptation from Image to Video Transformers

Author: Park, Jungin, Lee, Jiyoung, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: In this paper, we efficiently transfer the surpassing representation power of the vision foundation models, such as ViT and Swin, for video understanding with only a few trainable parameters. Previous adaptation methods have simultaneously considered spatial and temporal modeling with a unified learnable module but still suffered from fully leveraging the representative capabilities of image transformers. We argue that the popular dual-path (two-stream) architecture in video models can mitigate this problem. We propose a novel DualPath adaptation separated into spatial and temporal adaptation paths, where a lightweight bottleneck adapter is employed in each transformer block. Especially for temporal dynamic modeling, we incorporate consecutive frames into a grid-like frameset to precisely imitate vision transformers' capability that extrapolates relationships between tokens. In addition, we extensively investigate the multiple baselines from a unified perspective in video understanding and compare them with DualPath. Experimental results on four action recognition benchmarks prove that pretrained image transformers with DualPath can be effectively generalized beyond the data domain., Comment: CVPR 2023. Code is available at https://github.com/park-jungin/DualPath
Published: 2023

5. SimOn: A Simple Framework for Online Temporal Action Localization

Author: Tang, Tuan N., Park, Jungin, Kim, Kwonyoung, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Online Temporal Action Localization (On-TAL) aims to immediately provide action instances from untrimmed streaming videos. The model is not allowed to utilize future frames and any processing techniques to modify past predictions, making On-TAL much more challenging. In this paper, we propose a simple yet effective framework, termed SimOn, that learns to predict action instances using the popular Transformer architecture in an end-to-end manner. Specifically, the model takes the current frame feature as a query and a set of past context information as keys and values of the Transformer. Different from the prior work that uses a set of outputs of the model as past contexts, we leverage the past visual context and the learnable context embedding for the current query. Experimental results on the THUMOS14 and ActivityNet1.3 datasets show that our model remarkably outperforms the previous methods, achieving a new state-of-the-art On-TAL performance. In addition, the evaluation for Online Detection of Action Start (ODAS) demonstrates the effectiveness and robustness of our method in the online setting. The code is available at https://github.com/TuanTNG/SimOn
Published: 2022

6. Language-free Training for Zero-shot Video Grounding

Author: Kim, Dahye, Park, Jungin, Lee, Jiyoung, Park, Seongheon, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Given an untrimmed video and a language query depicting a specific temporal moment in the video, video grounding aims to localize the time interval by understanding the text and video simultaneously. One of the most challenging issues is an extremely time- and cost-consuming annotation collection, including video captions in a natural language form and their corresponding temporal regions. In this paper, we present a simple yet novel training framework for video grounding in the zero-shot setting, which learns a network with only video data without any annotation. Inspired by the recent language-free paradigm, i.e. training without language data, we train the network without compelling the generation of fake (pseudo) text queries into a natural language form. Specifically, we propose a method for learning a video grounding model by selecting a temporal interval as a hypothetical correct answer and considering the visual feature selected by our method in the interval as a language feature, with the help of the well-aligned visual-language space of CLIP. Extensive experiments demonstrate the prominence of our language-free training framework, outperforming the existing zero-shot video grounding method and even several weakly-supervised approaches with large margins on two standard datasets., Comment: Accepted to WACV 2023
Published: 2022

7. PointFix: Learning to Fix Domain Bias for Robust Online Stereo Adaptation

Author: Kim, Kwonyoung, Park, Jungin, Lee, Jiyoung, Min, Dongbo, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Online stereo adaptation tackles the domain shift problem, caused by different environments between synthetic (training) and real (test) datasets, to promptly adapt stereo models in dynamic real-world applications such as autonomous driving. However, previous methods often fail to counteract particular regions related to dynamic objects with more severe environmental changes. To mitigate this issue, we propose to incorporate an auxiliary point-selective network into a meta-learning framework, called PointFix, to provide a robust initialization of stereo models for online stereo adaptation. In a nutshell, our auxiliary network learns to fix local variants intensively by effectively back-propagating local information through the meta-gradient for the robust initialization of the baseline model. This network is model-agnostic, so can be used in any kind of architectures in a plug-and-play manner. We conduct extensive experiments to verify the effectiveness of our method under three adaptation settings such as short-, mid-, and long-term sequences. Experimental results show that the proper initialization of the base stereo model by the auxiliary network enables our learning paradigm to achieve state-of-the-art performance at inference., Comment: Accepted to ECCV 2022
Published: 2022

8. Probabilistic Representations for Video Contrastive Learning

Author: Park, Jungin, Lee, Jiyoung, Kim, Ig-Jae, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper presents Probabilistic Video Contrastive Learning, a self-supervised representation learning method that bridges contrastive learning with probabilistic representation. We hypothesize that the clips composing the video have different distributions in short-term duration, but can represent the complicated and sophisticated video distribution through combination in a common embedding space. Thus, the proposed method represents video clips as normal distributions and combines them into a Mixture of Gaussians to model the whole video distribution. By sampling embeddings from the whole video distribution, we can circumvent the careful sampling strategy or transformations to generate augmented views of the clips, unlike previous deterministic methods that have mainly focused on such sample generation strategies for contrastive learning. We further propose a stochastic contrastive loss to learn proper video distributions and handle the inherent uncertainty from the nature of the raw video. Experimental results verify that our probabilistic embedding stands as a state-of-the-art video representation learning for action recognition and video retrieval on the most popular benchmarks, including UCF101 and HMDB51., Comment: CVPR 2022
Published: 2022

9. Pin the Memory: Learning to Generalize Semantic Segmentation

Author: Kim, Jin, Lee, Jiyoung, Park, Jungin, Min, Dongbo, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: The rise of deep neural networks has led to several breakthroughs for semantic segmentation. In spite of this, a model trained on source domain often fails to work properly in new challenging domains, that is directly concerned with the generalization capability of the model. In this paper, we present a novel memory-guided domain generalization method for semantic segmentation based on meta-learning framework. Especially, our method abstracts the conceptual knowledge of semantic classes into categorical memory which is constant beyond the domains. Upon the meta-learning concept, we repeatedly train memory-guided networks and simulate virtual test to 1) learn how to memorize a domain-agnostic and distinct information of classes and 2) offer an externally settled memory as a class-guidance to reduce the ambiguity of representation in the test data of arbitrary unseen domain. To this end, we also propose memory divergence and feature cohesion losses, which encourage to learn memory reading and update processes for category-aware domain generalization. Extensive experiments for semantic segmentation demonstrate the superior generalization capability of our method over state-of-the-art works on various benchmarks., Comment: Accepted to CVPR 2022
Published: 2022

10. Self-balanced Learning For Domain Generalization

Author: Kim, Jin, Lee, Jiyoung, Park, Jungin, Min, Dongbo, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Domain generalization aims to learn a prediction model on multi-domain source data such that the model can generalize to a target domain with unknown statistics. Most existing approaches have been developed under the assumption that the source data is well-balanced in terms of both domain and class. However, real-world training data collected with different composition biases often exhibits severe distribution gaps for domain and class, leading to substantial performance degradation. In this paper, we propose a self-balanced domain generalization framework that adaptively learns the weights of losses to alleviate the bias caused by different distributions of the multi-domain source data. The self-balanced scheme is based on an auxiliary reweighting network that iteratively updates the weight of loss conditioned on the domain and class information by leveraging balanced meta data. Experimental results demonstrate the effectiveness of our method overwhelming state-of-the-art works for domain generalization., Comment: Accepted at International Conference on Image Processing (ICIP) 2021
Published: 2021
Full Text: View/download PDF

11. Bridge to Answer: Structure-aware Graph Interaction Network for Video Question Answering

Author: Park, Jungin, Lee, Jiyoung, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Multimedia
Abstract: This paper presents a novel method, termed Bridge to Answer, to infer correct answers for questions about a given video by leveraging adequate graph interactions of heterogeneous crossmodal graphs. To realize this, we learn question conditioned visual graphs by exploiting the relation between video and question to enable each visual node using question-to-visual interactions to encompass both visual and linguistic cues. In addition, we propose bridged visual-to-visual interactions to incorporate two complementary visual information on appearance and motion by placing the question graph as an intermediate bridge. This bridged architecture allows reliable message passing through compositional semantics of the question to generate an appropriate answer. As a result, our method can learn the question conditioned visual representations attributed to appearance and motion that show powerful capability for video question answering. Extensive experiments prove that the proposed method provides effective and superior performance than state-of-the-art methods on several benchmarks., Comment: CVPR 2021
Published: 2021

12. Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation

Author: Kim, Minsu, Joung, Sunghun, Kim, Seungryong, Park, JungIn, Kim, Ig-Jae, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Existing techniques to adapt semantic segmentation networks across the source and target domains within deep convolutional neural networks (CNNs) deal with all the samples from the two domains in a global or category-aware manner. They do not consider an inter-class variation within the target domain itself or estimated category, providing the limitation to encode the domains having a multi-modal data distribution. To overcome this limitation, we introduce a learnable clustering module, and a novel domain adaptation framework called cross-domain grouping and alignment. To cluster the samples across domains with an aim to maximize the domain alignment without forgetting precise segmentation ability on the source domain, we present two loss functions, in particular, for encouraging semantic consistency and orthogonality among the clusters. We also present a loss so as to solve a class imbalance problem, which is the other limitation of the previous methods. Our experiments show that our method consistently boosts the adaptation performance in semantic segmentation, outperforming the state-of-the-arts on various domain adaptation settings., Comment: AAAI 2021
Published: 2020

13. SumGraph: Video Summarization via Recursive Graph Modeling

Author: Park, Jungin, Lee, Jiyoung, Kim, Ig-Jae, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The goal of video summarization is to select keyframes that are visually diverse and can represent a whole story of an input video. State-of-the-art approaches for video summarization have mostly regarded the task as a frame-wise keyframe selection problem by aggregating all frames with equal weight. However, to find informative parts of the video, it is necessary to consider how all the frames of the video are related to each other. To this end, we cast video summarization as a graph modeling problem. We propose recursive graph modeling networks for video summarization, termed SumGraph, to represent a relation graph, where frames are regarded as nodes and nodes are connected by semantic relationships among frames. Our networks accomplish this through a recursive approach to refine an initially estimated graph to correctly classify each node as a keyframe by reasoning the graph representation via graph convolutional networks. To leverage SumGraph in a more practical environment, we also present a way to adapt our graph modeling in an unsupervised fashion. With SumGraph, we achieved state-of-the-art performance on several benchmarks for video summarization in both supervised and unsupervised manners., Comment: ECCV 2020
Published: 2020

14. Context-Aware Emotion Recognition Networks

Author: Lee, Jiyoung, Kim, Seungryong, Kim, Sunok, Park, Jungin, and Sohn, Kwanghoon
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Human-Computer Interaction, Computer Science - Multimedia
Abstract: Traditional techniques for emotion recognition have focused on the facial expression analysis only, thus providing limited ability to encode context that comprehensively represents the emotional responses. We present deep networks for context-aware emotion recognition, called CAER-Net, that exploit not only human facial expression but also context information in a joint and boosting manner. The key idea is to hide human faces in a visual scene and seek other contexts based on an attention mechanism. Our networks consist of two sub-networks, including two-stream encoding networks to seperately extract the features of face and context regions, and adaptive fusion networks to fuse such features in an adaptive fashion. We also introduce a novel benchmark for context-aware emotion recognition, called CAER, that is more appropriate than existing benchmarks both qualitatively and quantitatively. On several benchmarks, CAER-Net proves the effect of context for emotion recognition. Our dataset is available at http://caer-dataset.github.io., Comment: International Conference on Computer Vision (ICCV) 2019
Published: 2019

15. PointFix: Learning to Fix Domain Bias for Robust Online Stereo Adaptation

Author: Kim, Kwonyoung, primary, Park, Jungin, additional, Lee, Jiyoung, additional, Min, Dongbo, additional, and Sohn, Kwanghoon, additional
Published: 2022
Full Text: View/download PDF

16. SumGraph: Video Summarization via Recursive Graph Modeling

Author: Park, Jungin, primary, Lee, Jiyoung, additional, Kim, Ig-Jae, additional, and Sohn, Kwanghoon, additional
Published: 2020
Full Text: View/download PDF

17. Dual-Path Adaptation from Image to Video Transformers

Author: Park, Jungin, primary, Lee, Jiyoung, additional, and Sohn, Kwanghoon, additional
Published: 2023
Full Text: View/download PDF

18. PartMix: Regularization Strategy to Learn Part Discovery for Visible-Infrared Person Re-Identification

Author: Kim, Minsu, primary, Kim, Seungryong, additional, Park, Jungin, additional, Park, Seongheon, additional, and Sohn, Kwanghoon, additional
Published: 2023
Full Text: View/download PDF

19. AEDLE: Designing Drama Therapy Interface for Improving Pragmatic Language Skills of Children with Autism Spectrum Disorder Using AR

Author: Park, Jungin, primary, Bae, Gahyeon, additional, Park, Jueon, additional, Park, Seo Kyoung, additional, Kim, Yeon Soo, additional, and Lee, Sangsu, additional
Published: 2023
Full Text: View/download PDF

20. Language-free Training for Zero-shot Video Grounding

Author: Kim, Dahye, primary, Park, Jungin, additional, Lee, Jiyoung, additional, Park, Seongheon, additional, and Sohn, Kwanghoon, additional
Published: 2023
Full Text: View/download PDF

21. Rethinking Autocorrelation for Deep Spectrum Sensing in Cognitive Radio Networks

Author: Chae, Keunhong, primary, Park, Jungin, additional, and Kim, Yusung, additional
Published: 2023
Full Text: View/download PDF

22. Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation

Author: Kim, Minsu, Joung, Sunghun, Kim, Seungryong, Park, JungIn, Kim, Ig-Jae, and Sohn, Kwanghoon
Subjects: FOS: Computer and information sciences, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, General Medicine
Abstract: Existing techniques to adapt semantic segmentation networks across the source and target domains within deep convolutional neural networks (CNNs) deal with all the samples from the two domains in a global or category-aware manner. They do not consider an inter-class variation within the target domain itself or estimated category, providing the limitation to encode the domains having a multi-modal data distribution. To overcome this limitation, we introduce a learnable clustering module, and a novel domain adaptation framework called cross-domain grouping and alignment. To cluster the samples across domains with an aim to maximize the domain alignment without forgetting precise segmentation ability on the source domain, we present two loss functions, in particular, for encouraging semantic consistency and orthogonality among the clusters. We also present a loss so as to solve a class imbalance problem, which is the other limitation of the previous methods. Our experiments show that our method consistently boosts the adaptation performance in semantic segmentation, outperforming the state-of-the-arts on various domain adaptation settings., AAAI 2021
Published: 2021

23. Pin the Memory: Learning to Generalize Semantic Segmentation

Author: Kim, Jin, primary, Lee, Jiyoung, additional, Park, Jungin, additional, Min, Dongbo, additional, and Sohn, Kwanghoon, additional
Published: 2022
Full Text: View/download PDF

24. Probabilistic Representations for Video Contrastive Learning

Author: Park, Jungin, primary, Lee, Jiyoung, additional, Kim, Ig-Jae, additional, and Sohn, Kwanghoon, additional
Published: 2022
Full Text: View/download PDF

25. Self-Balanced Learning for Domain Generalization

Author: Kim, Jin, primary, Lee, Jiyoung, additional, Park, Jungin, additional, Min, Dongbo, additional, and Sohn, Kwanghoon, additional
Published: 2021
Full Text: View/download PDF

26. Bridge to Answer: Structure-aware Graph Interaction Network for Video Question Answering

Author: Park, Jungin, primary, Lee, Jiyoung, additional, and Sohn, Kwanghoon, additional
Published: 2021
Full Text: View/download PDF

27. Context-Aware Emotion Recognition Networks

Author: Lee, Jiyoung, primary, Kim, Seungryong, additional, Kim, Sunok, additional, Park, Jungin, additional, and Sohn, Kwanghoon, additional
Published: 2019
Full Text: View/download PDF

28. Video Summarization by Learning Relationships between Action and Scene

Author: Park, Jungin, primary, Lee, Jiyoung, additional, Jeon, Sangryul, additional, and Sohn, Kwanghoon, additional
Published: 2019
Full Text: View/download PDF

29. Graph Regularization Network with Semantic Affinity for Weakly-Supervised Temporal Action Localization

Author: Park, Jungin, primary, Lee, Jiyoung, additional, Jeon, Sangryul, additional, Kim, Seungryong, additional, and Sohn, Kwanghoon, additional
Published: 2019
Full Text: View/download PDF

30. Learning to Detect, Associate, and Recognize Human Actions and Surrounding Scenes in Untrimmed Videos

Author: Park, Jungin, primary, Jeon, Sangryul, additional, Kim, Seungryong, additional, Lee, Jiyoung, additional, Kim, Sunok, additional, and Sohn, Kwanghoon, additional
Published: 2018
Full Text: View/download PDF

31. F-35 Joint Strike Fighter: Current Outlook Is Improved, but Long-Term Affordability Is a Major Concern

Author: GOVERNMENT ACCOUNTABILITY OFFICE WASHINGTON DC, Sullivan, Michael, Fairbairn, Bruce, Bonner, Marvin, Roberts, W K, Stockdale, Erin, Park, Jungin, Porter, Megan, Lack, John, GOVERNMENT ACCOUNTABILITY OFFICE WASHINGTON DC, Sullivan, Michael, Fairbairn, Bruce, Bonner, Marvin, Roberts, W K, Stockdale, Erin, Park, Jungin, Porter, Megan, and Lack, John
Abstract: The F-35 Lightning II, the Joint Strike Fighter, is DOD s most costly and ambitious aircraft acquisition. The program is developing and fielding three aircraft variants for the Air Force, Navy, Marine Corps, and eight international partners. The F-35 is critical to long-term recapitalization plans as it is intended to replace hundreds of existing aircraft. This will require a long-term sustained funding commitment. Total U.S. investment is nearing $400 billion to develop and procure 2,457 aircraft through 2037. Fifty-two aircraft have been delivered through 2012. The F-35 program has been extensively restructured over the last 3 years to address prior cost, schedule, and performance problems. GAO s prior reviews of the F-35 made numerous recommendations to improve outcomes, such as increasing test resources and reducing annual procurement quantities. This report, prepared in response to the National Defense Authorization Act for 2010, addresses (1) F-35 program performance during 2012, including testing, technical risks, and software; (2) manufacturing performance indicators, production results, and design changes; and (3) acquisition and sustainment costs going forward. GAO s work included analyses of a wide range of program documents and interviews with defense and contractor officials. GAO is not making recommendations in this report. DOD s restructuring of the F-35 program and other actions are responsive to many prior recommendations. DOD agreed with GAO s report findings and conclusions., Report to Congressional Commitees.
Published: 2013

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

31 results on '"Park, Jungin"'

1. Bridging Vision and Language Spaces with Assignment Prediction

2. Knowing Where to Focus: Event-aware Transformer for Video Grounding

3. PartMix: Regularization Strategy to Learn Part Discovery for Visible-Infrared Person Re-identification

4. Dual-path Adaptation from Image to Video Transformers

5. SimOn: A Simple Framework for Online Temporal Action Localization

6. Language-free Training for Zero-shot Video Grounding

7. PointFix: Learning to Fix Domain Bias for Robust Online Stereo Adaptation

8. Probabilistic Representations for Video Contrastive Learning

9. Pin the Memory: Learning to Generalize Semantic Segmentation

10. Self-balanced Learning For Domain Generalization

11. Bridge to Answer: Structure-aware Graph Interaction Network for Video Question Answering

12. Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation

13. SumGraph: Video Summarization via Recursive Graph Modeling

14. Context-Aware Emotion Recognition Networks

15. PointFix: Learning to Fix Domain Bias for Robust Online Stereo Adaptation

16. SumGraph: Video Summarization via Recursive Graph Modeling

17. Dual-Path Adaptation from Image to Video Transformers

18. PartMix: Regularization Strategy to Learn Part Discovery for Visible-Infrared Person Re-Identification

19. AEDLE: Designing Drama Therapy Interface for Improving Pragmatic Language Skills of Children with Autism Spectrum Disorder Using AR

20. Language-free Training for Zero-shot Video Grounding

21. Rethinking Autocorrelation for Deep Spectrum Sensing in Cognitive Radio Networks

22. Cross-Domain Grouping and Alignment for Domain Adaptive Semantic Segmentation

23. Pin the Memory: Learning to Generalize Semantic Segmentation

24. Probabilistic Representations for Video Contrastive Learning

25. Self-Balanced Learning for Domain Generalization

26. Bridge to Answer: Structure-aware Graph Interaction Network for Video Question Answering

27. Context-Aware Emotion Recognition Networks

28. Video Summarization by Learning Relationships between Action and Scene

29. Graph Regularization Network with Semantic Affinity for Weakly-Supervised Temporal Action Localization

30. Learning to Detect, Associate, and Recognize Human Actions and Surrounding Scenes in Untrimmed Videos

31. F-35 Joint Strike Fighter: Current Outlook Is Improved, but Long-Term Affordability Is a Major Concern

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

31 results on '"Park, Jungin"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources