Author: "Gangisetty, Shankar" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Gangisetty, Shankar"' showing total 24 results

Start Over Author "Gangisetty, Shankar"

24 results on '"Gangisetty, Shankar"'

1. Making the V in Text-VQA Matter

Author: Hegde, Shamanthak, Jahagirdar, Soumya, and Gangisetty, Shankar
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Text-based VQA aims at answering questions by reading the text present in the images. It requires a large amount of scene-text relationship understanding compared to the VQA task. Recent studies have shown that the question-answer pairs in the dataset are more focused on the text present in the image but less importance is given to visual features and some questions do not require understanding the image. The models trained on this dataset predict biased answers due to the lack of understanding of visual context. For example, in questions like "What is written on the signboard?", the answer predicted by the model is always "STOP" which makes the model to ignore the image. To address these issues, we propose a method to learn visual features (making V matter in TextVQA) along with the OCR features and question features using VQA dataset as external knowledge for Text-based VQA. Specifically, we combine the TextVQA dataset and VQA dataset and train the model on this combined dataset. Such a simple, yet effective approach increases the understanding and correlation between the image features and text present in the image, which helps in the better answering of questions. We further test the model on different datasets and compare their qualitative and quantitative results., Comment: Accepted for the CVPR 2023 Workshop on Open-Domain Reasoning Under Multi-Modal Settings
Published: 2023

2. Weakly Supervised Visual Question Answer Generation

Author: Alampalle, Charani, Hegde, Shamanthak, Jahagirdar, Soumya, and Gangisetty, Shankar
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Growing interest in conversational agents promote twoway human-computer communications involving asking and answering visual questions have become an active area of research in AI. Thus, generation of visual questionanswer pair(s) becomes an important and challenging task. To address this issue, we propose a weakly-supervised visual question answer generation method that generates a relevant question-answer pairs for a given input image and associated caption. Most of the prior works are supervised and depend on the annotated question-answer datasets. In our work, we present a weakly supervised method that synthetically generates question-answer pairs procedurally from visual information and captions. The proposed method initially extracts list of answer words, then does nearest question generation that uses the caption and answer word to generate synthetic question. Next, the relevant question generator converts the nearest question to relevant language question by dependency parsing and in-order tree traversal, finally, fine-tune a ViLBERT model with the question-answer pair(s) generated at end. We perform an exhaustive experimental analysis on VQA dataset and see that our model significantly outperform SOTA methods on BLEU scores. We also show the results wrt baseline models and ablation study.
Published: 2023

3. Look, Read and Ask: Learning to Ask Questions by Reading Text in Images

Author: Jahagirdar, Soumya, Gangisetty, Shankar, and Mishra, Anand
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: We present a novel problem of text-based visual question generation or TextVQG in short. Given the recent growing interest of the document image analysis community in combining text understanding with conversational artificial intelligence, e.g., text-based visual question answering, TextVQG becomes an important task. TextVQG aims to generate a natural language question for a given input image and an automatically extracted text also known as OCR token from it such that the OCR token is an answer to the generated question. TextVQG is an essential ability for a conversational agent. However, it is challenging as it requires an in-depth understanding of the scene and the ability to semantically bridge the visual content with the text present in the image. To address TextVQG, we present an OCR consistent visual question generation model that Looks into the visual content, Reads the scene text, and Asks a relevant and meaningful natural language question. We refer to our proposed model as OLRA. We perform an extensive evaluation of OLRA on two public benchmarks and compare them against baselines. Our model OLRA automatically generates questions similar to the public text-based visual question answering datasets that were curated manually. Moreover, we significantly outperform baseline approaches on the performance measures popularly used in text generation literature.
Published: 2022

4. PIG-Net: Inception based Deep Learning Architecture for 3D Point Cloud Segmentation

Author: Hegde, Sindhu and Gangisetty, Shankar
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Point clouds, being the simple and compact representation of surface geometry of 3D objects, have gained increasing popularity with the evolution of deep learning networks for classification and segmentation tasks. Unlike human, teaching the machine to analyze the segments of an object is a challenging task and quite essential in various machine vision applications. In this paper, we address the problem of segmentation and labelling of the 3D point clouds by proposing a inception based deep network architecture called PIG-Net, that effectively characterizes the local and global geometric details of the point clouds. In PIG-Net, the local features are extracted from the transformed input points using the proposed inception layers and then aligned by feature transform. These local features are aggregated using the global average pooling layer to obtain the global features. Finally, feed the concatenated local and global features to the convolution layers for segmenting the 3D point clouds. We perform an exhaustive experimental analysis of the PIG-Net architecture on two state-of-the-art datasets, namely, ShapeNet [1] and PartNet [2]. We evaluate the effectiveness of our network by performing ablation study., Comment: 11 pages, 5 Figures, 6 Tables, Accepted in Computers & Graphics Journal 2021
Published: 2021
Full Text: View/download PDF

5. SHREC’22 track: Open-Set 3D Object Retrieval

Author: Feng, Yifan, Gao, Yue, Zhao, Xibin, Guo, Yandong, Bagewadi, Nihar, Bui, Nhat-Tan, Dao, Hieu, Gangisetty, Shankar, Guan, Ripeng, Han, Xie, Hua, Cong, Hunakunti, Chidambar, Jiang, Yu, Jiao, Shichao, Ke, Yuqi, Kuang, Liqun, Liu, Anan, Nguyen, Dinh-Huan, Nguyen, Hai-Dang, Nie, Weizhi, Pham, Bang-Dang, Raikar, Karthik, Tang, Qingmei, Tran, Minh-Triet, Wan, Jialong, Yan, Chenggang, You, Haoxuan, and Zhu, Difei
Published: 2022
Full Text: View/download PDF

6. FloodNet: Underwater image restoration based on residual dense learning

Author: Gangisetty, Shankar and Rai, Raghu Raj
Published: 2022
Full Text: View/download PDF

7. Fake Review Detection Using Hybrid Ensemble Learning

Author: Hegde, Sindhu, Raj Rai, Raghu, Sunitha Hiremath, P. G., Gangisetty, Shankar, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Hirche, Sandra, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Möller, Sebastian, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zhang, Junjie James, Series Editor, Thampi, Sabu M., editor, Gelenbe, Erol, editor, Atiquzzaman, Mohammed, editor, Chaudhary, Vipin, editor, and Li, Kuan-Ching, editor
Published: 2021
Full Text: View/download PDF

8. Generative Adversarial Network-Based Language Identification for Closely Related Same Language Family

Author: Kar, Ashish, Sunitha Hiremath, P. G., Gangisetty, Shankar, Angrisani, Leopoldo, Series Editor, Arteaga, Marco, Series Editor, Panigrahi, Bijaya Ketan, Series Editor, Chakraborty, Samarjit, Series Editor, Chen, Jiming, Series Editor, Chen, Shanben, Series Editor, Chen, Tan Kay, Series Editor, Dillmann, Rüdiger, Series Editor, Duan, Haibin, Series Editor, Ferrari, Gianluigi, Series Editor, Ferre, Manuel, Series Editor, Hirche, Sandra, Series Editor, Jabbari, Faryar, Series Editor, Jia, Limin, Series Editor, Kacprzyk, Janusz, Series Editor, Khamis, Alaa, Series Editor, Kroeger, Torsten, Series Editor, Liang, Qilian, Series Editor, Martín, Ferran, Series Editor, Ming, Tan Cher, Series Editor, Minker, Wolfgang, Series Editor, Misra, Pradeep, Series Editor, Möller, Sebastian, Series Editor, Mukhopadhyay, Subhas, Series Editor, Ning, Cun-Zheng, Series Editor, Nishida, Toyoaki, Series Editor, Pascucci, Federica, Series Editor, Qin, Yong, Series Editor, Seng, Gan Woon, Series Editor, Speidel, Joachim, Series Editor, Veiga, Germano, Series Editor, Wu, Haitao, Series Editor, Zhang, Junjie James, Series Editor, Thampi, Sabu M., editor, Gelenbe, Erol, editor, Atiquzzaman, Mohammed, editor, Chaudhary, Vipin, editor, and Li, Kuan-Ching, editor
Published: 2021
Full Text: View/download PDF

9. Look, Read and Ask: Learning to Ask Questions by Reading Text in Images

Author: Jahagirdar, Soumya, Gangisetty, Shankar, Mishra, Anand, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Lladós, Josep, editor, Lopresti, Daniel, editor, and Uchida, Seiichi, editor
Published: 2021
Full Text: View/download PDF

10. 3D Semantic Segmentation for Large-Scale Scene Understanding

Author: Akadas, Kiran, Gangisetty, Shankar, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Sato, Imari, editor, and Han, Bohyung, editor
Published: 2021
Full Text: View/download PDF

11. Underwater image restoration using deep encoder–decoder network with symmetric skip connections

Author: Gangisetty, Shankar and Rai, Raghu Raj
Published: 2022
Full Text: View/download PDF

12. SHREC 2021: Retrieval of cultural heritage objects

Author: Sipiran, Ivan, Lazo, Patrick, Lopez, Cristian, Jimenez, Milagritos, Bagewadi, Nihar, Bustos, Benjamin, Dao, Hieu, Gangisetty, Shankar, Hanik, Martin, Ho-Thi, Ngoc-Phuong, Holenderski, Mike, Jarnikov, Dmitri, Labrada, Arniel, Lengauer, Stefan, Licandro, Roxane, Nguyen, Dinh-Huan, Nguyen-Ho, Thang-Long, Perez Rey, Luis A., Pham, Bang-Dang, Pham, Minh-Khoi, Preiner, Reinhold, Schreck, Tobias, Trinh, Quoc-Huy, Tonnaer, Loek, von Tycowicz, Christoph, and Vu-Le, The-Anh
Published: 2021
Full Text: View/download PDF

13. SHREC 2020: 3D point cloud semantic segmentation for street scenes

Author: Ku, Tao, Veltkamp, Remco C., Boom, Bas, Duque-Arias, David, Velasco-Forero, Santiago, Deschaud, Jean-Emmanuel, Goulette, Francois, Marcotegui, Beatriz, Ortega, Sebastián, Trujillo, Agustín, Suárez, José Pablo, Santana, José Miguel, Ramírez, Cristian, Akadas, Kiran, and Gangisetty, Shankar
Published: 2020
Full Text: View/download PDF

14. Indian Regional Sign Language Recognition

Author: C, Sujatha, primary, Patil, Madhura K, additional, F.M, Umadevi., additional, Jain, Neha, additional, Badni, Akshata, additional, Angadi, Ranjeeta, additional, and Gangisetty, Shankar, additional
Published: 2023
Full Text: View/download PDF

15. 3D Semantic Segmentation for Large-Scale Scene Understanding

Author: Akadas, Kiran, primary and Gangisetty, Shankar, additional
Published: 2021
Full Text: View/download PDF

16. Look, Read and Ask: Learning to Ask Questions by Reading Text in Images

Author: Jahagirdar, Soumya, primary, Gangisetty, Shankar, additional, and Mishra, Anand, additional
Published: 2021
Full Text: View/download PDF

17. Making the V in Text-VQA Matter

Author: Hegde, Shamanthak, primary, Jahagirdar, Soumya, additional, and Gangisetty, Shankar, additional
Published: 2023
Full Text: View/download PDF

18. Weakly Supervised Visual Question Answer Generation

Author: Alampalle, Charani, primary, Hegde, Shamanthak, additional, Jahagirdar, Soumya, additional, and Gangisetty, Shankar, additional
Published: 2023
Full Text: View/download PDF

19. An Ensemble of Transformer and LSTM Approach for Multivariate Time Series Data Classification

Author: Narayan, Aryan, primary, Mishra, Bodhi Satwa, additional, Hiremath, P. G. Sunitha, additional, Pendari, Neha Tarannum, additional, and Gangisetty, Shankar, additional
Published: 2021
Full Text: View/download PDF

20. Stacked LSTM Based Wafer Classification

Author: Shinde, Neeta, primary, S, Chandana, additional, Patil, Shashank Anand, additional, Siri Chandana, K, additional, Pendari, Neha Tarannum, additional, Hiremath, P G Sunitha, additional, and Gangisetty, Shankar, additional
Published: 2021
Full Text: View/download PDF

21. REMOVED: SHREC 2021: 3D point cloud change detection for street scenes

Author: Ku, Tao, Galanakis, Sam, Boom, Bas, Veltkamp, Remco C., Bangera, Darshan, Gangisetty, Shankar, Stagakis, Nikolaos, Arvanitis, Gerasimos, and Moustakas, Konstantinos
Published: 2021
Full Text: View/download PDF

22. Underwater image restoration using deep encoder–decoder network with symmetric skip connections

Author: Gangisetty, Shankar, primary and Rai, Raghu Raj, additional
Published: 2021
Full Text: View/download PDF

23. PIG-Net: Inception based deep learning architecture for 3D point cloud segmentation

Author: Hegde, Sindhu, primary and Gangisetty, Shankar, additional
Published: 2021
Full Text: View/download PDF

24. SHREC 2021: 3D point cloud change detection for street scenes.

Author: Ku, Tao, Galanakis, Sam, Boom, Bas, Veltkamp, Remco C., Bangera, Darshan, Gangisetty, Shankar, Stagakis, Nikolaos, Arvanitis, Gerasimos, and Moustakas, Konstantinos
Subjects: *POINT cloud, *ARTIFICIAL neural networks, *GEOSTATIONARY satellites, *DATA analysis, *ALGORITHMS
Abstract: • We provide a unique classification-based 3D change detection dataset from a complex street environment. There are no other open 3D point cloud datasets released for our purpose. • We evaluate different algorithms on the dataset and help finding solutions for 3D point cloud change detection tasks. • The results show that the proposed siamese graph convolutional networks (SiamGCN) are good at extracting representative geometric features and can hereby outperform compared algorithms on the 3D change detection dataset. [Display omitted] The rapid development of 3D acquisition devices enables us to collect billions of points in a few hours. However, the analysis of the output data is a challenging task, especially in the field of 3D point cloud change detection. In this Shape Retrieval Challenge (SHREC) track, we provide a street-scene dataset for 3D point cloud change detection. The dataset consists of 866 3D object pairs in year 2016 and 2020 from 78 large-scale street scene 3D point clouds. Our goal is to detect the changes from multi-temporal point clouds in a complex street environment. We compare three methods on this benchmark, with one handcrafted (PoChaDeHH) and the other two learning-based (HGI-CD and SiamGCN). The results show that the handcrafted algorithm has balanced performance over all classes, while learning-based methods achieve overwhelming performance but suffer from the class-imbalanced problem and may fail on minority classes. The randomized oversampling metric applied in SiamGCN can alleviate this problem. Also, different siamese network architecture in HGI-CD and SiamGCN contribute to the designing of a network for the 3D change detection task. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

24 results on '"Gangisetty, Shankar"'

1. Making the V in Text-VQA Matter

2. Weakly Supervised Visual Question Answer Generation

3. Look, Read and Ask: Learning to Ask Questions by Reading Text in Images

4. PIG-Net: Inception based Deep Learning Architecture for 3D Point Cloud Segmentation

5. SHREC’22 track: Open-Set 3D Object Retrieval

6. FloodNet: Underwater image restoration based on residual dense learning

7. Fake Review Detection Using Hybrid Ensemble Learning

8. Generative Adversarial Network-Based Language Identification for Closely Related Same Language Family

9. Look, Read and Ask: Learning to Ask Questions by Reading Text in Images

10. 3D Semantic Segmentation for Large-Scale Scene Understanding

11. Underwater image restoration using deep encoder–decoder network with symmetric skip connections

12. SHREC 2021: Retrieval of cultural heritage objects

13. SHREC 2020: 3D point cloud semantic segmentation for street scenes

14. Indian Regional Sign Language Recognition

15. 3D Semantic Segmentation for Large-Scale Scene Understanding

16. Look, Read and Ask: Learning to Ask Questions by Reading Text in Images

17. Making the V in Text-VQA Matter

18. Weakly Supervised Visual Question Answer Generation

19. An Ensemble of Transformer and LSTM Approach for Multivariate Time Series Data Classification

20. Stacked LSTM Based Wafer Classification

21. REMOVED: SHREC 2021: 3D point cloud change detection for street scenes

22. Underwater image restoration using deep encoder–decoder network with symmetric skip connections

23. PIG-Net: Inception based deep learning architecture for 3D point cloud segmentation

24. SHREC 2021: 3D point cloud change detection for street scenes.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

24 results on '"Gangisetty, Shankar"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources