Author: "Ghosh, Shreya" / Publication Year Range: Last 3 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Ghosh, Shreya"' showing total 219 results

Start Over Author "Ghosh, Shreya" Publication Year Range Last 3 years

219 results on '"Ghosh, Shreya"'

1. Exploring Language Model Generalization in Low-Resource Extractive QA

Author: Sengupta, Saptarshi, Yin, Wenpeng, Nakov, Preslav, Ghosh, Shreya, and Wang, Suhang
Subjects: Computer Science - Computation and Language
Abstract: In this paper, we investigate Extractive Question Answering (EQA) with Large Language Models (LLMs) under domain drift, i.e., can LLMs generalize well to closed-domains that require specific knowledge such as medicine and law in a zero-shot fashion without additional in-domain training? To this end, we devise a series of experiments to empirically explain the performance gap. Our findings suggest that: a) LLMs struggle with dataset demands of closed-domains such as retrieving long answer-spans; b) Certain LLMs, despite showing strong overall performance, display weaknesses in meeting basic requirements as discriminating between domain-specific senses of words which we link to pre-processing decisions; c) Scaling model parameters is not always effective for cross-domain generalization; and d) Closed-domain datasets are quantitatively much different than open-domain EQA datasets and current LLMs struggle to deal with them. Our findings point out important directions for improving existing LLMs.
Published: 2024

2. Machine Learning to Detect Anxiety Disorders from Error-Related Negativity and EEG Signals

Author: Chandrasekar, Ramya, Hasan, Md Rakibul, Ghosh, Shreya, Gedeon, Tom, and Hossain, Md Zakir
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Machine Learning
Abstract: Anxiety is a common mental health condition characterised by excessive worry, fear and apprehension about everyday situations. Even with significant progress over the past few years, predicting anxiety from electroencephalographic (EEG) signals, specifically using error-related negativity (ERN), still remains challenging. Following the PRISMA protocol, this paper systematically reviews 54 research papers on using EEG and ERN markers for anxiety detection published in the last 10 years (2013 -- 2023). Our analysis highlights the wide usage of traditional machine learning, such as support vector machines and random forests, as well as deep learning models, such as convolutional neural networks and recurrent neural networks across different data types. Our analysis reveals that the development of a robust and generic anxiety prediction method still needs to address real-world challenges, such as task-specific setup, feature selection and computational modelling. We conclude this review by offering potential future direction for non-invasive, objective anxiety diagnostics, deployed across diverse populations and anxiety sub-types.
Published: 2024

3. MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing

Author: Ghosh, Shreya, Cai, Zhixi, Dhall, Abhinav, Kollias, Dimitrios, Goecke, Roland, and Gedeon, Tom
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: With the rapid advancements in multimodal generative technology, Affective Computing research has provoked discussion about the potential consequences of AI systems equipped with emotional intelligence. Affective Computing involves the design, evaluation, and implementation of Emotion AI and related technologies aimed at improving people's lives. Designing a computational model in affective computing requires vast amounts of multimodal data, including RGB images, video, audio, text, and physiological signals. Moreover, Affective Computing research is deeply engaged with ethical considerations at various stages-from training emotionally intelligent models on large-scale human data to deploying these models in specific applications. Fundamentally, the development of any AI system must prioritize its impact on humans, aiming to augment and enhance human abilities rather than replace them, while drawing inspiration from human intelligence in a safe and responsible manner. The MRAC 2024 Track 1 workshop seeks to extend these principles from controlled, small-scale lab environments to real-world, large-scale contexts, emphasizing responsible development. The workshop also aims to highlight the potential implications of generative technology, along with the ethical consequences of its use, to researchers and industry professionals. To the best of our knowledge, this is the first workshop series to comprehensively address the full spectrum of multimodal, generative affective computing from a responsible AI perspective, and this is the second iteration of this workshop. Webpage: https://react-ws.github.io/2024/, Comment: ACM MM Workshop 2024. Workshop webpage: https://react-ws.github.io/2024/
Published: 2024

4. 1M-Deepfakes Detection Challenge

Author: Cai, Zhixi, Dhall, Abhinav, Ghosh, Shreya, Hayat, Munawar, Kollias, Dimitrios, Stefanov, Kalin, and Tariq, Usman
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The detection and localization of deepfake content, particularly when small fake segments are seamlessly mixed with real videos, remains a significant challenge in the field of digital media security. Based on the recently released AV-Deepfake1M dataset, which contains more than 1 million manipulated videos across more than 2,000 subjects, we introduce the 1M-Deepfakes Detection Challenge. This challenge is designed to engage the research community in developing advanced methods for detecting and localizing deepfake manipulations within the large-scale high-realistic audio-visual dataset. The participants can access the AV-Deepfake1M dataset and are required to submit their inference results for evaluation across the metrics for detection or localization tasks. The methodologies developed through the challenge will contribute to the development of next-generation deepfake detection and localization systems. Evaluation scripts, baseline models, and accompanying code will be available on https://github.com/ControlNet/AV-Deepfake1M., Comment: ACM MM 2024. Challenge webpage: https://deepfakes1m.github.io/
Published: 2024

5. MIP-GAF: A MLLM-annotated Benchmark for Most Important Person Localization and Group Context Understanding

Author: Madan, Surbhi, Ghosh, Shreya, Sookha, Lownish Rai, Ganaie, M. A., Subramanian, Ramanathan, Dhall, Abhinav, and Gedeon, Tom
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Computer Science - Multimedia
Abstract: Estimating the Most Important Person (MIP) in any social event setup is a challenging problem mainly due to contextual complexity and scarcity of labeled data. Moreover, the causality aspects of MIP estimation are quite subjective and diverse. To this end, we aim to address the problem by annotating a large-scale `in-the-wild' dataset for identifying human perceptions about the `Most Important Person (MIP)' in an image. The paper provides a thorough description of our proposed Multimodal Large Language Model (MLLM) based data annotation strategy, and a thorough data quality analysis. Further, we perform a comprehensive benchmarking of the proposed dataset utilizing state-of-the-art MIP localization methods, indicating a significant drop in performance compared to existing datasets. The performance drop shows that the existing MIP localization algorithms must be more robust with respect to `in-the-wild' situations. We believe the proposed dataset will play a vital role in building the next-generation social situation understanding methods. The code and data is available at https://github.com/surbhimadan92/MIP-GAF., Comment: Accepted for publication at WACV 2025
Published: 2024

6. 7th ABAW Competition: Multi-Task Learning and Compound Expression Recognition

Author: Kollias, Dimitrios, Zafeiriou, Stefanos, Kotsia, Irene, Dhall, Abhinav, Ghosh, Shreya, Shao, Chunchang, and Hu, Guanyu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper describes the 7th Affective Behavior Analysis in-the-wild (ABAW) Competition, which is part of the respective Workshop held in conjunction with ECCV 2024. The 7th ABAW Competition addresses novel challenges in understanding human expressions and behaviors, crucial for the development of human-centered technologies. The Competition comprises of two sub-challenges: i) Multi-Task Learning (the goal is to learn at the same time, in a multi-task learning setting, to estimate two continuous affect dimensions, valence and arousal, to recognise between the mutually exclusive classes of the 7 basic expressions and 'other'), and to detect 12 Action Units); and ii) Compound Expression Recognition (the target is to recognise between the 7 mutually exclusive compound expression classes). s-Aff-Wild2, which is a static version of the A/V Aff-Wild2 database and contains annotations for valence-arousal, expressions and Action Units, is utilized for the purposes of the Multi-Task Learning Challenge; a part of C-EXPR-DB, which is an A/V in-the-wild database with compound expression annotations, is utilized for the purposes of the Compound Expression Recognition Challenge. In this paper, we introduce the two challenges, detailing their datasets and the protocols followed for each. We also outline the evaluation metrics, and highlight the baseline systems and their results. Additional information about the competition can be found at \url{https://affective-behavior-analysis-in-the-wild.github.io/7th}.
Published: 2024

7. FunnelNet: An End-to-End Deep Learning Framework to Monitor Digital Heart Murmur in Real-Time

Author: Jobayer, Md, Shawon, Md. Mehedi Hasan, Hasan, Md Rakibul, Ghosh, Shreya, Gedeon, Tom, and Hossain, Md Zakir
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Objective: Heart murmurs are abnormal sounds caused by turbulent blood flow within the heart. Several diagnostic methods are available to detect heart murmurs and their severity, such as cardiac auscultation, echocardiography, phonocardiogram (PCG), etc. However, these methods have limitations, including extensive training and experience among healthcare providers, cost and accessibility of echocardiography, as well as noise interference and PCG data processing. This study aims to develop a novel end-to-end real-time heart murmur detection approach using traditional and depthwise separable convolutional networks. Methods: Continuous wavelet transform (CWT) was applied to extract meaningful features from the PCG data. The proposed network has three parts: the Squeeze net, the Bottleneck, and the Expansion net. The Squeeze net generates a compressed data representation, whereas the Bottleneck layer reduces computational complexity using a depthwise-separable convolutional network. The Expansion net is responsible for up-sampling the compressed data to a higher dimension, capturing tiny details of the representative data. Results: For evaluation, we used four publicly available datasets and achieved state-of-the-art performance in all datasets. Furthermore, we tested our proposed network on two resource-constrained devices: a Raspberry PI and an Android device, stripping it down into a tiny machine learning model (TinyML), achieving a maximum of 99.70%. Conclusion: The proposed model offers a deep learning framework for real-time accurate heart murmur detection within limited resources. Significance: It will significantly result in more accessible and practical medical services and reduced diagnosis time to assist medical professionals. The code is publicly available at TBA., Comment: 8-page main paper and 4-page supplementary material
Published: 2024

8. Improving Transferability of Network Intrusion Detection in a Federated Learning Setup

Author: Ghosh, Shreya, Jameel, Abu Shafin Mohammad Mahdee, and Gamal, Aly El
Subjects: Computer Science - Cryptography and Security, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing
Abstract: Network Intrusion Detection Systems (IDS) aim to detect the presence of an intruder by analyzing network packets arriving at an internet connected device. Data-driven deep learning systems, popular due to their superior performance compared to traditional IDS, depend on availability of high quality training data for diverse intrusion classes. A way to overcome this limitation is through transferable learning, where training for one intrusion class can lead to detection of unseen intrusion classes after deployment. In this paper, we provide a detailed study on the transferability of intrusion detection. We investigate practical federated learning configurations to enhance the transferability of intrusion detection. We propose two techniques to significantly improve the transferability of a federated intrusion detection system. The code for this work can be found at https://github.com/ghosh64/transferability., Comment: This manuscript has been accepted for publication in ICMLCN 2024
Published: 2024

9. A Study on Transferability of Deep Learning Models for Network Intrusion Detection

Author: Ghosh, Shreya, Jameel, Abu Shafin Mohammad Mahdee, and Gamal, Aly El
Subjects: Computer Science - Cryptography and Security, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Signal Processing
Abstract: In this paper, we explore transferability in learning between different attack classes in a network intrusion detection setup. We evaluate transferability of attack classes by training a deep learning model with a specific attack class and testing it on a separate attack class. We observe the effects of real and synthetically generated data augmentation techniques on transferability. We investigate the nature of observed transferability relationships, which can be either symmetric or asymmetric. We also examine explainability of the transferability relationships using the recursive feature elimination algorithm. We study data preprocessing techniques to boost model performance. The code for this work can be found at https://github.com/ghosh64/transferability., Comment: A significantly revised version of this manuscript has been accepted for publication. This is a previous version of the manuscript containing results and discussions that could not be included in the accepted version
Published: 2023

10. AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

Author: Cai, Zhixi, Ghosh, Shreya, Adatia, Aman Pankaj, Hayat, Munawar, Dhall, Abhinav, Gedeon, Tom, and Stefanov, Kalin
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: The detection and localization of highly realistic deepfake audio-visual content are challenging even for the most advanced state-of-the-art methods. While most of the research efforts in this domain are focused on detecting high-quality deepfake images and videos, only a few works address the problem of the localization of small segments of audio-visual manipulations embedded in real videos. In this research, we emulate the process of such content generation and propose the AV-Deepfake1M dataset. The dataset contains content-driven (i) video manipulations, (ii) audio manipulations, and (iii) audio-visual manipulations for more than 2K subjects resulting in a total of more than 1M videos. The paper provides a thorough description of the proposed data generation pipeline accompanied by a rigorous analysis of the quality of the generated data. The comprehensive benchmark of the proposed dataset utilizing state-of-the-art deepfake detection and localization methods indicates a significant drop in performance compared to previous datasets. The proposed dataset will play a vital role in building the next-generation deepfake localization methods. The dataset and associated code are available at https://github.com/ControlNet/AV-Deepfake1M ., Comment: Accepted by ACM MM 2024
Published: 2023

11. Empathy Detection from Text, Audiovisual, Audio or Physiological Signals: Task Formulations and Machine Learning Methods

Author: Hasan, Md Rakibul, Hossain, Md Zakir, Ghosh, Shreya, Krishna, Aneesh, and Gedeon, Tom
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Machine Learning, Computer Science - Social and Information Networks
Abstract: Empathy indicates an individual's ability to understand others. Over the past few years, empathy has drawn attention from various disciplines, including but not limited to Affective Computing, Cognitive Science and Psychology. Detecting empathy has potential applications in society, healthcare and education. Despite being a broad and overlapping topic, the avenue of empathy detection leveraging Machine Learning remains underexplored from a systematic literature review perspective. We collected 828 papers from 10 well-known databases, systematically screened them and analysed the final 61 papers. Our analyses reveal several prominent task formulations $-$ including empathy on localised utterances or overall expressions, unidirectional or parallel empathy, and emotional contagion $-$ in monadic, dyadic and group interactions. Empathy detection methods are summarised based on four input modalities $-$ text, audiovisual, audio and physiological signals $-$ thereby presenting modality-specific network architecture design protocols. We discuss challenges, research gaps and potential applications in the Affective Computing-based empathy domain, which can facilitate new avenues of exploration. We further enlist the public availability of datasets and codes. We believe that our work is a stepping stone to developing a robust empathy detection system that can be deployed in practice to enhance the overall well-being of human life., Comment: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice
Published: 2023

12. Quality > Quantity: Synthetic Corpora from Foundation Models for Closed-Domain Extractive Question Answering

Author: Sengupta, Saptarshi, Heaton, Connor, Ghosh, Shreya, Nakov, Preslav, and Mitra, Prasenjit
Subjects: Computer Science - Computation and Language
Abstract: Domain adaptation, the process of training a model in one domain and applying it to another, has been extensively explored in machine learning. While training a domain-specific foundation model (FM) from scratch is an option, recent methods have focused on adapting pre-trained FMs for domain-specific tasks. However, our experiments reveal that either approach does not consistently achieve state-of-the-art (SOTA) results in the target domain. In this work, we study extractive question answering within closed domains and introduce the concept of targeted pre-training. This involves determining and generating relevant data to further pre-train our models, as opposed to the conventional philosophy of utilizing domain-specific FMs trained on a wide range of data. Our proposed framework uses Galactica to generate synthetic, ``targeted'' corpora that align with specific writing styles and topics, such as research papers and radiology reports. This process can be viewed as a form of knowledge distillation. We apply our method to two biomedical extractive question answering datasets, COVID-QA and RadQA, achieving a new benchmark on the former and demonstrating overall improvements on the latter. Code available at https://github.com/saptarshi059/CDQA-v1-Targetted-PreTraining/tree/main.
Published: 2023

13. Analysis of Elephant Movement in Sub-Saharan Africa: Ecological, Climatic, and Conservation Perspectives

Author: Hines, Matthew, Glatzer, Gregory, Ghosh, Shreya, and Mitra, Prasenjit
Subjects: Quantitative Biology - Populations and Evolution, Computer Science - Artificial Intelligence, Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: The interaction between elephants and their environment has profound implications for both ecology and conservation strategies. This study presents an analytical approach to decipher the intricate patterns of elephant movement in Sub-Saharan Africa, concentrating on key ecological drivers such as seasonal variations and rainfall patterns. Despite the complexities surrounding these influential factors, our analysis provides a holistic view of elephant migratory behavior in the context of the dynamic African landscape. Our comprehensive approach enables us to predict the potential impact of these ecological determinants on elephant migration, a critical step in establishing informed conservation strategies. This projection is particularly crucial given the impacts of global climate change on seasonal and rainfall patterns, which could substantially influence elephant movements in the future. The findings of our work aim to not only advance the understanding of movement ecology but also foster a sustainable coexistence of humans and elephants in Sub-Saharan Africa. By predicting potential elephant routes, our work can inform strategies to minimize human-elephant conflict, effectively manage land use, and enhance anti-poaching efforts. This research underscores the importance of integrating movement ecology and climatic variables for effective wildlife management and conservation planning., Comment: 11 pages, 17 figures, Accepted in ACM SIGCAS SIGCHI Conference on Computing and Sustainable Societies (COMPASS 2023)
Published: 2023

14. Spatio-temporal Storytelling? Leveraging Generative Models for Semantic Trajectory Analysis

Author: Ghosh, Shreya, Sengupta, Saptarshi, and Mitra, Prasenjit
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: In this paper, we lay out a vision for analysing semantic trajectory traces and generating synthetic semantic trajectory data (SSTs) using generative language model. Leveraging the advancements in deep learning, as evident by progress in the field of natural language processing (NLP), computer vision, etc. we intend to create intelligent models that can study the semantic trajectories in various contexts, predicting future trends, increasing machine understanding of the movement of animals, humans, goods, etc. enhancing human-computer interactions, and contributing to an array of applications ranging from urban-planning to personalized recommendation engines and business strategy., Comment: 8 pages, 1 figure, Submitted for peer review
Published: 2023

15. Lumos in the Night Sky: AI-enabled Visual Tool for Exploring Night-Time Light Patterns

Author: Hederich, Jakob, Ghosh, Shreya, He, Zeyu, and Mitra, Prasenjit
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Artificial Intelligence, Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: We introduce NightPulse, an interactive tool for Night-time light (NTL) data visualization and analytics, which enables researchers and stakeholders to explore and analyze NTL data with a user-friendly platform. Powered by efficient system architecture, NightPulse supports image segmentation, clustering, and change pattern detection to identify urban development and sprawl patterns. It captures temporal trends of NTL and semantics of cities, answering questions about demographic factors, city boundaries, and unusual differences., Comment: 5 pages, 3 figures. Accepted in ECML PKDD Demo track
Published: 2023

16. Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase

Author: Hasan, Md Rakibul, Ghosh, Shreya, Agrawal, Pradyumna, Cai, Zhixi, Dhall, Abhinav, and Gedeon, Tom
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper proposes a feedback mechanism to change behavioural patterns using the Pavlok device. Pavlok utilises beeps, vibration and shocks as a mode of aversion technique to help individuals with behaviour modification. While the device can be useful in certain periodic daily life situations, like alarms and exercise notifications, the device relies on manual operations that limit its usage. To automate behaviour modification, we propose a framework that first detects targeted behaviours through a lightweight deep learning model and subsequently nudges the user through Pavlok. Our proposed solution is implemented and verified in the context of snoring, which captures audio from the environment following a prediction of whether the audio content is a snore or not using a 1D convolutional neural network. Based on the prediction, we use Pavlok to nudge users for preventive measures, such as a change in sleeping posture. We believe that this simple solution can help people to change their atomic habits, which may lead to long-term health benefits. Our proposed real-time, lightweight model (99.8% less parameters over SOTA; 1,278,049 --> 1337) achieves SOTA performance (test accuracy of 0.99) on a public domain benchmark. The code and model are publicly available at https://github.com/hasan-rakibul/pavlok-nudge-snore., Comment: Md Rakibul Hasan and Shreya Ghosh are co-first authors
Published: 2023

17. Emolysis: A Multimodal Open-Source Group Emotion Analysis and Visualization Toolkit

Author: Ghosh, Shreya, Cai, Zhixi, Gupta, Parul, Sharma, Garima, Dhall, Abhinav, Hayat, Munawar, and Gedeon, Tom
Subjects: Computer Science - Human-Computer Interaction
Abstract: Automatic group emotion recognition plays an important role in understanding complex human-human interaction. This paper introduces, Emolysis, a Python-based, standalone open-source group emotion analysis toolkit for use in different social situations upon getting consent from the users. Given any input video, Emolysis processes synchronized multimodal input and maps it to group level emotion, valence and arousal. Additionally, the toolkit supports major mobile and desktop platforms (Android, iOS, Windows). The Emolysis platform also comes with an intuitive graphical user interface that allows users to select different modalities and target persons for more fine-grained emotion analysis. Emolysis is freely available for academic research and encourages application developers to extend it to application specific environments on top of the existing system. We believe that the extension mechanism is quite straightforward. Our code models and interface are available at https://github.com/ControlNet/emolysis., Comment: Accepted by ACII Demo 2024. Both Shreya Ghosh and Zhixi Cai contributed equally to this research
Published: 2023

18. Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization

Author: Cai, Zhixi, Ghosh, Shreya, Dhall, Abhinav, Gedeon, Tom, Stefanov, Kalin, and Hayat, Munawar
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Most deepfake detection methods focus on detecting spatial and/or spatio-temporal changes in facial attributes and are centered around the binary classification task of detecting whether a video is real or fake. This is because available benchmark datasets contain mostly visual-only modifications present in the entirety of the video. However, a sophisticated deepfake may include small segments of audio or audio-visual manipulations that can completely change the meaning of the video content. To addresses this gap, we propose and benchmark a new dataset, Localized Audio Visual DeepFake (LAV-DF), consisting of strategic content-driven audio, visual and audio-visual manipulations. The proposed baseline method, Boundary Aware Temporal Forgery Detection (BA-TFD), is a 3D Convolutional Neural Network-based architecture which effectively captures multimodal manipulations. We further improve (i.e. BA-TFD+) the baseline method by replacing the backbone with a Multiscale Vision Transformer and guide the training process with contrastive, frame classification, boundary matching and multimodal boundary matching loss functions. The quantitative analysis demonstrates the superiority of BA-TFD+ on temporal forgery localization and deepfake detection tasks using several benchmark datasets including our newly proposed dataset. The dataset, models and code are available at https://github.com/ControlNet/LAV-DF., Comment: The paper is under consideration/review at Computer Vision and Image Understanding Journal
Published: 2023

19. AI-based Fog and Edge Computing: A Systematic Review, Taxonomy and Future Directions

Author: Iftikhar, Sundas, Gill, Sukhpal Singh, Song, Chenghao, Xu, Minxian, Aslanpour, Mohammad Sadegh, Toosi, Adel N., Du, Junhui, Wu, Huaming, Ghosh, Shreya, Chowdhury, Deepraj, Golec, Muhammed, Kumar, Mohit, Abdelmoniem, Ahmed M., Cuadrado, Felix, Varghese, Blesson, Rana, Omer, Dustdar, Schahram, and Uhlig, Steve
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing
Abstract: Resource management in computing is a very challenging problem that involves making sequential decisions. Resource limitations, resource heterogeneity, dynamic and diverse nature of workload, and the unpredictability of fog/edge computing environments have made resource management even more challenging to be considered in the fog landscape. Recently Artificial Intelligence (AI) and Machine Learning (ML) based solutions are adopted to solve this problem. AI/ML methods with the capability to make sequential decisions like reinforcement learning seem most promising for these type of problems. But these algorithms come with their own challenges such as high variance, explainability, and online training. The continuously changing fog/edge environment dynamics require solutions that learn online, adopting changing computing environment. In this paper, we used standard review methodology to conduct this Systematic Literature Review (SLR) to analyze the role of AI/ML algorithms and the challenges in the applicability of these algorithms for resource management in fog/edge computing environments. Further, various machine learning, deep learning and reinforcement learning techniques for edge AI management have been discussed. Furthermore, we have presented the background and current status of AI/ML-based Fog/Edge Computing. Moreover, a taxonomy of AI/ML-based resource management techniques for fog/edge computing has been proposed and compared the existing techniques based on the proposed taxonomy. Finally, open challenges and promising future research directions have been identified and discussed in the area of AI/ML-based fog/edge computing., Comment: 49 page, 15 figures, 10 tables
Published: 2022
Full Text: View/download PDF

20. Attention-Based Multi-layer Perceptron to Categorize Affective Videos from Viewer’s Physiological Signals

Author: Shaiok, Lazib Sharar, Hoque, Ishtiaqul, Hasan, Md Rakibul, Ghosh, Shreya, Gedeon, Tom, Hossain, Md Zakir, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Nguyen, Ngoc Thanh, editor, Chbeir, Richard, editor, Manolopoulos, Yannis, editor, Fujita, Hamido, editor, Hong, Tzung-Pei, editor, Nguyen, Le Minh, editor, and Wojtkiewicz, Krystian, editor
Published: 2024
Full Text: View/download PDF

21. Internet of Things and Dew Computing-Based System for Smart Agriculture

Author: Bera, Somnath, Dey, Tanushree, Ghosh, Shreya, Mukherjee, Anwesha, Fortino, Giancarlo, Series Editor, Liotta, Antonio, Series Editor, De, Debashis, editor, and Roy, Samarjit, editor
Published: 2024
Full Text: View/download PDF

22. MARLIN: Masked Autoencoder for facial video Representation LearnINg

Author: Cai, Zhixi, Ghosh, Shreya, Stefanov, Kalin, Dhall, Abhinav, Cai, Jianfei, Rezatofighi, Hamid, Haffari, Reza, and Hayat, Munawar
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: This paper proposes a self-supervised approach to learn universal facial representations from videos, that can transfer across a variety of facial analysis tasks such as Facial Attribute Recognition (FAR), Facial Expression Recognition (FER), DeepFake Detection (DFD), and Lip Synchronization (LS). Our proposed framework, named MARLIN, is a facial video masked autoencoder, that learns highly robust and generic facial embeddings from abundantly available non-annotated web crawled facial videos. As a challenging auxiliary task, MARLIN reconstructs the spatio-temporal details of the face from the densely masked facial regions which mainly include eyes, nose, mouth, lips, and skin to capture local and global aspects that in turn help in encoding generic and transferable features. Through a variety of experiments on diverse downstream tasks, we demonstrate MARLIN to be an excellent facial video encoder as well as feature extractor, that performs consistently well across a variety of downstream tasks including FAR (1.13% gain over supervised benchmark), FER (2.64% gain over unsupervised benchmark), DFD (1.86% gain over unsupervised benchmark), LS (29.36% gain for Frechet Inception Distance), and even in low data regime. Our code and models are available at https://github.com/ControlNet/MARLIN ., Comment: CVPR 2023
Published: 2022

23. A Critical Review on the Recovery of Base and Critical Elements from Electronic Waste-Contaminated Streams Using Microbial Biotechnology

Author: Mishra, Sunanda, Ghosh, Shreya, van Hullebusch, Eric D., Singh, Shikha, and Das, Alok Prasad
Published: 2023
Full Text: View/download PDF

24. RAZE: Region Guided Self-Supervised Gaze Representation Learning

Author: Dubey, Neeru, Ghosh, Shreya, and Dhall, Abhinav
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Automatic eye gaze estimation is an important problem in vision based assistive technology with use cases in different emerging topics such as augmented reality, virtual reality and human-computer interaction. Over the past few years, there has been an increasing interest in unsupervised and self-supervised learning paradigms as it overcomes the requirement of large scale annotated data. In this paper, we propose RAZE, a Region guided self-supervised gAZE representation learning framework which leverage from non-annotated facial image data. RAZE learns gaze representation via auxiliary supervision i.e. pseudo-gaze zone classification where the objective is to classify visual field into different gaze zones (i.e. left, right and center) by leveraging the relative position of pupil-centers. Thus, we automatically annotate pseudo gaze zone labels of 154K web-crawled images and learn feature representations via `Ize-Net' framework. `Ize-Net' is a capsule layer based CNN architecture which can efficiently capture rich eye representation. The discriminative behaviour of the feature representation is evaluated on four benchmark datasets: CAVE, TabletGaze, MPII and RT-GENE. Additionally, we evaluate the generalizability of the proposed network on two other downstream task (i.e. driver gaze estimation and visual attention estimation) which demonstrate the effectiveness of the learnt eye gaze representation., Comment: arXiv admin note: substantial text overlap with arXiv:1904.02459
Published: 2022

25. 'Labelling the Gaps': A Weakly Supervised Automatic Eye Gaze Estimation

Author: Ghosh, Shreya, Dhall, Abhinav, Knibbe, Jarrod, and Hayat, Munawar
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Over the past few years, there has been an increasing interest to interpret gaze direction in an unconstrained environment with limited supervision. Owing to data curation and annotation issues, replicating gaze estimation method to other platforms, such as unconstrained outdoor or AR/VR, might lead to significant drop in performance due to insufficient availability of accurately annotated data for model training. In this paper, we explore an interesting yet challenging problem of gaze estimation method with a limited amount of labelled data. The proposed method distills knowledge from the labelled subset with visual features; including identity-specific appearance, gaze trajectory consistency and motion features. Given a gaze trajectory, the method utilizes label information of only the start and the end frames of a gaze sequence. An extension of the proposed method further reduces the requirement of labelled frames to only the start frame with a minor drop in the generated label's quality. We evaluate the proposed method on four benchmark datasets (CAVE, TabletGaze, MPII and Gaze360) as well as web-crawled YouTube videos. Our proposed method reduces the annotation effort to as low as 2.67%, with minimal impact on performance; indicating the potential of our model enabling gaze estimation 'in-the-wild' setup.
Published: 2022

26. AV-Gaze: A Study on the Effectiveness of Audio Guided Visual Attention Estimation for Non-Profilic Faces

Author: Ghosh, Shreya, Dhall, Abhinav, Hayat, Munawar, and Knibbe, Jarrod
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In challenging real-life conditions such as extreme head-pose, occlusions, and low-resolution images where the visual information fails to estimate visual attention/gaze direction, audio signals could provide important and complementary information. In this paper, we explore if audio-guided coarse head-pose can further enhance visual attention estimation performance for non-prolific faces. Since it is difficult to annotate audio signals for estimating the head-pose of the speaker, we use off-the-shelf state-of-the-art models to facilitate cross-modal weak-supervision. During the training phase, the framework learns complementary information from synchronized audio-visual modality. Our model can utilize any of the available modalities i.e. audio, visual or audio-visual for task-specific inference. It is interesting to note that, when AV-Gaze is tested on benchmark datasets with these specific modalities, it achieves competitive results on multiple datasets, while being highly adaptive toward challenging scenarios.
Published: 2022

27. Bridging Semantics: Mobility Analytics Framework for Knowledge Transfer

Author: Ghosh, Shreya, primary and Mitra, Prasenjit, additional
Published: 2024
Full Text: View/download PDF

28. Macromolecular crowding effects on protein dynamics

Author: Das, Nilimesh, Khan, Tanmoy, Halder, Bisal, Ghosh, Shreya, and Sen, Pratik
Published: 2024
Full Text: View/download PDF

29. Mobi-Sense: mobility-aware sensor-fog paradigm for mission-critical applications using network coding and steganography

Author: Mukherjee, Anwesha, Ghosh, Shreya, Ghosh, Soumya K., and Buyya, Rajkumar
Published: 2023
Full Text: View/download PDF

30. Internet of Things and Dew Computing-Based System for Smart Agriculture

Author: Bera, Somnath, primary, Dey, Tanushree, additional, Ghosh, Shreya, additional, and Mukherjee, Anwesha, additional
Published: 2023
Full Text: View/download PDF

31. How Early Can We Detect? Detecting Misinformation on Social Media Using User Profiling and Network Characteristics

Author: Ghosh, Shreya, Mitra, Prasenjit, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, De Francisci Morales, Gianmarco, editor, Perlich, Claudia, editor, Ruchansky, Natali, editor, Kourtellis, Nicolas, editor, Baralis, Elena, editor, and Bonchi, Francesco, editor
Published: 2023
Full Text: View/download PDF

32. ‘Labelling the Gaps’: A Weakly Supervised Automatic Eye Gaze Estimation

Author: Ghosh, Shreya, Dhall, Abhinav, Hayat, Munawar, Knibbe, Jarrod, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Wang, Lei, editor, Gall, Juergen, editor, Chin, Tat-Jun, editor, Sato, Imari, editor, and Chellappa, Rama, editor
Published: 2023
Full Text: View/download PDF

33. Conformational Differences in the Light Chain Constant Domain of Immunoglobulin G and Free Light Chain May Influence Proteolysis in AL Amyloidosis

Author: Klimtchuk, Elena S., Prokaeva, Tatiana, Spencer, Brian H., Wong, Sherry, Ghosh, Shreya, Urdaneta, Angela, Morgan, Gareth, Wales, Thomas E., and Gursky, Olga
Published: 2024
Full Text: View/download PDF

34. Optimum relative humidity enhances CO2 uptake in diamine-appended M2(dobpdc)

Author: Holmes, Hannah E., Ghosh, Shreya, Li, Chunyi, Kalyanaraman, Jayashree, Realff, Matthew J., Weston, Simon C., and Lively, Ryan P.
Published: 2023
Full Text: View/download PDF

35. SocialSense: Mobile crowd sensing-based physical distance monitoring model leveraging federated learning for pandemic

Author: De, Debashis, Ghosh, Shreya, and Mukherjee, Anwesha
Published: 2023
Full Text: View/download PDF

36. Transthyretin Cardiac Amyloidosis: Recent Advances in Diagnosis and Treatment

Author: Ghosh, Shreya, primary, Thakur, Ashwani Kumar, additional, and Khanra, Dibbendhu, additional
Published: 2023
Full Text: View/download PDF

37. AI-based fog and edge computing: A systematic review, taxonomy and future directions

Author: Iftikhar, Sundas, Gill, Sukhpal Singh, Song, Chenghao, Xu, Minxian, Aslanpour, Mohammad Sadegh, Toosi, Adel N., Du, Junhui, Wu, Huaming, Ghosh, Shreya, Chowdhury, Deepraj, Golec, Muhammed, Kumar, Mohit, Abdelmoniem, Ahmed M., Cuadrado, Felix, Varghese, Blesson, Rana, Omer, Dustdar, Schahram, and Uhlig, Steve
Published: 2023
Full Text: View/download PDF

38. Evolution of biomining technology

Author: Das, Alok Prasad, primary and Ghosh, Shreya, additional
Published: 2023
Full Text: View/download PDF

39. Role of metabolites and metabolomics for understanding of manganese bioleaching

Author: Das, Alok Prasad, primary and Ghosh, Shreya, additional
Published: 2023
Full Text: View/download PDF

40. Manganese bioleaching

Author: Das, Alok Prasad, primary and Ghosh, Shreya, additional
Published: 2023
Full Text: View/download PDF

41. Future trends

Author: Das, Alok Prasad, primary and Ghosh, Shreya, additional
Published: 2023
Full Text: View/download PDF

42. Case studies

Author: Das, Alok Prasad, primary and Ghosh, Shreya, additional
Published: 2023
Full Text: View/download PDF

43. Manganese and its application

Author: Das, Alok Prasad, primary and Ghosh, Shreya, additional
Published: 2023
Full Text: View/download PDF

44. Lumos in the Night Sky: AI-Enabled Visual Tool for Exploring Night-Time Light Patterns

Author: Hederich, Jakob, primary, Ghosh, Shreya, additional, He, Zeyu, additional, and Mitra, Prasenjit, additional
Published: 2023
Full Text: View/download PDF

45. Mixed culture bioleaching: an insight into manganese biomining process and efficacy

Author: Das, Alok Prasad, primary and Ghosh, Shreya, additional
Published: 2023
Full Text: View/download PDF

46. Mechanism of manganese bioleaching

Author: Das, Alok Prasad, primary and Ghosh, Shreya, additional
Published: 2023
Full Text: View/download PDF

47. Microorganisms in manganese biomining processes

Author: Das, Alok Prasad, primary and Ghosh, Shreya, additional
Published: 2023
Full Text: View/download PDF

48. Introduction

Author: Das, Alok Prasad, primary and Ghosh, Shreya, additional
Published: 2023
Full Text: View/download PDF

49. Metagenomic insights into unculturable microbial diversity

Author: Das, Alok Prasad, primary and Ghosh, Shreya, additional
Published: 2023
Full Text: View/download PDF

50. Comparative investigation of fungal and bacterial manganese biomining mechanisms

Author: Ghosh, Shreya, primary, Tripathy, Banismita, additional, Dey, Sudeshna, additional, and Das, Alok Prasad, additional
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

219 results on '"Ghosh, Shreya"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources