Author: "Abdullah, Hasnat Md." - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Abdullah, Hasnat Md."' showing total 6 results

Start Over Author "Abdullah, Hasnat Md."

6 results on '"Abdullah, Hasnat Md."'

1. UAL-Bench: The First Comprehensive Unusual Activity Localization Benchmark

Author: Abdullah, Hasnat Md, Liu, Tian, Wei, Kangda, Kong, Shu, and Huang, Ruihong
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Computation and Language
Abstract: Localizing unusual activities, such as human errors or surveillance incidents, in videos holds practical significance. However, current video understanding models struggle with localizing these unusual events likely because of their insufficient representation in models' pretraining datasets. To explore foundation models' capability in localizing unusual activity, we introduce UAL-Bench, a comprehensive benchmark for unusual activity localization, featuring three video datasets: UAG-OOPS, UAG-SSBD, UAG-FunQA, and an instruction-tune dataset: OOPS-UAG-Instruct, to improve model capabilities. UAL-Bench evaluates three approaches: Video-Language Models (Vid-LLMs), instruction-tuned Vid-LLMs, and a novel integration of Vision-Language Models and Large Language Models (VLM-LLM). Our results show the VLM-LLM approach excels in localizing short-span unusual events and predicting their onset (start time) more accurately than Vid-LLMs. We also propose a new metric, R@1, TD <= p, to address limitations in existing evaluation methods. Our findings highlight the challenges posed by long-duration videos, particularly in autism diagnosis scenarios, and the need for further advancements in localization techniques. Our work not only provides a benchmark for unusual activity localization but also outlines the key challenges for existing foundation models, suggesting future research directions on this important task.
Published: 2024

2. SynthEnsemble: A Fusion of CNN, Vision Transformer, and Hybrid Models for Multi-Label Chest X-Ray Classification

Author: Ashraf, S. M. Nabil, Mamun, Md. Adyelullahil, Abdullah, Hasnat Md., and Alam, Md. Golam Rabiul
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, I.4, I.5
Abstract: Chest X-rays are widely used to diagnose thoracic diseases, but the lack of detailed information about these abnormalities makes it challenging to develop accurate automated diagnosis systems, which is crucial for early detection and effective treatment. To address this challenge, we employed deep learning techniques to identify patterns in chest X-rays that correspond to different diseases. We conducted experiments on the "ChestX-ray14" dataset using various pre-trained CNNs, transformers, hybrid(CNN+Transformer) models and classical models. The best individual model was the CoAtNet, which achieved an area under the receiver operating characteristic curve (AUROC) of 84.2%. By combining the predictions of all trained models using a weighted average ensemble where the weight of each model was determined using differential evolution, we further improved the AUROC to 85.4%, outperforming other state-of-the-art methods in this field. Our findings demonstrate the potential of deep learning techniques, particularly ensemble deep learning, for improving the accuracy of automatic diagnosis of thoracic diseases from chest X-rays. Code available at:https://github.com/syednabilashraf/SynthEnsemble, Comment: Published in International Conference on Computer and Information Technology (ICCIT) 2023
Published: 2023
Full Text: View/download PDF

3. Affective social anthropomorphic intelligent system

Author: Mamun, Md. Adyelullahil, Abdullah, Hasnat Md., Alam, Md. Golam Rabiul, Hassan, Muhammad Mehedi, and Uddin, Md. Zia
Subjects: Computer Science - Sound, Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Human-Computer Interaction, Computer Science - Machine Learning
Abstract: Human conversational styles are measured by the sense of humor, personality, and tone of voice. These characteristics have become essential for conversational intelligent virtual assistants. However, most of the state-of-the-art intelligent virtual assistants (IVAs) are failed to interpret the affective semantics of human voices. This research proposes an anthropomorphic intelligent system that can hold a proper human-like conversation with emotion and personality. A voice style transfer method is also proposed to map the attributes of a specific emotion. Initially, the frequency domain data (Mel-Spectrogram) is created by converting the temporal audio wave data, which comprises discrete patterns for audio features such as notes, pitch, rhythm, and melody. A collateral CNN-Transformer-Encoder is used to predict seven different affective states from voice. The voice is also fed parallelly to the deep-speech, an RNN model that generates the text transcription from the spectrogram. Then the transcripted text is transferred to the multi-domain conversation agent using blended skill talk, transformer-based retrieve-and-generate generation strategy, and beam-search decoding, and an appropriate textual response is generated. The system learns an invertible mapping of data to a latent space that can be manipulated and generates a Mel-spectrogram frame based on previous Mel-spectrogram frames to voice synthesize and style transfer. Finally, the waveform is generated using WaveGlow from the spectrogram. The outcomes of the studies we conducted on individual models were auspicious. Furthermore, users who interacted with the system provided positive feedback, demonstrating the system's effectiveness., Comment: Multimedia Tools and Applications (2023)
Published: 2023
Full Text: View/download PDF

4. SynthEnsemble: A Fusion of CNN, Vision Transformer, and Hybrid Models for Multi-Label Chest X-Ray Classification

Author: Ashraf, S.M. Nabil, primary, Mamun, Md. Adyelullahil, additional, Abdullah, Hasnat Md., additional, and Alam, Md. Golam Rabiul, additional
Published: 2023
Full Text: View/download PDF

5. Affective social anthropomorphic intelligent system

Author: Mamun, Md. Adyelullahil, primary, Abdullah, Hasnat Md., additional, Alam, Md. Golam Rabiul, additional, Hassan, Muhammad Mehedi, additional, and Uddin, Md. Zia, additional
Published: 2023
Full Text: View/download PDF

6. A Comparative Analysis of Lumpy Skin Disease Prediction Through Machine Learning Approaches

Author: Dofadar, Dibyo Fabian, primary, Abdullah, Hasnat Md., additional, Khan, Riyo Hayat, additional, Rahman, Rafeed, additional, and Ahmed, Md. Sabbir, additional
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

6 results on '"Abdullah, Hasnat Md."'

1. UAL-Bench: The First Comprehensive Unusual Activity Localization Benchmark

2. SynthEnsemble: A Fusion of CNN, Vision Transformer, and Hybrid Models for Multi-Label Chest X-Ray Classification

3. Affective social anthropomorphic intelligent system

4. SynthEnsemble: A Fusion of CNN, Vision Transformer, and Hybrid Models for Multi-Label Chest X-Ray Classification

5. Affective social anthropomorphic intelligent system

6. A Comparative Analysis of Lumpy Skin Disease Prediction Through Machine Learning Approaches

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

6 results on '"Abdullah, Hasnat Md."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources