Author: "Mohamed W. Fakhr" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Mohamed W. Fakhr"' showing total 6 results

Start Over Author "Mohamed W. Fakhr"

6 results on '"Mohamed W. Fakhr"'

1. Scalable multimodal approach for face generation and super-resolution using a conditional diffusion model

Author: Ahmed Abotaleb, Mohamed W. Fakhr, and Mohamed Zaki
Subjects: Scalable multimodal approach, Speech conditioned face generation, Speech conditioned face super-resolution, Diffusion probabilistic models, Speaker embeddings, Medicine, Science
Abstract: Abstract Multimodal Conditioned face image generation and face super-resolution are significant areas of research. To achieve optimal results, this paper utilizes diffusion models as the primary engine for these tasks. This paper presents two main contributions: (1) “Speaking the Language of Faces” (SLF): a flexible, modular, fusion-less and architecturally simple multimodal system. (2) A Scalability scheme and a sensitivity analysis which can assist practitioners in system parameter estimation and feature selection. SLF consists of two main components: a feature vector generator (encoder), and an image generator (decoder) utilizing a conditional diffusion model. SLF can accept various inputs, including low-resolution images, speech signals, person attributes (age, gender, ethnicity), or any combination of these. Moreover, Scalability based on conditional scale values is utilized. The implementation of SLF has confirmed its versatility (e.g., speech to face image generation, conditioned face super-resolution). We trained multiple system versions to conduct a sensitivity analysis and to determine the influence of each individual feature on the output image. Consequently, speaker embeddings have proven to be sufficient audio features for our task. It was also found that the effects of audio signals are profound and are more pronounced than those of the low resolution images (8 × 8), whose effects are still significant. The effect of gender, ethnicity and age were found to be moderate. On another note, conditional scale values significantly impact the system’s behavior and performance.
Published: 2024
Full Text: View/download PDF

2. P-Wave Detection Using a Fully Convolutional Neural Network in Electrocardiogram Images

Author: Rana N. Costandy, Safa M. Gasser, Mohamed S. El-Mahallawy, Mohamed W. Fakhr, and Samir Y. Marzouk
Subjects: electrocardiogram, p-wave, atrial disorder, fully convolutional network, Technology, Engineering (General). Civil engineering (General), TA1-2040, Biology (General), QH301-705.5, Physics, QC1-999, Chemistry, QD1-999
Abstract: Electrocardiogram (ECG) signal analysis is a critical task in diagnosing the presence of any cardiac disorder. There are limited studies on detecting P-waves in various atrial arrhythmias, such as atrial fibrillation (AFIB), atrial flutter, junctional rhythm, and other arrhythmias due to P-wave variability and absence in various cases. Thus, there is a growing need to develop an efficient automated algorithm that annotates a 2D printed version of P-waves in the well-known ECG signal databases for validation purposes. To our knowledge, no one has annotated P-waves in the MIT-BIH atrial fibrillation database. Therefore, it is a challenge to manually annotate P-waves in the MIT-BIH AF database and to develop an automated algorithm to detect the absence and presence of different shapes of P-waves. In this paper, we present the manual annotation of P-waves in the well-known MIT-BIH AF database with the aid of a cardiologist. In addition, we provide an automatic P-wave segmentation for the same database using a fully convolutional neural network model (U-Net). This algorithm works on 2D imagery of printed ECG signals, as this type of imagery is the most commonly used in developing countries. The proposed automatic P-wave detection method obtained an accuracy and sensitivity of 98.56% and 98.78%, respectively, over the first 5 min of the second lead of the MIT-BIH AF database (a total of 8280 beats). Moreover, the proposed method is validated using the well-known automatically and manually annotated QT database (a total of 11,201 and 3194 automatically and manually annotated beats, respectively). This results in accuracies of 98.98 and 98.9%, and sensitivities of 98.97 and 97.24% for the automatically and manually annotated QT databases, respectively. Thus, these results indicate that the proposed automatic method can be used for analyzing long-printed ECG signals on mobile battery-driven devices using only images of the ECG signals, without the need for a cardiologist.
Published: 2020
Full Text: View/download PDF

3. Multimodal deep learning model for human handover classification

Author: Islam A. Monir, Mohamed W. Fakhr, and Nashwa El-Bendary
Subjects: Human handover, Control and Optimization, Computer Networks and Communications, Hardware and Architecture, Control and Systems Engineering, Robot-human interaction, Computer Science (miscellaneous), Electrical and Electronic Engineering, Robotic grasping, Instrumentation, Action recognition, Multimodality, Information Systems
Abstract: Giving and receiving objects between humans and robots is a critical task which collaborative robots must be able to do. In order for robots to achieve that, they must be able to classify different types of human handover motions. Previous works did not mainly focus on classifying the motion type from both giver and receiver perspectives. However, they solely focused on object grasping, handover detection, and handover classification from one side only (giver/receiver). This paper discusses the design and implementation of different deep learning architectures with long short term memory (LSTM) network; and different feature selection techniques for human handover classification from both giver and receiver perspectives. Classification performance while using unimodal and multimodal deep learning models is investigated. The data used for evaluation is a publicly available dataset with four different modalities: motion tracking sensors readings, Kinect readings for 15 joints positions, 6-axis inertial sensor readings, and video recordings. The multimodality added a huge boost in the classification performance; achieving 96% accuracy with the feature selection based deep learning architecture.
Published: 2022

4. Sentiment Analysis For Arabic Low Resource Data Using BERT-CNN

Author: Mohamed Fawzy, Mohamed W. Fakhr, and Mohamed Abo Rizka
Published: 2022

5. Recent computer vision applications for pavement distress and condition assessment

Author: Ayman H. El Hakea and Mohamed W. Fakhr
Subjects: Control and Systems Engineering, Building and Construction, Civil and Structural Engineering
Published: 2023

6. Human Handover Classification using a Deep Learning Model

Author: Islam A Monir, Nashwa El-Bendary, and Mohamed W Fakhr
Published: 2021

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

6 results on '"Mohamed W. Fakhr"'

1. Scalable multimodal approach for face generation and super-resolution using a conditional diffusion model

2. P-Wave Detection Using a Fully Convolutional Neural Network in Electrocardiogram Images

3. Multimodal deep learning model for human handover classification

4. Sentiment Analysis For Arabic Low Resource Data Using BERT-CNN

5. Recent computer vision applications for pavement distress and condition assessment

6. Human Handover Classification using a Deep Learning Model

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

6 results on '"Mohamed W. Fakhr"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources