Author: "Kosmas Dimitropoulos" / Topic: computer science - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Kosmas Dimitropoulos"' showing total 63 results

Start Over Author "Kosmas Dimitropoulos" Topic computer science

63 results on '"Kosmas Dimitropoulos"'

1. Continuous Sign Language Recognition Through Cross-Modal Alignment of Video and Text Embeddings in a Joint-Latent Space

Author: Kosmas Dimitropoulos, Dimitrios Konstantinidis, Ilias Papastratis, and Petros Daras
Subjects: General Computer Science, Computer science, business.industry, cross-modal learning, Feature extraction, General Engineering, Space (commercial competition), Sign language, computer.software_genre, continuous sign language recognition, deep-learning, joint latent space, Modal, Encoding (memory), Computer vision, General Materials Science, lcsh:Electrical engineering. Electronics. Nuclear engineering, Artificial intelligence, Focus (optics), Joint (audio engineering), business, Representation (mathematics), lcsh:TK1-9971, computer, Natural language processing
Abstract: Continuous Sign Language Recognition (CSLR) refers to the challenging problem of recognizing sign language glosses and their temporal boundaries from weakly annotated video sequences. Previous methods focus mostly on visual feature extraction neglecting text information and failing to effectively model the intra-gloss dependencies. In this work, a cross-modal learning approach that leverages text information to improve vision-based CSLR is proposed. To this end, two powerful encoding networks are initially used to produce video and text embeddings prior to their mapping and alignment into a joint latent representation. The purpose of the proposed cross-modal alignment is the modelling of intra-gloss dependencies and the creation of more descriptive video-based latent representations for CSLR. The proposed method is trained jointly with video and text latent representations. Finally, the aligned video latent representations are classified using a jointly trained decoder. Extensive experiments on three well-known sign language recognition datasets and comparison with state-of-the-art approaches demonstrate the great potential of the proposed approach.
Published: 2020
Full Text: View/download PDF

2. Multi-modal Variational Faster R-CNN for Improved Visual Object Detection in Manufacturing

Author: Kosmas Dimitropoulos, Panagiotis Mouzenidis, Dimitrios Konstantinidis, Antonios Louros, Theofilos D. Mastos, and Petros Daras
Subjects: Modal, business.industry, Computer science, Computer vision, Artificial intelligence, business, Object detection
Published: 2021
Full Text: View/download PDF

3. Artificial Intelligence Technologies for Sign Language

Author: Kosmas Dimitropoulos, Ilias Papastratis, Christos Chatzikonstantinou, Petros Daras, and Dimitrios Konstantinidis
Subjects: applications, Computer science, sign language representation, Chemical technology, sign language recognition, Representation (arts), TP1-1185, Sign language, Biochemistry, Data science, Atomic and Molecular Physics, and Optics, Field (computer science), Article, Analytical Chemistry, sign language capturing, Sign Language, Artificial Intelligence, Humans, Prospective Studies, Electrical and Electronic Engineering, Instrumentation, Algorithms
Abstract: AI technologies can play an important role in breaking down the communication barriers of deaf or hearing-impaired people with other communities, contributing significantly to their social inclusion. Recent advances in both sensing technologies and AI algorithms have paved the way for the development of various applications aiming at fulfilling the needs of deaf and hearing-impaired communities. To this end, this survey aims to provide a comprehensive review of state-of-the-art methods in sign language capturing, recognition, translation and representation, pinpointing their advantages and limitations. In addition, the survey presents a number of applications, while it discusses the main challenges in the field of sign language technologies. Future research direction are also proposed in order to assist prospective researchers towards further advancing the field.
Published: 2021

4. Editorial: Artificial Intelligence and Human Movement in Industries and Creation

Author: Kosmas Dimitropoulos, Frederic Fol Leymarie, Petros Daras, Sylvain Calinon, and Sotiris Manitsaris
Subjects: Cognitive science, machine learning, motion caption, Artificial Intelligence, Computer science, Movement (music), Electronic computers. Computer science, TJ1-1570, Mechanical engineering and machinery, QA75.5-76.95, human centred, human motion analysis, Computer Science Applications
Published: 2021
Full Text: View/download PDF

5. Multi-lead ECG signal analysis for myocardial infarction detection and localization through the mapping of Grassmannian and Euclidean features into a common Hilbert space

Author: Nikos Grammalidis, Anestis Apostolidis, Panagiotis Barmpoutis, and Kosmas Dimitropoulos
Subjects: Computer science, business.industry, Heart malformation, 0206 medical engineering, Hilbert space, Health Informatics, Pattern recognition, 02 engineering and technology, 020601 biomedical engineering, Outcome (probability), 03 medical and health sciences, symbols.namesake, 0302 clinical medicine, Feature (computer vision), Encoding (memory), Grassmannian, Signal Processing, Euclidean geometry, cardiovascular system, symbols, cardiovascular diseases, Artificial intelligence, business, 030217 neurology & neurosurgery, Reproducing kernel Hilbert space
Abstract: Background and objective Electrocardiogram is commonly used as a diagnostic tool for the monitoring of cardiac health and the detection of possible heart diseases. However, the procedure followed for the diagnosis of heart abnormalities is time consuming and prone to human errors. Thus, the development of computer-aided techniques for the automatic analysis of electrocardiogram signals is of vital importance for the diagnosis and prevention of heart diseases. The most serious outcome of coronary heart disease is the myocardial infarction, i.e., the rapid and irreversible damage of cardiac muscles, which, if not diagnosed and treated in time, continues to damage further the myocardial structure and function. In this paper we propose a novel approach for the automatic detection and localization of myocardial infarction from multi-lead electrocardiogram signals. Methods The proposed method initially reshapes the multidimensional signal into a third-order tensor structure and subsequently extracts feature representations in both Euclidean and Grassmannian space. In addition, two different methods are proposed for the mapping of the two different feature representations into a common Hilbert space before the final classification of signals. The first approach is based on the mapping of both Grassmannian and Euclidean features in a Reproducing Kernel Hilbert Space (RKHS), while the second one attempts to initially apply Vector of Locally Aggregated Descriptors (VLAD) encoding directly to Grassmann manifold and then concatenate the two VLAD representations. Results For the evaluation of the proposed method, we have conducted extensive tests using a publicly available dataset, namely PTB Diagnostic ECG database, containing 549 multi-lead ECG data recordings from 290 subjects and from different diagnostic classes. The method provides an excellent detection rate of 100%, and localization rate, i.e., 100% with the first fusion method and 99.7% with the second one. Conclusions The Experimental results presented in this paper show the superiority of the proposed methodology against a number of state-of-the-art approaches. The main advantage of the proposed approach is that it exploits better the intercorrelations between signals of different ECG leads, by extracting feature representations that lie in different geometrical spaces and contain complementary information with regard to the dynamics of signals.
Published: 2019
Full Text: View/download PDF

6. 3D Hand Pose Estimation via aligned latent space injection and kinematic losses

Author: Kosmas Dimitropoulos, Andreas Stergioulas, Dimitrios Konstantinidis, Theocharis Chatzis, and Petros Daras
Subjects: Background noise, Computer science, business.industry, Deep learning, RGB color model, Computer vision, Kinematics, Artificial intelligence, Space (mathematics), business, Pose, Decoding methods
Abstract: In this paper, we propose a novel multi-stage deep learning methodology to accurately tackle the problem of hand pose estimation. More specifically, we initially propose a disentanglement stage to differentiate the significant pose-specific information from the irrelevant background noise and illumination variations of RGB images. Then, we introduce a variational alignment stage to accurately align the latent spaces of the pose-specific and the true hand pose information, effectively improving the discrimination ability of the proposed methodology. Finally, we propose the use of two loss terms to impose physiological and geometrical kinematic constraints to the predicted hand poses, empowering the proposed methodology to avoid non-plausible poses. During all stages, a novel injection decoder is introduced, improving significantly the decoding accuracy of the latent space. Extensive experimentation on two well-known datasets (i.e., RHD and STB) validate the ability of the proposed methodology to achieve accurate hand pose estimation results, overcoming current state-of-the-art methods.
Published: 2021
Full Text: View/download PDF

7. Continuous Sign Language Recognition through a Context-Aware Generative Adversarial Network

Author: Kosmas Dimitropoulos, Ilias Papastratis, and Petros Daras
Subjects: Computer science, Greek Sign Language, 02 engineering and technology, 010501 environmental sciences, Sign language, computer.software_genre, Chinese Sign Language, lcsh:Chemical technology, 01 natural sciences, Biochemistry, Article, Analytical Chemistry, Pattern Recognition, Automated, Sign Language, 0202 electrical engineering, electronic engineering, information engineering, Humans, lcsh:TP1-1185, Electrical and Electronic Engineering, Instrumentation, 0105 earth and related environmental sciences, Language, Gestures, business.industry, continuous sign language recognition, Atomic and Molecular Physics, and Optics, language.human_language, language, 020201 artificial intelligence & image processing, Artificial intelligence, sign language translation, generative adversarial networks, business, computer, Natural language processing, Natural language, Sentence, Generative grammar, Sign (mathematics), Gesture
Abstract: Continuous sign language recognition is a weakly supervised task dealing with the identification of continuous sign gestures from video sequences, without any prior knowledge about the temporal boundaries between consecutive signs. Most of the existing methods focus mainly on the extraction of spatio-temporal visual features without exploiting text or contextual information to further improve the recognition accuracy. Moreover, the ability of deep generative models to effectively model data distribution has not been investigated yet in the field of sign language recognition. To this end, a novel approach for context-aware continuous sign language recognition using a generative adversarial network architecture, named as Sign Language Recognition Generative Adversarial Network (SLRGAN), is introduced. The proposed network architecture consists of a generator that recognizes sign language glosses by extracting spatial and temporal features from video sequences, as well as a discriminator that evaluates the quality of the generator’s predictions by modeling text information at the sentence and gloss levels. The paper also investigates the importance of contextual information on sign language conversations for both Deaf-to-Deaf and Deaf-to-hearing communication. Contextual information, in the form of hidden states extracted from the previous sentence, is fed into the bidirectional long short-term memory module of the generator to improve the recognition accuracy of the network. At the final stage, sign language translation is performed by a transformer network, which converts sign language glosses to natural language text. Our proposed method achieved word error rates of 23.4%, 2.1%, and 2.26% on the RWTH-Phoenix-Weather-2014 and the Chinese Sign Language (CSL) and Greek Sign Language (GSL) Signer Independent (SI) datasets, respectively.
Published: 2021

8. A Cross-Modal Variational Framework For Food Image Analysis

Author: Thomas Theodoridis, Kosmas Dimitropoulos, Petros Daras, and Vassilios Solachidis
Subjects: Computer science, 02 engineering and technology, 010501 environmental sciences, Recommender system, Machine learning, computer.software_genre, 01 natural sciences, Task (project management), Image (mathematics), 0202 electrical engineering, electronic engineering, information engineering, Eating habits, 0105 earth and related environmental sciences, Network architecture, ingredient recognition, business.industry, food analysis, variational, Information processing, Food Analysis, cross-modal, Modal, VAE, Task analysis, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer
Abstract: Food analysis resides at the core of modern nutrition recommender systems, providing the foundation for a high-level understanding of users’ eating habits. This paper focuses on the sub-task of ingredient recognition from food images using a variational framework. The framework consists of two variational encoder-decoder branches, aimed at processing information from different modalities (images and text), as well as a variational mapper branch, which accomplishes the task of aligning the distributions of the individual branches. Experimental results on the Yummly-28K data-set showcase that the proposed framework performs better than similar variational frameworks, while it surpasses current state-of-the-art approaches on the large-scale Recipe1M data-set.
Published: 2020

9. Early Fire Detection Based on Aerial 360-Degree Sensors, Deep Convolution Neural Networks and Exploitation of Fire Dynamic Textures

Author: Nikos Grammalidis, Panagiotis Barmpoutis, Kosmas Dimitropoulos, and Tania Stathaki
Subjects: 010504 meteorology & atmospheric sciences, Artificial neural network, Standard test image, Fire detection, Computer science, Flame detection, Science, Real-time computing, Field of view, flame detection, 02 engineering and technology, 01 natural sciences, remote sensing, natural disasters, 0202 electrical engineering, electronic engineering, information engineering, Equirectangular projection, 360-degree sensors, General Earth and Planetary Sciences, RGB color model, 020201 artificial intelligence & image processing, smoke detection, Projection (set theory), fire detection, 0105 earth and related environmental sciences
Abstract: The environmental challenges the world faces have never been greater or more complex. Global areas that are covered by forests and urban woodlands are threatened by large-scale forest fires that have increased dramatically during the last decades in Europe and worldwide, in terms of both frequency and magnitude. To this end, rapid advances in remote sensing systems including ground-based, unmanned aerial vehicle-based and satellite-based systems have been adopted for effective forest fire surveillance. In this paper, the recently introduced 360-degree sensor cameras are proposed for early fire detection, making it possible to obtain unlimited field of view captures which reduce the number of required sensors and the computational cost and make the systems more efficient. More specifically, once optical 360-degree raw data are obtained using an RGB 360-degree camera mounted on an unmanned aerial vehicle, we convert the equirectangular projection format images to stereographic images. Then, two DeepLab V3+ networks are applied to perform flame and smoke segmentation, respectively. Subsequently, a novel post-validation adaptive method is proposed exploiting the environmental appearance of each test image and reducing the false-positive rates. For evaluating the performance of the proposed system, a dataset, namely the “Fire detection 360-degree dataset”, consisting of 150 unlimited field of view images that contain both synthetic and real fire, was created. Experimental results demonstrate the great potential of the proposed system, which has achieved an F-score fire detection rate equal to 94.6%, hence reducing the number of required sensors. This indicates that the proposed method could significantly contribute to early fire detection.
Published: 2020

10. A Comprehensive Study on Deep Learning-Based 3D Hand Pose Estimation Methods

Author: Kosmas Dimitropoulos, Petros Daras, Andreas Stergioulas, Dimitrios Konstantinidis, and Theocharis Chatzis
Subjects: Computer science, 3D hand pose estimation, 02 engineering and technology, Machine learning, computer.software_genre, lcsh:Technology, Field (computer science), computer vision, lcsh:Chemistry, 0202 electrical engineering, electronic engineering, information engineering, General Materials Science, Instrumentation, Pose, lcsh:QH301-705.5, Fluid Flow and Transfer Processes, Modality (human–computer interaction), Artificial neural network, business.industry, lcsh:T, Process Chemistry and Technology, Deep learning, General Engineering, deep learning, 020207 software engineering, neural networks, lcsh:QC1-999, Computer Science Applications, lcsh:Biology (General), lcsh:QD1-999, lcsh:TA1-2040, RGB color model, Deep neural networks, 020201 artificial intelligence & image processing, Artificial intelligence, business, lcsh:Engineering (General). Civil engineering (General), computer, lcsh:Physics
Abstract: The field of 3D hand pose estimation has been gaining a lot of attention recently, due to its significance in several applications that require human-computer interaction (HCI). The utilization of technological advances, such as cost-efficient depth cameras coupled with the explosive progress of Deep Neural Networks (DNNs), has led to a significant boost in the development of robust markerless 3D hand pose estimation methods. Nonetheless, finger occlusions and rapid motions still pose significant challenges to the accuracy of such methods. In this survey, we provide a comprehensive study of the most representative deep learning-based methods in literature and propose a new taxonomy heavily based on the input data modality, being RGB, depth, or multimodal information. Finally, we demonstrate results on the most popular RGB and depth-based datasets and discuss potential research directions in this rapidly growing field.
Published: 2020

11. Innovative Parkinsons Disease Patients Motor Skills Assessment: The i-PROGNOSIS Paradigm

Author: Estelle Huchet, Dhaval Trivedi, K. Ray Chaudhuri, Nikos Grammalidis, Sevasti Bostantzopoulou, Vasileios Charisis, Anastasios Delopoulos, Kosmas Dimitropoulos, Zoe Katsarou, Hagen Jaeger, Hugo Silva, Elina Theodoropoulou, Theodore Savvidis, Michael Stadtschnitzer, Sofia B. Dias, Alexandros Papadopoulos, G. Lyberopoulos, Fotis Karayiannis, Konstantinos Filis, Gonçalo Telo, Konstantinos Kyritsis, José Alves Diniz, George Apostolidis, Vicky Zilidou, Vera Hoermann, George Ntakakis, Panagiotis D. Bamidis, Ioannis Ioakeimidis, Dimitrios Iakovakis, Stelios Hadjidimitriou, Heinz Reichmann, Athina Grammatikopoulou, Leontios J. Hadjileontiadis, Evdokimos I. Konstantinidis, and Lisa Klingelhoefer
Subjects: medicine.medical_specialty, Movement disorders, Parkinson's disease, Computer science, Psychological intervention, 02 engineering and technology, Disease, unified parkinson disease rating scale (UPDRS) part III, lcsh:QA75.5-76.95, parkinson's disease (PD), Physical medicine and rehabilitation, Rating scale, 0202 electrical engineering, electronic engineering, information engineering, medicine, Motor skill, Balance (ability), General Environmental Science, i-PROGNOSIS, General Engineering, 020207 software engineering, medicine.disease, Gait, motor assessment tests, General Earth and Planetary Sciences, 020201 artificial intelligence & image processing, lcsh:Electronic computers. Computer science, medicine.symptom, motor skills decline
Abstract: Being the second most common neurodegenerative disease, Parkinson's disease (PD) can be symptomatically treated, although, unfortunately, it cannot be cured yet. Moreover, diagnosing and assessing PD patients is a complex process, requiring continuous monitoring. In this vein, the design, development, and validation of innovative assessment tools may be helpful in the management of patients with PD, in particular. Based on intelligent ICT interventions, the i-PROGNOSIS project intends to mitigate PD's specific symptoms, such as neurological movement disorders of gait, balance, coordination, and posture, already characterized in the early phase of the disease. From this perspective, an innovative iPrognosis motor assessment tool is presented here, taking into consideration the Unified Parkinson Disease Rating Scale (UPDRS) Part III motor skills testing items, for evaluating the motor skills status. The efficiency of the proposed Assessment Tests to reflect the motor skills status, similarly to the UPDRS Part III items, was validated via 27 participants (18 males; mean age = 62 years, SD = 10.36 years; range, 43–79 years) with early (n = 10) and moderate (n = 17) PD who performed the Assessment Tests. Features from the latter were then correlated with the corresponding clinically assessed UPDRS Part III items, and statistically significant negative correlations (range, −0.364 to −0.802) were identified between the median values of the Assessment Tests and the UPDRS Part III items. In this vein, the iPrognosis Assessment Tests were integrated within the personalized interventions of the i-PROGNOSIS project, providing alternative means of assessing their effect on the PD patient's motor skills enhancement. The promising results presented here elaborate on the concept of using ICT-based assessment means to achieve comparable outcomes with the clinical standards in motor skills assessment.
Published: 2020
Full Text: View/download PDF

12. Session details: AGENT: The 2nd International Workshop on MultimodAl SiGnal Sensing/Analysis, Innovative Interactive Environments and PersoNalized Behavioral Modeling for Improving QualiTy-of-Life

Author: Leontios J. Hadjileontiadis, Petros Daras, Kosmas Dimitropoulos, Federico Alvarez, and Nicholas Vretos
Subjects: Quality of life (healthcare), Computer science, Human–computer interaction, SIGNAL (programming language), Session (computer science), Behavioral modeling
Published: 2020
Full Text: View/download PDF

13. Innovative interventions for Parkinson's disease patients using iPrognosis games

Author: Evdokimos I. Konstantinidis, Vasileios Charisis, Leontios J. Hadjileontiadis, Theodore Savvidis, Sofia B. Dias, Sevasti Bostantzopoulou, Vicky Zilidou, José Alves Diniz, Panagiotis D. Bamidis, Kosmas Dimitropoulos, Ioannis Ioakeimidis, Athina Grammatikopoulou, Dhaval Trivedi, Nikos Grammalidis, Lisa Klingelhoefer, Stelios Hadjidimitriou, Zoe Katsarou, and Michael Stadtschnitzer
Subjects: medicine.medical_specialty, Parkinson's disease, 020205 medical informatics, Computer science, Psychological intervention, 02 engineering and technology, Disease, medicine.disease, 3. Good health, Objective assessment, 03 medical and health sciences, 0302 clinical medicine, Quality of life (healthcare), Intervention (counseling), 0202 electrical engineering, electronic engineering, information engineering, medicine, Physical therapy, 030212 general & internal medicine
Abstract: Parkinson's disease (PD) is a chronic and progressive neurodegenerative disease that affects ~7 million people worldwide, without any cure to date; however, it can be symptomatically treated. In this vein, innovative technologies can be used for the objective assessment of clinical symptoms and to provide supportive therapies at home. The present work explores the processes and the outcomes of the i-PROGNOSIS (www.i-prognosis.eu) intervention deployment in three PD clinical centres (Greece, UK, and Germany). For that purpose, 36 PD patients were recruited to voluntarily participate in the i-PROGNOSIS feasibility study, spread across the three different countries. The PD patients interacted with the i-PROGNOSIS system for up-to-three months, mainly within the clinical environment, using the provided iPrognosis Games in dedicated gaming stations that were setup in the corresponding clinical centres. Overall, the results show that the iPrognosis Games were positively evaluated by medical experts. Moreover, based on the collected feedback, the iPrognosis Games have achieved their main goals of providing an innovative, objective and usable system for the monitoring of early PD (motor and non-motor) symptomatology, by providing tools for complementing existing clinical interventions for the improvement of PD patients' quality of life.
Published: 2020
Full Text: View/download PDF

14. Developing Accessibility Multimedia Services: The Case of EasyTV

Author: Pilar Orero, Emilio Ferreiro Lago, Chrysostomos Bourlis, Francesc Mas, Thanassis Kalvourtzis, Fabrizio Giacomelli, Jordi Mata, Sara Rovira-Esteva, Konstantinos Votis, Kiriakos Stefanidis, Pablo Calleja, Giuseppe Vitolo, Dimitrios Konstantinidis, Nikolaos Kaklanis, Salim Gannoum, Jordi Fabregat, Alvaro Llorente, Nicolamaria Manes, Petros Daras, María Poveda-Villalón, Giuliano Frittelli, Francisco José Gallego Moreno, Silvia Uribe, Pasquale Andriani, Giuseppa Caruso, Stavros Skourtis, Federico Alvarez, and Kosmas Dimitropoulos
Subjects: 030506 rehabilitation, Multimedia, Computer science, End user, 020207 software engineering, 02 engineering and technology, Sign language, Semantics, computer.software_genre, Personalization, 03 medical and health sciences, Digital society, Work (electrical), Order (business), 0202 electrical engineering, electronic engineering, information engineering, Mainstream, 0305 other medical science, computer
Abstract: People with various degrees of disabilities (e.g., visually or hearing impaired) often find it difficult to access mainstream products and services and thus they are excluded from enjoying audio-visual services on an equal basis as people without disabilities. These people feel marginalized in today's digital society and they are unable to reach their maximum potential personally, professionally and socially. EasyTV aims to address these issues by providing numerous services that not only ease the access of people with disabilities to multimedia services, but also enhance interaction with the media through a multi-language approach that adapts content based on user's preferences. In this work, the main EasyTV services are described in detail. Furthermore, an evaluation of the main services by end users is presented and discussed in order to highlight the importance of offering accessibility services to people with disabilities.
Published: 2020
Full Text: View/download PDF

15. Cross-modal Variational Alignment of Latent Spaces

Author: Petros Daras, Theocharis Chatzis, Vassilios Solachidis, Thomas Theodoridis, and Kosmas Dimitropoulos
Subjects: Modality (human–computer interaction), business.industry, Computer science, 020207 software engineering, Pattern recognition, 02 engineering and technology, Space (mathematics), Translation (geometry), Autoencoder, Image (mathematics), Distribution (mathematics), Modal, Cross-modal learning, Variational-autoencoder, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, Pose
Abstract: In this paper, we propose a novel cross-modal variational alignment method in order to process and relate information across different modalities. The proposed approach consists of two variational autoencoder (VAE) networks which generate and model the latent space of eachmodality. The first network is a multi-modal variational-autoencoder that maps directly one modality to the other, while the second one is a single-modal variational autoencoder. In order to associate the two spaces, we apply variational alignment, which acts as a translation mechanismthat projects the latent space of the first VAE onto the one ofthe single-modal VAE through an intermediate distribution.Experimental results on four well-known datasets, covering two different application domains (food image analysisand 3D hand pose estimation), show the generality of theproposed method and its superiority against a number of state-of-the-art approaches.
Published: 2020
Full Text: View/download PDF

16. Neural Network Compression Using Higher-Order Statistics and Auxiliary Reconstruction Losses

Author: Georgios Th. Papadopoulos, Petros Daras, Kosmas Dimitropoulos, and Christos Chatzikonstantinou
Subjects: Network architecture, Artificial neural network, Computational complexity theory, Computer science, business.industry, Gaussian, Higher-order statistics, 02 engineering and technology, Filter (signal processing), symbols.namesake, Softmax function, 0202 electrical engineering, electronic engineering, information engineering, symbols, 020201 artificial intelligence & image processing, Pruning (decision trees), Artificial intelligence, business, Algorithm
Abstract: In this paper, the problem of pruning and compressing the weights of various layers of deep neural networks is investigated. The proposed method aims to remove redundant filters from the network to reduce computational complexity and storage requirements, while improving the performance of the original network. More specifically, a novel filter selection criterion is introduced based on the fact that filters whose weights follow a Gaussian distribution correspond to hidden units that do not capture important aspects of data. To this end, Higher Order Statistics (HOS) are used and filters with low cumulant values that do not deviate significantly from Gaussian distribution are identified and removed from the network. In addition, a novel pruning strategy is proposed aiming to decide on the pruning ratio of each layer using the Shapiro-Wilk normality test. The use of auxiliary MSE losses (intermediate and after the softmax layer) during the fine-tuning phase further improves the overall performance of the compressed network. Extensive experiments with different network architectures and comparison with state-of-the-art approaches on well-known public datasets, such as CIFAR-10, CIFAR-100 and ILSCVR-12, demonstrate the great potential of the proposed approach.
Published: 2020
Full Text: View/download PDF

17. 3D Technologies and Applications in Sign Language

Author: Petros Daras, Dimitrios Konstantinidis, Athanasios Kalvourtzis, Kiriakos Stefanidis, and Kosmas Dimitropoulos
Subjects: Computer science, 05 social sciences, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, 0501 psychology and cognitive sciences, 02 engineering and technology, Sign language, 10. No inequality, 050107 human factors, Linguistics
Abstract: Millions of people suffering from partial or complete hearing loss use variants of sign language to communicate with each other or hearing people in their everyday life. Thus, it is imperative to develop systems to assist these people by removing the barriers that affect their social inclusion. These systems should aim towards capturing sign language in an accurate way, classifying sign language to natural words and representing sign language by having avatars or synthesized videos execute the exact same moves that convey a meaning in the sign language. This chapter reviews current state-of-the-art approaches that attempt to solve sign language recognition and representation and analyzes the challenges they face. Furthermore, this chapter presents a novel AI-based solution to the problem of robust sign language capturing and representation, as well as a solution to the unavailability of annotated sign language datasets before limitations and directions for future work are discussed.
Published: 2020
Full Text: View/download PDF

18. Wood species recognition through multidimensional texture analysis

Author: Kosmas Dimitropoulos, Panagiotis Lefakis, Ioannis Barboutis, Panagiotis Barmpoutis, and Nikos Grammalidis
Subjects: 0106 biological sciences, Computer science, business.industry, Small number, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Information and Computer Science, Forestry, Pattern recognition, 02 engineering and technology, Horticulture, 01 natural sciences, GeneralLiterature_MISCELLANEOUS, Computer Science Applications, Linear dynamical system, Support vector machine, Classification rate, 010608 biotechnology, Histogram, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, Agronomy and Crop Science, Classifier (UML), ComputingMethodologies_COMPUTERGRAPHICS
Abstract: Wood recognition is a crucial task for wood sciences and industries, since it leads to the identification of the anatomical features and physical properties of wood. Traditionally, the recognition process relies almost exclusively on human experts, who are based on various characteristics of wood, such as color, structure and texture. However, there are numerous types of wood species in the nature that are difficult to be identified even by experienced scientists. Towards this end, in this paper we propose a novel approach for automated wood species recognition through multidimensional texture analysis. By taking advantage of the fact that static wood images contain periodic spatially-evolving characteristics, we introduce a new spatial descriptor considering each wood image as a collection of multidimensional signals. More specifically, the proposed methodology enables the representation of wood images as concatenated histograms of higher order linear dynamical systems produced by vertical and horizontal image patches. The final classification of images, i.e., histogram representations, into wood species, is performed using a Support Vector Machines (SVM) classifier. For the evaluation of the proposed method, a dataset, namely “WOOD-AUTH”, consisting of more than 4200 wood images (from cross, radial and tangential sections of normal wood structure) of twelve common wood species existing in Greek territory, was created. Experimental results presented in this paper show the great potential of the proposed methodology, which, despite a small number of misclassification cases with regards to both anatomically similar and different species, outperforms a number of state of the art approaches, yielding a classification rate of 91.47% in wood cross sections.
Published: 2018
Full Text: View/download PDF

19. Higher Order Linear Dynamical Systems for Smoke Detection in Video Surveillance Applications

Author: Panagiotis Barmpoutis, Nikos Grammalidis, and Kosmas Dimitropoulos
Subjects: business.industry, Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Particle swarm optimization, 020101 civil engineering, 02 engineering and technology, 0201 civil engineering, Linear dynamical system, Statistical classification, Wavelet, Histogram, 0202 electrical engineering, electronic engineering, information engineering, Media Technology, 020201 artificial intelligence & image processing, Computer vision, Artificial intelligence, Electrical and Electronic Engineering, Focus (optics), Representation (mathematics), business
Abstract: In this paper, we consider the problem of multi-dimensional dynamic texture analysis, and we introduce a new higher order linear dynamical system (h-LDS) descriptor. The proposed h-LDS descriptor is based on the higher order decomposition of the multidimensional image data and enables the analysis of dynamic textures by using information from various image elements. In addition, we propose a methodology for its application to video-based early warning systems that focus on smoke identification. More specifically, the proposed methodology enables the representation of video subsequences as histograms of h-LDS descriptors produced by the smoke candidate image patches in each subsequence. Finally, to further improve the classification accuracy, we propose the combination of multidimensional dynamic texture analysis with the spatiotemporal modeling of smoke by using a particle swarm optimization approach. The ability of the h-LDS to analyze the dynamic texture information is evaluated through a multivariate comparison against the standard LDS descriptor. The experimental results that use two video datasets have shown the great potential of the proposed smoke detection method.
Published: 2017
Full Text: View/download PDF

20. A Deep Network for Automatic Video-Based Food Bite Detection

Author: Petros Daras, Dimitrios Konstantinidis, Billy Langlet, Ioannis Ioakimidis, and Kosmas Dimitropoulos
Subjects: 0303 health sciences, 030309 nutrition & dietetics, business.industry, Computer science, Deep learning, 030209 endocrinology & metabolism, Healthy eating, Bite detection, Video analysis, Motion (physics), Additional research, Task (project management), 03 medical and health sciences, 0302 clinical medicine, Kitchen utensils, Human–computer interaction, Motion features, Artificial intelligence, Eating habits, business, Video based
Abstract: Past research has now provided compelling evidence pointing towardscorrelations among individual eating styles and the development of (un)healthyeating patterns, obesity and other medical conditions. In this setting, an automatic,non-invasive food bite detection system can be a really useful tool in the handsof nutritionists, dietary experts and medical doctors in order to explore real-lifeeating behaviors and dietary habits.Unfortunately, the automatic detection of foodbites can be challenging due to occlusions between hands and mouth, use of differentkitchen utensils and personalized eating habits. On the other hand, althoughaccurate, manual bite detection is time-consuming for the annotator, making itinfeasible for large scale experimental deployments or real-life settings. To thisregard, we propose a novel deep learning methodology that relies solely on humanbody and face motion data extracted from videos depicting people eating meals.The purpose is to develop a system that can accurately, robustly and automaticallyidentify food bite instances,with the long-term goal to complement or even replacemanual bite-annotation protocols currently in use. The experimental results on alarge dataset reveal the superb classification performance of the proposed methodologyon the task of bite detection and paves the way for additional research onautomatic bite detection systems.
Published: 2019

21. Session details: AGENT workshop: Multimodal signal sensing and analysis for assistive environments for improving quality-of-life workshop

Author: Kosmas Dimitropoulos
Subjects: Quality of life (healthcare), Multimedia, Computer science, SIGNAL (programming language), Session (computer science), computer.software_genre, computer
Published: 2019
Full Text: View/download PDF

22. Motion Analysis of Parkinson Diseased Patients using a Video Game Approach

Author: Nikos Grammalidis, Kosmas Dimitropoulos, Athina Grammatikopoulou, Zoe Katsarou, and Sevasti Bostantjopoulou
Subjects: 030506 rehabilitation, Motion analysis, medicine.medical_specialty, Rehabilitation, Computer science, medicine.medical_treatment, Neurological disorder, Disease, medicine.disease, 03 medical and health sciences, 0302 clinical medicine, Physical medicine and rehabilitation, Rehabilitation exercise, medicine, 0305 other medical science, Video game, 030217 neurology & neurosurgery, Motor score, Motor skill
Abstract: Parkinson’s disease (PD) is a progressive neurological disorder and the second most common age-related neurodegenerative disease after Alzheimer's disease. The primary symptoms of the disease are associated with the loss of motor skills affecting patients’ movement and coordination and disrupting their daily life. Unfortunately, such motor symptoms cannot be fully relieved by therapeutic options. On the other hand, studies have shown that regular training and exercising can prove neuroprotective in PD patients helping them maintain independent longer. Based on recent studies stating that computer-based physical therapy games can be used as an option for facilitating PD rehabilitation exercise programs, we present the development of a body motion based videogame, using the Kinect sensor, targeted for PD patients. We tested twelve patients with advanced forms of PD motor symptoms (UPDRS motor score>20) and six initial stage PD patients (UPDRS motor score
Published: 2019
Full Text: View/download PDF

23. Fire Detection from Images Using Faster R-CNN and Multidimensional Texture Analysis

Author: Kosmas Dimitropoulos, Kyriaki Kaza, Nikos Grammalidis, and Panagiotis Barmpoutis
Subjects: business.industry, Fire detection, Computer science, Deep learning, 020206 networking & telecommunications, Pattern recognition, 02 engineering and technology, Image (mathematics), Set (abstract data type), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, Representation (mathematics)
Abstract: In this paper, we propose a novel image-based fire detection approach, which combines the power of modern deep learning networks with multidimensional texture analysis based on higher-order linear dynamical systems. The candidate fire regions are identified by a Faster R-CNN network trained for the task of fire detection using a set of annotated images containing actual fire as well as selected negatives. The candidate fire regions are projected to a Grassmannian space and each image is represented as a cloud of points on the manifold. Finally, a vector representation approach is applied aiming to aggregate the Grassmannian points based on a locality criterion on the manifold. For evaluating the performance of the proposed methodology, we performed experiments with annotated images of two different databases containing fire and fire-coloured objects. Experimental results demonstrate the potential of the proposed methodology compared to other state of the art approaches.
Published: 2019
Full Text: View/download PDF

24. Automated detection and classification of nuclei in PAX5 and H&E-stained tissue sections of follicular lymphoma

Author: Kosmas Dimitropoulos, Ioannis Kostopoulos, Panagiotis Barmpoutis, Triantafyllia Koletsa, and Nikos Grammalidis
Subjects: business.industry, Computer science, Speech recognition, 0206 medical engineering, Follicular lymphoma, Pattern recognition, 02 engineering and technology, medicine.disease, 020601 biomedical engineering, Stain, Tissue sections, hemic and lymphatic diseases, Cut, Signal Processing, 0202 electrical engineering, electronic engineering, information engineering, medicine, Centroblasts, 020201 artificial intelligence & image processing, Segmentation, Artificial intelligence, Electrical and Electronic Engineering, Cluster analysis, business, Grading (tumors)
Abstract: In this paper, we propose a novel framework for the detection and classification of centroblasts (CB) in follicular lymphoma (FL) tissue samples stained with PAX5 and H&E stains and sliced at 1 $$\upmu $$ m thickness level. By employing PAX5 immunohistochemistry, we facilitate the segmentation of nuclei, while the use of H&E stain enables us to extract textural information related to histological characteristics used by pathologists in the diagnosis of FL grading. For the segmentation of nuclei in PAX5-stained images, we initially apply an energy minimization technique based on graph cuts and then we propose a novel algorithm for the separation of overlapped nuclei inspired by the clustering of large-scale visual vocabularies. The morphological characteristics of nuclei extracted from PAX5-stained images are combined with a number of textural characteristics identified in H&E images through a Bayesian network classifier, which aims to model pathologists’ knowledge used in FL grading. Experimental results have already shown the great potential of the proposed methodology providing an average F-score of $$94.56\,\%$$ .
Published: 2016
Full Text: View/download PDF

25. A Deep Learning Approach For Analyzing Video And Skeletal Features In Sign Language Recognition

Author: Kosmas Dimitropoulos, Dimitrios Konstantinidis, and Petros Daras
Subjects: Computer science, business.industry, Deep learning, Optical flow, 020206 networking & telecommunications, 02 engineering and technology, Sign language, computer.software_genre, Task (project management), TheoryofComputation_MATHEMATICALLOGICANDFORMALLANGUAGES, Face (geometry), 0202 electrical engineering, electronic engineering, information engineering, Feature (machine learning), 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Word (computer architecture), Natural language processing, Meaning (linguistics)
Abstract: Sign language recognition (SLR) refers to theclassification of signs with a specific meaning performed by thedeaf and/or hearing-impaired people in their everydaycommunication. In this work, we propose a deep learningbased framework, in which we examine and analyze thecontribution of video (image and optical flow) and skeletal(body, hand and face) features in the challenging task of isolated SLR, in which each signed video corresponds to asingle word. Moreover, we employ various fusion schemes inorder to identify the optimal way to combine the informationobtained from the various feature representations and proposea robust SLR methodology. Our experimentation on two signlanguage datasets and the comparison with state-of-the-artSLR methods reveals the superiority of optimally combiningskeletal and video features for SLR tasks.
Published: 2018
Full Text: View/download PDF

26. Skeleton-Based Action Recognition Based On Deep Learning And Grassmannian Pyramids

Author: Dimitrios Konstantinidis, Kosmas Dimitropoulos, and Petros Daras
Subjects: Artificial neural network, Computer science, business.industry, Deep learning, Feature extraction, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 020207 software engineering, Pattern recognition, 02 engineering and technology, Skeleton (category theory), Manifold, Robustness (computer science), Grassmannian, Histogram, Pyramid, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Pyramid (image processing), Artificial intelligence, business
Abstract: The accuracy of modern depth sensors, the robustnessof skeletal data to illumination variations and the superbperformance of deep learning techniques on several classificationtasks have sparkled a renewed interest towards skeleton basedaction recognition. In this paper, we propose a four streamdeep neural network based on two types of spatialskeletal features and their corresponding temporal representationsextracted by the novel Grassmannian Pyramid Descriptor(GPD). The performance of the proposed action recognitionmethodology is further enhanced by the use of a meta-learnerthat takes advantage of the meta knowledge extracted fromthe processing of the different features. Experiments on severalwell-known action recognition datasets reveal that ourproposed methodology outperforms a number of state-of-the-artskeleton-based action recognition methods.
Published: 2018
Full Text: View/download PDF

27. Sign Language Recognition Based On Hand And Body Skeletal Data

Author: Dimitrios Konstantinidis, Kosmas Dimitropoulos, and Petros Daras
Subjects: Computer science, business.industry, Deep learning, Speech recognition, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 020206 networking & telecommunications, 02 engineering and technology, Sign language, Field (computer science), Discriminative model, restrict, Gesture recognition, 0202 electrical engineering, electronic engineering, information engineering, RGB color model, 020201 artificial intelligence & image processing, State (computer science), Artificial intelligence, business
Abstract: Sign language recognition (SLR) is a challenging, but highly important research field for several computer vision systems that attempt to facilitate the communication among the deaf and hearing impaired people. In this work, we propose an accurate and robust deep learning-based methodology for sign language recognition from video sequences. Our novel method relies on hand and body skeletal features extracted from RGB videos and, therefore, it acquires highly discriminative for gesture recognition skeletal data without the need for any additional equipment, such as data gloves, that may restrict signer’s movements. Experimentation on a large publicly available sign language dataset reveals the superiority of our methodology with respect to other state of the art approaches relying solely on RGB features.
Published: 2018
Full Text: View/download PDF

28. ProsocialLearn: A Prosocial Games Marketplace

Author: Simon Crowle, Konstantinos C. Apostolakis, Lee Middleton, Francesco D'Andria, Kosmas Dimitropoulos, Jose Miguel Garrido, Petros Daras, Michael Boniface, and Stefano Modafferi
Subjects: business.industry, Computer science, 4. Education, 05 social sciences, Internet privacy, ComputingMilieux_PERSONALCOMPUTING, 050109 social psychology, Prosocial behavior, Technology transfer, Production (economics), 0501 psychology and cognitive sciences, 10. No inequality, Adaptation (computer science), Game Developer, business
Abstract: ProsocialLearn is a digital pro-social games platform. The ProsocialLearn project has delivered a series of disruptive innovations for the production and distribution of pro-social digital games that engage children (7-10 years old). Additionally it has stimulated technology transfer from the games industry to the educational sector. ProsocialLearn fosters the creation of a new market for digital games aimed at increasing social inclusion and academic performance, as well as a distribution channel to deliver prosocial games to children and teachers in European schools. Furthermore, it provides a proven pro-social methodology to design digital games. The ProsocialLearn platform makes available a series of APIs, which game developers use to integrate many of the ProsocialLearn functions into games, i.e. emotion and engagement monitoring, in-game achievements, games adaptation based on Prosocial Learning Objectives (PLOs), and micro-transactions.
Published: 2018
Full Text: View/download PDF

29. Mixed Reality, Gamified Presence, and Storytelling for Virtual Museums

Author: Victor M. López-Menchero, Athina Grammatikopoulou, Michael Tsioumas, George Papagiannakis, Stella Sylaiou, Fotis Liarokapis, N. Grammalidis, Alan Chalmers, Constantine Stephanidis, Nadia Magnenat-Thalmann, Kosmas Dimitropoulos, Nikolaos Partarakis, George Margetis, Maria Pateraki, Efstratios Geronikolakis, Giannis Drossis, and Martha Vassiliadi
Subjects: Computer science, Metaphor, media_common.quotation_subject, Sense of presence, 020207 software engineering, 02 engineering and technology, Virtual reality, Mixed reality, Feeling, Human–computer interaction, 0202 electrical engineering, electronic engineering, information engineering, Key (cryptography), 020201 artificial intelligence & image processing, Augmented reality, media_common, Storytelling
Abstract: Mixed Reality as display technology, gamification as motivational element, storytelling as interaction metaphor while maintaining the feeling of Presence, are identified as key elements for defining the next generation of virtual museums.
Published: 2018
Full Text: View/download PDF

30. A Multimodal Approach for the Safeguarding and Transmission of Intangible Cultural Heritage: The Case of i-Treasures

Author: Kosmas Dimitropoulos, Nikos Grammalidis, Francesca Pozzi, Anastasios Katos, Leontios J. Hadjileontiadis, Sotiris Manitsaris, Marius Cotescu, Athanasios Manitsaris, Filareti Tsalakanidou, Selami Ciftci, Lise Crevier-Buchman, Spiros Nikolopoulos, Vasileios Charisis, Stéphane Dupont, Bruce Denby, Ioannis Kompatsiaris, LPP - Laboratoire de Phonétique et Phonologie - UMR 7018 (LPP), Université Sorbonne Nouvelle - Paris 3-Centre National de la Recherche Scientifique (CNRS), Centre de Robotique (CAOR), MINES ParisTech - École nationale supérieure des mines de Paris, Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL), Université Pierre et Marie Curie - Paris 6 (UPMC), Multitel Asbl, Centre for Research & Technology (.) (CERTH), Institute for Educational Technology = The Istituto per le Tecnologie Didattiche (ITD), Multimedia Technologies and Computer Graphics Laboratory, and University of Macedonia [Thessaloniki] (UoM)
Subjects: Intangible cultural heritage, Dance, Computer Networks and Communications, Computer science, 02 engineering and technology, Safeguarding, Electronic mail, [SPI]Engineering Sciences [physics], Semantic Analysis, Artificial Intelligence, Multimodal Analysis, Cultural diversity, 0202 electrical engineering, electronic engineering, information engineering, [SHS.LANGUE]Humanities and Social Sciences/Linguistics, Digitization, ComputingMilieux_MISCELLANEOUS, e-learning, Multi-sensor Technology, 020206 networking & telecommunications, Cultural heritage, Aesthetics, Intangible Cultural Heritage, 020201 artificial intelligence & image processing, [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing, [MATH.APPL]Mathematics [math]/domain_math.appl, Gesture
Abstract: Intangible Cultural Heritage (ICH) creations include, amongst other, music, dance, singing, theatre, human skills, and craftsmanship. These cultural expressions are usually transmitted orally and/or using gestures and are modified over a period of time, through a process of collective recreation. As the world becomes more interconnected and many different cultures come into contact, local communities run the risk of losing important elements of their ICH, while young people find it difficult to maintain the connection with the cultural heritage treasured by their elders. In this paper, we present a novel holistic approach for the safeguarding and transmission of ICH that goes beyond the mere digitization of ICH content. Based on multisensory technology for the capturing of ICH, the proposed approach enables the generation of completely novel cultural content. High-level semantics are extracted from the acquired data, enabling researchers to identify possible implicit or hidden correlations between different ICH expressions or interpretation styles and study the evolution of a specific ICH. These data, coupled with other cultural resources, are accessible through the i-Treasures Web-platform, which provides the means for supporting knowledge exchange between researchers as well as know-how transmission from ICH bearers to apprentices.
Published: 2018
Full Text: View/download PDF

31. LDS-Inspired Residual Networks

Author: Petros Daras, Kosmas Dimitropoulos, Dimitrios Ataloglou, Federico Alvarez, and Anastasios Dimou
Subjects: Contextual image classification, Artificial neural network, Computer science, Stochastic process, business.industry, Feature vector, Deep learning, 02 engineering and technology, Residual, Machine learning, computer.software_genre, Convolutional neural network, Residual neural network, Linear dynamical system, 0202 electrical engineering, electronic engineering, information engineering, Media Technology, 020201 artificial intelligence & image processing, Artificial intelligence, Electrical and Electronic Engineering, business, computer
Abstract: Residual networks (ResNets) have introduced a milestone for the deep learning community due to their outstanding performance in diverse applications. They enable efficient training of increasingly deep networks, reducing the training difficulty and error. The main intuition behind them is that, instead of mapping the input information, they are mapping a residual part of it. Since the original work, a lot of extensions have been proposed to improve information mapping. In this paper, a novel extension of the residual block is proposed inspired by linear dynamical systems (LDSs), called LDS-ResNet. Specifically, a new module is presented that improves mapping of residual information by transforming it in a hidden state and then mapping it back to the desired feature space using convolutional layers. The proposed module is utilized to construct multi-branch residual blocks for convolutional neural networks. An exploration of possible architectural choices is presented and evaluated. Experimental results show that LDS-ResNet outperforms the original ResNet in image classification and object detection tasks on public datasets such as CIFAR-10/100, ImageNet, VOC, and MOT2017. Moreover, its performance boost is complementary to other extensions of the original network such as pre-activation and bottleneck, as well as stochastic training and Squeeze-Excitation.
Published: 2018
Full Text: View/download PDF

32. Unsupervised Dance Motion Patterns Classification from Fused Skeletal Data Using Exemplar-Based HMMs

Author: Kosmas Dimitropoulos, Nikolaos V. Boulgouris, Nikos Grammalidis, and Alexandros Kitsikidis
Subjects: Sequence, Dance, business.industry, Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Partition (database), Motion (physics), Displacement (vector), Frame (artificial intelligence), Computer vision, Artificial intelligence, business, Representation (mathematics), Hidden Markov model, ComputingMethodologies_COMPUTERGRAPHICS
Abstract: In this paper, we propose a method for the partitioning of dance sequences into multiple periods and motion patterns. The proposed method deploys features in the form of a skeletal representation of the dancer observed through time using multiple depth sensors. This representation is the fusion of skeletal features captured using multiple sensors and combined into a single, more robust, skeletal representation. Using this information, initially we partition the dance sequence into periods and subsequently into motion patterns. Partitioning into periods is based on observing the horizontal displacement of the dancer while each period is subsequently partitioned into motion patterns by using an exemplar-based Hidden Markov Model that classifies each frame into an exemplar representing a hidden state of the HMM. The proposed method was tested on dance sequences comprising multiple periods and motion patterns providing promising results.
Published: 2015
Full Text: View/download PDF

33. Spatio-Temporal Flame Modeling and Dynamic Texture Analysis for Automatic Video-Based Fire Detection

Author: Kosmas Dimitropoulos, Nikos Grammalidis, and Panagiotis Barmpoutis
Subjects: Background subtraction, Warning system, Fire detection, business.industry, Computer science, Feature extraction, Frame (networking), Pattern recognition, Robustness (computer science), Media Technology, Computer vision, Artificial intelligence, Electrical and Electronic Engineering, business
Abstract: Every year, a large number of wildfires all over the world burn forested lands, causing adverse ecological, economic, and social impacts. Beyond taking precautionary measures, early warning and immediate response are the only ways to avoid great losses. To this end, in this paper we propose a computer vision approach for fire-flame detection to be used by an early-warning fire monitoring system. Initially, candidate fire regions in a frame are defined using background subtraction and color analysis based on a nonparametric model. Subsequently, the fire behavior is modeled by employing various spatio-temporal features, such as color probability, flickering, spatial, and spatio-temporal energy, while dynamic texture analysis is applied in each candidate region using linear dynamical systems and a bag-of-systems approach. To increase the robustness of the algorithm, the spatio-temporal consistency energy of each candidate fire region is estimated by exploiting prior knowledge about the possible existence of fire in neighboring blocks from the current and previous video frames. As a final step, a two-class support vector machine classifier is used to classify the candidate regions. Experimental results have shown that the proposed method outperforms existing state-of-the-art algorithms.
Published: 2015
Full Text: View/download PDF

34. An adaptive framework for the creation of bodymotion-based games

Author: N. Grammalidis, Athina Grammatikopoulou, O. Sahbenderoglu, Kosmas Dimitropoulos, and Sohaib Laraba
Subjects: Game art design, Game mechanics, Game design, Multimedia, Game design document, Human–computer interaction, Game programming, Computer science, Game development tool, Level design, computer.software_genre, computer, Motion capture
Abstract: As of the early 20th century, a significant body of research has been published that shows how effective game-based learning and gamification techniques can be compared to other methods. However, creating games can be time consuming and usually demands a significant effort. Therefore, this paper focuses on the design and development of a novel framework for the rapid design of body-motion-based customizable game-like applications. This framework consists of two components: i) an interface that allows the user to design the game and capture the motion data, and ii) a customizable game for learning and training using off-the-shelf motion capture sensors like the Microsoft Kinect. The game is automatically configured based on the output of the game design interface. In order to evaluate the proposed system, a pilot use case for the Latin dance Salsa has been selected. Preliminary small-scaled experiments with latin dance students have shown the great potential of the proposed application.
Published: 2017
Full Text: View/download PDF

35. On Supporting Parkinson's Disease Patients: The i-Prognosis Personalized Game Suite Design Approach

Author: Ioannis Ioakeimidis, Vasileios Charisis, Sofia B. Dias, Stelios Hadjidimitriou, Panagiotis D. Bamidis, Leontios J. Hadjileontiadis, José Alves Diniz, Evdokimos I. Konstantinidis, Nikos Grammalidis, Kosmas Dimitropoulos, P. Fagerberg, and Michael Stadtschnitzer
Subjects: Personalized Game Suite, i-PROGNOSIS, 030506 rehabilitation, Parkinson's Disease, Process management, Pluralistic walkthrough, business.industry, Computer science, Suite, Psychological intervention, 03 medical and health sciences, 0302 clinical medicine, Human–computer interaction, Handwriting, Storyboards, Intervention (counseling), Health care, Design process, 0305 other medical science, business, Inclusion (education), 030217 neurology & neurosurgery
Abstract: The use of serious games in health care interventions sector has grown rapidly in the last years, however, there is still a gap in the understanding on how these types of interventions are used for the management of the Parkinson Disease (PD), in particular. Targeting intelligent early detection and intervention in PD area, the Personalized Game Suite (PGS) design process approach is presented as part of the H2020 i-PROGNOSIS project that introduces the integration of different serious games in a unified platform (i.e., ExerGames, DietaryGames, EmoGames, and Handwriting/Voice Games). From the methodological point of view, to facilitate the visualization of 14 game-scenarios, the system interface and the PD contexts, the storyboarding technique was adopted here. Overall, the realization of the PGS sets the basis for establishing a holistic framework that could aim at improving motor and non-motor symptoms, in order to inform health care providers and policy makers for its inclusion in routine management for PD.
Published: 2017
Full Text: View/download PDF

36. Grading of invasive breast carcinoma through Grassmannian VLAD encoding

Author: Kalliopi Patsiaoura, Christina Zioga, Nikos Grammalidis, Panagiotis Barmpoutis, Athanasios Kamas, and Kosmas Dimitropoulos
Subjects: Pathology, Medical Doctors, Computer science, Health Care Providers, lcsh:Medicine, Datasets as Topic, 02 engineering and technology, Systems Science, Topology, Linear dynamical system, 0302 clinical medicine, Grassmannian, Breast Tumors, 0202 electrical engineering, electronic engineering, information engineering, Medicine and Health Sciences, Medical Personnel, lcsh:Science, Manifolds, Multidisciplinary, Histological Techniques, Linear model, Manifold, Dynamical Systems, Professions, Oncology, 030220 oncology & carcinogenesis, Physical Sciences, 020201 artificial intelligence & image processing, Anatomy, Algorithms, Research Article, medicine.medical_specialty, Computer and Information Sciences, Histology, Dynamical systems theory, Imaging Techniques, Breast Neoplasms, Research and Analysis Methods, 03 medical and health sciences, Breast cancer, Diagnostic Medicine, Image Interpretation, Computer-Assisted, Breast Cancer, medicine, Cancer Detection and Diagnosis, Humans, Neoplasm Invasiveness, Grading (tumors), business.industry, lcsh:R, Biology and Life Sciences, Cancers and Neoplasms, Pattern recognition, medicine.disease, Pathologists, Health Care, People and Places, Linear Models, lcsh:Q, Population Groupings, Artificial intelligence, Neoplasm Grading, business, Mathematics
Abstract: In this paper we address the problem of automated grading of invasive breast carcinoma through the encoding of histological images as VLAD (Vector of Locally Aggregated Descriptors) representations on the Grassmann manifold. The proposed method considers each image as a set of multidimensional spatially-evolving signals that can be efficiently modeled through a higher-order linear dynamical systems analysis. Subsequently, each H&E (Hematoxylin and Eosin) stained breast cancer histological image is represented as a cloud of points on the Grassmann manifold, while a vector representation approach is applied aiming to aggregate the Grassmannian points based on a locality criterion on the manifold. To evaluate the efficiency of the proposed methodology, two datasets with different characteristics were used. More specifically, we created a new medium-sized dataset consisting of 300 annotated images (collected from 21 patients) of grades 1, 2 and 3, while we also provide experimental results using a large dataset, namely BreaKHis, containing 7,909 breast cancer histological images, collected from 82 patients, of both benign and malignant cases. Experimental results have shown that the proposed method outperforms a number of state of the art approaches providing average classification rates of 95.8% and 91.38% with our dataset and the BreaKHis dataset, respectively.
Published: 2017

37. Multimodal Student Engagement Recognition in Prosocial Games

Author: Konstantinos C. Apostolakis, Kosmas Dimitropoulos, Petros Daras, and Athanasios Psaltis
Subjects: Facial expression, Modalities, Artificial neural network, Computer science, 4. Education, 05 social sciences, Emotion recognition, engagement recognition, human computer interaction, serious games, student engagement, 050301 education, Student engagement, Cognition, 02 engineering and technology, Behavioral engagement, Prosocial behavior, Artificial Intelligence, Control and Systems Engineering, Human–computer interaction, 020204 information systems, Phenomenon, 0202 electrical engineering, electronic engineering, information engineering, Electrical and Electronic Engineering, 0503 education, Software
Abstract: In this paper we address the problem of recognizing student engagement in prosocial games by exploiting engagement cues from different input modalities. Since engagement is a multifaceted phenomenon with different dimensions, i.e., behavioral, cognitive and affective, we propose the modeling of student engagement using real-time data from both the students and the game. More specifically, we apply body motion and facial expression analysis to identify the affective state of students, while we extract features related to their cognitive and behavioral engagement based on the analysis of their interaction with the game. For the automatic recognition of engagement, we adopt a machine learning approach based on artificial neural networks, while for the annotation of the engagement data, we introduce a novel approach based on the use of games with different degrees of challenge in conjunction with a retrospective self-reporting method. To evaluate the proposed methodology, we conducted real-life experiments in four classes, in three primary schools, with 72 students and 144 gameplay recordings in total. Experimental results show the great potential of the proposed methodology, which improves the classification accuracy of the three distinct dimensions with a detection rate of 85%. A detailed analysis of the role of each component of the Game Engagement Questionnaire (GEQ), i.e., immersion, presence, flow and absorption, in the classification process is also presented in this paper.
Published: 2017
Full Text: View/download PDF

38. Using adaptive neuro-fuzzy inference systems for the detection of centroblasts in microscopic images of follicular lymphoma

Author: Triantafyllia Koletsa, Emmanouil Michail, Ioannis Kostopoulos, Kosmas Dimitropoulos, and Nikos Grammalidis
Subjects: Neuro-fuzzy, business.industry, Computer science, Follicular lymphoma, Inference, Pattern recognition, Classification scheme, medicine.disease, Mixture model, Tissue sections, Signal Processing, medicine, Centroblasts, Computer vision, Artificial intelligence, Electrical and Electronic Engineering, Detection rate, business
Abstract: In this paper, a complete methodology for automatic detection of centroblasts (CBs) in microscopic images acquired from tissue biopsies of follicular lymphoma is presented. In the proposed method, tissue sections are sliced at a low thickness level, around 1–1.5 $$\upmu $$ m, which provides a more detailed depiction of the nuclei and other textural information. Initially, images are segmented into their basic cytological components, i.e., blood cells, nuclei and extra-cellular material, and then a novel touching-cell splitting algorithm is applied using a Gaussian mixture model and expectation–maximization algorithm. Additionally, a morphological and textural analysis of CBs is applied in order to extract various features related to their nuclei, nucleoli and cytoplasm. In the final step, a novel classification scheme is proposed based on adaptive neuro-fuzzy inference systems to classify the candidate cells. The methodology yielded promising results with an average detection rate of 90.35 %.
Published: 2014
Full Text: View/download PDF

39. Video fire detection – Review

Author: Osman Gunay, Nikos Grammalidis, Steven Verstockt, B. Ugur Toreyin, Kosmas Dimitropoulos, Benedict Gouverneur, Y. Hakan Habiboglu, and A. Enis Cetin
Subjects: Cover (telecommunications), Computer science, Computer Vision, Flame detection, Real-time computing, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Poison control, 020101 civil engineering, 02 engineering and technology, Computer security, computer.software_genre, 0201 civil engineering, Artificial Intelligence, 0202 electrical engineering, electronic engineering, information engineering, Electrical and Electronic Engineering, Video based, Point (typography), Fire detection, Applied Mathematics, Video processing, Video Based Fire Detection, Wavelets Covariance Matrices, Computational Theory and Mathematics, Decision Fusion, Signal Processing, 020201 artificial intelligence & image processing, Computer Vision and Pattern Recognition, Statistics, Probability and Uncertainty, computer, Smoke Detection, Fire behavior
Abstract: Cataloged from PDF version of article. This is a review article describing the recent developments in Video based Fire Detection (VFD). Video surveillance cameras and computer vision methods are widely used in many security applications. It is also possible to use security cameras and special purpose infrared surveillance cameras for fire detection. This requires intelligent video processing techniques for detection and analysis of uncontrolled fire behavior. VFD may help reduce the detection time compared to the currently available sensors in both indoors and outdoors because cameras can monitor “volumes” and do not have transport delay that the traditional “point” sensors suffer from. It is possible to cover an area of 100 km2 using a single pan-tiltzoom camera placed on a hilltop for wildfire detection. Another benefit of the VFD systems is that they can provide crucial information about the size and growth of the fire, direction of smoke propagation. © 2013 Elsevier Inc. All rights reserved
Published: 2013
Full Text: View/download PDF

40. Multimodal affective state recognition in serious games applications

Author: Kiriakos Stefanidis, Kyriaki Kaza, Konstantinos C. Apostolakis, Spyridon Thermos, Kosmas Dimitropoulos, Athanasios Psaltis, and Petros Daras
Subjects: Motion analysis, Facial expression, Modalities, Artificial neural network, Computer science, business.industry, Feature vector, Feature extraction, 020207 software engineering, 02 engineering and technology, Machine learning, computer.software_genre, emotion recognition, multimodal fusion, serious games, Robustness (computer science), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Gesture
Abstract: A challenging research issue, which has recently attracted a lot of attention, is the incorporation of emotion recognition technology in serious games applications, in order to improve the quality of interaction and enhance the gaming experience. To this end, in this paper, we present an emotion recognition methodology that utilizes information extracted from multimodal fusion analysis to identify the affective state of players during gameplay scenarios. More specifically, two monomodal classifiers have been designed for extracting affective state information based on facial expression and body motion analysis. For the combination of different modalities a deep model is proposed that is able to make a decision about player’s affective state, while also being robust in the absence of one information cue. In order to evaluate the performance of our methodology, a bimodal database was created using Microsoft’s Kinect sensor, containing feature vectors extracted from users' facial expressions and body gestures. The proposed method achieved higher recognition rate in comparison with mono-modal, as well as early-fusion algorithms. Our methodology outperforms all other classifiers, achieving an overall recognition rate of 98.3%.
Published: 2016
Full Text: View/download PDF

41. Body Motion Analysis for Emotion Recognition in Serious Games

Author: Athanasios Psaltis, Kiriakos Stefanidis, Spyridon Thermos, Kosmas Dimitropoulos, Kyriaki Kaza, Petros Daras, and Konstantinos C. Apostolakis
Subjects: Motion analysis, business.industry, Computer science, Deep learning, media_common.quotation_subject, Emotion classification, Speech recognition, 05 social sciences, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Anger, 050105 experimental psychology, Sadness, Surprise, Happiness, 0501 psychology and cognitive sciences, Computer vision, Artificial intelligence, business, Body motion analysis, 3D body movement features, Emotion recognition, RBM, Serious games, Classifier (UML), 050107 human factors, media_common
Abstract: In this paper, we present an emotion recognition methodology that utilizes information extracted from body motion analysis to assess affective state during gameplay scenarios. A set of kinematic and geometrical features are extracted from joint-oriented skeleton tracking and are fed to a deep learning network classifier. In order to evaluate the performance of our methodology, we created a dataset with Microsoft Kinect recordings of body motions expressing the five basic emotions (anger, happiness, fear, sadness and surprise) which are likely to appear in a gameplay scenario. In this five emotions recognition problem, our methodology outperformed all other classifiers, achieving an overall recognition rate of 93 %. Furthermore, we conducted a second series of experiments to perform a qualitative analysis of the features and assess the descriptive power of different groups of features.
Published: 2016

42. Extracting Dynamics from Multi-dimensional Time-evolving Data using a Bag of Higher-order Linear Dynamical Systems

Author: Panagiotis Barmpoutis, Nikos Grammalidis, Alexandors Kitsikidis, and Kosmas Dimitropoulos
Subjects: Computer science, Order (business), Dynamics (mechanics), Multi dimensional, Statistical physics, Linear dynamical system
Published: 2016
Full Text: View/download PDF

43. Classification of Nuclei in Follicular Lyphoma Tissue Sections Using Different Stains and Bayesian Networks

Author: Panagiotis Barmpoutis, Kosmas Dimitropoulos, Triantafyllia Koletsa, Nikos Grammalidis, and Ioannis Kostopoulos
Subjects: Eosin, Computer science, business.industry, 0206 medical engineering, Bayesian network, Pattern recognition, 02 engineering and technology, Centroblast, 020601 biomedical engineering, Stain, chemistry.chemical_compound, Tissue sections, chemistry, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Segmentation, Artificial intelligence, Cluster analysis, business, Grading (tumors)
Abstract: Automated centroblast (CB) detection in Follicular Lymphoma (FL) tissue samples has recently attracted significant research interest. Most of the methods described in the literature are based on the use of Hematoxilin and Eosin (H&E) stain. However, the automated detection of CBs from H&E stained images remains a challenging issue. To this end, this paper presents a novel approach which is based on the use of both PAX5 and H&E stains in tissue sections sliced at the thickness of 1μm. The goal of PAX5 is three-fold: to facilitate the segmentation of nuclei, to remove a number of follicular dendritic cells and finally to extract morphological characteristics of nuclei. Furthermore, the use of H&E stain enables us to extract textural information related to histological characteristics used by pathologists in diagnosis of FL grading. In our method we propose a novel algorithm for the separation of overlapped nuclei inspired by the clustering of large scale visual vocabularies. Finally, aiming to model pathologists’ knowledge used in FL grading, we use a Bayesian Network classifier to combine the morphological and textural characteristics. Experiments conducted on a dataset of ten pairs of PAX5 and H&E images demonstrate the potential of the proposed approach providing an average detection rate of 93.46%.
Published: 2016
Full Text: View/download PDF

44. 5D modelling: An efficient approach for creating spatiotemporal predictive 3d maps of large-scale cultural resources

Author: Chryssy Potsiou, Nikos Grammalidis, Kosmas Dimitropoulos, Nikolaos Doulamis, Anastasios Doulamis, Marinos Ioannides, E. K. Stathopoulou, Charalabos Ioannidis, and C. Chrysouli
Subjects: lcsh:Applied optics. Photonics, Computer science, Process (engineering), Interoperability, CityGML, Semantics, Machine learning, computer.software_genre, lcsh:Technology, 5D modelling, Bottleneck, 11. Sustainability, Visualization, business.industry, lcsh:T, lcsh:TA1501-1820, Electrical Engineering - Electronic Engineering - Information Engineering, Data science, Metadata, Cultural heritage, Cultural Heritage Model, lcsh:TA1-2040, Engineering and Technology, Artificial intelligence, business, lcsh:Engineering (General). Civil engineering (General), computer
Abstract: Outdoor large-scale cultural sites are mostly sensitive to environmental, natural and human made factors, implying an imminent need for a spatio-temporal assessment to identify regions of potential cultural interest (material degradation, structuring, conservation). On the other hand, in Cultural Heritage research quite different actors are involved (archaeologists, curators, conservators, simple users) each of diverse needs. All these statements advocate that a 5D modelling (3D geometry plus time plus levels of details) is ideally required for preservation and assessment of outdoor large scale cultural sites, which is currently implemented as a simple aggregation of 3D digital models at different time and levels of details. The main bottleneck of such an approach is its complexity, making 5D modelling impossible to be validated in real life conditions. In this paper, a cost effective and affordable framework for 5D modelling is proposed based on a spatial-temporal dependent aggregation of 3D digital models, by incorporating a predictive assessment procedure to indicate which regions (surfaces) of an object should be reconstructed at higher levels of details at next time instances and which at lower ones. In this way, dynamic change history maps are created, indicating spatial probabilities of regions needed further 3D modelling at forthcoming instances. Using these maps, predictive assessment can be made, that is, to localize surfaces within the objects where a high accuracy reconstruction process needs to be activated at the forthcoming time instances. The proposed 5D Digital Cultural Heritage Model (5D-DCHM) is implemented using open interoperable standards based on the CityGML framework, which also allows the description of additional semantic metadata information. Visualization aspects are also supported to allow easy manipulation, interaction and representation of the 5D-DCHM geometry and the respective semantic information. The open source 3DCityDB incorporating a PostgreSQL geo-database is used to manage and manipulate 3D data and their semantics.
Published: 2015

45. Experimental study of skeleton tracking abilities from microsoft kinect non-frontal views

Author: Tao Wei, Yuansong Qiao, Kosmas Dimitropoulos, Brian Lee, Nikos Grammalidis, and Alexandros Kitsikidis
Subjects: Match moving, business.industry, Computer science, Computer graphics (images), ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Operational complexity, Tracking system, Computer vision, Artificial intelligence, Tracking (particle physics), business, Motion capture
Abstract: Human body motion tracking is important for a large variety of applications e.g., entertainemnt, human computer interaction, creative intudstry etc. Μarker-based motion capture systems offer accurate motion tracking, however, they cannot be widely used due to their high cost and operational complexity. To this end, emerging depth sensing technologies, such as Microsoft Kinect, can be employed in many cases as an alternative low cost marker-less solution. Such sensors are mainly used to track the frontal view of the user, since it provides high tracking quality. To assess the potential tracking abilities of Kinect nonfrontal views, we compare the tracking results from four views (front, left side, right side, rear) with those from an off-the-self optical motion capture system. The experimental results show that the skeleton tracking accuracy of the rear view is close to the accuracy of the frontal view, although Kinect sensor cannot distinguish between the front and back sides of the user. In addition, the left and right side views can produce valuable tracking results for the body parts closer to each sensor.
Published: 2015
Full Text: View/download PDF

46. Multi-Entity Bayesian Networks for Knowledge-Driven Analysis of ICH Content

Author: Giannis Chantas, Spiros Nikolopoulos, Ioannis Kompatsiaris, Stella Douka, Nikos Grammalidis, Alexandros Kitsikidis, and Kosmas Dimitropoulos
Subjects: Intangible cultural heritage, Computer science, business.industry, Semantic analysis (machine learning), Multi entity bayesian networks, Bayesian network, Artificial intelligence, Probabilistic inference, business, Machine learning, computer.software_genre, computer
Abstract: In this paper we introduce Multi-Entity Bayesian Networks (MEBNs) as the means to combine first-order logic with probabilistic inference and facilitate the semantic analysis of Intangible Cultural Heritage (ICH) content. First, we mention the need to capture and maintain ICH manifestations for the safeguarding of cultural treasures. Second, we present the MEBN models and stress their key features that can be used as a powerful tool for the aforementioned cause. Third, we present the methodology followed to build a MEBN model for the analysis of a traditional dance. Finally, we compare the efficiency of our MEBN model with that of a simple Bayesian network and demonstrate its superiority in cases that demand for situation-specific treatment.
Published: 2015
Full Text: View/download PDF

47. A Game-like Application for Dance Learning Using a Natural Human Computer Interface

Author: Deniz Ugurca, Alexandros Kitsikidis, Stella Douka, Nikos Grammalidis, Kosmas Dimitropoulos, Filareti Tsalakanidou, Erdal Yilmaz, and Can Bayçay
Subjects: Focus (computing), Dynamic time warping, Intangible cultural heritage, Dance, Multimedia, Computer science, Sports science, computer.software_genre, Field (computer science), Game design, Human–computer interaction, TUTOR, computer, computer.programming_language
Abstract: Game-based learning and gamification techniques are recently becoming a popular trend in the field of Technology Enhanced Learning. In this paper, we mainly focus on the use of game design elements for the transmission of Intangible Cultural Heritage (ICH) knowledge and, especially, for the learning of traditional dances. More specifically, we present a 3D game environment that employs an enjoyable natural human computer interface, which is based on the fusion of multiple depth sensors data in order to capture the body movements of the user/learner. In addition, the system automatically assesses the learner’s performance by utilizing a combination of Dynamic Time Warping (DTW) with Fuzzy Inference System (FIS) approach and provides feedback in a form of a score as well as instructions from a virtual tutor in order to promote self-learning. As a pilot use case, a Greek traditional dance, namely Tsamiko, has been selected. Preliminary small-scaled experiments with students of the Department of Physical Education and Sports Science at Aristotle University of Thessaloniki have shown the great potential of the proposed application.
Published: 2015
Full Text: View/download PDF

48. Using Intelligent Digital Cameras to Monitor Aerodrome Surface Traffic

Author: E. Piazza, Kosmas Dimitropoulos, Nikos Grammalidis, M. Strintzis, N. Pavlidou, D. Simitopoulos, R. Heidger, C. Herrlich, and A. Gilbert
Subjects: Artificial Intelligence System, Computer Networks and Communications, Computer science, business.industry, Real-time computing, Air traffic control, Intelligent sensor, Artificial Intelligence, Aerodrome, Takeoff, Visibility, Telecommunications, business, Guidance system, Wireless sensor network
Abstract: An aircraft is most at risk for an accident when it's still on the ground - when taxiing before takeoff or after landing. This is because traffic throughput on the ground is limited by inadequate airport infrastructures and is often incapacitated during conditions of poor visibility. The INTERVUSE project, funded by the European Commission, aims to address these problems by developing a cost-effective artificial intelligence system based on a network of intelligent digital cameras. The system uses image-processing techniques to detect traffic and correlates and fuses data to generate a synthetic ground-situation display.
Published: 2005
Full Text: View/download PDF

49. Interactive mobile application, web technologies and fire simulations in the service of forest fire volunteers

Author: Kosmas Dimitropoulos, Nikos Grammalidis, and Theoharis Mexis
Subjects: Vegetation types, Service (systems architecture), Database, Civil defense, Computer science, Control (management), Context (language use), Computer security, computer.software_genre, Mobile device, computer, Flame-Sim
Abstract: In this paper we present a novel interactive platform for the operational support and training of the forest fire volunteer groups that are accredited through the mechanisms of Civil Protection Services. The proposed system is based on mobile and web technologies, fire simulation techniques and shortest-path routing calculation. The system consists of a mobile application (client) allowing the volunteers to report fire incidents and vegetation types of an area to a Control Center (server). The Control Center is also capable of organizing Training Scenarios, based on fire simulations and detailed safe routes, which are sent to the volunteers' mobile devices. The system was tested by Greek and Bulgarian volunteer firefighters in the context of the `OUTLAND' project. The architecture and the functionalities of the proposed system are presented accompanied by figures and indicative screenshots.
Published: 2014
Full Text: View/download PDF

50. Multi-sensor Technology and Fuzzy Logic for Dancer’s Motion Analysis and Performance Evaluation within a 3D Virtual Environment

Author: Alexandros Kitsikidis, Nikos Grammalidis, Stella Douka, Erdal Yilmaz, and Kosmas Dimitropoulos
Subjects: Motion analysis, Dance, business.industry, Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, computer.software_genre, Fuzzy logic, Motion (physics), Multi sensor, Fuzzy inference system, Virtual machine, Artificial intelligence, business, computer, ComputingMethodologies_COMPUTERGRAPHICS, Avatar
Abstract: In this paper, we describe a novel methodology for dance learning and evaluation using multi-sensor and 3D gaming technology. The learners are captured during dancing, while an avatar visualizes their motion using fused input from multiple sensors. Motion analysis and fuzzy-logic are employed for the evaluation of the learners’ performance against the performance of an expert. Specifically, a two level Fuzzy Inference System is proposed which uses as input low level skeletal data and high level motion recognition probabilities for the evaluation of dancer’s performance. Tests with real dancers, both learners and experts, dancing Tsamiko, a very popular traditional Greek dance, are presented showing the potential of the proposed method.
Published: 2014
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

63 results on '"Kosmas Dimitropoulos"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources