Author: "Vaufreydaz, Dominique" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Vaufreydaz, Dominique"' showing total 148 results

Start Over Author "Vaufreydaz, Dominique"

148 results on '"Vaufreydaz, Dominique"'

1. Exploring VQ-VAE with Prosody Parameters for Speaker Anonymization

Author: Leang, Sotheara, Augusma, Anderson, Castelli, Eric, Letué, Frédérique, Sam, Sethserey, and Vaufreydaz, Dominique
Subjects: Computer Science - Computer Vision and Pattern Recognition, Electrical Engineering and Systems Science - Signal Processing
Abstract: Human speech conveys prosody, linguistic content, and speaker identity. This article investigates a novel speaker anonymization approach using an end-to-end network based on a Vector-Quantized Variational Auto-Encoder (VQ-VAE) to deal with these speech components. This approach is designed to disentangle these components to specifically target and modify the speaker identity while preserving the linguistic and emotionalcontent. To do so, three separate branches compute embeddings for content, prosody, and speaker identity respectively. During synthesis, taking these embeddings, the decoder of the proposed architecture is conditioned on both speaker and prosody information, allowing for capturing more nuanced emotional states and precise adjustments to speaker identification. Findings indicate that this method outperforms most baseline techniques in preserving emotional information. However, it exhibits more limited performance on other voice privacy tasks, emphasizing the need for further improvements.
Published: 2024

2. Towards LLM-Powered Ambient Sensor Based Multi-Person Human Activity Recognition

Author: Chen, Xi, Cumin, Julien, Ramparany, Fano, and Vaufreydaz, Dominique
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence
Abstract: Human Activity Recognition (HAR) is one of the central problems in fields such as healthcare, elderly care, and security at home. However, traditional HAR approaches face challenges including data scarcity, difficulties in model generalization, and the complexity of recognizing activities in multi-person scenarios. This paper proposes a system framework called LAHAR, based on large language models. Utilizing prompt engineering techniques, LAHAR addresses HAR in multi-person scenarios by enabling subject separation and action-level descriptions of events occurring in the environment. We validated our approach on the ARAS dataset, and the results demonstrate that LAHAR achieves comparable accuracy to the state-of-the-art method at higher resolutions and maintains robustness in multi-person scenarios.
Published: 2024

3. Generative Resident Separation and Multi-label Classification for Multi-person Activity Recognition

Author: Chen, Xi, Cumin, Julien, Ramparany, Fano, and Vaufreydaz, Dominique
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Electrical Engineering and Systems Science - Signal Processing
Abstract: This paper presents two models to address the problem of multi-person activity recognition using ambient sensors in a home. The first model, Seq2Res, uses a sequence generation approach to separate sensor events from different residents. The second model, BiGRU+Q2L, uses a Query2Label multi-label classifier to predict multiple activities simultaneously. Performances of these models are compared to a state-of-the-art model in different experimental scenarios, using a state-of-the-art dataset of two residents in a home instrumented with ambient sensors. These results lead to a discussion on the advantages and drawbacks of resident separation and multi-label classification for multi-person activity recognition., Comment: Context and Activity Modeling and Recognition (CoMoReA) Workshop at IEEE International Conference on Pervasive Computing and Communications (PerCom 2024), Mar 2024, Biarritz, France
Published: 2024

4. Multimodal Group Emotion Recognition In-the-wild Using Privacy-Compliant Features

Author: Augusma, Anderson, Vaufreydaz, Dominique, and Letué, Frédérique
Subjects: Computer Science - Artificial Intelligence, Computer Science - Cryptography and Security, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: This paper explores privacy-compliant group-level emotion recognition ''in-the-wild'' within the EmotiW Challenge 2023. Group-level emotion recognition can be useful in many fields including social robotics, conversational agents, e-coaching and learning analytics. This research imposes itself using only global features avoiding individual ones, i.e. all features that can be used to identify or track people in videos (facial landmarks, body poses, audio diarization, etc.). The proposed multimodal model is composed of a video and an audio branches with a cross-attention between modalities. The video branch is based on a fine-tuned ViT architecture. The audio branch extracts Mel-spectrograms and feed them through CNN blocks into a transformer encoder. Our training paradigm includes a generated synthetic dataset to increase the sensitivity of our model on facial expression within the image in a data-driven way. The extensive experiments show the significance of our methodology. Our privacy-compliant proposal performs fairly on the EmotiW challenge, with 79.24% and 75.13% of accuracy respectively on validation and test set for the best models. Noticeably, our findings highlight that it is possible to reach this accuracy level with privacy-compliant features using only 5 frames uniformly distributed on the video.
Published: 2023
Full Text: View/download PDF

5. A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation

Author: Airale, Louis, Vaufreydaz, Dominique, and Alameda-Pineda, Xavier
Subjects: Computer Science - Graphics, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Animating still face images with deep generative models using a speech input signal is an active research topic and has seen important recent progress. However, much of the effort has been put into lip syncing and rendering quality while the generation of natural head motion, let alone the audio-visual correlation between head motion and speech, has often been neglected. In this work, we propose a multi-scale audio-visual synchrony loss and a multi-scale autoregressive GAN to better handle short and long-term correlation between speech and the dynamics of the head and lips. In particular, we train a stack of syncer models on multimodal input pyramids and use these models as guidance in a multi-scale generator network to produce audio-aligned motion unfolding over diverse time scales. Our generator operates in the facial landmark domain, which is a standard low-dimensional head representation. The experiments show significant improvements over the state of the art in head motion dynamics quality and in multi-scale audio-visual synchrony both in the landmark domain and in the image domain.
Published: 2023

6. Preliminary Study on SSCF-derived Polar Coordinate for ASR

Author: Leang, Sotheara, Castelli, Eric, Vaufreydaz, Dominique, and Sam, Sethserey
Subjects: Electrical Engineering and Systems Science - Audio and Speech Processing, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Sound, Electrical Engineering and Systems Science - Signal Processing
Abstract: The transition angles are defined to describe the vowel-to-vowel transitions in the acoustic space of the Spectral Subband Centroids, and the findings show that they are similar among speakers and speaking rates. In this paper, we propose to investigate the usage of polar coordinates in favor of angles to describe a speech signal by characterizing its acoustic trajectory and using them in Automatic Speech Recognition. According to the experimental results evaluated on the BRAF100 dataset, the polar coordinates achieved significantly higher accuracy than the angles in the mixed and cross-gender speech recognitions, demonstrating that these representations are superior at defining the acoustic trajectory of the speech signal. Furthermore, the accuracy was significantly improved when they were utilized with their first and second-order derivatives ($\Delta$, $\Delta$$\Delta$), especially in cross-female recognition. However, the results showed they were not much more gender-independent than the conventional Mel-frequency Cepstral Coefficients (MFCCs).
Published: 2022

7. Autoregressive GAN for Semantic Unconditional Head Motion Generation

Author: Airale, Louis, Alameda-Pineda, Xavier, Lathuilière, Stéphane, and Vaufreydaz, Dominique
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this work, we address the task of unconditional head motion generation to animate still human faces in a low-dimensional semantic space from a single reference pose. Different from traditional audio-conditioned talking head generation that seldom puts emphasis on realistic head motions, we devise a GAN-based architecture that learns to synthesize rich head motion sequences over long duration while maintaining low error accumulation levels.In particular, the autoregressive generation of incremental outputs ensures smooth trajectories, while a multi-scale discriminator on input pairs drives generation toward better handling of high- and low-frequency signals and less mode collapse.We experimentally demonstrate the relevance of the proposed method and show its superiority compared to models that attained state-of-the-art performances on similar tasks.
Published: 2022

8. TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut

Author: Wang, Yangtao, Shen, Xi, Yuan, Yuan, Du, Yuming, Li, Maomao, Hu, Shell Xu, Crowley, James L, and Vaufreydaz, Dominique
Subjects: Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: In this paper, we describe a graph-based algorithm that uses the features obtained by a self-supervised transformer to detect and segment salient objects in images and videos. With this approach, the image patches that compose an image or video are organised into a fully connected graph, where the edge between each pair of patches is labeled with a similarity score between patches using features learned by the transformer. Detection and segmentation of salient objects is then formulated as a graph-cut problem and solved using the classical Normalized Cut algorithm. Despite the simplicity of this approach, it achieves state-of-the-art results on several common image and video detection and segmentation tasks. For unsupervised object discovery, this approach outperforms the competing approaches by a margin of 6.1%, 5.7%, and 2.6%, respectively, when tested with the VOC07, VOC12, and COCO20K datasets. For the unsupervised saliency detection task in images, this method improves the score for Intersection over Union (IoU) by 4.4%, 5.6% and 5.2%. When tested with the ECSSD, DUTS, and DUT-OMRON datasets, respectively, compared to current state-of-the-art techniques. This method also achieves competitive results for unsupervised video object segmentation tasks with the DAVIS, SegTV2, and FBMS datasets., Comment: arXiv admin note: text overlap with arXiv:2202.11539
Published: 2022

9. Self-Supervised Transformers for Unsupervised Object Discovery using Normalized Cut

Author: Wang, Yangtao, Shen, Xi, Hu, Shell, Yuan, Yuan, Crowley, James, and Vaufreydaz, Dominique
Subjects: Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: Transformers trained with self-supervised learning using self-distillation loss (DINO) have been shown to produce attention maps that highlight salient foreground objects. In this paper, we demonstrate a graph-based approach that uses the self-supervised transformer features to discover an object from an image. Visual tokens are viewed as nodes in a weighted graph with edges representing a connectivity score based on the similarity of tokens. Foreground objects can then be segmented using a normalized graph-cut to group self-similar regions. We solve the graph-cut problem using spectral clustering with generalized eigen-decomposition and show that the second smallest eigenvector provides a cutting solution since its absolute value indicates the likelihood that a token belongs to a foreground object. Despite its simplicity, this approach significantly boosts the performance of unsupervised object discovery: we improve over the recent state of the art LOST by a margin of 6.9%, 8.1%, and 8.1% respectively on the VOC07, VOC12, and COCO20K. The performance can be further improved by adding a second stage class-agnostic detector (CAD). Our proposed method can be easily extended to unsupervised saliency detection and weakly supervised object detection. For unsupervised saliency detection, we improve IoU for 4.9%, 5.2%, 12.9% on ECSSD, DUTS, DUT-OMRON respectively compared to previous state of the art. For weakly supervised object detection, we achieve competitive performance on CUB and ImageNet.
Published: 2022

10. Navigation In Urban Environments Amongst Pedestrians Using Multi-Objective Deep Reinforcement Learning

Author: Deshpande, Niranjan, Vaufreydaz, Dominique, and Spalanzani, Anne
Subjects: Computer Science - Robotics, Statistics - Machine Learning
Abstract: Urban autonomous driving in the presence of pedestrians as vulnerable road users is still a challenging and less examined research problem. This work formulates navigation in urban environments as a multi objective reinforcement learning problem. A deep learning variant of thresholded lexicographic Q-learning is presented for autonomous navigation amongst pedestrians. The multi objective DQN agent is trained on a custom urban environment developed in CARLA simulator. The proposed method is evaluated by comparing it with a single objective DQN variant on known and unknown environments. Evaluation results show that the proposed method outperforms the single objective DQN variant with respect to all aspects.
Published: 2021

11. SocialInteractionGAN: Multi-person Interaction Sequence Generation

Author: Airale, Louis, Vaufreydaz, Dominique, and Alameda-Pineda, Xavier
Subjects: Computer Science - Neural and Evolutionary Computing, Statistics - Machine Learning
Abstract: Prediction of human actions in social interactions has important applications in the design of social robots or artificial avatars. In this paper, we focus on a unimodal representation of interactions and propose to tackle interaction generation in a data-driven fashion. In particular, we model human interaction generation as a discrete multi-sequence generation problem and present SocialInteractionGAN, a novel adversarial architecture for conditional interaction generation. Our model builds on a recurrent encoder-decoder generator network and a dual-stream discriminator, that jointly evaluates the realism of interactions and individual action sequences and operates at different time scales. Crucially, contextual information on interacting participants is shared among agents and reinjected in both the generation and the discriminator evaluation processes. Experiments show that albeit dealing with low dimensional data, SocialInteractionGAN succeeds in producing high realism action sequences of interacting people, comparing favorably to a diversity of recurrent and convolutional discriminator baselines, and we argue that this work will constitute a first stone towards higher dimensional and multimodal interaction generation. Evaluations are conducted using classical GAN metrics, that we specifically adapt for discrete sequential data. Our model is shown to properly learn the dynamics of interaction sequences, while exploiting the full range of available actions., Comment: IEEE Transactions on Affective Computing, Institute of Electrical and Electronics Engineers, 2022
Published: 2021
Full Text: View/download PDF

12. Behavioral decision-making for urban autonomous driving in the presence of pedestrians using Deep Recurrent Q-Network

Author: Deshpande, Niranjan, Vaufreydaz, Dominique, and Spalanzani, Anne
Subjects: Computer Science - Neural and Evolutionary Computing, Computer Science - Robotics, Statistics - Machine Learning
Abstract: Decision making for autonomous driving in urban environments is challenging due to the complexity of the road structure and the uncertainty in the behavior of diverse road users. Traditional methods consist of manually designed rules as the driving policy, which require expert domain knowledge, are difficult to generalize and might give sub-optimal results as the environment gets complex. Whereas, using reinforcement learning, optimal driving policy could be learned and improved automatically through several interactions with the environment. However, current research in the field of reinforcement learning for autonomous driving is mainly focused on highway setup with little to no emphasis on urban environments. In this work, a deep reinforcement learning based decision-making approach for high-level driving behavior is proposed for urban environments in the presence of pedestrians. For this, the use of Deep Recurrent Q-Network (DRQN) is explored, a method combining state-of-the art Deep Q-Network (DQN) with a long term short term memory (LSTM) layer helping the agent gain a memory of the environment. A 3-D state representation is designed as the input combined with a well defined reward function to train the agent for learning an appropriate behavior policy in a real-world like urban simulator. The proposed method is evaluated for dense urban scenarios and compared with a rule-based approach and results show that the proposed DRQN based driving behavior decision maker outperforms the rule-based approach.
Published: 2020

13. Group-Level Emotion Recognition Using a Unimodal Privacy-Safe Non-Individual Approach

Author: Petrova, Anastasia, Vaufreydaz, Dominique, and Dessus, Philippe
Subjects: Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: This article presents our unimodal privacy-safe and non-individual proposal for the audio-video group emotion recognition subtask at the Emotion Recognition in the Wild (EmotiW) Challenge 2020 1. This sub challenge aims to classify in the wild videos into three categories: Positive, Neutral and Negative. Recent deep learning models have shown tremendous advances in analyzing interactions between people, predicting human behavior and affective evaluation. Nonetheless, their performance comes from individual-based analysis, which means summing up and averaging scores from individual detections, which inevitably leads to some privacy issues. In this research, we investigated a frugal approach towards a model able to capture the global moods from the whole image without using face or pose detection, or any individual-based feature as input. The proposed methodology mixes state-of-the-art and dedicated synthetic corpora as training sources. With an in-depth exploration of neural network architectures for group-level emotion recognition, we built a VGG-based model achieving 59.13% accuracy on the VGAF test set (eleventh place of the challenge). Given that the analysis is unimodal based only on global features and that the performance is evaluated on a real-world dataset, these results are promising and let us envision extending this model to multimodality for classroom ambiance evaluation, our final target application.
Published: 2020

14. Deep learning investigation for chess player attention prediction using eye-tracking and game data

Author: Louedec, Justin Le, Guntz, Thomas, Crowley, James, and Vaufreydaz, Dominique
Subjects: Statistics - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: This article reports on an investigation of the use of convolutional neural networks to predict the visual attention of chess players. The visual attention model described in this article has been created to generate saliency maps that capture hierarchical and spatial features of chessboard, in order to predict the probability fixation for individual pixels Using a skip-layer architecture of an autoencoder, with a unified decoder, we are able to use multiscale features to predict saliency of part of the board at different scales, showing multiple relations between pieces. We have used scan path and fixation data from players engaged in solving chess problems, to compute 6600 saliency maps associated to the corresponding chess piece configurations. This corpus is completed with synthetically generated data from actual games gathered from an online chess platform. Experiments realized using both scan-paths from chess players and the CAT2000 saliency dataset of natural images, highlights several results. Deep features, pretrained on natural images, were found to be helpful in training visual attention prediction for chess. The proposed neural network architecture is able to generate meaningful saliency maps on unseen chess configurations with good scores on standard metrics. This work provides a baseline for future work on visual attention prediction in similar contexts.
Published: 2019
Full Text: View/download PDF

15. The Role of Emotion in Problem Solving: First Results from Observing Chess

Author: Guntz, Thomas, Crowley, James, Vaufreydaz, Dominique, Balzarini, Raffaella, and Dessus, Philippe
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper we present results from recent experiments that suggest that chess players associate emotions to game situations and reactively use these associations to guide search for planning and problem solving. We describe the design of an instrument for capturing and interpreting multimodal signals of humans engaged in solving challenging problems. We review results from a pilot experiment with human experts engaged in solving challenging problems in Chess that revealed an unexpected observation of rapid changes in emotion as players attempt to solve challenging problems. We propose a cognitive model that describes the process by which subjects select chess chunks for use in interpretation of the game situation and describe initial results from a second experiment designed to test this model.
Published: 2018

16. Building Prior Knowledge: A Markov Based Pedestrian Prediction Model Using Urban Environmental Data

Author: Vasishta, Pavan, Vaufreydaz, Dominique, and Spalanzani, Anne
Subjects: Statistics - Machine Learning, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Autonomous Vehicles navigating in urban areas have a need to understand and predict future pedestrian behavior for safer navigation. This high level of situational awareness requires observing pedestrian behavior and extrapolating their positions to know future positions. While some work has been done in this field using Hidden Markov Models (HMMs), one of the few observed drawbacks of the method is the need for informed priors for learning behavior. In this work, an extension to the Growing Hidden Markov Model (GHMM) method is proposed to solve some of these drawbacks. This is achieved by building on existing work using potential cost maps and the principle of Natural Vision. As a consequence, the proposed model is able to predict pedestrian positions more precisely over a longer horizon compared to the state of the art. The method is tested over "legal" and "illegal" behavior of pedestrians, having trained the model with sparse observations and partial trajectories. The method, with no training data, is compared against a trained state of the art model. It is observed that the proposed method is robust even in new, previously unseen areas., Comment: 15 th International Conference on Control, Automation, Robotics and Vision (ICARCV 2018), Nov 2018, Singapore, Singapore
Published: 2018

17. Smartphone-based user positioning in a multiple-user context with Wi-Fi and Bluetooth

Author: Ta, Viet-Cuong, Dao, Trung-Kien, Vaufreydaz, Dominique, and Castelli, Eric
Subjects: Electrical Engineering and Systems Science - Signal Processing, Computer Science - Networking and Internet Architecture
Abstract: In a multiuser context, the Bluetooth data from the smartphone could give an approximation of the distance between users. Meanwhile, the Wi-Fi data can be used to calculate the user's position directly. However, both the Wi-Fi-based position outputs and Bluetooth-based distances are affected by some degree of noise. In our work, we propose several approaches to combine the two types of outputs for improving the tracking accuracy in the context of collaborative positioning. The two proposed approaches attempt to build a model for measuring the errors of the Bluetooth output and Wi-Fi output. In a non-temporal approach, the model establishes the relationship in a specific interval of the Bluetooth output and Wi-Fi output. In a temporal approach, the error measurement model is expanded to include the time component between users' movement. To evaluate the performance of the two approaches, we collected the data from several multiuser scenarios in indoor environment. The results show that the proposed approaches could reach a distance error around 3.0m for 75 percent of time, which outperforms the positioning results of the standard Wi-Fi fingerprinting model., Comment: International Conference on Indoor Positioning and Indoor Navigation (IPIN), Sep 2018, Nantes, France
Published: 2018

18. Personal space of autonomous car's passengers sitting in the driver's seat

Author: Ferrier-Barbut, Eleonore, Vaufreydaz, Dominique, David, Jean-Alix, Lussereau, Jérôme, and Spalanzani, Anne
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Robotics
Abstract: This article deals with the specific context of an autonomous car navigating in an urban center within a shared space between pedestrians and cars. The driver delegates the control to the autonomous system while remaining seated in the driver's seat. The proposed study aims at giving a first insight into the definition of human perception of space applied to vehicles by testing the existence of a personal space around the car.It aims at measuring proxemic information about the driver's comfort zone in such conditions.Proxemics, or human perception of space, has been largely explored when applied to humans or to robots, leading to the concept of personal space, but poorly when applied to vehicles. In this article, we highlight the existence and the characteristics of a zone of comfort around the car which is not correlated to the risk of a collision between the car and other road users. Our experiment includes 19 volunteers using a virtual reality headset to look at 30 scenarios filmed in 360{\textdegree} from the point of view of a passenger sitting in the driver's seat of an autonomous car.They were asked to say "stop" when they felt discomfort visualizing the scenarios.As said, the scenarios voluntarily avoid collision effect as we do not want to measure fear but discomfort.The scenarios involve one or three pedestrians walking past the car at different distances from the wings of the car, relative to the direction of motion of the car, on both sides. The car is either static or moving straight forward at different speeds.The results indicate the existence of a comfort zone around the car in which intrusion causes discomfort.The size of the comfort zone is sensitive neither to the side of the car where the pedestrian passes nor to the number of pedestrians. In contrast, the feeling of discomfort is relative to the car's motion (static or moving).Another outcome from this study is an illustration of the usage of first person 360{\textdegree} video and a virtual reality headset to evaluate feelings of a passenger within an autonomous car.
Published: 2018

19. Multimodal Observation and Interpretation of Subjects Engaged in Problem Solving

Author: Guntz, Thomas, Balzarini, Raffaella, Vaufreydaz, Dominique, and Crowley, James L.
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Computer Vision and Pattern Recognition, Statistics - Machine Learning
Abstract: In this paper we present the first results of a pilot experiment in the capture and interpretation of multimodal signals of human experts engaged in solving challenging chess problems. Our goal is to investigate the extent to which observations of eye-gaze, posture, emotion and other physiological signals can be used to model the cognitive state of subjects, and to explore the integration of multiple sensor modalities to improve the reliability of detection of human displays of awareness and emotion. We observed chess players engaged in problems of increasing difficulty while recording their behavior. Such recordings can be used to estimate a participant's awareness of the current situation and to predict ability to respond effectively to challenging situations. Results show that a multimodal approach is more accurate than a unimodal one. By combining body posture, visual attention and emotion, the multimodal approach can reach up to 93% of accuracy when determining player's chess expertise while unimodal approach reaches 86%. Finally this experiment validates the use of our equipment as a general and reproducible tool for the study of participants engaged in screen-based interaction and/or problem solving.
Published: 2017

20. Generative Resident Separation and Multi-label Classification for Multi-person Activity Recognition

Author: Chen, Xi, primary, Cumin, Julien, additional, Ramparany, Fano, additional, and Vaufreydaz, Dominique, additional
Published: 2024
Full Text: View/download PDF

21. Starting engagement detection towards a companion robot using multimodal features

Author: Vaufreydaz, Dominique, Johal, Wafa, and Combe, Claudine
Subjects: Computer Science - Robotics, Computer Science - Computer Vision and Pattern Recognition
Abstract: Recognition of intentions is a subconscious cognitive process vital to human communication. This skill enables anticipation and increases the quality of interactions between humans. Within the context of engagement, non-verbal signals are used to communicate the intention of starting the interaction with a partner. In this paper, we investigated methods to detect these signals in order to allow a robot to know when it is about to be addressed. Originality of our approach resides in taking inspiration from social and cognitive sciences to perform our perception task. We investigate meaningful features, i.e. human readable features, and elicit which of these are important for recognizing someone's intention of starting an interaction. Classically, spatial information like the human position and speed, the human-robot distance are used to detect the engagement. Our approach integrates multimodal features gathered using a companion robot equipped with a Kinect. The evaluation on our corpus collected in spontaneous conditions highlights its robustness and validates the use of such a technique in a real environment. Experimental validation shows that multimodal features set gives better precision and recall than using only spatial and speed features. We also demonstrate that 7 selected features are sufficient to provide a good starting engagement detection score. In our last investigation, we show that among our full 99 features set, the space reduction is not a solved task. This result opens new researches perspectives on multimodal engagement detection.
Published: 2015
Full Text: View/download PDF

22. Autoregressive GAN for Semantic Unconditional Head Motion Generation

Author: Airale, Louis, primary, Alameda-Pineda, Xavier, additional, Lathuilière, Stéphane, additional, and Vaufreydaz, Dominique, additional
Published: 2023
Full Text: View/download PDF

23. TokenCut: Segmenting Objects in Images and Videos With Self-Supervised Transformer and Normalized Cut

Author: Wang, Yangtao, primary, Shen, Xi, additional, Yuan, Yuan, additional, Du, Yuming, additional, Li, Maomao, additional, Hu, Shell Xu, additional, Crowley, James L., additional, and Vaufreydaz, Dominique, additional
Published: 2023
Full Text: View/download PDF

24. Multimodal Group Emotion Recognition In-the-wild Using Privacy-Compliant Features

Author: Augusma, Anderson, primary, Vaufreydaz, Dominique, additional, and Letué, Frédérique, additional
Published: 2023
Full Text: View/download PDF

25. SocialInteractionGAN: Multi-Person Interaction Sequence Generation

Author: Airale, Louis, primary, Vaufreydaz, Dominique, additional, and Alameda-Pineda, Xavier, additional
Published: 2023
Full Text: View/download PDF

26. Starting engagement detection towards a companion robot using multimodal features

Author: Vaufreydaz, Dominique, Johal, Wafa, and Combe, Claudine
Published: 2016
Full Text: View/download PDF

27. L’instrumentation intelligente des salles de classe au service de l’observation des interactions enseignant-apprenants

Author: RICS, Revue, Laurent, Romain, Dessus, Philippe, and Vaufreydaz, Dominique
Abstract: Cette recherche qualitative porte sur la relation enseignant-élèves en 1re année du primaire dans le cadre d’une comparaison des systèmes éducatifs québécois et français. De nombreuses recherches (Espinosa, 2020 ; Fortin et al., 2011 ; Roffey, 2012; Virat, 2019) montrent qu’une relation enseignant-élèves de qualité est un élément majeur du bien-être social et psychologique de l’enseignant et de l’élève, mais qu’elle permet également la réussite éducative et scolaire des élèves. Dans le cadre de cette recherche, six enseignantes de 1re année de l’école primaire, dont trois québécoises et trois françaises, ont pris part à un entretien semi-dirigé qui a permis de recueillir leurs représentations et leurs pratiques déclarées au regard de la qualité de leur relation avec leurs élèves. Les résultats montrent des similitudes et des différences entre les deux systèmes éducatifs, notamment que les enseignantes françaises semblent accorder davantage d’importance aux apprentissages, contrairement aux enseignantes québécoises qui semblent plutôt accorder de l’importance aux liens affectifs.
Published: 2023
Full Text: View/download PDF

28. Unsupervised Segmentation of Meeting Configurations and Activities using Speech Activity Detection

Author: Brdiczka, Oliver, Vaufreydaz, Dominique, Maisonnasse, Jérôme, Reignier, Patrick, Maglogiannis, Ilias, editor, Karpouzis, Kostas, editor, and Bramer, Max, editor
Published: 2006
Full Text: View/download PDF

29. A Lightweight Speech Detection System for Perceptive Environments

Author: Vaufreydaz, Dominique, Emonet, Rémi, Reignier, Patrick, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Renals, Steve, editor, Bengio, Samy, editor, and Fiscus, Jonathan G., editor
Published: 2006
Full Text: View/download PDF

30. Self-Supervised Transformers for Unsupervised Object Discovery using Normalized Cut

Author: Wang, Yangtao, primary, Shen, Xi, additional, Hu, Shell Xu, additional, Yuan, Yuan, additional, Crowley, James L., additional, and Vaufreydaz, Dominique, additional
Published: 2022
Full Text: View/download PDF

31. Experiments on the Construction of a Phonetically Balanced Corpus from the Web

Author: Villaseñor-Pineda, Luis, Montes-y-Gómez, Manuel, Vaufreydaz, Dominique, Serignat, Jean-François, Goos, Gerhard, editor, Hartmanis, Juris, editor, van Leeuwen, Jan, editor, and Gelbukh, Alexander, editor
Published: 2004
Full Text: View/download PDF

32. A Corpus Balancing Method for Language Model Construction

Author: Villaseñor-Pineda, Luis, Montes-y-Gómez, Manuel, Pérez-Coutiño, Manuel Alberto, Vaufreydaz, Dominique, Goos, Gerhard, editor, Hartmanis, Juris, editor, van Leeuwen, Jan, editor, and Gelbukh, Alexander, editor
Published: 2003
Full Text: View/download PDF

33. Analyser automatiquement les signaux de l’enseignement : Une approche d’apprentissage social fondée sur les preuves

Author: Laurent, Romain, Dessus, Philippe, Vaufreydaz, Dominique, Laboratoire de Recherche sur les Apprentissages en Contexte (LaRAC), Université Grenoble Alpes (UGA), Multimodal Perception and Sociable Interaction (M-PSI), Laboratoire d'Informatique de Grenoble (LIG), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Idex Formation, ANR-15-IDEX-0002,UGA,IDEX UGA(2015), Dessus, Philippe, and IDEX UGA - - UGA2015 - ANR-15-IDEX-0002 - IDEX - VALID
Subjects: Machine Learning, Éducation fondée sur les preuves, Pédagogie, Signal Processing and Analysis, Pedagogy, Apprentissage social, [SHS.EDU]Humanities and Social Sciences/Education, [SHS.EDU] Humanities and Social Sciences/Education, Evidence-based Education, Apprentissage machine, Traitement et analyse du signal, Social Learning
Abstract: Recent advances in signal processing and analysis have made it possible to create new ways of instrumenting the observation and the analysis of educational events, and thus to gather new kinds of evidence on teaching and learning practice. This article identifies some of these, based on a “social learning” framework, which posits that pedagogy is a social activity embedded in everyday life, and relies on certain innate human capacities., Analyser automatiquement les signaux de l'enseignement : une approche d'apprentissage social fondée sur les preuves Résumé : Les récentes avancées en traitement et analyse du signal ont permis de créer de nouvelles manières d'instrumenter l'observation et l'analyse des événements scolaires, et donc de recueillir de nouveaux types de preuves des pratiques d'enseignement ou d'apprentissage. Cet article en recense certains en se fondant sur le cadre d'analyse de l'apprentissage social, posant que la pédagogie est une activité sociale intégrée à la vie de tous les jours, et reposant sur certaines capacités humaines innées.
Published: 2022

34. Sciences sociales et apprentissage machine pour l'interaction

Author: Vaufreydaz, Dominique, Laboratoire d'Informatique de Grenoble (LIG), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA), Multimodal Perception and Sociable Interaction (M-PSI), and Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )
Subjects: [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], Robot, Sciences humaines & sociales, Interactions Homme-Machine, Apprentissage machine deep learning, [SHS]Humanities and Social Sciences
Abstract: National audience; Le machine learning a aujourd'hui fait preuve de son efficacité : on peut produire, à partir d'une grande masse d'informations, des Intelligences Artificielles capables de répondre à de nombreux besoins, comme le montrent les progrès en vision par ordinateur ou en traduction automatique ces dernières années. Pour autant, cette technique a des limites, vis-à-vis des secteurs ne disposant pas de suffisamment de données, vis-à-vis de certaines questions éthiques, et vis-à-vis de son explicabilité. Pour pallier ces problèmes dans les applications où le Machine Learning seul n’est pas efficient, les sciences humaines peuvent apporter des solutions et de la précision aux systèmes automatiques. À l'aide de deux exemples concrets, Dominique Vaufreydaz illustre comment les apports des sciences humaines peuvent nourrir et améliorer un programme informatique dédié aux interactions avec les humains.
Published: 2021

35. Navigation in Urban Environments amongst pedestrians using Multi-Objective Deep Reinforcement Learning

Author: Deshpande, Niranjan, primary, Vaufreydaz, Dominique, additional, and Spalanzani, Anne, additional
Published: 2021
Full Text: View/download PDF

36. Apprendre en toute éthique dans les salles de classe intelligentes

Author: Laurent, Romain, Dessus, Philippe, Vaufreydaz, Dominique, Dessus, Philippe, IDEX UGA - - UGA2015 - ANR-15-IDEX-0002 - IDEX - VALID, and Canopé
Subjects: Salle de classe intelligente, [SHS.EDU] Humanities and Social Sciences/Education, Typologie
Abstract: Après le numérique sous des formes variées mais désormais familières aux enseignants et à leurs élèves, c'est aujourd'hui l'intelligence artificielle qui s'invite dans les salles de classe. La vision informatique, notamment, offre des opportunités inédites de captation et d'analyse de ce qui se passe dans les classes, dans une perspective d'amplification de la cognition humaine. Les rétroactions formatives à l'enseignant pourraient s'en trouver considérablement enrichies, particulièrement pour saisir l'impact de ses pratiques sur les apprenants, mais cette introduction de la « machine qui pense », souvent présentée dans la littérature scientifique comme une panacée, ne saurait nous exonérer de penser avant elle tous les tenants et aboutissants d'une telle implantation. Comme pour toutes les sphères de l'activité humaine où elle est désormais invitée (voire convoquée), les moyens et buts de l'intelligence artificielle à l'école doivent être interrogés.
Published: 2021

37. Design spatial sociotechnique

Author: Laurent, Romain, Dessus, Philippe, and Vaufreydaz, Dominique
Abstract: Introduction À la question « Quelle ingénierie pédagogique à l’ère numérique ? », Peraya et Peltier (2020) nous rappellent que ni la question de l’autonomie de l’apprenant ni la maîtrise des compétences informationnelles ne sont des nécessités nouvelles posées à l’ingénierie pédagogique. Ils nous interrogent en retour sur les dynamiques politique, sociale, économique, technologique qui pourraient, ou impliqueraient de, renouveler les pratiques ingéniériales, en réponse aux demandes des niveau...
Published: 2020

38. Ethical Teaching Analytics in a Context-Aware Classroom: A Manifesto

Author: Laurent, Romain, Vaufreydaz, Dominique, Dessus, Philippe, Laboratoire de Recherche sur les Apprentissages en Contexte (LaRAC), Université Grenoble Alpes (UGA), Interaction située avec les objets et environnements intelligents (PERVASIVE ), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Informatique de Grenoble (LIG), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), and Teaching Lab project, PIA 2 IDEX formation program, Univ. Grenoble Alpes
Subjects: learning analytics, [INFO.INFO-IU]Computer Science [cs]/Ubiquitous Computing, teacher cognition, teaching analytics, machine learning, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], [SHS.EDU]Humanities and Social Sciences/Education, ComputingMilieux_COMPUTERSANDEDUCATION, ethics and privacy, ubiquitous computing, ambient classroom
Abstract: International audience; Should Big Teacher be watching you? The Teaching Lab project at Grenoble Alpes University proposes recommendations for designing smart classrooms with ethical considerations taken into account.
Published: 2020

39. Collaborative Smartphone-Based User Positioning in a Multiple-User Context Using Wireless Technologies

Author: Ta, Viet-Cuong, Dao, Trung-Kien, Vaufreydaz, Dominique, Castelli, Eric, Vietnam National University [Hanoï] (VNU), International Research Institute MICA (MICA), Institut National Polytechnique de Grenoble (INPG)-Hanoi University of Science and Technology (HUST)-Centre National de la Recherche Scientifique (CNRS), Interaction située avec les objets et environnements intelligents (PERVASIVE ), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Informatique de Grenoble (LIG), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), and Université Grenoble Alpes (UGA)
Subjects: [INFO.INFO-IU]Computer Science [cs]/Ubiquitous Computing, indoor localization, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], multiple-user positioning, multi-sensor fusion, indoor navigation, lcsh:TP1-1185, lcsh:Chemical technology, Article
Abstract: International audience; For the localization of multiple users, Bluetooth data from the smartphone is able to complement Wi-Fi-based methods with additional information, by providing an approximation of the relative distances between users. In practice, both positions provided by Wi-Fi data and relative distance provided by Bluetooth data are subject to a certain degree of noise due to the uncertainty of radio propagation in complex indoor environments. In this study, we propose and evaluate two approaches, namely Non-temporal and Temporal ones, of collaborative positioning to combine these two cohabiting technologies to improve the tracking performance. In the Non-temporal approach, our model establishes an error observation function in a specific interval of the Bluetooth and Wi-Fi output. It is then able to reduce the positioning error by looking for ways to minimize the error function. The Temporal approach employs an extended error model that takes into account the time component between users’ movements. For performance evaluation, several multi-user scenarios in an indoor environment are set up. Results show that for certain scenarios, the proposed approaches attain over 40% of improvement in terms of average accuracy.
Published: 2020

40. Position du retour visuel pour l'interaction véhicule autonome/piétons

Author: Troel−Madec, Maureen, Alaimo, Julien, Boissieux, Laurence, Chatagnon, Sandrine, Borkoswki, Stan, Spalanzani, Anne, Vaufreydaz, Dominique, Pôle supérieur de Design Léonard de Vinci, Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria), Service Expérimentation et Développement (SED [Grenoble]), Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria), Interaction située avec les objets et environnements intelligents (PERVASIVE), Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Informatique de Grenoble (LIG ), Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019]), ANR-17-CE22-0010,Hianic,Navigation autonome dans des foules inspirée par les humains(2017), and SED [Grenoble]
Subjects: [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC], ComputingMilieux_MISCELLANEOUS
Abstract: National audience
Published: 2019

41. Behavioral decision-making for urban autonomous driving in the presence of pedestrians using Deep Recurrent Q-Network

Author: Deshpande, Niranjan, primary, Vaufreydaz, Dominique, additional, and Spalanzani, Anne, additional
Published: 2020
Full Text: View/download PDF

42. Group-Level Emotion Recognition Using a Unimodal Privacy-Safe Non-Individual Approach

Author: Petrova, Anastasia, primary, Vaufreydaz, Dominique, additional, and Dessus, Philippe, additional
Published: 2020
Full Text: View/download PDF

43. Design spatial sociotechnique

Author: Laurent, Romain, primary, Dessus, Philippe, additional, and Vaufreydaz, Dominique, additional
Published: 2020
Full Text: View/download PDF

44. eHMI positioning for autonomous vehicle/pedestrians interaction

Author: Troel-Madec, Maureen, primary, Boissieux, Laurence, additional, Borkoswki, Stan, additional, Vaufreydaz, Dominique, additional, Alaimo, Julien, additional, Chatagnon, Sandrine, additional, and Spalanzani, Anne, additional
Published: 2019
Full Text: View/download PDF

45. A Lightweight Speech Detection System for Perceptive Environments

Author: Vaufreydaz, Dominique, primary, Emonet, Rémi, additional, and Reignier, Patrick, additional
Published: 2006
Full Text: View/download PDF

46. Experiments on the Construction of a Phonetically Balanced Corpus from the Web

Author: Villaseñor-Pineda, Luis, primary, Montes-y-Gómez, Manuel, additional, Vaufreydaz, Dominique, additional, and Serignat, Jean-François, additional
Published: 2004
Full Text: View/download PDF

47. A Corpus Balancing Method for Language Model Construction

Author: Villaseñor-Pineda, Luis, primary, Montes-y-Gómez, Manuel, additional, Pérez-Coutiño, Manuel Alberto, additional, and Vaufreydaz, Dominique, additional
Published: 2003
Full Text: View/download PDF

48. Multimodal perception and sociable interaction

Author: Vaufreydaz, Dominique, Interaction située avec les objets et environnements intelligents (PERVASIVE), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Informatique de Grenoble (LIG ), Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019]), Plateforme Robotique d’Assistance et de Maintien A Domicile PRAMAD FUI PRAMAD2, Université Grenoble Alpes (France), MSTII, Crowley James L., ANR-06-TCOM-0020,CASPER,Communication et Assistance Ambiante par Analyse d'ActivitéS pour les PERsonnes âgées et déficientes cognitives(2006), ANR-15-CE23-0005,CEEGE,Estimation d'expertise en echec a partir de l'observation de fixation et emotion(2015), ANR-13-JS01-0010,VALET,Renormalisation et théorèmes limites en théorie ergodique (VALET=Vershik's automorphisms, limits in ergodic theory)(2013), ANR-17-CE22-0010,Hianic,Navigation autonome dans des foules inspirée par les humains(2017), ANR-11-EQPX-0002,AmiQual4HOME,AmiLab pour Habitats Intelligents(2011), European Project: 26125,CHIL, and European Project: IST-2000-28323,FAME
Subjects: interaction sociable, perception multimodale, traitement du signal, [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV], computer vision, [INFO.INFO-IU]Computer Science [cs]/Ubiquitous Computing, machine learning, [STAT.ML]Statistics [stat]/Machine Learning [stat.ML], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, multimodal perception, vision par ordinateur, [INFO.INFO-RB]Computer Science [cs]/Robotics [cs.RO], [INFO]Computer Science [cs], apprentissage machine, sociable interaction, signal processing
Abstract: L’une des tâches les plus complexes pour laquelle les ordinateurs ont été programmés concerne le mimétisme des capacités de perception et d’interaction des humains en utilisant tout d’abord des informations monomodales (acoustiques, visuelles, tactiles, de proprioception, …) puis multimodales en combinant plusieurs modalités. À partir de ces capacités de perception, les systèmes interactifs, c’est-à-dire les systèmes interagissant avec des humains, peuvent être sensibles à l’environnement qui les entoure, aux utilisateurs présents, à la situation courante… Cela leur permet de percevoir, comprendre et prédire pour agir en conséquence, voire d’agir d’une manière sociable pour être un partenaire des humains à part entière.La perception multimodale par ordinateur et les interactions sociables sont les problématiques de fond de mes travaux depuis mon recrutement en tant que Maître de conférences en 2005, le traitement du signal (« signal processing ») et l’apprentissage automatique (« machine learning ») en étant les fondements. Ce manuscrit présente mes travaux sur la perception multimodale et les interactions sociables dans plusieurs contextes en les regroupant autour de mes thématiques de recherche principales.Ce manuscrit aborde tout d’abord la perception multimodale ubiquitaire au sein d’espaces perceptifs multimodaux tels les salles de réunions augmentées, les appartements équipés pour le maintien de personnes âgées/fragiles à domicile ou des espaces à plus grande échelle comme les bâtiments d’un campus universitaire. Faisant suite aux progrès en robotique, cette perception s’est naturellement déplacée des environnements perceptifs vers les robots mobiles, permettant des interactions sociables entre les humains et des robots compagnons (Human Robot Interaction - HRI) mais aussi avec des robots particuliers que sont les véhicules autonomes. Les travaux de recherche concernant la perception des humains et de leurs affects sont ensuite présentés via mes recherches sur la perception en champ proche (< 1 m) et sur la détection des humains et de leurs comportements autour de nos systèmes interactifs, base nécessaire à leur fonctionnement. Nos travaux préliminaires sur la détection de personnes en utilisant de l’apprentissage profond (« Deep Learning ») sont décrits. Ce manuscrit se clôt en présentant les directions et les perspectives de mon projet de recherche intitulé « Perception multimodale et interaction sociable ».
Published: 2018

49. A Framework for a Multimodal Analysis of Teaching Centered on Shared Attention and Knowledge Access

Author: Dessus , Philippe, Aubineau , Louise-Héléna, Vaufreydaz , Dominique, Crowley , James L., Interaction située avec les objets et environnements intelligents (PERVASIVE), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Informatique de Grenoble (LIG ), Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019])-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes [2016-2019] (UGA [2016-2019]), Laboratoire de Recherche sur les Apprentissages en Contexte (LaRAC), Université Grenoble Alpes [2016-2019] (UGA [2016-2019]), Laboratoire des Sciences de l'Éducation (Grenoble) ( LSE ), Université Grenoble Alpes ( UGA ) -Université Pierre Mendès France - Grenoble 2 ( UPMF ), Interaction située avec les objets et environnements intelligents ( PERVASIVE ), Institut National de Recherche en Informatique et en Automatique ( Inria ) -Institut National de Recherche en Informatique et en Automatique ( Inria ) -Université de Grenoble-Alpes-Laboratoire d'Informatique de Grenoble ( LIG ), and Université Pierre Mendès France - Grenoble 2 ( UPMF ) -Université Joseph Fourier - Grenoble 1 ( UJF ) -Institut National Polytechnique de Grenoble ( INPG ) -Centre National de la Recherche Scientifique ( CNRS ) -Université Grenoble Alpes ( UGA ) -Université Pierre Mendès France - Grenoble 2 ( UPMF ) -Université Joseph Fourier - Grenoble 1 ( UJF ) -Institut National Polytechnique de Grenoble ( INPG ) -Centre National de la Recherche Scientifique ( CNRS ) -Université Grenoble Alpes ( UGA ) -Institut polytechnique de Grenoble - Grenoble Institute of Technology ( Grenoble INP )
Subjects: [INFO.INFO-IU]Computer Science [cs]/Ubiquitous Computing, Eye tracking, [SHS.EDU]Humanities and Social Sciences/Education, Classroom Observation, Teacher cognition, [ INFO.INFO-IU ] Computer Science [cs]/Ubiquitous Computing, [ SHS.EDU ] Humanities and Social Sciences/Education, Joint Attention
Abstract: National audience; The effects of teaching on learning are mostly uncertain, hidden, and not immediate. Research investigating how teaching can have an impact on learning has recently been given a significant boost with signal processing devices and data mining analyses.We devised a framework for the study of teaching and learning processes which posits that lessons are composed of episodes of joint attention and access to the taught content, and that the interplay of behaviors like joint attention, actional contingency, and feedback loops compose different levels of teaching. Teaching by social tolerance, which occurs when learners (Ls) have no attentional problems but their access to the taught knowledge depends on the teacher (T). Teaching by opportunity provisioning, when Ls can be aware on the taught content but lack access to it (e.g., lack of understanding), and T builds ad hoc situations in which Ls are provided with easier content. Teaching by stimulus or local enhancement, when Ls have fully access to the content but lack attention toward it. T explicitly shows content to Ls, slows down her behaviors, tells and acts in an adapted way (e.g., motherese).A variety of devices installed in a classroom will capture and automatically characterize these events. T’s and Ls’ utterances and gazes will be recorded through low-cost cameras installed on 3D printed glasses, and T will wear a mobile eye tracker and a mobile microphone. Instructional material is equipped with qrcodes so that Ls’ and T’s video streams are processed to determine where people are looking at, and to infer the corresponding teaching levels.This novel framework will be used to analyze instructional events in ecological situations, and will be a first step to build a ”pervasive classroom”, where eye-tracking and sensor-based devices analyze a wide range of events in a multimodal and interdisciplinary way.
Published: 2018

50. Deep learning investigation for chess player attention prediction using eye-tracking and game data

Author: Louedec, Justin Le, primary, Guntz, Thomas, additional, Crowley, James L., additional, and Vaufreydaz, Dominique, additional
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

148 results on '"Vaufreydaz, Dominique"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources