Audio-visual emotion recognition: A dynamic, multimodal approach

Jérémie Nicolle; Vincent Rapp; Kevin Bailly; Lionel Prevost; Mohamed Chetouani

Poster De Conférence Année : 2014

Audio-visual emotion recognition: A dynamic, multimodal approach

(1) , (1) , (1) , (2) , (1)

1
2

Jérémie Nicolle

Fonction : Auteur

Institut des Systèmes Intelligents et de Robotique

Vincent Rapp

Fonction : Auteur
PersonId : 961979

Institut des Systèmes Intelligents et de Robotique

Kevin Bailly

Fonction : Auteur
PersonId : 181765
IdHAL : kevin-bailly
ORCID : 0000-0001-7802-3673
IdRef : 178678244

Institut des Systèmes Intelligents et de Robotique

Lionel Prevost

Fonction : Auteur
PersonId : 967227

Laboratoire de Mathématiques Informatique et Applications

Mohamed Chetouani

Fonction : Auteur
PersonId : 179528
IdHAL : mohamed-chetouani
ORCID : 0000-0002-2920-4539
IdRef : 089021916

Institut des Systèmes Intelligents et de Robotique

Résumé

Designing systems able to interact with students in a natural manner is a complex and far from solved problem. A key aspect of natural interaction is the ability to understand and appropriately respond to human emotions. This paper details our response to the continuous Audio/Visual Emotion Challenge (AVEC'12) whose goal is to predict four affective signals describing human emotions. The proposed method uses Fourier spectra to extract multi-scale dynamic descriptions of signals characterizing face appearance, head movements and voice. We perform a kernel regression with very few representative samples selected via a supervised weighted-distance-based clustering, that leads to a high generalization power. We also propose a particularly fast regressor-level fusion framework to merge systems based on different modalities. Experiments have proven the efficiency of each key point of the proposed method and our results on challenge data were the highest among 10 international research teams.

Mots clés

Multimodal fusion Facial expressions Feature selection Dynamic features Affective computing

Domaines

Interface homme-machine [cs.HC]

Fichier principal

p44-nicole.pdf (436.52 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Ihm14 Ihm14 : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01089628

Soumis le : mardi 2 décembre 2014-09:26:24

Dernière modification le : jeudi 17 août 2023-13:30:46

Archivage à long terme le : mardi 3 mars 2015-10:35:57

Dates et versions

hal-01089628 , version 1 (02-12-2014)

Identifiants

HAL Id : hal-01089628 , version 1

Citer

Jérémie Nicolle, Vincent Rapp, Kevin Bailly, Lionel Prevost, Mohamed Chetouani. Audio-visual emotion recognition: A dynamic, multimodal approach. IHM'14, 26e conférence francophone sur l'Interaction Homme-Machine, Oct 2014, Lille, France. pp.44-51, 2014. ⟨hal-01089628⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC UNIV-AG CNRS ISIR IHM-2014 LAMIA SORBONNE-UNIVERSITE SU-SCIENCES ISIR_PIROS

318 Consultations

271 Téléchargements

Audio-visual emotion recognition: A dynamic, multimodal approach

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager