Deep Impression: Audiovisual Deep Residual Networks for Multimodal Apparent Personality Trait Recognition

About

Here, we develop an audiovisual deep residual network for multimodal apparent personality trait recognition. The network is trained end-to-end for predicting the Big Five personality traits of people from their videos. That is, the network does not require any feature engineering or visual analysis such as face detection, face landmark alignment or facial expression recognition. Recently, the network won the third place in the ChaLearn First Impressions Challenge with a test accuracy of 0.9109.

Ya\u{g}mur G\"u\c{c}l\"ut\"urk, Umut G\"u\c{c}l\"u, Marcel A. J. van Gerven, Rob van Lier• 2016

Related benchmarks

Task	Dataset	Result
Multimodal Personality Understanding	CFI Big Five V2	Accuracy91.72	10
Multimodal Personality Understanding	DMSP MBTI	E/I Accuracy88.77	10
Personality Understanding	DMSP MBTI	Overall DP7.78	10
Personality Understanding	CFI Big Five V2	Overall DP0.3192	10

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord