A Dictionary Learning based 3D Morphable Shape Model

Ferrari, Claudio; Lisanti, Giuseppe; Berretti, Stefano; Del Bimbo, Alberto

doi:10.1109/TMM.2017.2707341

Face analysis from 2D images and videos is a central task in many multimedia applications. Methods developed to this end perform either face recognition or facial expression recognition, and in both cases results are negatively influenced by variations in pose, illumination and resolution of the face. Such variations have a lower impact on 3D face data, which has given the way to the idea of using a 3D Morphable Model as an intermediate tool to enhance face analysis on 2D data. In this paper, we propose a new approach for constructing a 3D Morphable Shape Model (called DL-3DMM) and show our solution can reach the accuracy of deformation required in applications where fine details of the face are concerned. For constructing the model, we start from a set of 3D face scans with large variability in terms of ethnicity and expressions. Across these training scans, we compute a point-to-point dense alignment, which is accurate also in the presence of topological variations of the face. The DL-3DMM is constructed by learning a dictionary of basis components on the aligned scans. The model is then fitted to 2D target faces using an efficient regularized ridge-regression guided by 2D/3D facial landmark correspondences in order to generate pose-normalized face images. Comparison between the DL-3DMM and the standard PCA-based 3DMM demonstrates that in general a lower reconstruction error can be obtained with our solution. Application to action unit detection and emotion recognition from 2D images and videos shows competitive results with state of the art methods on two benchmark datasets.

FERRARI, C., LISANTI, G., BERRETTI, S., DEL BIMBO, A. (2017). A Dictionary Learning based 3D Morphable Shape Model. IEEE TRANSACTIONS ON MULTIMEDIA, 19, 2666-2679 [10.1109/TMM.2017.2707341].