Highly accurate gaze estimation using a consumer RGB-depth sensor

Reza Ghiass, Ognjen Arandelovic

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Downloads (Pure)

Abstract

Determining the direction in which a person is looking is an important problem in a wide range of HCI applications. In this paper we describe a highly accurate algorithm that performs gaze estimation using an affordable and widely available device such as Kinect. The method we propose starts by performing accurate head pose estimation achieved by fitting a person specific morphable model of the face to depth data. The ordinarily competing requirements of high accuracy and high speed are met concurrently by formulating the fitting objective function as a combination of terms which excel either in accurate or fast fitting, and then by adaptively adjusting their relative contributions throughout fitting. Following pose estimation, pose normalization is done by re-rendering the fitted model as a frontal face. Finally gaze estimates are obtained through regression from the appearance of the eyes in synthetic, normalized images. Using EYEDIAP, the standard public dataset for the evaluation of gaze estimation algorithms from RGB-D data, we demonstrate that our method greatly outperforms the state of the art.
Original languageEnglish
Title of host publicationProceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence
Subtitle of host publicationNew York City, USA, 9–15 July 2016
EditorsSubbarao Kambhampati
Place of PublicationPalo Alto
PublisherAAAI Press
Pages3368-3374
Publication statusPublished - 9 Jul 2016
Event25th International Joint Conference on Artificial Intelligence - New York, United States
Duration: 9 Jul 201615 Jul 2016
http://ijcai-16.org/index.php/welcome/view/home

Conference

Conference25th International Joint Conference on Artificial Intelligence
Country/TerritoryUnited States
CityNew York
Period9/07/1615/07/16
Internet address

Fingerprint

Dive into the research topics of 'Highly accurate gaze estimation using a consumer RGB-depth sensor'. Together they form a unique fingerprint.

Cite this