
Gary R. Bradski, Microcomputer Research Lab, Santa Clara, CA, Intel Corporation
Index Words: computer vision, face tracking, mean shift algorithm, perceptual user interface, 3D graphics interface
Abstract
As a first step towards a perceptual user interface, a computer vision color tracking algorithm is developed and applied towards tracking human faces. Computer vision algorithms that are intended to form part of a perceptual user interface must be fast and efficient. They must be able to track in real time yet not absorb a major share of computational resources: other tasks must be able to run while the visual interface is being used. The new algorithm developed here is based on a robust non-parametric technique for climbing density gradients to find the mode (peak) of probability distributions called the mean shift algorithm. In our case, we want to find the mode of a color distribution within a video scene. Therefore, the mean shift algorithm is modified to deal with dynamically changing color probability distributions derived from video frame sequences. The modified algorithm is called the Continuously Adaptive Mean Shift (CAMSHIFT) algorithm. CAMSHIFT's tracking accuracy is compared against a Polhemus tracker. Tolerance to noise, distractors and performance is studied.
CAMSHIFT is then used as a computer interface for controlling commercial computer
games and for exploring immersive 3D graphic worlds.
Page 1 of 11