Digital video processing is becoming widely used in many aspects of our nowadays life. The availability of high-computation-power systems allows processing of huge amount of raw data to achieve content based functionalities, such as search and manipulation of objects, semantic description of scenes, detection of unusual events, and recognition of objects. This work applies the probabilistic framework for spatio-temporal video representation to supply infrastructure for face tracking in video sequences. It examines the ability to detect skin color regions in [L,a,b] color space, and handle the inherent over-segmentation problem of the framework.
It also compares 3 approaches for using the motion information to extend the feature space to distinguish between blobs that occlude each other and move at different velocity during the clip: Connected Components Analysis, Learning GMM model and Frame-by-Frame tracking. The following figures illustrates the enhanced segmentation capabilities when working in the extended feature space.
For more details see the slides
For problems or
questions regarding this web page contact