
Traditional computer interfaces have very limited input capabilities, typically restricted to keyboard typing and mouse manipulations (pointing, selecting, dragging, etc.). The area of vision-based interaction seeks to provide a wider and more expressive range of input capabilities by using computer vision techniques to process sensor data from one or more cameras in real-time, in order to reliably estimate relevant visual information about the user – i.e., to use vision as a passive, non-intrusive, non-contact input modality for human-computer interaction.