|
We present an algorithm designed for navigating around a performance that
was filmed as a "casual" multi-view video collection: real-world footage
captured on hand held cameras by a few audience members. The objective is to
easily navigate in 3D, generating a video-based rendering (VBR) of a
performance filmed with widely separated cameras. Casually filmed events are
especially challenging because they yield footage with complicated
backgrounds and camera motion. Such challenging conditions preclude the use
of most algorithms that depend on correlation-based stereo or 3D
shape-from-silhouettes.
The project aims to infer the poses of a character acting in a environment
filmed by a set of video cameras. Once his poses are estimated, a free-viewpoint
video of the entire action can be genertated.
|