Retrieving Objects From Videos Based on Affine Regions
Vittorio Ferrari, Tinne Tuytelaars and Luc Van Gool
In: EUSIPCO, September 2004, Vienna, Austria.
We present a method to (semi-)automatically
annotate video material.
More precisely, we focus on recognizing specific objects and scenes
Objects are learnt simply by having the user delineate
them in one (or a few) images.
The basic building block to achieve this goal consists of
affine invariant regions. These are local image patches that adapt
their shape based on the image content so as to be invariant
to viewpoint changes.
Instead of simply matching the regions
and counting the number of matches, we propose to
gather more evidence about the presence of the object by
exploring the image around the initial matches.
This boosts the performance, especially under difficult, real-world imaging
Experimental results on news broadcast
data demonstrate the viability of the approach.
|EPrint Type:||Conference or Workshop Item (Paper)|
|Project Keyword:||Project Keyword UNSPECIFIED|
|Deposited By:||Vittorio Ferrari|
|Deposited On:||28 November 2005|