This work demonstrates fully automatic recognition and reconstruction of 3D objects in image databases. This work has led to the first large scale systems for recognising and reconstructing 3-dimensional objects from the internet.
We pose the object recognition problem as one of finding consistent matches between all images, subject to the constraint that the images were taken from a perspective camera. We assume that the objects or scenes are rigid. For each image we associate a camera matrix, which is parameterised by rotation, translation and focal length. We use invariant local features to find matches between all images, and the RANSAC algorithm to find those that are consistent with the fundamental matrix.Objects are recognised as subsets of matching images. We then solve for the structure and motion of each object, using a sparse bundle adjustment algorithm.
Our results demonstrate that it is possible to recognise and reconstruct 3D objects from an unordered image database with no user input at all.