UnivIS
Information system of Friedrich-Alexander-University Erlangen-Nuremberg © Config eG 
FAU Logo
  Collection/class schedule    module collection Home  |  Legal Matters  |  Contact  |  Help    
search:      semester:   
 
 Layout
 
printable version

 
 
Appearance-based statistical object recognition

The goal of the project, which is carried out at the Graduate Research Center "3D Image Analysis and Synthesis", is the classification and localization of 3D objects in images. An appearance-based approach is applied. We don't use any segmentation process, which detects geometric features like edges or corners. 2-D local feature vectors are determined directly from pixel intensities in gray level images. They are computed using a wavelet transformation. The components of the feature vectors are statistically modeled as normal distributed. In this way illumination changes and noises can be handled.

In real scenes and applications objects could be placed on heterogeneous background or be partly occluded. This is why we introduced a separate background model. In the recognition phase the decision is made, which feature vectors belong to the object and which to the background. The components of the background vectors are then modeled using the uniform distribution.

A so called global assignment function makes it possible to recognize more than one object in a scene. The number of objects in an image is unknown. A special abort criterion decides, when the searching process ends. The finding of an efficient working abort criterion is very important.

The learning process begins with the image acquisition of all possible object classes in many known poses. In the laboratory environment, the images are taken with a special setup with turntable and camera arm. In real problems of object recognition in images, it is much easier to record the objects using a hand-held camera. For this reason we propose a new approach for object recognition, where the image acquisition is done in this way. The poses of the objects in all training frames are computed using a so called structure-from-motion algorithm. The image acquisition process is, therefore, much easier, but we have to deal with an additional training inaccuracy.

In order to evaluate the object recognition system we took a very large image data base 3D-REAL-ENV. With more than 30000 training images and more than 8000 test images with real heterogeneous background different algorithms can be objective compared. The illumination in the test images is different from the illumination in the training phase.

In the last time the color modeling of the object is introduced. We use 6-D local feature vectors in this case, whereas the wavelet transformation is performed separately for the red, the green, and the blue channel. The classification rate improved in this way from 55.4% (gray level modeling) to 87.3% (color modeling). The localization rate increased from 69.0% (gray level modeling) to 77.1% (color modeling).

Project manager:
Prof. em. Dr.-Ing. Dr.-Ing. h.c. Heinrich Niemann

Project participants:
Prof. Dr.-Ing. Marcin Grzegorzek

Keywords:
object recognition, object localization, object classification, statistical modeling, wavelets

Duration: 1.6.1999 - 30.6.2006

Sponsored by:
Deutsche Forschungsgemeinschaft

Contact:
Grzegorzek, Marcin
E-Mail: marcin.grzegorzek@imi.uni-luebeck.de
Publications
Grzegorzek, Marcin ; Niemann, Heinrich: Statistical Object Recognition Including Color Modeling. In: Kamel, Mohamed ; Campilho, Aurelio (Ed.) : Proceedings of the 2nd International Conference on Image Analysis and Recognition (2nd International Conference on Image Analysis and Recognition Toronto 28-30.09.2005). Berlin : Springer, 2005, pp 481-489. - ISBN 10 3-540-29069-9
Reinhold, Michael ; Grzegorzek, Marcin ; Denzler, Joachim ; Niemann, Heinrich: Appearance-Based Recognition of 3-D Objects by Cluttered Background and Occlusions. In: Pattern Recognition 38 (2005), No. 5, pp 739-753
Grzegorzek, Marcin ; Scholz, Ingo ; Reinhold, Michael ; Niemann, Heinrich: Fast Training for Object Recognition with Structure-from-Motion. In: Pattern Recognition and Image Analysis: Advanced in Mathematical Theory and Applications 15 (2005), No. 1, pp 183-186

Institution: Chair of Computer Science 5 (Pattern Recognition)
UnivIS is a product of Config eG, Buckenhof