Curious George: An Attentive Semantic Robot
D. Meger, P.-E. Forssén, K. Lai, S. Helmer, S. McCann, T. Southey, M. Baumann, J. J. Little, D. G. LoweRobotics and Autonomous Systems Journal
Volume 56, Number 6, Pages 503-511
June 2008
Abstract
State-of-the-art methods have recently achieved impressive performance for recognising the objects present in large databases of pre-collected images. There has been much less focus on building embodied systems that recognise objects present in the real world. This paper describes an intelligent system that attempts to perform robust object recognition in a realistic scenario, where a mobile robot moving through an environment must use the images collected from its camera directly to recognise objects. To perform successful recognition in this scenario, we have chosen a combination of techniques including a peripheral-foveal vision system, an attention system combining bottom-up visual saliency with structure from stereo, and a localisation and mapping technique. The result is a highly capable object recognition system that can be easily trained to locate the objects of interest in an environment, and subsequently build a spatial-semantic map of the region. This capability has been demonstrated during the Semantic Robot Vision Challenge, and is further illustrated with a demonstration of semantic mapping. We also empirically verify that the attention system outperforms an undirected approach even with a significantly lower number of foveations.
Full Paper
Portable document format file PDF (This system won the SRVC Comptetition at the AAAI07 conference in Vancouver. [Curious George page]
Bibtex entry
@Article{meger08, author = {D. Meger and P.-E. Forss{\'e}n and K. Lai and S. Helmer and S. McCann and T. Southey and M. Baumann and J. J. Little and D. G. Lowe}, title = {Curious George: An Attentive Semantic Robot}, journal = {Robotics and Autonomous Systems Journal}, year = {2008}, OPTkey = {}, volume = {56}, number = {6}, pages = {503-511}, month = {June}, issn = {0921-8890}, publisher = {Elsevier Ltd.} }
Per-Erik Forssén
Contact:
Computer Vision Laboratory
Department of Electrical Engineering
Building B
Room 2D:521
SE-581 83 Linköping, Sweden
+46(0)13 285654
< >
My pages:
Informationsansvarig: Per-Erik Forssén
Senast uppdaterad: 2024-09-28