Back to Search Start Over

Forward engineering object recognition : a scalable approach

Authors :
James J. DiCarlo.
Massachusetts Institute of Technology. Dept. of Brain and Cognitive Sciences.
Pinto, Nicolas
James J. DiCarlo.
Massachusetts Institute of Technology. Dept. of Brain and Cognitive Sciences.
Pinto, Nicolas
Publication Year :
2011

Abstract

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, 2011.<br />This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.<br />Cataloged from student-submitted PDF version of thesis.<br />Includes bibliographical references (p. 254-302).<br />The ease with which we recognize visual objects belies the computational difficulty of this feat. Despite the concerted efforts of both biological and computer vision research communities over the last forty years, human-level visual recognition remains an unsolved problem. The impact of a robust yet inexpensive solution would dramatically change computer science and neuroscience, unleashing a host of innovative applications in our modern society. In this thesis, we identify two operational barriers that have obstructed progress towards finding a solution { namely the lack of clear indicators and operational definitions of success, and the currently limited exploration of the staggeringly large hypothesis space of biologically- inspired solutions. To break down these barriers, we first establish new neuroscience-motivated baselines and new suites of fully-controlled benchmarks for object and face recognition. We also compare and contrast a variety of high-level visual systems, both artificial (state-of-the- art computer vision) and biological (humans). Then, we propose a simple high-throughput approach to undertake a systematic exploration of the biologically-inspired model class. By leveraging recent advances in massively parallel computing, we show that it is possible to generate a multitude of candidate models, screen them for desirable properties and discover robust solutions. Finally, we validate the scalability of our approach by showing its potential on large-scale real-world" applications. Taken together, this thesis represents a humble attempt towards an integrated approach to the problem of brain-inspired object recognition { spanning the engineering, specification, evaluation, and application of an interesting set of biologically-inspired ideas, driven and enabled by massively parallel technology. Even relatively early instantiations of this approach yield algorithms that achieve state-of-the-art performance in object recognition tasks and can generalize<br />by Nicolas Pinto.<br />Ph.D.

Details

Database :
OAIster
Notes :
[6], ii, 302 p., application/pdf, English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1134462152
Document Type :
Electronic Resource