Object Recognition with and without Objects

Object Recognition with and without Objects

Zhuotun Zhu, Lingxi Xie, Alan Yuille

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
Main track. Pages 3609-3615. https://doi.org/10.24963/ijcai.2017/505

While recent deep neural networks have achieved a promising performance on object recognition, they rely implicitly on the visual contents of the whole image. In this paper, we train deep neural networks on the foreground (object) and background (context) regions of images respectively. Considering human recognition in the same situations, networks trained on the pure background without objects achieves highly reasonable recognition performance that beats humans by a large margin if only given context. However, humans still outperform networks with pure object available, which indicates networks and human beings have different mechanisms in understanding an image. Furthermore, we straightforwardly combine multiple trained networks to explore different visual cues learned by different networks. Experiments show that useful visual hints can be explicitly learned separately and then combined to achieve higher performance, which verifies the advantages of the proposed framework.
Keywords:
Machine Learning: Classification
Machine Learning: Machine Learning
Machine Learning: Neural Networks