Education, Science, Technology, Innovation and Life
Open Access
Sign In

BLSTM Recurrent Neural Network for Object Recognition

Download as PDF

DOI: 10.23977/jaip.2016.11005 | Downloads: 88 | Views: 7461

Author(s)

Yalan Qin 1

Affiliation(s)

1 College of Computer and Information Science, Southwest University, Chongqing, 400715, China

Corresponding Author

Yalan Qin

ABSTRACT

Multi-object relationship information can help eliminate some incorrect combinations or locations of objects. Moreover, it is favorable to extract scene information for object recognition. In this paper, we introduce a new way to generate image representation and propose a deep learning framework to fuse the contextual dependencies among objects and scene information in an image. It adopts a bidirectional long short-term memory recurrent neural network (BLSTM-RNN) to deal with the problem of variable-length sequence produced by local detectors in different images. Then it is applied to the existing tree context model for further recognition. Experimental results on SUN09 dataset show that our model outperforms the state-of the-art object localization methods.

KEYWORDS

Multi-object Relationship; Object Recognition; BLSTM

CITE THIS PAPER

Yalan Q. (2016) BLSTM Recurrent Neural Network for Object Recognition. Journal of Artificial Intelligence Practice (2016) 1: 25-29.

REFERENCES

[1] P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan, “Object detection with discriminatively trained part-based models,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 32, no. 9, pp. 1627–1645, Sep. 2010.
[2] D. Hoiem, A. A. Efros, and M. Hebert, “Putting objects in perspective,”Int. J. Comput. Vis., vol. 80, no. 1, pp. 3–15, 2008. 
[3] Graves A, Schmidhuber J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures[J]. Neural Networks, 2005, 18(5): 602-610.
[4] M. J. Choi, J. J. Lim, A. Torralba, and A. S. Willsky, “Exploiting hierarchical context on a large database of object categories,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), San Francisco, CA, USA,Jun. 2010, pp. 129–136.
[5] Doetsch P, Kozielski M, Ney H. Fast and robust training of recurrent neural networks for offline handwriting recognition [C]//Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on. IEEE, 2014: 279-284.
[6] M. J. Choi, A. Torralba, and A. S. Willsky, “A tree-based context model for object recognition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 34,no. 2, pp. 240–252, Feb. 2012.

Downloads: 5221
Visits: 171663

Sponsors, Associates, and Links


All published work is licensed under a Creative Commons Attribution 4.0 International License.

Copyright © 2016 - 2031 Clausius Scientific Press Inc. All Rights Reserved.