Ba, R. Kiros, K. Cho, A. Courville, R. Salakhutdinov, R. Zemel, Y. Bengio. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention http://arxiv.org/abs/1502.03044 [3] J. Chorowski, D. Bahdanau, K. Cho, Y. Bengio. End-to-end Continuous Speech Recognition using Attention-based Recurrent Neural Networks http://arxiv.org/abs/1412.1602 [4] J. Elson, J. Douceur, J. Howell, J. Saul. Asirra: A CAPTCHA that Exploits Interest-Aligned Manual Image Categorization. In Proceedings of 14th ACM Conference on Computer and Communications Security (CCS), Association for Computing Machinery, Inc., Oct. 2007 [5] G. Hinton, P. Dayan, M. Revow. Modelling the Manifolds of Images of Handwritten Digits. http://www.cs.toronto.edu/~fritz/absps/manifold.pdf [6] Bayes Rule. http://www.eecs.qmul.ac.uk/~norman/BBNs/Bayes_rule.htm