The Natural Language Decathlon: Multitask Learning as Question Answering. McCann, Bryan; Keskar, Nitish Shirish; Xiong, Caiming; Socher, Richard. 2018.
The Natural Language Decathlon: Multitask Learning as Question Answering. McCann, Bryan; Keskar, Nitish Shirish; Xiong, Caiming; Socher, Richard. 2018.
Attention is all you need. Vaswani, Ashish; Shazeer, Noam; Parmar, Niki; Uszkoreit, Jakob; Jones, Llion; Gomez, Aidan N; Kaiser, {\L}ukasz; Polosukhin, Illia. In Advances in Neural Information Processing Systems, bll 5998–6008. 2017.
Non-parametric estimation of Jensen-Shannon Divergence in Generative Adversarial Network training. Sinn, Mathieu; Rawat, Ambrish. 2017.
Wasserstein GAN. Arjovsky, Martin; Chintala, Soumith; Bottou, Léon. 2017.
DRAGNN: A Transition-based Framework for Dynamically Connected Neural Networks. Kong, Lingpeng; Alberti, Chris; Andor, Daniel; Bogatyy, Ivan; Weiss, David. 2017.
Improved Training of Wasserstein GANs. Gulrajani, Ishaan; Ahmed, Faruk; Arjovsky, Mart{\’{\i}}n; Dumoulin, Vincent; Courville, Aaron C. In CoRR, abs/1704.00028. 2017.
Derivation of Backpropagation in Convolutional Neural Network (CNN). Zhang, Zhifei. bl 7. 2016.
ConceptRDF: An RDF presentation of ConceptNet knowledge base. Najmi, Erfan; Malik, Zaki; Hashmi, Khayyam; Rezgui, Abdelmounaam. In Information and Communication Systems (ICICS), 2016 7th International Conference on, bll 145–150. IEEE, 2016.
Globally Normalized Transition-Based Neural Networks. Andor, Daniel; Alberti, Chris; Weiss, David; Severyn, Aliaksei; Presta, Alessandro; Ganchev, Kuzman; Petrov, Slav; Collins, Michael. 2016.
Enriching Word Vectors with Subword Information. Bojanowski, Piotr; Grave, Edouard; Joulin, Armand; Mikolov, Tomas. 2016.
An Ensemble Method to Produce High-Quality Word Embeddings. Speer, Robert; Chin, Joshua. 2016.
Neural Architectures for Named Entity Recognition. Lample, Guillaume; Ballesteros, Miguel; Subramanian, Sandeep; Kawakami, Kazuya; Dyer, Chris. In CoRR, abs/1603.01360. 2016.
LSTM: A Search Space Odyssey. Greff, Klaus; Srivastava, Rupesh Kumar; Koutník, Jan; Steunebrink, Bas R.; Schmidhuber, Jürgen. In CoRR, abs/1503.04069. 2015.
An Empirical Exploration of Recurrent Network Architectures. Józefowicz, Rafal; Zaremba, Wojciech; Sutskever, Ilya. In ICML, Vol. 37JMLR Workshop and Conference Proceedings, F. R. Bach, D. M. Blei (reds.), bll 2342–2350. JMLR.org, 2015.
Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Recurrent Neural Network. Wang, Peilu; Qian, Yao; Soong, Frank K.; He, Lei; Zhao, Hai. In CoRR, abs/1510.06168. 2015.
Convolutional Neural Networks for Sentence Classification. Kim, Yoon. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, {EMNLP} 2014, October 25-29, 2014, Doha, Qatar, {A} meeting of SIGDAT, a Special Interest Group of the {ACL}, bll 1746–1751. 2014.
Glove: Global Vectors for Word Representation. Pennington, Jeffrey; Socher, Richard; Manning, Christopher D. In EMNLP, Vol. 14, bll 1532–1543. 2014.
Generative Adversarial Networks. Goodfellow, Ian J.; Pouget-Abadie, Jean; Mirza, Mehdi; Xu, Bing; Warde-Farley, David; Ozair, Sherjil; Courville, Aaron; Bengio, Yoshua. 2014.
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. Cho, Kyunghyun; van Merrienboer, Bart; Gulcehre, Caglar; Bahdanau, Dzmitry; Bougares, Fethi; Schwenk, Holger; Bengio, Yoshua. 2014.
Retrofitting Word Vectors to Semantic Lexicons. Faruqui, Manaal; Dodge, Jesse; Jauhar, Sujay K.; Dyer, Chris; Hovy, Eduard; Smith, Noah A. 2014.
Neural Machine Translation by Jointly Learning to Align and Translate. Bahdanau, Dzmitry; Cho, Kyunghyun; Bengio, Yoshua. 2014.
Training recurrent neural networks. Sutskever, Ilya. In University of Toronto, Toronto, Ont., Canada. 2013.
Integration of world knowledge for natural language understanding. Ovchinnikova, Ekaterina. Vol. 3. Springer Science \& Business Media, 2012.
Natural Language Understanding and World Knowledge. Ovchinnikova, Ekaterina. In Integration of World Knowledge for Natural Language Understanding, bll 15–37. Atlantis Press, Paris, 2012.
On the difficulty of training Recurrent Neural Networks. Pascanu, Razvan; Mikolov, Tomas; Bengio, Yoshua. 2012.
BLEU: a method for automatic evaluation of machine translation. Papineni, Kishore; Roukos, Salim; Ward, Todd; Zhu, Wei-Jing. In Proceedings of the 40th annual meeting on association for computational linguistics, bll 311–318. Association for Computational Linguistics, 2002.