- [2001 PhD Thesis] Long Short-Term Memory in Recurrent Neural Networks, [Gers' Ph.D. Thesis].
- [2014 ArXiv] Recurrent Neural Network Regularization, [paper].
- [2015 ArXiv] Grid Long Short-Term Memory, [paper], sources: [Tensotflow-GridLSTMCell].
- [2016 ArXiv] Visualizing and Understanding Curriculum Learning for Long Short-Term Memory Networks, [paper].
- [2016 ArXiv] Contextual LSTM (CLSTM) models for Large scale NLP tasks, [paper], [bibtex], sources: [kafkasl/contextualLSTM].
- [2016 ICLR] Visualizing and Understanding Recurrent Networks, [paper].
- Phased LSTM: Accelerating Recurrent Network Training for Long or Event-based Sequences, [paper], sources: [Tensorflow-PhasedLSTMCell].
- [2017 ACML] Nested LSTMs, [paper], sources: [hannw/nlstm], [titu1994/Nested-LSTM].
- [2017 ICLR] Variable Computation in Recurrent Neural Networks, [paper].
- [2018 EMNLP] Simple Recurrent Units for Highly Parallelizable Recurrence, [paper], [bibtex], sources: [taolei87/sru].
- [2018 ICLR] Skip RNN: Learning to Skip State Updates in Recurrent Neural Networks, [paper], [homepage], sources: [imatge-upc/skiprnn-2017-telecombcn].