GitHub - jostmey/rwa: Machine Learning on Sequential Data Using a Recurrent Weighted Average

Description

This repository holds the code to a new kind of RNN model for processing sequential data. The model computes a recurrent weighted average (RWA) over every previous processing step. With this approach, the model can form direct connections anywhere along a sequence. This stands in contrast to traditional RNN architectures that only use the previous processing step. A detailed description of the RWA model has been published in a manuscript at https://arxiv.org/pdf/1703.01253.pdf.

Because the RWA can be computed as a running average, it does not need to be completely recomputed with each processing step. The numerator and denominator can be saved from the previous step. Consequently, the model scales like that of other RNN models such as the LSTM model.

In each folder, the RWA model is evaluated on a different task. The performance of the RWA model is compared against a LSTM model. The RWA is found to train considerably faster on most tasks by at least a factor of five. As the sequences become longer, the RWA model scales even better. See the manuscript listed above for the details about each result.

Note: The RWA model has failed to yield competitive results on Natural Language Problems.

Download

Download: zip
Git: git clone https://github.com/jostmey/rwa

Requirements

The code is written in Python3. The scripts have been upgraded to run using version 1.0 of TensorFlow.

Alternative Implementations

RWA model as TensorFlow RNNCell (My implementation)
RWA model as TensorFlow RNNCell (Not tested)
RWA model in Keras (Reproduced results in paper)
RWA model in Keras (Not tested)
RWA model in Pytorch (Unstable branch - Work in progess)
RWA model in Pytorch (Numerically unstable implementation)
RWA model in Go

Acknowledgements

Thanks Alex Nichol for correcting the equations for numerical stability.

Corrections (Changelog)

March 17th, 2017: Corrected equations used to rescale the numerator and denominator terms, which is used to avoid overflow and underflow conditions. Results for the RWA model were recomputed.
March 26th, 2017: Corrected a bug specific to the code for loading the permuted MNIST task. Results for permuted MNIST task were recomputed.
April 3rd, 2017: Corrected bug in the LSTM model. This bug affected all the results except for the copy problem. Results for the LSTM model were recomputed. No significant changes in performance were observed.

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
adding_problem_100		adding_problem_100
adding_problem_1000		adding_problem_1000
artwork		artwork
copy_problem_100		copy_problem_100
copy_problem_1000		copy_problem_1000
length_problem_100		length_problem_100
length_problem_1000		length_problem_1000
mnist		mnist
mnist_permuted		mnist_permuted
reber_grammar		reber_grammar
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Download

Requirements

Alternative Implementations

Acknowledgements

Corrections (Changelog)

About

Releases

Packages

Contributors 3

Languages

License

jostmey/rwa

Folders and files

Latest commit

History

Repository files navigation

Description

Download

Requirements

Alternative Implementations

Acknowledgements

Corrections (Changelog)

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages