Skip to content

Paper overview | Neural Turing Machines by Google DeepMind

Seminar overview of the third article produced by Google DeepMind. This one again contains conceptual novelties: adding external memory to machine learning pipeline (using an Artificial Neural Network as a Controller, which decides how to use this memory). System is differentiable, meaning that you can give it inputs, show the outputs it should produce, define an error-function (cross-entropy in this case) and then train the whole thing using gradient descent. The amazing outcome is that the system learns not the statistical relations between the input and the output as your usual ML, but attempts to learn an algorithm, which allows it to generalize well and perform correctly on problem instances which are bigger or different from what is has been trained on.

Download PDF

No comments yet.

Leave a Reply

Your email address will not be published. Required fields are marked *

Comments (0)