Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM)

August 26, 2019Artis Modus

Brandon Rohrer

Part of the End-to-End Machine Learning School course library at http://e2eml.school

Find the rest of the How Neural Networks Work video series in this free online course:
https://end-to-end-machine-learning.teachable.com/p/how-deep-neural-networks-work

A gentle walk through how they work and how they are useful.

Some other helpful resources:
RNN and LSTM slides: http://bit.ly/2sO00ZC
Luis Serrano’s Friendly Intro to RNNs: https://youtu.be/UNmqTiOnRfg
How neural networks work video: https://youtu.be/ILsA4nyG7I0
Chris Olah’s tutorial: http://bit.ly/2seO9VI
Andrej Karpathy’s blog post: http://bit.ly/1K610Ie
Andrej Karpathy’s RNN code: http://bit.ly/1TNCiT9
Andrej Karpathy’s CS231n lecture: http://bit.ly/2tijgQ9
DeepLearning4J tutorial: https://deeplearning4j.org/lstm
RNN/LSTM blog post: https://brohrer.github.io/how_rnn_lstm_work.html
Data Science and Robots blog: https://brohrer.github.io/blog.html

Follow me for announcements: https://twitter.com/_brohrer_

Source

Similar Posts

31 thoughts on “Recurrent Neural Networks (RNN) and Long Short-Term Memory (LSTM)”

allisswellable says:

April 19, 2019 at 7:07 am

Excellent explanation of LSTM in a very simplified and realistic way… Great video!
Abdulrahman Saad says:

April 21, 2019 at 2:52 am

thank you bro
abirami ravi says:

May 4, 2019 at 2:59 pm

Thanks a lot for making everything so simple to understand, which in fact we find it very difficult to understand
Stephen Kong says:

May 7, 2019 at 9:52 am

Is RRN a replacement of Hidden Markov Model?
Sanjay Dhande says:

May 11, 2019 at 11:13 am

Best explanation, Thanks a lot. one cannot understand with text and math so easily. Great and nice work!
Leonardo Bascope says:

May 21, 2019 at 10:07 am

Amazing explanation. Thank you very much!
Temp Name says:

May 21, 2019 at 10:10 pm

I heard minecraft spider at 19:01
Smoked Chicken Drum says:

May 26, 2019 at 8:35 pm

Awesome video indeed. Very abstract concepts explained in very straightforward visual figures. Really appreciate it.
Corug says:

June 1, 2019 at 12:11 pm

thank you!
sarun wiriyapistan says:

June 3, 2019 at 6:31 am

well explanation, good for the one who getting start with machine learning and neuron network.
Edvins Dancigs says:

June 4, 2019 at 8:40 pm

amazing!
EggShot says:

June 15, 2019 at 1:07 am

4:30 "it's Tuesday" -> Takes some Nasal Spray? 😀 Anyone else noticed?
Great video, btw. Still watching but you'll probably end up saving my ass <3
Sean says:

June 15, 2019 at 3:29 pm

This was fantastic, thank you so much for taking the effort to do all of this, it has really helped me get started.
Stefano Giannini says:

June 18, 2019 at 4:13 am

Thank You, great video
babakzamin says:

June 24, 2019 at 11:08 am

What type of LSTM is he explaining?! It seems somehow different from the standard version of LSTM where there are two bus of information. It's something like GRU. The explanation of LSTM was more simpler, but it was also good.
franeklubi says:

June 24, 2019 at 3:31 pm

I love You man, great video
031neezy says:

June 28, 2019 at 6:44 am

Brilliant video! Thanks!!
xTORREfiolx says:

July 11, 2019 at 2:09 am

As a complete newbie to NN, your explanation of both RNN and LSTM was amazing! However, a couple of question are still unsolved:

1. What does the ignoring gate do concretely? I suppose that, if the initial NN predicted 'Jane'/'Spot'/'.' the ignoring NN would stop the '.' from going through. But if the initial NN is trained like in the example of the video, the ignoring gate would be useless all the time…

2. I suppose that the selection gate did nothing in the video because the prediction had only 2 elements. Its effect would be noticeable if there were 100 elements and the selection gate would only let through ~ 10 of them?

3. This may be a dumb question, but… If all NN receive the same input (previous prediction + new input), the only way to get different results would be the initial random state, right? How can they play their role, if the inputs and the NN type are the same?

4. Finally, I checked Chris Olah's tutorial and I saw that the diagrams differ. In particular, the names differ: going by chronological order, your "ignoring" gate would be Chris' "forgetting" one, your "forgetting" would be Chris' "input" and finally your "selection" would be Chris' "output". Is it right? Also, in your diagram the prediction that will become the output is squished by the tanh function and gated, while Chris' is untouched (tanh and gate are only present in what is feeded in the next iteration of the LSTM).

Hope you can clarify some of my doubts, and thank you again for this video!
Aryan Mn says:

July 13, 2019 at 4:05 am

Its best and still i dont get it
Shit
Ahmad Alghooneh says:

July 17, 2019 at 12:07 pm

best, bestttt where have you been man?
Crutz says:

July 18, 2019 at 3:42 pm

Thank you!
Dream Worker 65524 says:

July 26, 2019 at 4:58 pm

Yes, this is the best LSTM video on YouTube. Today is 27th July 2019.
Linu Bajy says:

July 28, 2019 at 3:14 am

This was a great video for a beginner like me. Thanks a lot !! Looking forward seeing more videos like these.
Francesco Califano says:

July 29, 2019 at 5:10 am

I'm preparing my university exam about "Data Science and machine learning", these videos are pure gold for someone who is approaching this topics for the first time like me. Thank you so much, it was really worth the time spent to watch it.
Popo Baba says:

July 30, 2019 at 11:52 am

That was a fantastic explanation, video, and LSTM example. Thank you! Also, cool beard : )
iPoopDinosaurs says:

July 30, 2019 at 12:09 pm

Dwight Schrute imposter dictionary: {Bears, Beets, Battlestar Galactica, .}

Mistakes a Dwight Schrute imposter RNN can make:

"Bears. Beets. Battlestar Galactica."
Vipul Petkar says:

August 4, 2019 at 4:52 am

Please make the Same video about Transformers neural network
Dream Worker 65524 says:

August 14, 2019 at 2:49 am

Sorry to say that, Karpathy's lecture notes, particularly the alien language, bring me the worst study experience in ML.
Sridhar Sundaraju says:

August 20, 2019 at 7:11 am

Brandon has applied extraordinary skills in communicating the difficult (convoluted?) topic and concepts on LSTM, in simple and comprehensible language. The example on writing the "Children Text Book" brings out all of the major processes into sharp focus. I have learnt immensely. I will now be studying other videos by Brandon. Thanks a lot.
AweSam says:

August 23, 2019 at 3:19 am

this video needs to be more popular. ;–; thankyou sir
Kartik Podugu says:

August 24, 2019 at 6:06 pm

Hi Brandon, @8:59, a RNN is cell shown. what is in the small square, is it again a NN or just a single layer? Similar question regarding lot of small squares used in LSTM cell @17:30

Comments are closed.