Activation Functions in a Neural Network explained

May 12, 2019Artis Modus

deeplizard

In this video, we explain the concept of activation functions in a neural network and show how to specify activation functions in code with Keras.

Check out the corresponding blog and other resources for this video at: http://deeplizard.com/learn/video/m0pIlLfpXWE

Follow deeplizard on Twitter:
https://twitter.com/deeplizard

Follow deeplizard on Steemit:
https://steemit.com/@deeplizard

Become a patron:
https://www.patreon.com/deeplizard

Support deeplizard:
Bitcoin: 1AFgm3fLTiG5pNPgnfkKdsktgxLCMYpxCN
Litecoin: LTZ2AUGpDmFm85y89PFFvVR5QmfX6Rfzg3
Ether: 0x9105cd0ecbc921ad19f6d5f9dd249735da8269ef

Recommended books:
The Most Human Human: What Artificial Intelligence Teaches Us About Being Alive: http://amzn.to/2GtjKqu

Source

Similar Posts

29 thoughts on “Activation Functions in a Neural Network explained”

deeplizard says:

January 10, 2018 at 4:41 am

*Note, starting at 1:09, the denominator of the sigmoid function should be e^x+1 rather than e^(x+1).*

Machine Learning / Deep Learning Tutorials for Programmers playlist: https://www.youtube.com/playlist?list=PLZbbT5o_s2xq7LwI2y8_QtvuXZedL6tQU

Keras Machine Learning / Deep Learning Tutorial playlist: https://www.youtube.com/playlist?list=PLZbbT5o_s2xrwRnXk_yCPtnqqo4_u2YGL
Srijal shrestha says:

April 3, 2018 at 4:30 pm

Which programming language are you using?
Md.Yasin Arafat Yen says:

April 8, 2018 at 11:45 pm

Thanks a lot mam! This helped a lot
Md.Yasin Arafat Yen says:

April 29, 2018 at 6:20 pm

In relu, More positive it is, more activated the node is. For, 1 it'll be activated, for 3 it'll be activated also..! Than what's the difference between different positive values? Node is gonna activated anyway
Taylor Adam says:

May 23, 2018 at 12:48 pm

Amazing tutorials!
Entertainment says:

September 19, 2018 at 1:35 pm

Thank you so much
Dour Wolf Games says:

September 23, 2018 at 8:19 am

1:11 where does the six come from? is it a trainable value like a bias, or is it the number of nodes in the active layer, or is it always six? In other words, why is the sigmoid's input limited to six?
Omar Abd says:

September 27, 2018 at 8:30 pm

first and foremost,
thank you for amazing explain,
Emmanuel Boateng says:

October 8, 2018 at 6:16 pm

Hi, I would like to know which of the independent variables is much significant or has the highest impact on the dependent variable. My model is a 9-5-1 MLP which I have extracted its weights and biases. However, my concern now is how to use those weights to determine the most relevant input to the least relevant so I can rank. Thank you.
kareem jeiroudi says:

October 13, 2018 at 10:31 pm

You're my favorite Machine Learning teacher, please keep making such videos.
Phil Ad says:

November 5, 2018 at 1:01 am

You never explain WHY to use an activation function. You just showed us HOW it works.
Moein Hasani says:

November 8, 2018 at 12:38 pm

great video Tnx!
Prashanth K V T K N says:

November 25, 2018 at 9:21 am

Nice Explanation. What is softmax and why it is widely used in Deep learning instead of general activation functions?
Yashwant Dhole says:

December 1, 2018 at 1:36 am

You saved my tommorow's ML exam.
Saanvi Sharma says:

December 16, 2018 at 10:15 am

I've just started ML and thanks to you. Cuz of you I'm able to digest Coursera lectures practically. And finally you've earned a subscriber ?
Casan Scott says:

January 4, 2019 at 7:13 am

Simple, but it helped me out a lot!
Nirbhay Pandya says:

January 19, 2019 at 12:24 am

Thank you so much 🙂 Keep making such amazing videos please.
yashwanth reddy says:

January 23, 2019 at 11:01 pm

Does Deep learning with Keras need no skills?
One of my friend said that it's irrelevant to code with Keras :'-(
Naq says:

January 24, 2019 at 9:23 am

Hi, I found this from somewhere "Because of the horizontal line in ReLu for when x<0, the gradient can go towards 0 . For activations in that region of ReLu, gradient will be 0 causing the weights to not get adjusted during descent. Hence, those neurons die and not respond making part of the NN passive"

I don't understand why a gradient of 0, (gradient in this context seems to refer to the gradient of the relu function rather than the gradient of the loss function) will not allow for weights to get adjusted. Please help!
Mehran Zand says:

January 26, 2019 at 5:37 am

thank you very much
Harshal Deore says:

January 30, 2019 at 3:44 pm

This is totally wrong way to explain why we need activation function. We hide behind the biological explanation of neurons to explain why activation function is needed.
Harshal Deore says:

January 30, 2019 at 3:45 pm

The answer lies in linearity and non linearity.
Carlos Ortiz says:

February 3, 2019 at 9:48 pm

Thank you for these videos! Its so easy to follow your explanations.
Luke C says:

February 10, 2019 at 8:27 pm

Activation function happens on the second half of a node. So it might be miss leading to show all the weight pointing to the next hidden layer.
Leila Jalali says:

February 19, 2019 at 11:37 am

without a doubt you are one of the best instructor I have ever seen. I absolutely love your channel. well done! and thank you so much for your great help. You saved my life
Ram says:

March 19, 2019 at 10:45 am

Perfect Explanation .
Abrahametry Mii says:

April 15, 2019 at 9:03 pm

Extremely! Wonderful Explanation <3 it
Sanket Gupta says:

April 28, 2019 at 1:14 am

Love your voice
niyatha m says:

May 10, 2019 at 12:31 am

can i access this jupyter notebook?

Comments are closed.