Mask Region based Convolution Neural Networks – EXPLAINED!

February 26, 2018Artis Modus

CodeEmporium

In this video, we will take a look at new type of neural network architecture called “Masked Region based Convolution Neural Networks”, Masked R-CNN for short. And in the process, highlight some key sub problems in computer vision.

Please SUBSCRIBE to the channel for more content on Machine Learning, Deep Learning, Data Science, and Artificial Intelligence. Hoping to build a community of AI geeks. You’ll fit right in!

REFERENCES

Main paper: https://arxiv.org/pdf/1703.06870v3.pdf
Code: https://github.com/facebookresearch/Detectron

Convolution Neural networks: https://www.youtube.com/watch?v=m8pOnJxOcqY
Semantic segmentation in deep learning: http://blog.qure.ai/notes/semantic-segmentation-deep-learning-review
Top papers: http://www.arxiv-sanity.com/top?timefilter=alltime&vfilter=all
Recurrent Instance Segmentation: http://www.robots.ox.ac.uk/~tvg/publications/2016/RIS7.pdf
Mask R-CNN Presentation by the Author: https://www.youtube.com/watch?v=g7z4mkfRjI4
Mark Jay’s Video: https://www.youtube.com/watch?v=2TikTv6PWDw
COCO dataset: http://cocodataset.org/#home
Fully Convolutional Networks: https://people.eecs.berkeley.edu/~jonlong/long_shelhamer_fcn.pdf
Faster R-CNN explained: https://medium.com/@smallfishbigsea/faster-r-cnn-explained-864d4fb7e3f8
Notes/summary of Masked R-CNN: http://www.shortscience.org/paper?bibtexKey=journals/corr/HeGDG17#aleju

Music at : https://www.bensound.com/royalty-free-music/track/tenderness

Source

Similar Posts

37 thoughts on “Mask Region based Convolution Neural Networks – EXPLAINED!”

Tummala Anvesh says:

February 26, 2018 at 5:42 pm

Great video, keep rocking.
Tiago Freitas says:

March 1, 2018 at 2:59 am

Great Explanation, will follow your videos! Thanks for the share
Deepak Umredkar says:

March 14, 2018 at 6:43 am

how to prepare own dataset for this I dont want to use cocodataset
thank you
apple-sauce says:

March 25, 2018 at 8:36 am

Thanks!!!
Mark Jay says:

March 27, 2018 at 12:06 am

nice explanation. subbed
zzz says:

April 3, 2018 at 8:10 pm

This is great! Please keep on making stuff like this xD.
zishan ahmed says:

April 4, 2018 at 11:33 am

Gr8 work dude.Subscribed
Ha Nguyen says:

April 16, 2018 at 1:29 am

At 6:41 what is "analog is 2 a 1 versus rest approach"? Thank you very much.
杨凡凡 says:

April 17, 2018 at 8:29 pm

I want to classify body movements. What are your ideas?
Henry Dozie says:

May 12, 2018 at 9:29 pm

Can I please get the ppt?
Amazing video
Ciao! says:

May 14, 2018 at 3:29 pm

Excellent video!
user123 says:

May 17, 2018 at 7:59 am

Nice work man!!!!
Rahul Deora says:

June 17, 2018 at 11:24 am

At 3:48, how exactly does max pool rotational invariance?? I understand translational invariance but a rotation would make different features activated
Facundo Calcagno says:

June 27, 2018 at 5:32 am

Great video and explanation. I used the model to detect lanes in roads using the Culanes Dataset with very good results. The project is in Github by the name RMASK_Lane_Detection
shivam sisodiya says:

July 25, 2018 at 7:40 pm

Just found another great tutorial on AI
邵帅 says:

August 16, 2018 at 2:01 am

Great video
Manuel Ignacio Pérez Carrasco says:

October 16, 2018 at 8:42 am

Thank you for the explanation!! Can you share me your slides?
Qiang Lu says:

October 29, 2018 at 6:25 pm

Good summary and ROI ALIGN description.
Tugba Çekirge says:

November 21, 2018 at 3:19 am

Thanks! m/
Luke C says:

November 26, 2018 at 6:29 pm

Subbed this is a really really well made easy to understand video. Hope to see more from you in the future!
Sanjeet Kumar says:

December 2, 2018 at 6:57 am

very nice explanation. Thanks
Shahriar Shakir Sumit says:

December 5, 2018 at 12:09 am

At first thank you very much for this video. Your videos quality are very good. I have started to watch your videos. Can you
Using Mask RCNN we can detect human class, from that human class can we detect human face ? Then which algorithm will i use to detect face ? Can you please give me some suggestions. And is it possible to use same dataset for human detection along with face detection ??
Rong Zhou says:

December 24, 2018 at 11:48 pm

Nice explanation especially on the ROI align part! I understood based on your explanation!!! Thanks!
shreyash kawalkar says:

December 29, 2018 at 12:19 am

Awesome. Thanks!!
Oliver Deane says:

December 29, 2018 at 5:38 am

Great explanation, thanks a lot! Can I ask what you mean when you say "when computing the mask, a loss of KM squared is incurred" at 6:44?
Jerrin JOE VARGHESE says:

March 3, 2019 at 11:36 pm

Do u know where I can find a code for it
mihiri chathurika Amarasingha says:

April 3, 2019 at 8:01 pm

Very detailed video. Thank you very much.
Abhinav Kumar says:

May 21, 2019 at 6:46 pm

Nice, You made it look easy!
Daniel Weikert says:

June 4, 2019 at 9:48 am

Thank you great work! Is there an easy (beginner friendly) explanation how ROI align works?
Archon Southpaw says:

June 30, 2019 at 11:15 pm

preciate you stay blessed
Dreamer Hatfim says:

July 8, 2019 at 5:29 am

If i want to use pretrained R-CNN for my own dataset to segment ( delineate) background from foerground , do i need to annotated or label my data ? The data i am using if person image ..
Hung Pham says:

August 11, 2019 at 7:18 am

Nhà thông minh của trí tuệ nhân tạo????
Hung Pham says:

August 11, 2019 at 7:20 am

Thu thập thói quen hành vi người dùng hay đi qua chung một tuyến đường của trí tuệ nhân tạo
Hung Pham says:

August 11, 2019 at 7:21 am

Đánh dấu địa điểm thường xuyên đến
Hung Pham says:

August 11, 2019 at 7:23 am

Tự động đánh dấu phân biệt sắp xếp vào những người và điểm thường lui tới vào kho
Remi Daviet says:

October 5, 2019 at 4:39 am

Thank you for taking the time and efforts to make this video.
Side note: the creepy whispered "subscribe" at the end of the video has more of a repulsive effect and doesn't really make me want to subscribe (more like making me want to close the video as fast as possible). The positive energy given during the video would probably work a lot better if it were used to ask for subscription too.
monster kumar says:

October 25, 2019 at 8:40 pm

Please make a video related to visual question answering

Comments are closed.