AI learns to Speedrun QWOP using Machine Learning

February 25, 2021Artis Modus

Wesley Liao

UPDATE:
AI was able to surpass the World Record in my new video:
https://youtu.be/82sTpO_EpEc

AI bot learns to play QWOP like a human and achieves a top 10 speedrun (1m 8s). Trained using Reinforcement Learning and Imitation Learning.

Writeup:
https://towardsdatascience.com/achieving-human-level-performance-in-qwop-using-reinforcement-learning-and-imitation-learning-81b0a9bbac96

Github repo:
https://github.com/Wesleyliao/QWOP-RL

Papers mentioned:
– Sample Efficient Actor-Critic with Experience Replay
https://arxiv.org/pdf/1611.01224.pdf
– Deep Q-learning from Demonstrations
https://arxiv.org/pdf/1704.03732.pdf

Kurodo’s channel:
https://www.youtube.com/channel/UCLxJfj_Dq8Ks89tUVR3z7ug

QWOP speedrun leaderboard:
https://www.speedrun.com/qwop

Source

Similar Posts

23 thoughts on “AI learns to Speedrun QWOP using Machine Learning”

WebX says:

April 16, 2021 at 6:07 pm

Right away I'm not such a fan of the fact that you had to demonstrate a stride and then it used that. It feels like it limits the possibilities only to that which humans can already achieve.

A more general purpose AI might take longer to train, but it would have been able to discover special movies on its own. They usually find glitches in the physics engine and exploit those in a way no human can.

Great video and thanks of the upload and explanation!
Guy Smith says:

April 16, 2021 at 7:49 pm

It's frustrating to even watch someone play this game.
Arvind Iyer says:

April 16, 2021 at 8:39 pm

Ok, but why is the runner dude wearing a leotard?
Detector de Mendigo says:

April 17, 2021 at 4:52 pm

when I use my left hand:
Neithan says:

April 17, 2021 at 7:29 pm

2000 Game Devs: Let's Create fun games.
2021 ML/AI: Let's not have nice things, shall we?
robert says:

April 18, 2021 at 2:16 am

did you try training for 20 hours without pretraining? i’m curious how much the extra 12 hours of training helped compared to pretraining
MrHoliday Doc says:

April 18, 2021 at 8:52 am

we live in a strange time where we know flash games from childhood and are intelligent enough to program ai to accomplish what we couldn’t lol
David Sharkey says:

April 18, 2021 at 3:41 pm

That's normally what I look like on the way home from the pub!
Franciszek Zielony says:

April 18, 2021 at 8:33 pm

Fajne
crixus123 crixus123 says:

April 18, 2021 at 10:34 pm

Machine can calculate but never Will understand. So simple.
winnie the poop says:

April 19, 2021 at 12:53 am

Omg this is amazing
Paxmax says:

April 19, 2021 at 2:33 am

I believe the biggest problem for AI in this instance was the lack of Kurodial arteries, therefore restricting the bandwith.
Mat says:

April 19, 2021 at 4:34 am

Well, this reinforces my confidence that computers will never take over the world.
If THAT guy was chasing after me… I'd take his lunch money and then look for his brother.
Awesomesauce says:

April 19, 2021 at 6:48 am

Blast from the past. I remember being in high school when this game was at the height of its popularity. Almost every computer in the lab during lunch had this on it.
nilsvids says:

April 19, 2021 at 12:01 pm

I was very excited to see the final result with 65 hours.. but it turns out it was what we've already seen, what a disappointment
Chandrakant Nimbalkar says:

April 20, 2021 at 6:01 am

Special Olympics!
valerie Shi says:

April 20, 2021 at 7:12 am

May I ask what the reward is? The speed, or something like height of weight center?
Amaroq Starwind says:

April 21, 2021 at 3:30 pm

Ah, Semi-Supervised Learning. That is how machine learning really needs to be done.

Or really…
– A combination of Supervised Learning, Semi-Supervised Learning, and Unsupervised Learning
– Redundancies to prevent it from just always taking the laziest path.
– Hard-coded rules manually put in by a human.
– Humans always kept in the loop, along with Non-AI redundancies.
– Everything done in a safe and controlled environment, instead of a live deployment where it's possible for the AI to do real damage.
Amaroq Starwind says:

April 21, 2021 at 3:41 pm

You should look into Deep Transfer Learning as well.
Joy Kim says:

April 27, 2021 at 4:51 pm

How to train a machine to complete a difficult game for you: Learn it yourself first and teach it.
lukeves says:

June 7, 2022 at 6:08 pm

RIP VANGELIS!
Randall Stephens says:

June 14, 2022 at 11:05 am

"I'm human …" [citation needed] "…and know how to use legs." [citation needed]
Anthony A says:

June 28, 2022 at 7:13 am

this game should be an ultimate AI algorithm benchmark tester, I knew about the game since it first came out and didn't know much about programming until recently so a lot made sense easily once I got the proper vocabulary understood

Comments are closed.