I actually read this paper a couple weeks ago as part of a deep learning reading...

kordless · on Jan 27, 2014

> surpasses a human expert on three of them

I hazard it's not very impressed with your Space Invader score, either.

mandor · on Jan 28, 2014

We worked on a similar experiment two years ago (deep learning + reinforcement learning algorithm + some innovations, to learn to play Atari 2600 games). We obtained similar scores in the games we tested but we did not submit any paper because we considered that the scores were not good enough. In particular, for Space Invaders, you can easily get 600 points by hiding behind a shelter while continuously firing, and never learn how to avoid the bullet.

So, I was not impressed by their results on Space Invaders.

Overall, we struggled to learn long-term strategies (finding pure reactive strategies is easy) and to learn to avoid bullets. They did too: "The games Q*bert, Seaquest, Space Invaders, on which we are far from human performance, are more challenging because they require the network to find a strategy that extends over long time scales."

=> that's the real challenge...

bliti · on Jan 27, 2014

This deep learning group you mentioned. May anyone join in?

tansey · on Jan 28, 2014

Sorry, it's for UT Austin PhD students only.

lispsil · on Jan 27, 2014

I suspect google wants to port this to their D-wav, probably sell API access to it. Rent an AI

eru · on Jan 27, 2014

Compare http://wavewatching.net/2014/01/18/scott-aaronson-again-resi... and (more directly) http://www.scottaaronson.com/blog/?p=1643