Ciro Santilli @cirosantilli 37

 Incoming links: Simple DirectMedia Layer

Ciro's 2D reinforcement learning games Updated 2025-07-16

 View more

Prototype: github.com/cirosantilli/Urho3D-cheat

Prior art research: github.com/cirosantilli/awesome-reinforcement-learning-games

Video 1.

Top Down 2D Continuous Game with Urho3D C++ SDL and Box2D for Reinforcement learning by Ciro Santilli (2018)

Source. Source code at: github.com/cirosantilli/Urho3D-cheat.

Figure 1.
Screenshot of the basketball stage of Ciro's 2D continuous game
. Source code at: github.com/cirosantilli/rl-game-2d-grid. Big kudos to game-icons.net for the sprites.

Less good discrete prototype: github.com/cirosantilli/rl-game-2d-grid YouTube demo: Video 1. "Top Down 2D Continuous Game with Urho3D C++ SDL and Box2D for Reinforcement learning by Ciro Santilli (2018)".

Video 2.

Top Down 2D Discrete Tile Based Game with C++ SDL and Boost R-Tree for Reinforcement Learning by Ciro Santilli (2017)

Source.

The goal of this project is to reach artificial general intelligence.

A few initiatives have created reasonable sets of robotics-like games for the purposes of AI development, most notably: OpenAI and DeepMind.

However, all projects so far have only created sets of unrelated games, or worse: focused on closed games designed for humans!

What is really needed is to create a single cohesive game world, designed specifically for this purpose, and with a very large number of game mechanics.

Notably, by "game mechanic" is meant "a magic aspect of the game world, which cannot be explained by object's location and inertia alone" in order to test the the missing link between continuous and discrete AI.

Much in the spirit of gvgai, we have to do the following loop:

create an initial game that a human can solve
find an AI that beats it well
study the AI, and add a new mechanic that breaks the AI, but does not break a human!

The question then becomes: do we have enough computational power to simulation a game worlds that is analogous enough to the real world, so that our AI algorithms will also apply to the real world?

To reduce computation requirements, it is better to focus on a 2D world at first. Such world with the right mechanics can break any AI, while still being faster to simulate than a 3D world.

The initial prototype uses the Urho3D open source game engine, and that is a reasonable project, but a raw Simple DirectMedia Layer + Box2D + OpenGL solution from scratch would be faster to develop for this use case, since Urho3D has a lot of human-gaming features that are not needed, and because 2019 Urho3D lead developers disagree with the China censored keyword attack.

Simulations such as these can be viewed as a form of synthetic data generation procedure, where the goal is to use computer worlds to reduce the costs of experiments and to improve reproducibility.

Ciro has always had a feeling that AI research in the 2020's is too unambitious. How many teams are actually aiming for AGI? When he then read Superintelligence by Nick Bostrom (2014) it said the same. AGI research has become a taboo in the early 21st century.

Related projects:

github.com/deepmind/lab2d: 2D gridworld games, C++ with Lua bindings

Related ideas:

www.youtube.com/watch?v=MHFrhIAj0ME?t=4183 Can't get you out of my head by Adam Curtis (2021) Part 1: Bloodshed on Wolf Mountain :)
www.youtube.com/watch?v=EUjc1WuyPT8 AI alignment: Why It's Hard, and Where to Start by Eliezer Yudkowsky (2016)

Bibliograpy:

agents.inf.ed.ac.uk/blog/multiagent-learning-environments/ Multi-Agent Learning Environments (2021) by Lukas Schäfer from the Autonomous agents research group of the University of Edinburgh. One of their games actually uses apples as visual represntation of rewards, exactly like Ciro's game. So funny. They also have a 2d continuous game: agents.inf.ed.ac.uk/blog/multiagent-learning-environments/#mpe
humanoid robot simulation
- 2022 MoCapAct by Microsoft Research: www.microsoft.com/en-us/research/blog/mocapact-training-humanoid-robots-to-move-like-jagger
Section "AI training game"
Section "Software-based artificial life"

Video 3.

DeepMind Has A Superhuman Level Quake 3 AI Team by Two Minute Papers (2018)

Source. Commentary of DeepMind's 2019 Capture the Flag paper. DeepMind does some similar simulations to what Ciro wants, but TODO do they publish source code for all of them? If not Ciro calls bullshit on non-reproducible research. Does this repo contain everything?

Video 4.

OpenAI Plays Hide and Seek... and Breaks The Game! by Two Minute Papers (2019)

Source. Commentary of OpenAi's 2019 hide and seek paper. OpenAI does some similar simulations to what Ciro wants, but TODO do they publish source code for all of them? If not Ciro calls bullshit on non-reproducible research, and even worse due to the fake "Open" in the name. Does this repo contain everything?

Video 5.

Much bigger simulation, AIs learn Phalanx by Pezzza's Work (2022)

Source. 2d agents with vision. Simple prey/predator scenario.

 Read the full article

Ciro Santilli's open source contributions / Open source Updated 2025-07-16

 View more

Date	Project	Size	Description
2019-04	gnuplot		Why does plotting with point labels make plot generation extremely slow?
2019-04	GDB Dashboard		Limit the size of shown arguments in the Stack display
2018-03	QEMU	2	Test record and replay feature. Also here
2018-02	pandoc		Add option to produce AsciiDoc output without explicit heading ids
2017-10	Android		GLES3 content gles3jni from ndk examples fails with "java.lang.RuntimeException: createContext failed: EGL_BAD_CONFIG"
2017-09	Mozilla rr		How to automatically start replay and go directly to main instead of `_start`?
2017-09	Mozilla rr		Reverse step over time(NULL) enters rr/src/preload/syscall_hook.S and leads to "Cannot find bounds of current function"
2017-08	xsel		Why maximum 4000 characters output with xsel -b ?
2017-06	Buildroot		Don't print mutiline struct function arguments on stack when set pretty print on
2017-04	GDB Dashboard		Add style option to print stack arguments on a single line
2017-05	Buildroot		Build fails with "unexpected EOF while looking for matching "'" if PATH contains a newline
2017-04	GDB Dashboard		Add style option to print stack arguments on a single line
2017-03	clBLAS		`.s[0]` + CL_DEVICE_TYPE_ALL
2017-01	game-icons.net		Use multiple separate paths, allow customizing the color of each component, and give a default color
2017-01	game-icons.net		delapouite/originals/svg/brick-wall.svg has some whitespace on top
2017-01	OpenAI Gym		examples/agents/keyboard_agent.py fails with "AttributeError: 'TimeLimit' object has no attribute 'viewer'"
2016-12	Simple DirectMedia Layer		Add C variable printf debug snippets
2015-03	tig		Accepted feature.
2014-11	GitLab		Duplicate
2014-11	GitLab		Bug.
2014-11	GitLab		Support.
2014-11	Bootstrap Hover Dropdown		Bug confirmed.
2014-11	GitLab		Bug confirmed.
2014-11	GitLab		Triaging.
2014-11	GitLab		Problem with the display icons in the left block
2014-11	sass		Bug confirmed.
2014-10	GitLab		Point duplicate.
2014-10	GitLab		Bug confirmed.
2014-10	GitLab		Bug confirmed.
2014-10	Semaphore CI		Bug confirmed.
2014-10	libgit2		Bug confirmed.
2014-10	GitLab		Support.
2014-10	GitLab		Point duplicate.
2014-09	vader.vim		Accepted feature.
2014-09	GitLab		Point already fixed.
2014-09	vader.vim		Accepted feature.
2014-09	GitLab		Bug confirmed.
2014-09	GitLab		Bug confirmed.
2014-09	GitLab		Point duplicate.
2014-09	GitLab		Point already fixed.
2014-08	markdownlint/markdownlint		Accepted feature.
2014-08	softcover		Accepted feature.
2014-08	markdownlint/markdownlint		Accepted feature.
2014-07	GitLab		Bug confirmed.
2014-07	GitLab		Accepted feature.
2014-07	GitLab		Accepted feature.
2014-06	GitLab		Accepted feature.
2014-06	GitLab		Point duplicate.
2014-06	karlcow/markdown-testsuite		Bug confirmed.
2014-06	plasticboy/vim-markdown		Close issue.
2014-06	plasticboy/vim-markdown		Review patch.
2014-06	plasticboy/vim-markdown		Review and patch patch.
2014-05	softcover		Accepted feature.
2014-04	karlcow/markdown-testsuite		Close issue with better issues.
2014-03	tig		Accepted feature.
2014-03	GitLab		Accepted feature.
2014-03	softcover		Accepted feature.
2014-03	GitLab		Add useful information.
2014-03	GitLab		Point duplicate.
2014-03	GitLab		Point duplicate.
2014-03	GitLab		Accepted feature.
2014-02	GitLab		Point duplicate.
2014-02	GitLab		Accepted feature.
2014-02	Overleaf		Feature generated considerable interest.
2014-02	GitLab		Point already fixed.
2014-02	GitLab		Link feature request to patch.
2013-10	yakuake		Bug confirmed.
2013-10	okular		Bug confirmed.
2013-06	krusader		Bug confirmed.
2013-05	NumPy		Bug confirmed + inner cause.
2012-05	krusader		Accepted feature.
2012-05	krusader		Bug confirmed.
2012-05	AutoKey		Bug confirmed.

 Read the full article