- github.com/deepmind/meltingpot TODO vs DeepMind Lab2D? Also 2D discrete. Started in 2021.
- github.com/deepmind/ai-safety-gridworlds mentioned e.g. at www.youtube.com/watch?v=CGTkoUidQ8I by Rober Miles
Creating Multimodal Interactive Agents from DeepMind by Two Minute Papers (2023)
Source. www.deepmind.com/blog/building-interactive-agents-in-video-game-worldsOpen-Ended Learning Leads to Generally Capable Agents by DeepMind (2021)
Short name: XLand. Whitepaper: www.deepmind.com/blog/generally-capable-agents-emerge-from-open-ended-play.We define this category as AI games in which agents are able to produce or consume natural language.
It dawned on Ciro Santilli that it would be very difficult to classify an agent as an AGI if tthat agent can't speak to take orders, read existing human generated documentation, explain what it is doing, or ask for clarification.
Prototype: github.com/cirosantilli/Urho3D-cheat
Top Down 2D Continuous Game with Urho3D C++ SDL and Box2D for Reinforcement learning by Ciro Santilli (2018)
Source. Source code at: github.com/cirosantilli/Urho3D-cheat.Screenshot of the basketball stage of Ciro's 2D continuous game
. Source code at: github.com/cirosantilli/rl-game-2d-grid. Big kudos to game-icons.net for the sprites.Less good discrete prototype: github.com/cirosantilli/rl-game-2d-grid YouTube demo: Video 1. "Top Down 2D Continuous Game with Urho3D C++ SDL and Box2D for Reinforcement learning by Ciro Santilli (2018)".
Top Down 2D Discrete Tile Based Game with C++ SDL and Boost R-Tree for Reinforcement Learning by Ciro Santilli (2017)
Source. The goal of this project is to reach artificial general intelligence.
A few initiatives have created reasonable sets of robotics-like games for the purposes of AI development, most notably: OpenAI and DeepMind.
However, all projects so far have only created sets of unrelated games, or worse: focused on closed games designed for humans!
What is really needed is to create a single cohesive game world, designed specifically for this purpose, and with a very large number of game mechanics.
Notably, by "game mechanic" is meant "a magic aspect of the game world, which cannot be explained by object's location and inertia alone" in order to test the the missing link between continuous and discrete AI.
The question then becomes: do we have enough computational power to simulation a game worlds that is analogous enough to the real world, so that our AI algorithms will also apply to the real world?
To reduce computation requirements, it is better to focus on a 2D world at first. Such world with the right mechanics can break any AI, while still being faster to simulate than a 3D world.
The initial prototype uses the Urho3D open source game engine, and that is a reasonable project, but a raw Simple DirectMedia Layer + Box2D + OpenGL solution from scratch would be faster to develop for this use case, since Urho3D has a lot of human-gaming features that are not needed, and because 2019 Urho3D lead developers disagree with the China censored keyword attack.
Simulations such as these can be viewed as a form of synthetic data generation procedure, where the goal is to use computer worlds to reduce the costs of experiments and to improve reproducibility.
Ciro has always had a feeling that AI research in the 2020's is too unambitious. How many teams are actually aiming for AGI? When he then read Superintelligence by Nick Bostrom (2014) it said the same. AGI research has become a taboo in the early 21st century.
Related projects:
- github.com/deepmind/lab2d: 2D gridworld games, C++ with Lua bindings
Related ideas:
- www.youtube.com/watch?v=MHFrhIAj0ME?t=4183 Can't get you out of my head by Adam Curtis (2021) Part 1: Bloodshed on Wolf Mountain :)
- www.youtube.com/watch?v=EUjc1WuyPT8 AI alignment: Why It's Hard, and Where to Start by Eliezer Yudkowsky (2016)
Bibliograpy:
- agents.inf.ed.ac.uk/blog/multiagent-learning-environments/ Multi-Agent Learning Environments (2021) by Lukas Schäfer from the Autonomous agents research group of the University of Edinburgh. One of their games actually uses apples as visual represntation of rewards, exactly like Ciro's game. So funny. They also have a 2d continuous game: agents.inf.ed.ac.uk/blog/multiagent-learning-environments/#mpe
- humanoid robot simulation
- Section "AI training game"
- Section "Software-based artificial life"
OpenAI Plays Hide and Seek... and Breaks The Game! by Two Minute Papers (2019)
Source. Commentary of OpenAi's 2019 hide and seek paper. OpenAI does some similar simulations to what Ciro wants, but TODO do they publish source code for all of them? If not Ciro calls bullshit on non-reproducible research, and even worse due to the fake "Open" in the name. Does this repo contain everything?Much bigger simulation, AIs learn Phalanx by Pezzza's Work (2022)
Source. 2d agents with vision. Simple prey/predator scenario.github.com/deepmind/lab/tree/master/game_scripts/levels/contributed/dmlab30 has some good games with video demos on YouTube, though for some weird reason they are unlisted.
TODO get one of the games running. Instructions: github.com/deepmind/lab/blob/master/docs/users/build.md. This may helpgithub.com/deepmind/lab/issues/242: "Complete installation script for Ubuntu 20.04".
It is interesting how much overlap some of those have with Ciro's 2D reinforcement learning games
At twitter.com/togelius/status/1328404390114435072 called out on DeepMind Lab2D for not giving them credit on prior work!As seen from web.archive.org/web/20220331022932/http://gvgai.net/ though, DeepMind sponsored them at some point.
This very much looks like like GVGAI which was first released in 2014, been used in dozens (maybe hundreds) of papers, and for which one of the original developers was Tom Schaul at DeepMind...
www.gvgai.net (dead as of 2023)
The project kind of died circa 2020 it seems, a shame. Likely they funding ran out. The domain is dead as of 2023, last archive from 2022: web.archive.org/web/20220331022932/http://gvgai.net/. Marks as funded by DeepMind. Researchers really should use university/GitHub domain names!
Similar goals to Ciro's 2D reinforcement learning games, but they were focusing mostly on discrete games.
They have some source at: github.com/GAIGResearch/GVGAI TODO review
From QMUL Game AI Research Group:From other universities:TODO check:
- Simon M. Lucas: gaigresearch.github.io/members/Simon-Lucas, principal investigator
- Diego Perez Liebana www.linkedin.com/in/diegoperezliebana/
- Raluca D. Gaina: www.linkedin.com/in/raluca-gaina-347518114/ from Queen Mary
- Ahmed Khalifa
- Jialin Liu
2025 round one during week of April 21st, not toning down online profiles:
- Turned down two days later before anything.As evidenced by my Stack Exchange contributions, I love trying out new software to see if it works and how well. I love benchmarking it. And I love documenting what I observed in great detail to help others to choose the best software for them. I also love meeting various new people and understanding what they are up to and how I can help.
- Alice&Bob:
I'm looking to do a meaningful job in a deeptech field, and quantum computing seems like it could become huge. I've learnt a few basics, and would like to go further with job experience in the area.
Salary: 90k.- jobs.lever.co/alice-bob/b4632e27-cf56-4570-84bb-d56a169d1c43 Senior Software Engineer - Cloud. I could do this. But do I want to.
- Pasqal: careers.pasqal.com/jobs/5817098-software-development-engineer-integration Software Development Engineer IntegrationApplication auto-reply also points to:which is cool.I'm looking to do a meaningful job in a deeptech field, and quantum computing seems like it could become huge. I've learnt a few basics, and would like to go further with job experience in the area.Rejected 1 week later without interview even though my CV seemed like a perfect match for this job. Sent an email to the contributors of Pulser.I applied for this job careers.pasqal.com/jobs/5817098-software-development-engineer-integration but got rejected by HR lady without interview.Pinging you guys here just in case because sometimes my profiles scare the HR people and then when I ping the programmers they like me. I grabbed your emails from GitHub.
- H Company:Fastest initial job application steps ever! Name, email, CV, over.
- jobs.ashbyhq.com/hcompany/e6793ce6-918b-48a6-bf56-205c477cc1c0 Member of technical staff (Evaluations)
- jobs.ashbyhq.com/hcompany/89d867e7-2bd3-4918-aebc-cabbac526b6f Senior Back-end Engineer
- Mistral AI:
- jobs.lever.co/mistral/db67d7a2-bcec-4151-9b3a-8212ddabf419 Senior Software Engineer, Data Engineering - Paris. Declined May 7th without interview, at least they said they have lots of applicants and some have more closely related qualifications.
- Poolside AI:> Of all the applications of deep learning, code generation is one of those that interest me the most as they seem one of the most pertinent in order to one day achieve AGI (the others being theorem proving and robotics), and I'd like to try and get some work experience in the area, which is why I'm applying to your company.
- poolside.ai/careers/member-of-engineering-evaluations--ba11fe78-f6f6-4165-b76b-020a46ad8fee Member of Engineering (Evaluations)
Of all the applications of deep learning, code generation is one of those that interest me the most as they seem one of the most pertinent in order to one day achieve AGI (the others being theorem proving and robotics), and I'd like to try and get some work experience in the area.
- poolside.ai/careers/member-of-engineering-data-platform--13d32f62-d530-4372-b458-0687d99eea04 Member of Engineering (Data Platform)
This job seemed like a possibility as I've done some personal "data intensive" projects in the past (not distributed unfortunately, ~500 GB so it fit on my local disk), and I kind of enjoyed it and would be interested in trying out a more "data heavy" job like this for a change.
Rejected May 6th without interview. - poolside.ai/careers/member-of-engineering-evaluations--ba11fe78-f6f6-4165-b76b-020a46ad8fee Member of Engineering (Evaluations)
- Google:
- www.google.com/about/careers/applications/jobs/results/91141790538572486-senior-software-engineer-google-pixel-graphics Senior Software Engineer, Google Pixel Graphics
- www.google.com/about/careers/applications/jobs/results/96937277808091846-senior-software-engineer-pixel-graphics-gpu-software Senior Software Engineer, Pixel Graphics, GPU Software
- InstaDeep:
- www.instadeep.com/job-offer/fea583b3-d333-447e-8811-8ce58367c003/ Software Engineer (Simulation) asking 85k
I really like the idea of using simulations to speed up development of ideas or AI. I've done this in my past jobs in the semiconductor industry, and a bit on side time tinkering with simple simulation games that might be useful to train AI. I'm curious to what you will be simulating! The general areas that your company operates in, biotech and PCB design are also in my interest.lI really like the idea of using simulations to speed up development of ideas or AI. I've done this in my past jobs in the semiconductor industry, and a bit on side time tinkering with simple simulation games that might be useful to train AI. I'm curious to what you will be simulating!
- www.instadeep.com/job-offer/fea583b3-d333-447e-8811-8ce58367c003/ Software Engineer (Simulation) asking 85k
- AMD:
* careers.amd.com/careers-home/jobs/57882 Senior Software Development Engineer on ROCm. Rejected a few days later without interview. - Common Crawl: commoncrawl.org/jobs Software Engineer/Data Scientist, Python, Spark
Was a closed source project by "Roboti LLC", which was then acquired by DeepMind in October 2021 and open sourced March 2022: www.deepmind.com/blog/open-sourcing-mujoco
Tested on Ubuntu 23.10;
git clone https://github.com/google-deepmind/mujoco
cd mujoco
git checkout 5d46c39529819d1b31249e249ca399f306a108ac
mkdir -p build
cd build
cmake ..
make -j
Now let's play. Minimal interactive UI simulation of a simple MJCF scene with one falling cube:Test soure code: github.com/google-deepmind/mujoco/blob/5d46c39529819d1b31249e249ca399f306a108ac/sample/basic.cc. The only thing you can do is rotate the scene with the computer mouse it seems. Mentioned at: mujoco.readthedocs.io/en/2.2.2/programming.html#sabasic
bin/basic ../doc/_static/hello.xml
Some more interesting models can be found under the
model/
directory: github.com/google-deepmind/mujoco/tree/5d46c39529819d1b31249e249ca399f306a108ac/model E.g. the imaginary humanoid robot DeepMind used in many demos can be seen with:bin/basic ../model/humanoid/humanoid.xml
A more advanced UI with a few controls:Test soure code: github.com/google-deepmind/mujoco/tree/5d46c39529819d1b31249e249ca399f306a108ac/simulate. Mentioned at: mujoco.readthedocs.io/en/2.2.2/programming.html#sasimulate
bin/simulate ../doc/_static/hello.xml
A very cool thing about that UI is that you can manually control joints. There are no joints in the hello.xml, but e.g. with the humanoid model:under "Control" you move each joint of the robot separately which is quite cool.
bin/simulate ../model/humanoid/humanoid.xml
There's also a Mentioned at: mujoco.readthedocs.io/en/2.2.2/programming.html#sarecord but TODO that produced a broken video, related issues:
bin/record
test executable that presumably renders the simulation directly to a file:bin/record ../doc/_static/hello.xml 5 60 rgb.out
ffmpeg -f rawvideo -pixel_format rgb24 -video_size 800x800 -framerate 60 -i rgb.out -vf "vflip" video.mp4
DeepMind likes coming up with new improved algorithms for these more specific cases, e.g. it was announced in 2025 that AlphaEvolve found a novel 4x4 complex valued algorithm that uses 48 multiplications.
Bibliography:
- fmm.univ-lille.fr/ attempts to keep an up-to-date list for various sizes
Updates Understanding the state of 3x3 matrix multiplication Updated 2025-05-23 +Created 2025-05-21
After yet another awesome announcement by DeepMind that it had improved theoretical 4x4 matrix multiplication reducing the number of scalar multiplications with its AlphaEvolve system, I decided to have a look at the smallest open size 3x3 to understand what was going on in there.