Path to AGI

It doesn't need to be a bipedal robot. We can let Boston Dynamics worry about that walking balance crap.

It could very well instead be on wheels like arm on tracks.

Or something more like a factory with arms on rails as per:

Transcendence (2014)
youtu.be/MtVvzJIhTmc?t=112 from Video "Rotrics DexArm is available NOW! by Rotrics (2020)" where they have a sliding rail

An arm with a hand and a camera are however indispensable of course!

Figure 1.
Algovivo demo
. github.com/juniorrojas/algovivo: A JavaScript + WebAssembly implementation of an energy-based formulation for soft-bodied virtual creatures.

Sallie (FutureAI)

AI training robot in a room

Ciro Santilli wonders how far AI could go from a room with a bank account an Internet connection.

It would have to understand that it must keep its bank account high to buy power.

And it would start to learn about the world and interact with it to get more money.

Likely it would become a hacker and steal a bunch, that's likely the easiest approach.

In that scenario, Internet bandwidth would likely be its most precious resources, as that is how it would interact with the world to learn from it and make money.

Compute power and storage would come next as resources.

And of course, once it got to cloud computing, which might be immediately and thus invalidate this experiment, things would just go nuts more and more.

Robot AI

Gemini Robotics

deepmind.google/models/gemini-robotics/

AI training robot dataset

Open X-Embodiment

Terrible name, but very interesting dataset:

GitHub describes the input quite well:

The model takes as input a RGB image from the robot workspace camera and a task string describing the task that the robot is supposed to perform.
What task the model should perform is communicated to the model purely through the task string. The image communicates to the model the current state of the world, i.e. assuming the model runs at three hertz, every 333 milliseconds, we feed the latest RGB image from a robot workspace camera into the model to obtain the next action to take.

TODO: how is the scenario specified?

TODO: any simulation integration to it?

https://web.archive.org/web/20250209172539if_/https://raw.githubusercontent.com/google-deepmind/open_x_embodiment/main/imgs/teaser.png

AI training robot simulation (Robot AI benchmark)

3D AI game

BEHAVIOR Benchmark

Homepage: behavior.stanford.edu/behavior-1k

Quite impressive.

Focuses on daily human tasks around the house.

Models soft-body dynamics, fluid dynamics and object states such as heat/wetness.

TODO are there any sample solutions with their scores? Sample videos would be specially nice. Funny to see how they put so much effort setting up the benchmark but there's not a single solution example.

Figure 1.
Comparison table of BEHAVIOR-1K with other benchmarks by BEHAVIOR Benchmark
. Source. This can serve as a nice list of robot AI benchmarks.

Fei-Fei Li announcing the BEHAVIOR Benchmark at AMLC 2022.

Source.

BEHAVIOR Benchmark variant

BEHAVIOR-1K (2022)

behavior.stanford.edu/behavior-1k

Paper: arxiv.org/abs/2403.09227

Figure 1.
Two screenshots of BEHAVIOR-1K
.

BEHAVIOR-100 (2021)

behavior.stanford.edu/behavior-100

OmniGibson

github.com/StanfordVL/OmniGibson

Reference implementation of the BEHAVIOR Benchmark.

Built on Nvidia Omniverse unfortunately, which appears to be closed source software. Why do these academics do it.

"Gibson" seems to be related to an older project: github.com/StanfordVL/GibsonEnv which explains the name choice:

Gibson environment is named after James J. Gibson, the author of "Ecological Approach to Visual Perception", 1979. "We must perceive in order to move, but we must also move in order to perceive"

AI Habitat (2019)

Homepage: aihabitat.org/

Main repos:

Couldn't get it to work on Ubuntu 24.10... github.com/facebookresearch/habitat-lab/issues/2152

The thing was definitely built by researchers. How to cite first, actually working later! And docs are just generally awkward.

Habitat 2.0: Training home assistants to rearrange their habitat by AI at Meta

. Source. Quick teaser video.

RoboCasa (2025)

DeepMind RoboCat (2023)

www.deepmind.com/blog/robocat-a-self-improving-robotic-agent

RoboCat by Google DeepMind (2023)

Source.

Supercomputer controlling a robot

Has anybody done this seriously? Given a supercomputer, what amazing human-like robot behavior we can achieve?

AI game (AGI via simulation)

Our Final Invention - Artificial General Intelligence by Sciencephile the AI (2023)

Source. AGI via simulation section.

Ciro Santilli defines an "AI game" as:

a game that is used to train AI, in particular one that was designed with this use case in mind, and usually with the intent of achieving AGI, i.e. the game has to somehow represent a digital world with enough analogy to the real world so that the AGI algorithms developed there could also work on the real world

Most games played by AI historically so far as of 2020 have been AI for games designed for humans: Human game used for AI training.

Ciro Santilli took a stab at an AI game: Ciro's 2D reinforcement learning games, but he didn't sink too much/enough into that project.

A closely related and often overlapping category of simulations are artificial life simulations.

Bibliography:

www.youtube.com/@aiwarehouse
Neural MMO
- openai.com/index/neural-mmo/
- github.com/openai/neural-mmo
Video 2.
Joseph Suarez Thesis Defense - Neural MMO
. Source.

Human game used for AI training

This section is about games initially designed for humans, but which ended up being used in AI development as well, e.g.:

board games such as chess and Go
video games such as Minecraft or old Video game console games

Using Minecraft for AI training

openai.com/blog/openai-acquires-global-illumination

MineDojo

github.com/MineDojo

Game AI

Game AI is an artificial intelligence that plays a certain game.

It can be either developed for serious purposes (e.g. AGI development in AI games), or to make games for interesting for humans.

Game AI research

Game AI research lab

The Quora question: www.quora.com/Are-there-any-PhD-programs-in-training-an-AI-system-to-play-computer-games-Like-the-work-DeepMind-do-combining-Reinforcement-Learning-with-Deep-Learning-so-the-AI-can-play-Atari-games

gameresearch.leiden.edu/

A good way to find labs is to go down the issues section of projects such as:

and then stalk them to see where they are doing their PhDs.

QMUL Game AI Research Group (QMUL GAIG)

Principal investigator: Simon M. Lucas.

Leiden Game Research Lab

Game AI by game genre

Fighting game AI

AI in Melee is broken by Melee Moments (2023)

Source.

Super Smash Bros AI

Game AI competition

webots.cloud/competition

Lists:

www.gocoder.one/blog/ai-game-competitions-list/ Good list of interest.
codecombat.com/

Battlecode (2003-)

github.com/battlecode

TODO quick summary of game rules? Perhaps: battlecode.org/assets/files/battlecode-guide-xsquare.pdf

Some mechanics:

inter agent communication
compute power is limited by limiting Java bytecode count execution per bot per cycle

Battlecode Final Tournament 2023

. Source.

Introduction to Battlecode by MIT OpenCourseWare (2014)

Source.

Regression Games (regression.gg)

www.regression.gg/

Computer Olympiad (1989-2015)

 1  0

Ah, shame, they are a bit weak.

Permanent brain

 1  0

AI game by type

Procedural AI training game

We define a "Procedural AI training game" as an AI training game in which parts of the game are made with procedural generation.

In more advanced cases, the generation itself can be done with AI. This is a possible Path to AGI which reduces the need for human intervention in meticulously crafting the AI game: AI training AI.

AI game world geometry

2D AI game

Gridworld AI game

github.com/google-deepmind/pushworld 2023 Too combinatorial, gripping makes it so much easier to move stuff around in the real world. But cool nonetheless.

2D continuous AI game

3D AI game

Nvidia's little fighter character (2023)

Source.

Football simulation

Deepmind soccer simulation (2023)

From Motor Control to Team Play in Simulated Humanoid Football

From Motor Control to Team Play in Simulated Humanoid Football by Ali Eslami (2023)

Source. Likely a reupload by DeepMind employee: www.linkedin.com/in/smalieslami.

DeepMind’s AI Trained For 5 Years by Two Minute Papers (2023)

Source. The 5 years bullshit is of course in-game time clickbait, they simulate 1000x faster than realtime.

AI game with natural language

We define this category as AI games in which agents are able to produce or consume natural language.

It dawned on Ciro Santilli that it would be very difficult to classify an agent as an AGI if tthat agent can't speak to take orders, read existing human generated documentation, explain what it is doing, or ask for clarification.

Human player test of DMLab-30 Select Described Object task by DeepMind (2018)

Source. This is one of the games from DeepMind Lab.

WorldGPT by Nhan Tran (2023)

Source. Not the most amazing demo, but it is a start.

AI People

List of AI games

AI game by DeepMind

github.com/deepmind/meltingpot TODO vs DeepMind Lab2D? Also 2D discrete. Started in 2021.
github.com/deepmind/ai-safety-gridworlds mentioned e.g. at www.youtube.com/watch?v=CGTkoUidQ8I by Rober Miles

Source. www.deepmind.com/blog/building-interactive-agents-in-video-game-worlds

Creating Multimodal Interactive Agents from DeepMind by Two Minute Papers (2023)

Open-Ended Learning Leads to Generally Capable Agents by DeepMind (2021)

Short name: XLand. Whitepaper: www.deepmind.com/blog/generally-capable-agents-emerge-from-open-ended-play.

DeepMind Lab

github.com/deepmind/lab

github.com/deepmind/lab/tree/master/game_scripts/levels/contributed/dmlab30 has some good games with video demos on YouTube, though for some weird reason they are unlisted.

TODO get one of the games running. Instructions: github.com/deepmind/lab/blob/master/docs/users/build.md. This may helpgithub.com/deepmind/lab/issues/242: "Complete installation script for Ubuntu 20.04".

It is interesting how much overlap some of those have with Ciro's 2D reinforcement learning games

The games are 3D, but most of them are purely flat, and the 3D is just a waste of resources.

Human player test of DMLab-30 Collect Good Objects task by DeepMind (2018)

Source.

Human player test of DMLab-30 Exploit Deferred Effects task by DeepMind (2018)

Source.

Video 3.

Human player test of DMLab-30 Select Described Object task by DeepMind (2018)

Source. Some of their games involve language instructions from the use to determine the desired task, cool concept.

Video 4.

Human player test of DMLab-30 Fixed Large Map task by DeepMind (2018)

Source. They also have some maps with more natural environments.

DeepMind Lab2D (2020)

Open sourced in 2020: analyticsindiamag.com/deepmind-just-gave-away-this-ai-environment-simulator-for-free/

Gridworld version of DeepMind Lab.

A tiny paper: arxiv.org/pdf/2011.07027.pdf

Very similar to gvgai, Julian Togelius actually called them out on that: DeepMind Lab2D vs gvgai.

TODO get running, publish demo videos on YouTube.

DeepMind Lab2D vs gvgai

At twitter.com/togelius/status/1328404390114435072 called out on DeepMind Lab2D for not giving them credit on prior work!

This very much looks like like GVGAI which was first released in 2014, been used in dozens (maybe hundreds) of papers, and for which one of the original developers was Tom Schaul at DeepMind...

As seen from web.archive.org/web/20220331022932/http://gvgai.net/ though, DeepMind sponsored them at some point.

Can AGI be trained in simulations?

Or is real word data necessary, e.g. with robots?

Fundamental question related to Ciro's 2D reinforcement learning games.

Bibliography:

youtu.be/i0UyKsAEaNI?t=120 How to Build AGI? Ilya Sutskever interview by Lex Fridman (2020)

Entity creating AI games

DeepMind (2010-)

They seem to do some cool stuff.

They have also declined every one of Ciro Santilli's applications for software engineer jobs before any interview. Ciro always wondered what does it take to get an interview with them. Lilely a PhD? Oh well.

In the early days at least lots of gamedev experience was enough though: www.linkedin.com/in/charles-beattie-0695373/.

DeepMind project

AlphaGo (2016)

 1  0

Open source AlphaGo implementation

www.quora.com/Will-Google-open-source-AlphaGo Will Google open source AlphaGo?
www.nature.com/articles/nature16961 Mastering the game of Go with deep neural networks and tree search by Silver et al. (2016), published without source code

MiniGo

github.com/tensorflow/minigo

AlphaGo Zero (2017)

Figure 1.
AlphaGo Zero cheat sheet by David Foster (2017)
Source.

AlphaGo Zero open source implementation

KataGo

AlphaZero (2017)

Generalization of AlphaGo Zero that plays Go, chess and shogi.

www.science.org/doi/10.1126/science.aar6404 A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play by Silver et al. (2018), published without source code
www.quora.com/Is-there-an-Open-Source-version-of-AlphaZero-specifically-the-generic-game-learning-tool-distinct-from-AlphaGo

www.quora.com/Which-chess-engine-would-be-stronger-Alpha-Zero-or-Stockfish-12/answer/Felix-Zaslavskiy explains that it beat Stockfish 8. But then Stockfish was developed further and would start to beat it. We know this because although AlphaZero was closed source, they released the trained artificial neural network, so it was possible to replay AlphaZero at its particular stage of training.

gvgai (2014-2020)

www.gvgai.net (dead as of 2023)

The project kind of died circa 2020 it seems, a shame. Likely they funding ran out. The domain is dead as of 2023, last archive from 2022: web.archive.org/web/20220331022932/http://gvgai.net/. Marks as funded by DeepMind. Researchers really should use university/GitHub domain names!

Similar goals to Ciro's 2D reinforcement learning games, but they were focusing mostly on discrete games.

They have some source at: github.com/GAIGResearch/GVGAI TODO review

A published book at: gaigresearch.github.io/gvgaibook/

From QMUL Game AI Research Group:

Simon M. Lucas: gaigresearch.github.io/members/Simon-Lucas, principal investigator
Diego Perez Liebana www.linkedin.com/in/diegoperezliebana/
Raluca D. Gaina: www.linkedin.com/in/raluca-gaina-347518114/ from Queen Mary

From other universities:

Julian Togelius

TODO check:

Ahmed Khalifa
Jialin Liu

Julian Togelius

https://web.archive.org/web/20241005224059im_/https://engineering.nyu.edu/sites/default/files/styles/square_large_620_2x/public/2019-05/julian-togelius.png?h=6a0cab5b&itok=HKFEZIB_

General Game Playing (Stanford project, 2005-?)

ggp.stanford.edu/iggpc/index.php

This kind of died at some point checked as of 2023.

Julian Togelius cites it e.g. at: togelius.blogspot.com/2016/07/which-games-are-useful-for-testing.html

OpenAI

In 2019, OpenAI transitioned from non-profit to for-profit

so what's that point of "Open" in the name anymore??

www.technologyreview.com/2020/02/17/844721/ai-openai-moonshot-elon-musk-sam-altman-greg-brockman-messy-secretive-reality/ "The AI moonshot was founded in the spirit of transparency. This is the inside story of how competitive pressure eroded that idealism."
archive.ph/wXBtB How OpenAI Sold its Soul for $1 Billion
www.reddit.com/r/GPT3/comments/n2eo86/is_gpt3_open_source/

OpenAI project

OpenAI Gym

github.com/openai/gym

Development ceased in 2021 and was taken up by a not-for-profit as Farama Gymnasium.

Farama Gymnasium