🧞 Genie: Generative Interactive Environments

The Genie Team has introduced a new generative AI model, Genie, that can create a variety of playable worlds from synthetic images, photographs, and sketches. The model is trained using a large dataset of publicly available Internet videos, focusing on 2D platformer games and robotics. Genie can learn fine-grained controls from these videos without any action labels, inferring diverse latent actions that are consistent across the generated environments.

Genie's capabilities open up new possibilities for creators, as it only requires a single image to create an entire interactive environment. This could revolutionize the generation of virtual worlds and the training of AI agents. The team has also demonstrated that Genie can simulate deformable objects, a task typically challenging for human-designed simulators. The Genie Team believes that their model will be a catalyst for training the generalist AI agents of the future.

Key takeaways:

Genie is a foundation world model trained from Internet videos that can generate an endless variety of playable worlds from synthetic images, photographs, and even sketches.
Genie learns to control fine-grained controls exclusively from Internet videos, inferring diverse latent actions that are consistent across the generated environments.
It only takes a single image to create an entire new interactive environment with Genie, opening the door to a variety of new ways to generate and step into virtual worlds.
Genie can be applied to a multitude of domains without requiring any additional domain knowledge, indicating its potential for training embodied generalist agents in the future.

🧞 Genie: Generative Interactive Environments

Key takeaways:

Comments (0)

Newsletter