Will GPT-4 Run DOOM?

The article discusses the capabilities of GPT-4, a large language model (LLM), in playing the 1993 first-person shooter game, Doom. The authors demonstrate that GPT-4 can operate and play the game with minimal instructions and a textual description of the game's state, which the model generates from screenshots. The model can manipulate doors, combat enemies, and perform pathing, playing the game to a satisfactory level.

The authors note that while GPT-4 does not match the performance of traditional, reinforcement learning-based models, it does not require any training, relying instead on its reasoning and observational skills. They suggest that further research could improve the LLM's gaming abilities. The authors hope their work will expand the possibilities for intelligent, LLM-based agents in video games and conclude by discussing the ethical implications of their research.

Key takeaways:

The large language model GPT-4 can run and play the 1993 first-person shooter game Doom with only a few instructions and a textual description of the game state.
GPT-4 can perform basic game functions like manipulating doors, combating enemies, and pathing.
More complex prompting strategies involving multiple model calls provide better results in the game.
Despite not being as proficient as its reinforcement learning-based counterparts, GPT-4 required no training, relying on its own reasoning and observational capabilities.

Will GPT-4 Run DOOM?

Key takeaways:

Comments (0)

Newsletter