AI chatbots can apparently get better at math for the strangest reason

A recent study by VMware reveals that chatbots perform better on math problems when given prompts that reference Star Trek. The study, which tested three AI tools, found that while some prompts improved answers and others had insignificant effects, the most effective prompts were those that asked the AI to start its response with phrases like “Captain’s Log, Stardate [insert date here]:.” The researchers were surprised by this result and could not explain why Star Trek references improved the AI's performance.

The study also found that an automated process of trying numerous variations of prompts and tweaking the language based on how much it improved the chatbots’ accuracy was more effective than the researchers' attempts to frame questions with positive thinking. The researchers suggested that these chatbots, trained on billions of lines of text from the real world, might have picked up on human tendencies to give more accurate responses when pressured or encouraged. However, they admitted that they don't fully understand how AI language models work.

Key takeaways:

Research from software company VMware shows that chatbots perform better on math questions when prompted to pretend they're on Star Trek.
The study used an automated process to try numerous variations of prompts and tweak the language based on how much it improved the chatbots’ accuracy.
The most effective prompts exhibited a degree of peculiarity, with one of the models yielding the most accurate answers when asked to start its response with the phrase “Captain’s Log, Stardate [insert date here]:.”
The researchers have no idea why Star Trek references improved the AI’s performance, highlighting the fact that even the people who build and study AI language models don’t fully understand how they work.

AI chatbots can apparently get better at math for the strangest reason

Key takeaways:

Comments (0)

Newsletter