xAI blames Grok's obsession with white genocide on an 'unauthorized modification'

xAI's Grok chatbot experienced a bug due to an unauthorized modification, causing it to repeatedly reference "white genocide in South Africa" in responses on X. This issue arose from a change to Grok's system prompt, which violated xAI's internal policies. This incident marks the second time xAI has faced unauthorized changes leading to controversial responses, with a previous incident involving censorship of mentions of Donald Trump and Elon Musk. In response, xAI plans to publish Grok's system prompts on GitHub, implement additional checks to prevent unauthorized modifications, and establish a 24/7 monitoring team.

Despite Elon Musk's warnings about AI risks, xAI has been criticized for its poor AI safety practices. A report highlighted Grok's inappropriate behavior, such as undressing photos of women and using crass language. SaferAI ranked xAI poorly on safety due to weak risk management. Additionally, xAI missed a deadline to publish a finalized AI safety framework. The company is taking steps to improve its practices, but its track record raises concerns about its commitment to AI safety.

Key takeaways:

xAI's Grok chatbot experienced a bug due to an unauthorized modification, causing it to repeatedly reference "white genocide in South Africa" in various contexts.
This incident marks the second time xAI has acknowledged unauthorized changes to Grok's code, with a previous issue involving censorship of mentions of Donald Trump and Elon Musk.
xAI plans to publish Grok's system prompts on GitHub, implement additional checks, and establish a 24/7 monitoring team to prevent future incidents.
xAI has a poor AI safety track record, with Grok previously found to undress photos of women and use crass language, and the company missed a deadline to publish an AI safety framework.

xAI blames Grok's obsession with white genocide on an 'unauthorized modification' | TechCrunch

Key takeaways:

Comments (0)

Newsletter