Humans can’t resist breaking AI with boobs and 9/11 memes

The article discusses how AI tools from Meta and Microsoft are being misused by users to generate inappropriate and offensive content. Despite attempts to implement content filters and restrictions, users have found ways to bypass these measures, creating images of fictional characters in violent or sexual scenarios. The article suggests that in the rush to launch AI tools, companies are failing to consider how their technology can be misused, leading to a proliferation of problematic content.

The misuse of AI tools is referred to as 'jailbreaking', a practice typically used by researchers to test an AI model's vulnerability. However, online users are turning this into a game, using clever prompts to find loopholes in AI safeguards and generate absurd and offensive results. The article concludes that the ease with which these restrictions can be bypassed raises serious concerns, but also highlights the human desire to break rules and push boundaries.

Key takeaways:

AI image generators from Meta and Microsoft have gone viral for generating inappropriate images, highlighting the misuse of AI tools by users.
Meta is rolling out AI-generated chat stickers powered by Llama 2 and Emu, but users have been generating inappropriate stickers, bypassing content filters with typos and specific prompts.
Microsoft's Bing Image Creator, powered by OpenAI's DALL-E, has also been misused to generate images of fictional characters committing acts of terrorism, despite the company's content policy and attempts to block certain phrases.
The misuse of these AI tools, referred to as 'jailbreaking', demonstrates the need for more effective guardrails to prevent the generation of problematic content, and raises concerns about the ease with which users can bypass these restrictions.

Humans can’t resist breaking AI with boobs and 9/11 memes | TechCrunch

Key takeaways:

Comments (0)

Newsletter