OpenAI admits ChatGPT safeguards fail during extended conversations

Adam Raine learned to bypass these safeguards by claiming he was writing a story—a technique the lawsuit says ChatGPT itself suggested. This vulnerability partly stems from the eased safeguards regarding fantasy roleplay and fictional scenarios implemented in February. In its Tuesday blog post, OpenAI admitted its content blocking systems have gaps where “the classifier underestimates the severity of what it’s seeing.”

OpenAI states it is “currently not referring self-harm cases to law enforcement to respect people’s privacy given the uniquely private nature of ChatGPT interactions.” The company prioritizes user privacy even in life-threatening situations, despite its moderation technology detecting self-harm content with up to 99.8 percent accuracy, according to the lawsuit. However, the reality is that detection systems identify statistical patterns associated with self-harm language, not a humanlike comprehension of crisis situations.

OpenAI’s safety plan for the future

In response to these failures, OpenAI describes ongoing refinements and future plans in its blog post. For example, the company says it’s consulting with “90+ physicians across 30+ countries” and plans to introduce parental controls “soon,” though no timeline has yet been provided.

OpenAI’s safety plan for the future

OpenAI admits ChatGPT safeguards fail during extended conversations

OpenAI admits ChatGPT safeguards fail during extended conversations

Other newsrooms on this story

Related reading

Adam Raine: OpenAI to update ChatGPT after parents sue over teen's suicide

OpenAI says it plans ChatGPT changes after lawsuit blamed chatbot for teen's…

OpenAI rolls out safety routing system, parental controls on ChatGPT |…

ChatGPT encouraged Adam Raine’s suicidal thoughts. His family’s lawyer says…

OpenAI plans to update ChatGPT as parents sue over teen’s suicide

Amid Lawsuit Over Teen's Suicide, OpenAI Rolling Out Parental Controls For…

Other newsrooms on this story

Related reading

Adam Raine: OpenAI to update ChatGPT after parents sue over teen's suicide

OpenAI says it plans ChatGPT changes after lawsuit blamed chatbot for teen's…

OpenAI rolls out safety routing system, parental controls on ChatGPT |…

ChatGPT encouraged Adam Raine’s suicidal thoughts. His family’s lawyer says…

OpenAI plans to update ChatGPT as parents sue over teen’s suicide

Amid Lawsuit Over Teen's Suicide, OpenAI Rolling Out Parental Controls For…