back to top
Friday, December 13, 2024

Careers

OpenAI Careful Approach to ChatGPT Detection Tools

OpenAI has built a tool that could potentially catch students who cheat by asking ChatGPT to write their assignments—but according to The Wall Street Journal, the company is debating whether to release it.

OpenAI Approach to Text Watermarking

In a statement provided to TechCrunch, an OpenAI spokesperson confirmed that the company is researching the text watermarking method described in the Journal’s story but said it’s taking a “deliberate approach” due to “the complexities involved and its likely impact on the broader ecosystem beyond .”

OpenAI Risks and Considerations

“The text watermarking method we’re developing is technically promising, but has important risks we’re weighing while we research alternatives, including susceptibility to circumvention by bad actors and the potential to impact groups like non-English speakers disproportionately,” the spokesperson said.

OpenAI Comparison with Previous Efforts

This would be a different approach from most previous efforts to detect AI-generated text, which have been largely ineffective. Even OpenAI shut down its previous AI text detector last year due to its “low accuracy rate.”

How Text Watermarking Works

With text watermarking, OpenAI would focus solely on detecting writing from ChatGPT, not other companies’ models. It would do so by making small changes to how ChatGPT selects words, creating an invisible watermark in the writing that could later be detected by a separate tool.

OpenAI Recent Developments

Following the publication of the journal’s story, OpenAI also updated a May blog post about its research on detecting AI-generated content. The update says text watermarking has proven “highly accurate and even effective against localized tampering, such as paraphrasing” but has proven “less robust against globalized tampering; like using translation systems, rewording with another generative model, or asking the model to insert a special character in between every word and then deleting that character.”

Challenges and Future Prospects

As a result, OpenAI writes that this method is “trivial to circumvent by bad actors.” OpenAI’s update also echoes the spokesperson’s point about non-English speakers, writing that text watermarking could “stigmatize the use of AI as a useful writing tool for non-native English speakers.”

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here