Jailbreak ScriptLink - Search News

News

Anthropic offers $20,000 to whoever can jailbreak its new AI safety system

Can you jailbreak Anthropic's latest AI safety measure? Researchers want you to try -- and are offering up to $20,000 if you succeed. Trained on synthetic data, these "classifiers" were able to filter ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Feedback

News

Anthropic offers $20,000 to whoever can jailbreak its new AI safety system

Trending now