Anthropic Claude 3 Logo

6 天

Anthropic dares you to try to jailbreak Claude AI

Anthropic developed a defense against universal AI jailbreaks for Claude called Constitutional Classifiers - here's how it ...

新浪网5 天

Anthropic 推“宪法分类器”，可大幅降低 Claude 越狱率

IT之家 2 月 5 日消息，为解决人工智能工具中存在的滥用自然语言提示问题，OpenAI 的竞争对手 Anthropic 推出了一个名为“宪法分类器（constitutional ...

5 天

Anthropic: users to put jailbreak protection for AI chatbot to the test

Anthropic has developed a filter system designed to prevent responses to inadmissible AI requests. Now it is up to users to ...

TechCrunch4 天

Lyft’s new AI customer assistant is powered by Anthropic’s Claude

Lyft quietly incorporated Claude, Anthropic’s family of large language models, into its customer care AI assistant in late 2024 via Amazon Bedrock, according to Anthropic. It provides answers to ...

ZDNet3 天

Anthropic offers $20,000 to whoever can jailbreak its new AI safety system

After improving it, Anthropic ran a test of 10,000 synthetic jailbreaking attempts on an October version of Claude 3.5 Sonnet with and without classifier protection using known successful attacks.

TechRadar6 天

Anthropic has a new security system it says can stop almost all AI jailbreaks

Anthropic unveils new proof-of-concept security measure tested on Claude 3.5 Sonnet “Constitutional classifiers” are an attempt to teach LLMs value systems Tests resulted in more than an 80% ...

BGR19 天

Anthropic CEO says Claude may match some of ChatGPT’s key features this year

If you’re a fan of generative AI programs like ChatGPT, Gemini, Claude, and others, and you have half an hour to spare, you should watch The Wall Street Journal’s interview with Anthropic ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果