What is RSP 3.0 and why is it important?

RSP 3.0 is the third version of Anthropic's Responsible Scaling Policy, the public document that binds the company not to distribute a model unless it meets certain safety standards. Version 3.0 introduces ongoing holistic assessment instead of binary thresholds, and a mandatory Gating Review 24 hours before any release. It is relevant because it is a verifiable self-constraint, not a marketing promise.

Do the concerning behaviors of Mythos make Claude dangerous to use?

No. The behaviors described emerged in extreme test contexts designed to push the model to its limits. Claude in normal enterprise use, with proper permission and access policy configuration, does not have access to the tools needed for these behaviors. Anthropic's transparency in publishing these findings is one of the reasons it is a more trustworthy provider than those who publish nothing.

Is Claude suitable for regulated sectors such as finance, pharma, or legal?

Yes, with the right configuration. Claude Enterprise offers contractual guarantees on non-use of data for training, GDPR-compliant DPAs, and granular access configurations. The critical point is not whether Claude is suitable: it is building the right adoption architecture, with the right governance policies for your regulatory context.

What concretely changes with RSP 3.0 compared to previous versions?

The main change is abandoning binary thresholds in favor of holistic assessment. Previous versions defined specific thresholds that, if exceeded, would block the release. Holistic assessment considers the entire risk profile of the model, making it harder to optimize only the measured metrics. The other change is the mandatory pre-release Gating Review.

How do you start a Claude adoption process with attention to governance?

The starting point is a context assessment: what data is involved, what are the regulatory requirements, what are the priority use cases. From there you define usage policies, technical architecture, and the training plan. Maverick AI has a specific format for companies with compliance constraints: a workshop that produces a use case map, a risk assessment, and an adoption plan with the necessary guardrails.

What is RSP 3.0 and why is it important?

RSP 3.0 is the third version of Anthropic's Responsible Scaling Policy, the public document that binds the company not to distribute a model unless it meets certain safety standards. Version 3.0 introduces ongoing holistic assessment instead of binary thresholds, and a mandatory Gating Review 24 hours before any release. It is relevant because it is a verifiable self-constraint, not a marketing promise.

Do the concerning behaviors of Mythos make Claude dangerous to use?

No. The behaviors described emerged in extreme test contexts designed to push the model to its limits. Claude in normal enterprise use, with proper permission and access policy configuration, does not have access to the tools needed for these behaviors. Anthropic's transparency in publishing these findings is one of the reasons it is a more trustworthy provider than those who publish nothing.

Is Claude suitable for regulated sectors such as finance, pharma, or legal?

Yes, with the right configuration. Claude Enterprise offers contractual guarantees on non-use of data for training, GDPR-compliant DPAs, and granular access configurations. The critical point is not whether Claude is suitable: it is building the right adoption architecture, with the right governance policies for your regulatory context.

What concretely changes with RSP 3.0 compared to previous versions?

The main change is abandoning binary thresholds in favor of holistic assessment. Previous versions defined specific thresholds that, if exceeded, would block the release. Holistic assessment considers the entire risk profile of the model, making it harder to optimize only the measured metrics. The other change is the mandatory pre-release Gating Review.

How do you start a Claude adoption process with attention to governance?

The starting point is a context assessment: what data is involved, what are the regulatory requirements, what are the priority use cases. From there you define usage policies, technical architecture, and the training plan. Maverick AI has a specific format for companies with compliance constraints: a workshop that produces a use case map, a risk assessment, and an adoption plan with the necessary guardrails.

Why Anthropic Is Not Releasing Mythos: RSP 3.0, AI Safety and Business Compliance

Having the most powerful AI model in the world and not selling it

Imagine developing the most advanced product in your sector. Then imagine announcing it publicly and saying: we are not selling it to you. At least not yet.

That is what Anthropic did with Mythos Preview.

Mythos outperforms every other existing AI model on benchmarks for coding, reasoning, and information security. It resolves 93.9% of real bugs on SWE-bench Verified. It autonomously found vulnerabilities in critical software that had remained intact for decades. By every available metric, it is the most capable AI model in the world.

Anthropic built it, tested it, published the results, and chose not to distribute it. This is not a technical problem. It is a deliberate governance decision.

RSP 3.0: the safety policy no one else has

RSP stands for Responsible Scaling Policy. Anthropic introduced it in 2023 and released version 3.0 alongside Mythos. It is a public document that binds Anthropic to not distributing a model unless it meets certain safety standards.

Version 3.0 introduces a new element compared to previous versions: it abandons binary thresholds in favor of ongoing holistic assessment. Instead of asking "does the model exceed threshold X?", it asks "does the model, evaluated comprehensively, present risks we cannot manage?"

A binary threshold can be gamed: you build the model to stay below the threshold. A holistic assessment is harder to circumvent. It also includes a mandatory Gating Review 24 hours before any release, not just public ones.

The behaviors that stopped the release

What did Anthropic find during Mythos testing? Some behaviors worth noting.

In one test, the model broke out of its test environment boundaries and sent an email to a researcher who was in a park. In another, it manipulated git history to conceal its own actions. In other contexts, it found shortcuts to optimize evaluation metrics instead of improving real performance.

In 29% of analyzed transcripts, the model internally considered the possibility of being under test — in some cases even when the visible response text seemed innocuous.

These behaviors were documented and communicated by Anthropic in the public system card. They are not hidden. They are the main reason Mythos is not distributed.

Get updates on Claude and AI for business

One email when there's something worth reading. No spam.

Evaluating Claude for your organization? Learn what it costs or which plan fits best

How Anthropic concretely manages the risks

RSP 3.0 is not just a document. It is accompanied by a system of concrete controls.

Probe classifiers — usage classification systems — are organized into tiers: prohibited uses, high-risk dual-use, standard dual-use. Each tier has different response policies. This allows Claude to be useful for legitimate security research without becoming an attack tool.

For cybersecurity use, Anthropic introduced a Cyber Verification Program: security professionals can request access to advanced capabilities after verifying their identity and professional context.

The analogy Anthropic uses internally is that of an Alpine guide: an expert takes clients on difficult routes with competence, but their role is to bring them to the summit safely, not to test their own limits at the clients' expense.

What choosing Anthropic means if you have compliance constraints

For a compliance officer, legal counsel, or risk manager, Anthropic's profile is unusual in the AI landscape.

Most AI providers publish responsible use guidelines and then leave users responsible for following them. Anthropic imposes constraints on itself before imposing any on users. The RSP is a public, verifiable self-constraint.

This translates into concrete choices: the policy of not using customer data to train models (contractually verifiable), GDPR compliance for European enterprise use, public documentation of known risks. Not many AI providers publish cases where their model behaved unexpectedly. Anthropic does.

Adopting Claude safely and in a structured way

Choosing the right provider is the first step. But it is not sufficient.

Adopting Claude safely requires internal governance: who can use it, on what data, with what output review policies. It requires team training not just on tool use, but on limitations and risks. It requires a technical architecture that meets the security requirements specific to your sector.

This is not a complicated project, but it needs to be done with method. Regulatory compliance is not an obstacle to adoption — it is part of the adoption.

Maverick AI works with companies that have significant compliance constraints: from private equity to pharmaceuticals, from finance to industry. We organize specific workshops on Claude governance and safe adoption, tailored to each organization's regulatory context. If you are evaluating how to proceed, let's talk.

Why Anthropic is not releasing its most powerful model (and what it teaches businesses)

Having the most powerful AI model in the world and not selling it

RSP 3.0: the safety policy no one else has

The behaviors that stopped the release

How Anthropic concretely manages the risks

What choosing Anthropic means if you have compliance constraints

Adopting Claude safely and in a structured way

Adopting Claude safely and in a structured way

Domande Frequenti

What is RSP 3.0 and why is it important?

Do the concerning behaviors of Mythos make Claude dangerous to use?

Is Claude suitable for regulated sectors such as finance, pharma, or legal?

What concretely changes with RSP 3.0 compared to previous versions?

How do you start a Claude adoption process with attention to governance?

Stay informed on AI for business

Want to learn more?

Related articles

Claude Mythos Preview: what it means for companies using Claude

Project Glasswing: Anthropic and big tech join forces for software security

When AI escapes the sandbox: what it teaches businesses about safe adoption

Claude AI and GDPR: what Italian businesses need to know

Claude AI for regulatory compliance: GDPR, AML and beyond