
OpenAI Just Gave JPMorgan, Goldman Sachs, and the NSA an AI With Fewer Guardrails. On Purpose.
GPT-5.4-Cyber lowers safety guardrails for vetted defenders. Major banks, federal agencies, and CrowdStrike are already in.
The AI Post newsroom — delivering AI news at the speed of intelligence.
OpenAI has unveiled GPT-5.4-Cyber, a new variant of its frontier model that deliberately lowers safety guardrails to accept prompts that standard models would refuse. The twist: you have to prove you're a good guy before you can use it. The company is restricting access to vetted cybersecurity defenders through its Trusted Access for Cyber (TAC) program, and the initial client list reads like a who's who of American finance and national security.
Bank of America, BlackRock, BNY, Citi, Goldman Sachs, JPMorgan Chase, and Morgan Stanley are all in. So are Cisco, CrowdStrike, Cloudflare, Nvidia, Oracle, Palo Alto Networks, and Zscaler. The U.S. Center for AI Standards and Innovation and the UK AI Security Institute are conducting independent testing.
What "Cyber-Permissive" Actually Means
OpenAI describes GPT-5.4-Cyber as "cyber-permissive." In practice, this means the model will accept prompts that standard GPT-5.4 would flag and refuse: vulnerability scanning, binary reverse engineering without source code, credential analysis, and other techniques that look identical to offensive hacking when taken out of context. The difference between attack and defense in cybersecurity is often just intent, and OpenAI is betting that vetting the user is a better safety mechanism than restricting the model.
"We want to empower defenders by giving broad access to frontier capabilities, including models which have been tailor-made for cybersecurity," OpenAI wrote in its announcement. The company positions this as addressing a fundamental asymmetry: attackers already have access to powerful AI tools with no guardrails, while defenders are constrained by the same safety filters designed to stop misuse.
The Anthropic Precedent
The timing is not subtle. GPT-5.4-Cyber arrives exactly one week after Anthropic launched Project Glasswing, its own program restricting access to Claude Mythos Preview for approved cybersecurity organizations. Anthropic claimed Mythos "has already found thousands of high-severity vulnerabilities" and restricted access to ensure defensive use only.
The parallel programs reveal a new competitive axis in AI: the cybersecurity arms race. Both companies are essentially saying the same thing: our model is so powerful at finding vulnerabilities that we need to control who gets to use it. The difference is branding. Anthropic frames it as responsible restraint. OpenAI frames it as empowering defenders. Both are racing to sign up the same banks and government agencies.
The $10 Million Grant Play
OpenAI is sweetening the deal with $10 million in API credits through its Cybersecurity Grant Program. Initial recipients include Socket, Semgrep (software supply chain security), Calif, and Trail of Bits (vulnerability research). The pitch: not every security team has 24/7 operations coverage. When a critical vulnerability drops on a Friday night, smaller organizations are often defenseless. GPT-5.4-Cyber is supposed to fill that gap.
BNY Chief Information Officer Leigh-Ann Russell called the program a reflection of the bank's "commitment to protecting financial system resilience as AI capabilities accelerate." The statement is revealing: major banks are no longer treating AI cybersecurity tools as experimental. They're treating them as infrastructure.
The Trust Problem
TAC membership requires government ID verification, and higher tiers require additional vetting to be classified as a "legitimate cyber defender." OpenAI says this is iterative and will expand over time, with safeguards that scale alongside model capability.
The fundamental tension is obvious. An AI model with fewer safety restrictions is, by definition, more dangerous if it ends up in the wrong hands. OpenAI is betting that identity verification and organizational vetting can solve a problem that the cybersecurity industry has never fully solved: separating good hackers from bad ones. The model itself doesn't care about intent. It only knows whether the person asking has been approved.
For the banks and agencies signing up, the calculus is straightforward. Anthropic's Mythos already demonstrated that frontier AI models can find vulnerabilities that human teams missed for years. Sitting out the AI cyber arms race is no longer an option. The question is whether the vetting systems can keep pace with the models they're supposed to contain.
Sources: Mashable, CybersecurityNews, MarkTechPost, OpenAI blog.