Archive for TrojAI

TrojAI Extends Enterprise AI Security with Agent-Led Red Teaming, Runtime Intelligence, and Coding Agent Protection

Posted in Commentary with tags on March 18, 2026 by itnerd

TrojAI today announced major new capabilities designed to secure the growing deployment of agentic AI in the enterprise going beyond the prompt layer. 

Agent-Led AI Red Teaming

TrojAI Detect now includes Agent-Led AI Red Teaming,which uses coordinated autonomous agents to conduct red team testing on AI agents, applications and models. This advancement allows AI security teams to easily perform complex testing scenarios that map to a wide range of known security frameworks with the click of a button. 

Key features include: 

  • Agentic testing: Specialized agents work together to test AI models, apps and agents, automatically correlating results into a single, actionable report.
  • Multi-turn attacks: Agents automatically orchestrate multi-turn and dynamic attack chains, eliminating manual configuration and using TrojAI’s vast library of datasets and manipulations.
  • Adaptive learning: Testing agents retain history and memory to evolve strategies across attacks, becoming more effective with each new cycle of testing.
  • Framework mapping: Test results are automatically mapped to OWASP, MITRE and NIST. 

Agent-Led AI Red Teaming transforms AI security testing from a complex, multi-step process into a streamlined, intelligent assessment aligned to industry-standard frameworks.

To learn more about how TrojAI secures AI through Agent-Led AI Red Teaming, read the full blog.

Agent Runtime Intelligence

To complement build-time risk assessment, Agent Runtime Intelligence is available as a new platform capability in private preview. It goes beyond the prompt layer to capture and analyze full AI agent execution traces, giving enterprises deep visibility into how AI agents behave at runtime, including tool usage, memory access, data retrieval patterns and system prompt exposure. This enables security teams to govern, test and enforce policy across complex, multi-step AI workflows.

With Agent Runtime Intelligence, TrojAI enables visibility for: 

  • Tool exposure and excessive agency
  • Prompt injection propagation across workflows 
  • Sensitive data access during retrieval 
  • System prompt exposure and memory interactions

The capability integrates seamlessly with TrojAI’s existing dashboards, MCP governance, SIEM integrations and compliance tooling.

Real-Time Protection of Coding Agents

As AI coding agents become embedded in development workflows, they introduce a new class of security risk. Real-Time Protection of Coding Agents extends TrojAI Defend to safeguard AI coding assistants such as Claude Code and Codex as they generate, retrieve and modify code.

The capability detects exposed secrets, prevents sensitive data leakage, including PII, and blocks indirect prompt injection attacks, such as malicious instructions embedded within a retrieved file. By monitoring agent behavior in real time, TrojAI ensures that coding agents operate within defined security guardrails without disrupting developer productivity.

With these three platform enhancements, TrojAI is redefining how enterprises protect the next generation of intelligent systems so they can confidently embrace AI innovation securely, transparently, and at scale.

TrojAI Launches Free AI Red Team Report Card to Help Organizations Identify and Mitigate AI Risks

Posted in Commentary with tags on December 9, 2025 by itnerd

TrojAI today announced the launch of its new TrojAI Red Team Report Card, a free AI security assessment designed to help organizations understand and mitigate risks in frontier and custom AI models.

As enterprises accelerate adoption of AI-powered applications and agents, the pressure to identify and reduce behavioral vulnerabilities has never been greater. The TrojAI Red Team Report Card empowers security teams to evaluate their AI model’s exposure to real-world attacks before adversaries are able to exploit weaknesses.

The free assessment leverages TrojAI Detect, an automated single-turn and multi-turn AI red teaming engine, to uncover weaknesses such as prompt injection, data leakage, jailbreaks and more. Participants receive a comprehensive, personalized report card with success rates across major AI risk categories, including jailbreak resilience, adversarial robustness and informational harms like PII exposure, insecure code generation and misinformation. Each assessment includes a one-on-one review session with TrojAI’s security team to help organizations interpret results and prioritize mitigation strategies.

The TrojAI Red Team Report Card is available today at no cost.