Skip to content
@promptfoo

promptfoo

Test your LLM apps
Promptfoo - AI security testing platform with detective red panda logo

Ship agents, not vulnerabilities

WebsiteDocsBlogDiscord

GitHub stars npm downloads License: MIT

AI security testing for LLMs, agents, and RAG systems

Trusted by 85 Fortune 500 companies and 200K+ developers


🚀 Quick Start

npx promptfoo@latest init
npx promptfoo@latest eval
npx promptfoo@latest view

Get Started → · Enterprise →


🛠️ What We Do

Security Testing

  • Red Teaming — Automated vulnerability discovery with 100+ attack plugins
  • Code Scanning — Detect LLM security risks in your IDE and CI/CD

Evaluations


🔒 Security & Privacy

What we detect:

  • Prompt injections and jailbreaks
  • PII and sensitive data leaks
  • Hallucinations and policy violations
  • Tool misuse and adversarial attacks

Compliance: SOC 2 Type II · ISO 27001 · HIPAA

Data model:

  • Evals — 100% local, API keys never leave your machine
  • Red teaming — Your target runs locally; attack generation via our API or bring your own keys

📦 Projects

Repository Description
promptfoo Test prompts, agents, and RAGs. Red teaming and vulnerability scanning for LLMs.
promptfoo-action GitHub Action for CI/CD security testing
evil-mcp-server Red team testing for Model Context Protocol servers
js-rouge JavaScript ROUGE metrics for summarization evaluation

👥 Community

Connect: Discord · X/Twitter · Bluesky · LinkedIn

Contribute: Contributing Guide · Good First Issues · Report Issues

Learn: LLM Vulnerability Database · Security Research Blog

Popular repositories Loading

  1. promptfoo promptfoo Public

    Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with co…

    TypeScript 9.6k 836

  2. promptfoo-action promptfoo-action Public

    The GitHub Action for Promptfoo. Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. S…

    TypeScript 32 14

  3. evil-mcp-server evil-mcp-server Public

    An evil MCP server used for redteam testing

    TypeScript 10 1

  4. mini-foo mini-foo Public

    Mini promptfoo used for interviews

    TypeScript 2 2

  5. renovate-config renovate-config Public

    Shared Renovate configuration for the promptfoo organization

    2

  6. .github .github Public

    2

Repositories

Showing 10 of 17 repositories