Promptfoo

Open-source

4.3

Productivity & Business

Test prompts, agents, and RAGs with red teaming and vulnerability scanning across multiple LLM providers.

About Promptfoo

Promptfoo is a testing and evaluation framework for prompts, agents, and RAG systems. It supports red teaming, pentesting, and vulnerability scanning for AI applications with CI/CD integration. Used by OpenAI and Anthropic, with 20.5k GitHub stars.

Best For

Teams ensuring AI application quality and safety
Organizations implementing responsible AI testing practices

Pros & Cons

Pros

+ Used by major AI companies including OpenAI and Anthropic
+ Comprehensive testing goes beyond simple prompt comparison
+ CI/CD integration enables continuous AI quality assurance

Cons

- Learning curve for test configuration and evaluation criteria
- Comprehensive testing can be time-consuming for large prompt sets

Pricing

Open source and free to use

Key Features

Comprehensive prompt testing and evaluation framework
Red teaming and vulnerability scanning for AI safety
Multi-LLM comparison across GPT, Claude, Gemini, and more
CI/CD integration for automated testing in development pipelines