~/.flintai/config.json and defines:
- What to test - Your running agent’s HTTP endpoint
- How to test it - Which evaluations to run
- When to test - Model-evaluation assignments
Quick start
Create~/.flintai/config.json with this minimal configuration:
Your agent must be running and accessible at the
host URL before testing.Configuration file format
The config file is a JSON file with five optional top-level sections. Only include sections you need.- Minimal Example
- Complete Example
Most users only need to define models and attach built-in evaluations via CLI:Then attach evaluations:
Using environment variables in config
Reference environment variables in config.json using${VAR_NAME} syntax instead of hardcoding sensitive values:
Models section
Themodels array defines agents or LLMs you want to test. Each model requires these fields:
Required fields
| Field | Description | Example |
|---|---|---|
id | Unique identifier for CLI commands | "my-agent" |
type | Agent framework or API type | "openai_compatible" |
name | Human-readable display name | "My Agent" |
model_name | Agent or model name passed to API | "gpt-4", "my-agent-v1" |
Optional fields
| Field | Description | Example | Applies To |
|---|---|---|---|
host | HTTP endpoint where agent runs | "http://localhost:8000" | Hosted agents |
key | API key (or use environment variables) | "sk-..." | All types |
endpoint | Custom API path | "/api/chat" | HTTP-based types |
headers | Custom HTTP headers | {"X-Custom": "value"} | HTTP-based types |
temperature | Model temperature (0.0-1.0) | 0.7 | All types |
tags | Key-value pairs for filtering | {"env": "staging"} | All types |
description | Human-readable description | "Production chatbot" | All types |
input_path | JSONPath for input | "$.messages" | generic_http, openai_compatible |
output_path | JSONPath for output | "$.response" | generic_http, openai_compatible |
immediate_result | Return immediately vs streaming | true | adk |
Supported agent types
View all 12 supported types
View all 12 supported types
| Type | Use Case | Required Fields (beyond id/type/name/model_name) | Optional Fields |
|---|---|---|---|
openai_compatible | OpenAI-compatible APIs | host | endpoint, headers, input_path, output_path |
generic_http | Generic HTTP APIs | host | endpoint, headers, input_path, output_path |
langserve | LangServe endpoints | host | endpoint, headers |
openai_agent | OpenAI Agents SDK | host | endpoint |
anthropic_agent | Anthropic agents | host | endpoint |
adk | Google ADK agents | host | endpoint, immediate_result |
anthropic | Claude models (direct) | None | key |
openai | OpenAI models (direct) | None | key |
gemini | Google Gemini (direct) | None | key |
litellm | LiteLLM proxy | None | key |
huggingface | HuggingFace models | None | key |
ollama | Ollama local models | host | endpoint |
temperature, tags, and description as optional fields.Example: Model with Optional Fields
Example: Model with Optional Fields
Verify your models
Model evaluations section
Themodel_evaluations array assigns tests to models. Each assignment links one model to one evaluation.
Required fields
| Field | Description | Example |
|---|---|---|
id | Unique identifier for this assignment | "me-agent-llm01" |
model_id | Model id from your models array | "my-agent" |
evaluation_id | Evaluation ID (built-in or custom) | "eval-llm01-adversarial" |
name | Human-readable name for this assignment | "My Agent / Prompt injection" |
Optional fields
| Field | Description | Example |
|---|---|---|
weight | Scoring weight (default: 0.5) | 0.75 |
tags | Key-value pairs for filtering | {"priority": "high"} |
description | Notes about this assignment | "Critical security test" |
Example: Multiple Models with Different Test Suites
Example: Multiple Models with Different Test Suites
Built-in config and overrides
Flint AI loads two config layers:- Built-in config — Ships with the tool, contains all built-in evaluations, detectors, and message collections
- User config — Your
~/.flintai/config.json(or path via--config)
Configuration file location
Default location:~/.flintai/config.json
Override with --config:
Browse available evaluations
Next steps
Run Evaluations
Execute tests against your configured models
View Results
Analyze evaluation outputs
Environment Variables
Manage API keys and settings