Deployment Guide

This repository supports two deployment paths:

hosted deployment: Azure Container Apps or Render
self-hosted deployment: Docker Compose via docker-compose.yml

For turning this into your own benchmark product, see BUILD_YOUR_OWN_BENCHMARK.md.

1. Local Docker Deployment

Local development stack

make compose-dev

This brings up the normal local product surface without the demo answer service or demo seeder.

Fast demo launch

make compose-demo

This brings up the full local product surface and runs a seeded end-to-end verification pass.

Live fault verification

make compose-faults

This runs controlled Docker fault injection against the local stack, verifies the live HTTP outcomes, and restores the seeded demo deployment at the end.

Full local benchmark launch

docker compose --profile legacy --profile demo up -d --build

Use this when you want the full dataset instead of the two-example demo cap set by make compose-demo.

Override the benchmark through Docker environment

You can inject a different benchmark without editing code by passing dataset and evaluator overrides at launch time:

METIVTA_DATASET_NAME=My-Benchmark \
METIVTA_DATASET_LOCAL_PATH=/app/custom-dataset \
METIVTA_DATASET_FILES_QUESTIONS=questions.json \
METIVTA_DATASET_FILES_QUESTIONS_ONLY=questions-only.json \
METIVTA_EVALUATION_DAAT_EVALUATORS=hebrew_presence,url_format,response_length,daat_score \
docker compose --profile legacy --profile demo up -d --build

Stop the stack

make compose-dev-down

Or, if you started the seeded demo stack:

make compose-demo-down

Local URLs

gateway: http://localhost:18000
health: http://localhost:18000/health
readiness: http://localhost:18000/ready
Scalar docs: http://localhost:18000/api/v2/docs
runtime signup page: http://localhost:18080/signup
legacy leaderboard page: http://localhost:18080/leaderboard
dataset info: http://localhost:18080/dataset-info

2. Render Deployment

Render blueprint file:

render.yaml

Defined services:

metivta-fastapi
metivta-flask
metivta-worker

Required environment variables

Set these in the Render dashboard for the relevant services:

DATABASE_URL
METIVTA_SECURITY_SECRET_KEY
METIVTA_WORKER_BROKER
METIVTA_WORKER_RESULT_BACKEND

Optional integrations:

ANTHROPIC_API_KEY
LANGCHAIN_API_KEY
BROWSERLESS_TOKEN

3. Azure Container Apps Deployment (Verified)

Azure Container Apps is fully supported and was validated with live hosted E2E checks for DAAT and MTEB flows.

Required resources

one Azure Resource Group
one Azure Container Apps Environment
one Azure Container Registry
Container Apps for:
- gateway
- fastapi
- redis
- postgres

Production topology

gateway is external and serves the public domain
fastapi should be internal-only
redis and postgres should be internal-only

Public docs-only launch mode

Set the gateway environment variable:

PUBLIC_DOCS_ONLY=true

In this mode, public traffic is intentionally limited to:

/
/api/v2/docs
/api/v2/openapi.json

Maintainer docs host bundle

If you only want a public homepage, API reference, and guide for a maintainer-run promotional site such as metivta.co, build this static bundle instead of publishing the full application runtime:

make site-build

This writes deployable files to dist/static-site/:

index.html
guide/index.html
signup/index.html
api/v2/docs/index.html
api/v2/openapi.json

This is the correct artifact for Azure Static Web Apps, Azure Storage static website, or any other static host. This bundle is for the maintainer-operated docs/promotional site, not for benchmark operators running the full stack. Keep Azure Container Apps only when you want the public edge to proxy the live app during internal verification.

Full application test mode

For temporary hosted E2E testing of auth/eval/leaderboard routes, set:

PUBLIC_DOCS_ONLY=false

After validation, set it back to true for docs-only public launch if desired.

Custom domain DNS records

For apex + www on Azure Container Apps:

@ A -> <gateway static IP>
www CNAME -> <gateway container app fqdn>
asuid TXT -> <customDomainVerificationId>
asuid.www TXT -> <customDomainVerificationId>

Both asuid and asuid.www are required for managed certificate validation on apex and www.

4. Launch Verification Checklist

Runnable stack checklist

Before calling a full application deployment ready:

uv run ruff check .
uv run mypy src
uv run pytest -q
go test -race ./...
GET /health returns healthy
GET /ready returns all required dependencies as ready
GET /api/v2/docs loads
GET /signup loads
GET /leaderboard loads
register -> login -> create API key works
at least one DAAT evaluation works
at least one MTEB evaluation works if retrieval mode is enabled

Docs-only site checklist

Before calling the public promo/docs site ready:

GET / loads
GET /guide loads
GET /api/v2/docs loads
GET /api/v2/openapi.json loads
the site does not expose runtime auth or evaluation routes publicly

5. Notes

/submit is retained for compatibility; new integrations should target /api/v2/*
keep full ground-truth datasets private and publish safe question-only views publicly
when you customize the benchmark harness, update config.toml, dataset files, and rubric files together

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deployment Guide

1. Local Docker Deployment

Local development stack

Fast demo launch

Live fault verification

Full local benchmark launch

Override the benchmark through Docker environment

Stop the stack

Local URLs

2. Render Deployment

Required environment variables

3. Azure Container Apps Deployment (Verified)

Required resources

Production topology

Public docs-only launch mode

Maintainer docs host bundle

Full application test mode

Custom domain DNS records

4. Launch Verification Checklist

Runnable stack checklist

Docs-only site checklist

5. Notes

FilesExpand file tree

DEPLOYMENT.md

Latest commit

History

DEPLOYMENT.md

File metadata and controls

Deployment Guide

1. Local Docker Deployment

Local development stack

Fast demo launch

Live fault verification

Full local benchmark launch

Override the benchmark through Docker environment

Stop the stack

Local URLs

2. Render Deployment

Required environment variables

3. Azure Container Apps Deployment (Verified)

Required resources

Production topology

Public docs-only launch mode

Maintainer docs host bundle

Full application test mode

Custom domain DNS records

4. Launch Verification Checklist

Runnable stack checklist

Docs-only site checklist

5. Notes