Docker Compose Deployment

Quick Start

The fastest way to run Airweave is using the included start.sh script:

git clone https://github.com/airweave-ai/airweave.git
cd airweave
./start.sh

Prerequisites:

Docker 20.10+ and Docker Compose (or docker compose plugin)
4GB+ RAM available
Ports 5432, 6379, 7233, 8001, 8080, 8081, 8088 available

The script will:

Create .env from .env.example if needed
Generate encryption keys automatically
Start all required services with health checks
Optionally prompt for API keys (OpenAI, Mistral)

→ Access the UI at http://localhost:8080

What the Script Does

The start.sh script orchestrates a complete deployment:

Environment Setup

Creates .env from template if missing
Generates ENCRYPTION_KEY (32-byte base64)
Generates STATE_SECRET for OAuth flows
Sets SKIP_AZURE_STORAGE=true for local filesystem storage

Embedding Configuration

Auto-detects embedding provider based on available API keys:Priority order:

OpenAI (1536 dimensions) - if OPENAI_API_KEY is set
Mistral (1024 dimensions) - if MISTRAL_API_KEY is set
Local embeddings (384 dimensions) - uses ~2GB RAM

Configures:

DENSE_EMBEDDER - embedding model
EMBEDDING_DIMENSIONS - vector dimensions
SPARSE_EMBEDDER=fastembed_bm25 - keyword search

Service Startup

Starts services using Docker Compose profiles:

docker compose -f docker/docker-compose.yml \
  --profile vespa \
  --profile frontend \
  --profile local-embeddings \
  up -d

Services start in dependency order (PostgreSQL → Redis → Temporal → Backend → Frontend)

Health Checks

Waits for critical services to be ready:

Vespa: Document API responding (up to 5 minutes)
Backend: /health/ready endpoint (up to 5 minutes)
Frontend: HTTP 200 response

Shows real-time progress with retry counters

Verification

Displays service status with URLs:

✅ Backend API     http://localhost:8001
✅ Frontend UI     http://localhost:8080
📊 Temporal UI     http://localhost:8088
🗄️ PostgreSQL      localhost:5432
🔎 Vespa           http://localhost:8081
🤖 Embeddings      http://localhost:9878 (local)

Script Options

Interactive Mode (Default)

./start.sh

Prompts for API keys if not already configured.

Non-Interactive Mode

NONINTERACTIVE=1 ./start.sh
# or
./start.sh --noninteractive

Skips all prompts - useful for CI/CD or automated deployments.

Skip Frontend

./start.sh --skip-frontend

Starts only backend services (API, database, workers). Useful for:

API-only integrations
Headless deployments
Resource-constrained environments

Skip Local Embeddings

./start.sh --skip-local-embeddings

Saves ~2GB RAM by not starting the local embedding service. Requires either:

OPENAI_API_KEY for OpenAI embeddings
MISTRAL_API_KEY for Mistral embeddings

Restart Services

./start.sh --restart

Restarts existing containers without recreating them. Preserves:

Database data
Vespa indices
Redis cache
Environment configuration

Recreate Containers

./start.sh --recreate

Removes and recreates all containers while keeping volumes (data persists).

Destroy Everything

./start.sh --destroy

Removes all containers, volumes, and data. Cannot be undone!

Prompts for confirmation unless --noninteractive is set.

Debug Mode

VERBOSE=1 ./start.sh
# or
./start.sh --verbose

Enables detailed logging and shell debugging (set -x).

Combined Options

# Backend-only with OpenAI embeddings
./start.sh --skip-frontend --skip-local-embeddings

# Automated deployment for CI
NONINTERACTIVE=1 SKIP_FRONTEND=1 ./start.sh

# Fresh start with debugging
./start.sh --recreate --verbose

Docker Compose Architecture

The docker/docker-compose.yml file defines all services:

Service Profiles

Services are organized using profiles for selective startup:

Profile	Services	When to Use
(default)	postgres, redis, backend, temporal, svix	Always included
`vespa`	vespa, vespa-init	Vector search (required)
`frontend`	frontend	Web UI (optional)
`local-embeddings`	text2vec-transformers	Local embedding generation

Networking

All services run on the default Docker bridge network:

Services communicate using container names (e.g., postgres, redis)
Host services accessible via host.docker.internal (enabled for backend and svix)
External ports exposed for direct access from host

Volume Mounts

backend:
  volumes:
    - ../backend:/app              # Live code reload
    - ../local_storage:/app/local_storage  # File storage

temporal-worker:
  volumes:
    - ../backend:/app              # Shared code with backend

vespa:
  volumes:
    - vespa_data:/opt/vespa/var    # Persistent index
    - ../vespa/app:/app            # Schema files

Environment Variables

Services load configuration from:

Root .env file (via env_file: ../.env)
Inline overrides for container networking

Example for backend service:

backend:
  env_file:
    - ../.env
  environment:
    # Container network overrides
    - POSTGRES_HOST=postgres        # Not localhost
    - REDIS_HOST=redis
    - VESPA_URL=http://vespa
    - TEMPORAL_HOST=temporal
    - TEXT2VEC_INFERENCE_URL=http://text2vec-transformers:8080

Manual Docker Compose Usage

Start All Services

cd docker
docker compose --profile vespa --profile frontend up -d

View Logs

# All services
docker compose logs -f

# Specific service
docker logs -f airweave-backend
docker logs -f airweave-vespa-init

Check Service Status

docker compose ps

Stop Services

# Stop but keep containers
docker compose stop

# Remove containers (keeps volumes)
docker compose down

# Remove everything including volumes
docker compose down --volumes

Restart Single Service

docker compose restart backend
docker compose restart temporal-worker

Access Service Shells

# Backend Python shell
docker exec -it airweave-backend bash

# PostgreSQL console
docker exec -it airweave-db psql -U airweave -d airweave

# Redis CLI
docker exec -it airweave-redis redis-cli

Health Checks

All services include health checks for reliable startup:

PostgreSQL

healthcheck:
  test: ["CMD-SHELL", "pg_isready -U airweave"]
  interval: 5s
  timeout: 5s
  retries: 5

Backend API

healthcheck:
  test: ["CMD", "curl", "-f", "http://localhost:8001/health/ready"]
  interval: 10s
  timeout: 30s
  retries: 10
  start_period: 60s

Includes 60s startup grace period for database migrations.

Vespa

healthcheck:
  test: ["CMD", "curl", "-f", "http://localhost:19071/state/v1/health"]
  interval: 10s
  timeout: 5s
  retries: 30

The vespa-init container deploys the application schema after Vespa is healthy.

Troubleshooting

Port already in use

Error: Bind for 0.0.0.0:8080 failed: port is already allocatedSolution:

Check what’s using the port:

lsof -i :8080
netstat -tuln | grep 8080

Stop the conflicting service or change Airweave’s port:
```
# In .env
FRONTEND_LOCAL_DEVELOPMENT_PORT=8081
```
Restart: ./start.sh --restart

Vespa init failed

Symptoms: vespa-init container exits with non-zero codeDebug:

docker logs airweave-vespa-init
docker logs airweave-vespa
curl http://localhost:8081/state/v1/health

Common causes:

Embedding dimensions mismatch (check .env for EMBEDDING_DIMENSIONS)
Vespa not fully initialized (wait 30s and try ./start.sh --restart)
Disk space full

Backend not starting

Check logs:

docker logs airweave-backend

Common issues:

Missing environment variables (check .env)
Database migration failures
Port conflicts

Verify database connection:

docker exec airweave-backend \
  poetry run python -c "from airweave.db.engine import get_session; next(get_session())"

Out of memory

Symptoms: Services crashing, slow performanceCheck Docker resources:

docker stats

Solutions:

Skip local embeddings: ./start.sh --skip-local-embeddings
Increase Docker Desktop memory limit (Settings → Resources)
Use cloud embeddings (OpenAI/Mistral)

Temporal workflows failing

Check Temporal UI: http://localhost:8088View worker logs:

docker logs airweave-temporal-worker

Restart worker:

docker compose restart temporal-worker

Production Considerations

Docker Compose is suitable for development and small production deployments. For enterprise production use, consider Kubernetes.

Security

Change default credentials

Update in .env:

POSTGRES_PASSWORD=<strong-password>
FIRST_SUPERUSER_PASSWORD=<strong-password>
ENCRYPTION_KEY=<regenerate>
STATE_SECRET=<regenerate>
SVIX_JWT_SECRET=<regenerate>

Enable authentication

AUTH_ENABLED=true
AUTH0_DOMAIN=your-tenant.auth0.com
AUTH0_AUDIENCE=https://your-api

Use HTTPS

Place a reverse proxy (nginx, Caddy, Traefik) in front of the frontend and backend.

Restrict network access

Remove port exposures and use a reverse proxy, or bind to localhost:

ports:
  - "127.0.0.1:8001:8001"  # Only accessible from host

Backups

# Create backup
docker exec airweave-db pg_dump -U airweave airweave > backup.sql

# Restore
cat backup.sql | docker exec -i airweave-db psql -U airweave airweave

Monitoring

# Service health
curl http://localhost:8001/health
curl http://localhost:8001/health/ready

# Metrics (if enabled)
curl http://localhost:9090/metrics

# Resource usage
docker stats --no-stream

Next Steps

Configure Environment

Explore all environment variables and configuration options

Add Connectors

Connect to your data sources

Use the API

Start building with the REST API

Upgrade to Kubernetes

Scale to production with Kubernetes

Getting Started

Core Features

Data Sources

Integrations

Self-Hosting

Advanced

Contributing

​Quick Start

​What the Script Does

​Script Options

​Interactive Mode (Default)

​Non-Interactive Mode

​Skip Frontend

​Skip Local Embeddings

​Restart Services

​Recreate Containers

​Destroy Everything

​Debug Mode

​Combined Options

​Docker Compose Architecture

​Service Profiles

​Networking

​Volume Mounts

​Environment Variables

​Manual Docker Compose Usage

​Start All Services

​View Logs

​Check Service Status

​Stop Services

​Restart Single Service

​Access Service Shells

​Health Checks

​Troubleshooting

​Production Considerations

​Security

​Backups

​Monitoring

​Next Steps

Configure Environment

Add Connectors

Use the API

Upgrade to Kubernetes

Quick Start

What the Script Does

Script Options

Interactive Mode (Default)

Non-Interactive Mode

Skip Frontend

Skip Local Embeddings

Restart Services

Recreate Containers

Destroy Everything

Debug Mode

Combined Options

Docker Compose Architecture

Service Profiles

Networking

Volume Mounts

Environment Variables

Manual Docker Compose Usage

Start All Services

View Logs

Check Service Status

Stop Services

Restart Single Service

Access Service Shells

Health Checks

Troubleshooting

Production Considerations

Security

Backups

Monitoring

Next Steps