Deployment Options
Airweave can be deployed in several ways depending on your infrastructure and scalability requirements:Docker Compose
Single-server deployment for development and small production workloads
Kubernetes
Scalable production deployment with orchestration and high availability
Airweave Cloud
Fully managed hosting - no infrastructure required
Configuration
Environment variables and advanced configuration options
Architecture Components
Airweave consists of several interconnected services:Core Services
Backend API
Backend API
Technology: FastAPI (Python 3.11)The main application server that handles:
- REST API endpoints
- Authentication and authorization
- Business logic and orchestration
- Database migrations (Alembic)
8001- API server9090- Metrics/monitoring
http://localhost:8001/health/readyFrontend UI
Frontend UI
Technology: React/TypeScript with ViteWeb-based user interface for:
- Managing collections and source connections
- Configuring integrations
- Monitoring sync jobs
- Testing search queries
8080 (default)Build: Multi-stage Docker build with production optimizationPostgreSQL
PostgreSQL
Version: 16Primary data store for:
- User accounts and permissions
- Collection and source metadata
- Sync job history and state
- Webhook subscriptions
5432Optimized settings:max_connections=200shared_buffers=256MBeffective_cache_size=1GB
Redis
Redis
Version: 7-alpineUsed for:
- Pub/sub messaging
- Session storage
- Webhook queue management (via Svix)
6379Search & Indexing
Vespa
Vespa
Version: 8Vector search engine for:
- Dense vector embeddings (configurable dimensions)
- Sparse BM25 embeddings
- Hybrid search ranking
- Real-time document indexing
8081- Query/Document API19071- Config server
- Dynamic schema templating based on embedding dimensions
- Automatic deployment via init container
- Persistent storage with Docker volumes
Embeddings Service (Optional)
Embeddings Service (Optional)
Image:
semitechnologies/transformers-inferenceLocal embedding generation using:- Model:
sentence-transformers-all-MiniLM-L6-v2 - Dimensions: 384
9878Note: Automatically skipped if OPENAI_API_KEY is provided. Uses ~2GB of memory.Workflow Orchestration
Temporal
Temporal
Version: 1.24.2Durable workflow engine for:
- Data sync orchestration
- Retry logic and error handling
- Long-running background tasks
- Scheduled sync jobs
7233- gRPC API8233- Internal metrics
Temporal UI
Temporal UI
Version: 2.26.2Web interface for:
- Workflow monitoring
- Task queue inspection
- Debugging failed workflows
8088Temporal Worker
Temporal Worker
Executes workflow tasks including:
- Connector sync activities
- Document processing
- Embedding generation
- Cleanup tasks
Additional Services
Svix
Svix
Webhook delivery systemManages:
- Webhook subscriptions
- Event delivery with retries
- Delivery logs and debugging
8071Features:- Redis-based queue
- Subnet whitelisting for testing
- Automatic database initialization
Resource Requirements
Minimum (Development)
- CPU: 2 cores
- Memory: 4GB RAM
- Storage: 10GB
- Docker: 20.10+ with Docker Compose
Recommended (Production)
- CPU: 4-8 cores
- Memory: 8-16GB RAM
- Storage: 50GB+ (depends on data volume)
- Network: Stable internet for connector syncs
Service Dependencies
The startup sequence is managed automatically via health checks:Storage Options
Airweave supports multiple storage backends for file attachments:| Backend | Use Case | Configuration |
|---|---|---|
| Filesystem | Local development, K8s PVC | STORAGE_BACKEND=filesystem |
| Azure Blob | Azure-based deployments | STORAGE_BACKEND=azure |
| AWS S3 | AWS deployments or S3-compatible | STORAGE_BACKEND=aws |
| GCP Storage | Google Cloud deployments | STORAGE_BACKEND=gcp |
By default, local deployments use filesystem storage at
./local_storage. See Configuration for cloud storage setup.Next Steps
Choose your deployment method
- Docker Compose for quick setup
- Kubernetes for production scale
Configure environment variables
Review Configuration for all available options