What is Airweave?
Airweave is an open-source platform that connects to your apps, tools, and databases, continuously syncs their data, and exposes it through a unified, LLM-friendly search interface. AI agents query Airweave to retrieve relevant, grounded, up-to-date context from multiple sources in a single request.Connect your data sources
50+ integrations including GitHub, Notion, Slack, Stripe, Gmail, and databases
Search with natural language
AI-powered semantic search with filters, reranking, and query expansion
Build AI agents
Native integrations with agent frameworks via SDKs, REST API, and MCP
Deploy anywhere
Cloud-hosted at app.airweave.ai or self-hosted with Docker
Where it fits
Airweave sits between your data sources and AI systems as shared retrieval infrastructure. It handles authentication, ingestion, syncing, indexing, and retrieval so you don’t have to rebuild fragile pipelines for every agent or integration.How it works
Connect your data sources
Authenticate with your apps and databases using OAuth, API keys, or database credentials. Airweave supports 50+ integrations including:
- Productivity: Notion, Slack, Gmail, Google Drive, Confluence
- Development: GitHub, GitLab, Bitbucket, Jira, Linear
- Business: Stripe, HubSpot, Salesforce, Zendesk
- Databases: PostgreSQL, MySQL, MongoDB
Data syncs automatically
Airweave continuously syncs data from your sources, extracts entities (documents, issues, emails, customers), and indexes them for semantic search. Configure sync schedules or enable real-time continuous syncing.
Search with natural language
Query your collections using the Python SDK, TypeScript SDK, or REST API. Results are ranked by relevance, filtered by source and metadata, and optionally reranked with AI for improved accuracy.
Example: Search across all your data
Key features
Unified search across all sources
Unified search across all sources
Create collections that group multiple data sources together. Search across GitHub, Notion, Slack, databases, and more with a single query. No need to query each source separately or manage multiple APIs.
AI-powered semantic search
AI-powered semantic search
Airweave uses advanced embedding models to understand the meaning of your queries, not just keyword matching. Supports hybrid search (semantic + keyword), query expansion, and AI reranking for maximum accuracy.
Automatic sync and indexing
Automatic sync and indexing
Set up a source connection once and Airweave handles the rest. Data syncs automatically on a schedule or continuously. Incremental syncing ensures you only process changes, not entire datasets.
Flexible filtering and ranking
Flexible filtering and ranking
Filter results by source, date range, status, or custom metadata. Apply recency bias to prefer newer content. Set score thresholds for high-confidence results only.
Production-ready infrastructure
Production-ready infrastructure
Built on PostgreSQL, Vespa, Temporal, and Redis. Scales to millions of documents. Self-host with Docker or use the managed cloud service. SOC 2 Type II compliant.
Native agent integrations
Native agent integrations
Works with popular AI frameworks out of the box. Model Context Protocol (MCP) server included. Python and TypeScript SDKs with full type safety. REST API for any language.
Get started in minutes
Cloud (Recommended)
Hosted service with free tier. No setup required.
Self-hosted
Run locally with Docker in under 5 minutes.
Follow the quickstart
Create your first collection and search in under 10 minutes
Community and support
Discord
Join our community for help and discussions
GitHub
Star the repo and contribute
Follow for updates and announcements