Overview
A collection serves as a namespace for your data sources. When you create a collection, Airweave:- Assigns a unique
readable_idfor use in API calls and URLs - Configures vector storage with your organization’s embedding model
- Sets up a dedicated search endpoint
- Creates a container for source connections to sync data into
Collections are isolated per organization. Each collection can have its own sync configuration and contains data from one or more authenticated source connections.
Creating a Collection
Create a new collection to start organizing your data sources.Choose a descriptive name
Pick a name that clearly identifies the data contained within (e.g., “Customer Support”, “Finance Data”, “Engineering Docs”).
Optionally set a readable_id
If you don’t provide one, Airweave will auto-generate a URL-safe identifier from your collection name with a random suffix (e.g.,
customer-support-x7k9m).Request Parameters
Human-readable display name for the collection. Must be between 4 and 64 characters.Examples:
"Finance Data", "Customer Support", "Marketing Analytics"URL-safe unique identifier. Must contain only lowercase letters, numbers, and hyphens.
Cannot start or end with a hyphen.If not provided, automatically generated from the name with a random 6-character suffix.Examples:
"finance-data-ab123", "customer-support-xy789"Optional default sync configuration for all syncs in this collection. Can be overridden at the sync or job level.
Response Fields
The API returns aCollection object with the following fields:
Unique system identifier (auto-generated)
Display name of the collection
URL-safe identifier for API endpoints
Current operational status:
NEEDS_SOURCE: No authenticated connections or no successful syncs yetACTIVE: At least one connection has completed a syncERROR: All connections failed their last sync
Embedding dimensions (derived from deployment metadata)
Name of the embedding model (e.g.,
"text-embedding-3-large")Your organization identifier
Timestamp when created
Timestamp of last modification
Listing Collections
Retrieve all collections in your organization with optional pagination and search filtering.Query Parameters
Number of collections to skip for pagination (min: 0)
Maximum number of collections to return (1-1000)
Filter collections by name or readable_id (case-insensitive partial match)
Getting a Collection
Retrieve details of a specific collection by itsreadable_id.
Updating a Collection
Modify a collection’s name or sync configuration. Thereadable_id cannot be changed after creation.
Deleting a Collection
Permanently delete a collection and all associated data. This operation cannot be undone.Collection Status
Collections have three possible statuses that reflect their operational state:NEEDS_SOURCE
NEEDS_SOURCE
The collection has no authenticated source connections, or connections exist but haven’t completed a successful sync yet.Next steps: Add and authenticate a source connection, then trigger a sync.
ACTIVE
ACTIVE
At least one source connection has completed a sync successfully, or a sync is currently running.This is the normal operating state for collections with data.
ERROR
ERROR
All source connections have failed their most recent sync attempt.Action required: Check the sync error logs and fix authentication or configuration issues.
Next Steps
Search
Learn how to search across your collection data
Webhooks
Set up real-time notifications for collection events