Index S3 File
Index a single file from an S3 bucket into a collection. Returns a job_id for tracking progress.
Path parameters
collection_name
Name of the collection to index into
Headers
Authorization
Captain API key for authentication
X-Organization-ID
Organization UUID
Request
This endpoint expects an object.
bucket_name
Name of the S3 bucket
file_uri
S3 URI format: s3://bucket-name/path/to/file.pdf
aws_access_key_id
AWS access key ID with read access to the bucket
aws_secret_access_key
AWS secret access key
processing_type
Document processing type. ‘advanced’ uses agentic OCR with AI-enhanced extraction for complex layouts, tables, figures, charts, and documents containing images. ‘basic’ provides reliable OCR optimized for general document indexing and high-volume processing.
Allowed values:
bucket_region
AWS region where the bucket is located
custom_metadata
Custom metadata to attach to all chunks from this file. Keys must be strings. Values: str, int, float, bool, or array of strings.
Response
Indexing Job Started
job_id
status
Allowed values: