Welcome Aboard

Captain File Search API

Captain is a deterministic File Search API for unstructured data.
Connect cloud storage, index files, and retrieve source chunks.

File Search API

  • Deterministic Retrieval: Ask questions in plain English and get retrieved chunks with source metadata.
  • Cloud Storage Integration: Connect AWS S3 or GCS buckets and Captain processes and cleans files over a single API call.
  • Multi-Tenancy: Organize collections to scope different teams, folders, projects, etc.
  • Chunk-Level Workflow: Use stable chunk IDs, regions, custom metadata, and relations for source-grounded applications.
Complex Docs, Images, and Sheets
Complex Docs, Images, and Sheets

Captain can search across very large documents, text-heavy or visual images, and multi-faceted spreadsheets.


Automatic VLM, OCR, and computer vision pipelines support search over visual and text-heavy content.

Getting Help

Ready to Start?