Query Collection
Authentication
Path parameters
Headers
Request
Enable real-time streaming of the response
Enable LLM-generated answers based on the relevant sections retrieved. When false, returns raw search results.
Number of results to return. Only valid when inference=false. Not supported when inference=true (the agent controls its own search strategy).
Enable Voyage AI rerank-2.5 reranking for improved relevance ordering. Adds ~100-300ms latency.
Custom system prompt to override the default RAG prompt when inference=true. Allows customizing how the LLM processes and responds to the query with the retrieved context.
Include normalized bounding box layout data for each search result. Returns element-level positions (titles, paragraphs, tables, figures, form fields) with page coordinates for PDF and DOCX files. Only supported with inference=false.
Response
AI-generated summary/response (when inference=true)
Alias for summary (v1 compatibility)
List of relevant documents (when inference=true)
Raw search results with content (when inference=false)
Unique request identifier (used for streaming)
Streaming configuration (when stream=true)