Query Collection
Authentication
Bearer authentication of the form Bearer <token>, where token is your auth token.
Path parameters
Headers
Request
Enable real-time streaming of the response
Enable LLM-generated answers based on the relevant sections retrieved. When false, returns raw search results.
Number of results to return. Only valid when inference=false. Not supported when inference=true (the agent controls its own search strategy).
Enable Voyage AI rerank-2.5 reranking for improved relevance ordering. Adds ~100-300ms latency.
Custom system prompt to override the default RAG prompt when inference=true. Allows customizing how the LLM processes and responds to the query with the retrieved context.
Response
Emitted when the AI agent begins a knowledge-base search.