Scientific Search
Scientific Search gives you a deterministic way to find scientific material and retrieve the complete document text as markdown. It does not synthesize an answer. It returns source documents that your application can inspect, store, or index into File Search.
What It Covers
PubMed abstracts and Semantic Scholar papers are not part of this deterministic workflow because they do not guarantee a complete, freely retrievable document for every result. Scientific Search is intentionally narrower: every returned result includes full_text_available: true and a download_url for the full markdown document.
How It Works
- Search scientific documents with
GET /v2/datasets/scientific/search. - Read
results[].download_urlfrom the response. - Fetch the full markdown document from that URL.
- Index the markdown into a Captain collection if you want retrieval, filtering, relations, or RAG over those scientific documents.
All requests require Authorization: Bearer {api_key}. Include X-Organization-ID only when your key is not already scoped to an organization.
Search Documents
Use q for the search query. Results are gathered from the fixed source set in a fixed order, so the same query parameters produce the same result shape.
Search Response
Fetch Full Markdown
Call the download_url from a search result, or construct the URL from the source type and document ID.
Document Response
PMC document:
ClinicalTrials.gov document:
If a document cannot be retrieved, found may be false with a reason, or the endpoint may return an error for invalid IDs, unknown document types, or upstream failures.
Index Into File Search
Scientific Search returns source markdown. File Search indexes and retrieves that markdown.
Then query the collection with File Search or inspect the exact request shape in Query - v3.
Fields
Search Request
Search Result
Document Response
Source Search and Scrape
Scientific also has scraper-backed source search endpoints for supported web publications:
Use those only when you specifically need publication URL discovery and scraping. For deterministic scientific document retrieval, prefer GET /v2/datasets/scientific/search and GET /v2/datasets/scientific/documents/{doc_type}/{doc_id}.