Research Datasets Search API
Query NCBI SRA, GEO, BioSample, BioProject, dbGaP, and Taxonomy from a single endpoint. Pre-shaped JSON with accessions, platforms, strategies, and organisms for AI-driven omics-dataset discovery.
What is Genomics Datasets Search API?
Semantic search across NCBI SRA, GEO, BioSample, BioProject, dbGaP, and Taxonomy. Returns ranked records with source URLs ready for downstream LLMs.
- Endpoint
- POST /api/search/genomics-datasets
- Authentication
- API key · x-api-key header
- Cost
- 30 credits / call
- Rate limit
- 60 req / min
- Response
- Structured JSON
Try Research Datasets Search Live
Enter your API key and run a real query against the live endpoint.
Don't have an API key?
Sign in to your account to create and manage your API keys.
Integration guide
Copy a snippet, replace your API key, run. Works in any HTTP client — examples below in cURL, JavaScript, and Python.
/api/search/genomics-datasetshttps://www.apipick.comSemantic search across NCBI SRA, GEO, BioSample, BioProject, dbGaP, and Taxonomy. Returns ranked records with source URLs ready for downstream LLMs.
querystringrequiredNatural-language search query
max_num_resultsintegeroptional1–5, default 5
relevance_thresholdnumberoptional0.0–1.0 quality filter
curl -X POST "https://www.apipick.com/api/search/genomics-datasets" \
-H "Content-Type: application/json" \
-H "x-api-key: YOUR_API_KEY" \
-d '{
"query": "RNA-seq human liver SRA runs",
"max_num_results": 5
}'{
"query": "RNA-seq human liver SRA runs",
"results": [
{
"title": "Example result",
"url": "https://example.com/article",
"snippet": "Short excerpt of the page content…",
"source_type": "web",
"published_at": "2026-04-15",
"score": 0.92
}
],
"result_count": 1,
"credits_used": 30,
"remaining_credits": 99
}Rate limits
Throttling is per API key, sliding 60-second window. Hit the limit and you get a clean 429 with a Retry-After header.
60req/min
Per API key, per endpoint. Sliding 60-second window.
3concurrent
Max simultaneous in-flight requests per API key.
X-RateLimit-LimitMaximum requests allowed per minuteX-RateLimit-RemainingRequests remaining in the current windowX-RateLimit-ResetSeconds until the current window resetsRetry-AfterSeconds to wait before retrying (only on 429)HTTP/1.1 429 Too Many Requests
Retry-After: 12
X-RateLimit-Limit: 60
X-RateLimit-Remaining: 0
X-RateLimit-Reset: 12
{
"error": "rate_limit_exceeded",
"message": "Rate limit exceeded: 60 requests/minute per API key. Retry after 12s.",
"retry_after": 12
}Frequently Asked Questions
Why is this 30 credits per call?
SRA, GEO, and the other NCBI archive databases are large, frequently updated repositories that require ongoing index maintenance, so the endpoint is priced at 30 credits (≈ $0.03 per call).
Which sources are covered?
NCBI SRA (sequencing runs), GEO (gene-expression datasets), BioSample, BioProject, dbGaP (genotype–phenotype), and Taxonomy. All are queried in parallel and ranked together.
Can I search by study, organism, or assay?
Yes. Pass an accession, an organism, an assay type, or a natural-language query like 'RNA-seq human liver' and the endpoint ranks the most relevant datasets.
What fields come back?
Records include titles, accessions, organism, platform/instrument, sequencing strategy, and dataset size where available, plus a link to the NCBI record.
Tool schema for OpenAI / Claude?
GET /api/search/genomics-datasets/tool-schema returns ready-to-paste OpenAI function and Claude tool-use definitions.
Explore other search APIs
Real-time semantic web search built for LLM tool calling. Returns ranked titles, URLs, and clean snippets pre-shaped for agent consumption. Country and date filters supported.
Real-time news search across major outlets. Date-range and country filtering for time-sensitive queries. Built for morning briefings, market-news agents, and RAG pipelines.
Extract clean readable content from up to 25 URLs per call. Strips ads, nav, and boilerplate; returns markdown-flavoured text ready for LLM ingestion. 2 credits per URL.
Search peer-reviewed papers and pre-prints across arXiv, PubMed, bioRxiv, and medRxiv from one endpoint. Built for AI-driven literature review, RAG over scientific corpora, and citation extraction.
Search clinical trials, FDA drug labels, and ChEMBL bioactivity. Built for medical research, drug repurposing, and AI-driven clinical decision support workflows.
Search SEC filings (10-K, 10-Q, 8-K), US earnings call transcripts, and equity statistics. Built for AI-driven due diligence, fundamental analysis, and financial RAG pipelines.
Semantic search over global patent filings across USPTO, EPO, WIPO, and major national offices. Built for prior-art research, IP landscaping, and AI-driven competitive intelligence.
Search Polymarket and Kalshi prediction-market contracts on politics, economics, sports, and current events. Built for crowd-forecast retrieval and probability-grounded LLM answers.
Semantic search across UK case law and primary legislation from one endpoint. Built for legal research, compliance review, statutory interpretation, and AI-driven legal-tech workflows.
Search CISA Known Exploited Vulnerabilities, NVD CVE records, EPSS exploit scores, and MITRE ATT&CK techniques. Built for AI-driven vulnerability triage, threat intelligence, and security operations.
Search global and US equities, crypto, forex, ETFs, mutual funds, commodities, and US market movers. Built for AI-driven price lookups, market-data retrieval, and trading research.
Search FRED, US Bureau of Labor Statistics, World Bank indicators, IMF macro data, US federal spending, and German labour statistics. Built for AI-driven macroeconomic research and analysis.
Search US public-company balance sheets, income statements, cash flow statements, dividends, and insider transactions. Built for AI-driven fundamental analysis and due diligence.
Search the US earnings calendar for upcoming report dates, EPS/revenue estimates, and before/after-market timing. Built for AI-driven event-driven trading and research.
Search ChEMBL bioactivity, PubChem chemical structures, and Open Targets target-disease associations. Built for AI-driven drug discovery, cheminformatics, and biomedical research.
Search NCBI Gene, dbSNP, ClinVar, dbVar, MedGen, and GTR for genes, variants, and clinical significance. Built for AI-driven genomics and variant-interpretation research.
Search NCBI Nucleotide, Protein, Genome, Assembly, CDD, and Structure for sequences, assemblies, and protein domains. Built for AI-driven bioinformatics.
Search WHO global health statistics, NIH research grants, and openFDA drug adverse-event reports. Built for AI-driven public-health research, grant discovery, and pharmacovigilance.
Search UK Parliament debates, written questions, and member activity (Hansard). Built for AI-driven civic-tech tools, legislative monitoring, and policy research.
Search licensed Wiley finance journals and textbooks. Built for AI-driven academic-grade financial research, literature review, and investment analysis.