Preprints.ai
Reference

Docs

REST API for retrieving assessments and paper data. All endpoints are read-only and unauthenticated.

Base URL

https://preprints.ai

All endpoints below are relative to this base.

Endpoints

GET /v1/stats

Aggregate counts for the live pipeline — total papers indexed, assessed, grade distribution, queue status.

curl https://preprints.ai/v1/stats
{
  "papers": { "total": 318526, "assessed": 1934, "fulltext": 316459 },
  "queue":  { "pending": 338, "processing": 6, "failed": 4 },
  "grades": { "A": 51, "B": 637, "C": 982, "D": 278, "E": 66 },
  "novelty": { "1": 120, "2": 340, "3": 890, "4": 300, "5": 60 }
}

GET /v1/pipeline

Recent assessments + grade heatmap. Useful for live feeds or custom dashboards.

curl https://preprints.ai/v1/pipeline

GET /reviews/{doi}

Full assessment view for a paper. Supports any DOI that has been assessed.

curl https://preprints.ai/reviews/10.1101/2025.01.15.633214

POST /v1/assess

Request an assessment for a specific paper. Returns an existing assessment if one is cached, otherwise triggers a new pipeline run.

curl -X POST https://preprints.ai/v1/assess \
  -H "content-type: application/json" \
  -d '{"doi": "10.1101/2025.01.15.633214"}'

Grading scale

Every paper receives a 10-tier Publication Fit score (the primary public signal) plus a legacy two-character A–E × 1–5 grade (kept for backwards compatibility). The tier is derived deterministically from the grade — see methodology.

TierLabelJournal anchor
1LandmarkNature, Science, Cell
2Broad-impactNature subjournals, eLife, Cell Reports
3Field-flagshipPLOS Biology, society flagship journals
4Specialty solidNature Communications, JBC, NAR
5PLOS One bar (corpus median)Methodologically adequate, narrower scope
6Specialty mid-tierMDPI, Frontiers field journals
7Major revisions neededNot publishable as-is
8Reject — methodology brokenDon't publish
9Inadequate evidencePanel verdict on deep-E papers
10Not for publicationPaper-mill / fabrication (Layer 1 deterministic only)
Legacy 2-axis grade (A1–E5)
LetterMeaningIntegrity score range
AExceptional evidence — rigorous methods, comprehensive controls≥ 0.85
BCompelling — strong methods, minor gaps0.70–0.84
CSolid — appropriate methods, some limitations0.55–0.69
DIncomplete — significant methodological gaps0.40–0.54
EInadequate — fundamental flaws, unreliable conclusions< 0.40

Significance digit — lower number = higher novelty. A1 is the best possible grade; E5 is the worst. (Inverted 2026-05-12; the prior convention had 5 as Landmark.)

DigitSignificance
1Landmark — field-defining, transformative
2Fundamental — major conceptual advance
3Important — significant advance, broad interest
4Valuable — addresses a real gap, moderate novelty
5Useful — incremental advance, narrow scope

Rate limits

Public API is open; no auth required. Fair-use throttle of 120 requests per minute per IP. If you need bulk access for research, get in touch — we can share a data dump rather than have you hammer the API.

Webhooks

Not yet public. If you're building an integration that needs push-style delivery of new assessments, contact us — we have a private beta.

Machine-readable specs

Full OpenAPI 3.1 spec at /openapi.json. LLM-friendly summary at /llms.txt.

Methodology

For the full pipeline architecture — four layers, eleven agents, rule-based grade derivation, Opus arbitration — see the methodology page.

Stability

The /v1 API is stable. New fields will be added additively; existing fields will not change shape within v1. Breaking changes go to /v2.

Source & transparency

Every assessment is reproducible — the paper text, model versions, prompts, and agent responses are logged. Grades are deterministic given the text labels (evidence strength, significance) agents emit — so two runs differ only where model sampling differs.