Deterministic checks
Paper-mill detection, statcheck p-value verification, GRIM tests, data availability, retraction cross-referencing — before any LLM runs.
Nine specialist agents review every preprint independently, then deliberate to produce a consensus grade on evidence strength and significance.
— papers assessed across — indexed preprints from bioRxiv and medRxiv.
Machine-generated indicators. Assists but does not replace expert peer review.
Auto-refreshes every 30 seconds
Four layers. Nine agents. One consensus grade.
Paper-mill detection, statcheck p-value verification, GRIM tests, data availability, retraction cross-referencing — before any LLM runs.
Four integrity agents (methodologist, statistician, ethics, validity) plus five domain agents review independently.
Evidence_strength and significance labels from each agent map to an A–E grade via a rule-based lookup calibrated to eLife reviews.
Borderline or low-agreement cases are arbitrated by Claude Opus with the full agent panel as context.