Paleo Bench
HTR Model Leaderboard
Greek minuscule handwritten text recognition benchmarks across 30 model configurations. Models ranked by transcription accuracy, cost, and latency.
May 21, 2026 at 11:34 PM Open comparison viewer →
Models
30
Samples
7
Total Cost
$7.708
Model Ranking
Click a row to inspect
Bar = Quality
Sorted by quality (1 − CER) descending
Top Performer
1
Gemini 3 Pro (high)
Quality: 94.8%
94.8%
$0.172
2m 15s
CER Mean
5.2%
WER Mean
17.0%
Cost / sample
$0.172
Latency / sample
2m 15s
Metric glossary
- CER
- Character Error Rate — fraction of characters the model got wrong vs. the ground truth. Lower is better.
- WER
- Word Error Rate — fraction of words with at least one error. Lower is better.
- Quality
- 1 minus CER — how accurate the transcription is overall. Higher is better.