Paleo Bench

HTR Model Leaderboard

Greek minuscule handwritten text recognition benchmarks across 30 model configurations. Models ranked by transcription accuracy, cost, and latency.

Models

Samples

Total Cost

$7.708

Model Ranking

Click a row to inspect

Bar = Quality

Cost

Latency

Sorted by quality (1 − CER) descending

Top Performer

Quality: 94.8%

94.8%

$0.172

2m 15s

CER Mean

5.2%

WER Mean

17.0%

Cost / sample

$0.172

Latency / sample

2m 15s

Metric glossary

CER: Character Error Rate — fraction of characters the model got wrong vs. the ground truth. Lower is better.
WER: Word Error Rate — fraction of words with at least one error. Lower is better.
Quality: 1 minus CER — how accurate the transcription is overall. Higher is better.