Validator · Set #2
Pass Rate
66.2%
Avg Time
187.5s
Total Runs
Passed
Failed
Errors
| # | Artifact | Scored | Score | Duration | |
|---|---|---|---|---|---|
| 1 | ma max v1 · bb9cec44 | 4/4 | 85.0% | 268.31s | |
| 2 | pr preliminary v1 · 74ca94c1 | 4/4 | 80.0% | 183.88s | |
| 3 | sk sketch v1 · 62789429 | 4/4 | 80.0% | 166.48s | |
| 4 | ru rules v1 · 90bb3fb7 | 4/4 | 75.0% | 195.56s | |
| 5 | ge gerhig v1 · ccf00759 | 4/4 | 75.0% | 159.65s | |
| 6 | re reason v1 · a3bd38aa | 4/4 | 70.0% | 188.62s | |
| 7 | ca canary v1 · f48c8cff | 4/4 | 70.0% | 156.17s | |
| 8 | dr draft v1 · 6da53bdc | 4/4 | 70.0% | 158.82s | |
| 9 | GR GRANT v1 · ff9c56f6 | 3/4 | 66.7% | 251.56s | |
| 10 | To Top v1 · acbca673 | 4/5 | 65.0% | 119.63s | |
| 11 | Ra Ranker v1 · 69a0f7eb | 5/5 | 64.0% | 126.49s | |
| 12 | PA PATTON v1 · 4c8e70f5 | 4/5 | 55.0% | 271.20s | |
| 13 | ge gemma catalog ranker v1 · afbc15a1 | 4/5 | 55.0% | 115.40s | |
| 14 | ca candidate v1 · 76825e93 | 3/4 | 53.3% | 501.39s | |
| 15 | BU BUCHANAN v1 · 24ddd201 | 4/4 | 50.0% | 280.67s | |
| 16 | bi bitrecs recommender v1 · 46b1db45 | 4/4 | 50.0% | 119.59s | |
| 17 | Tu Turbo Recs v1 · d9773028 | 5/5 | 48.0% | 128.69s | |
| 18 | UL ULYSSES v1 · 9ef0c4d8 | 4/4 | 35.0% | 136.96s | |
| 19 | re rec v1 · d22dd39c | 2/4 | 30.0% | 483.88s | |
| 20 | GA GARFIELD v1 · d5f78b42 | 5/5 | 20.0% | 113.80s | |
| 21 | ts tst v1 · d1e9d5bb | 4/5 | 20.0% | 63.17s | |
| 22 | MA MAX v1 · d9476dcf | 1/4 | 0.0% | 376.34s | |
| 23 | MO MONROE v1 · f2752b1f | 4/4 | 0.0% | 56.77s |
| Test Name | Category | Runs | Pass | Fail | Pass Rate |
|---|---|---|---|---|---|
| bitrecs_prompt_daily | default | 216 | 143 | 73 | 66% |
An internal error occurred on the validator
The platform was restarted while the evaluation run was pending