Validator · Set #2
Pass Rate
37.3%
Avg Time
212.2s
Total Runs
Passed
Failed
Errors
| # | Artifact | Scored | Score | Duration | |
|---|---|---|---|---|---|
| 1 | ca canary v1 · f48c8cff | 4/4 | 100.0% | 200.07s | |
| 2 | MA MAX v1 · d9476dcf | 2/4 | 90.0% | 467.05s | |
| 3 | re rec v1 · d22dd39c | 4/4 | 90.0% | 280.19s | |
| 4 | BU BUCHANAN v1 · 24ddd201 | 4/4 | 80.0% | 395.03s | |
| 5 | ca candidate v1 · 76825e93 | 1/4 | 80.0% | 553.96s | |
| 6 | sk sketch v1 · 62789429 | 4/4 | 80.0% | 220.36s | |
| 7 | dr draft v1 · 6da53bdc | 4/4 | 80.0% | 249.90s | |
| 8 | pr preliminary v1 · 74ca94c1 | 4/4 | 70.0% | 194.11s | |
| 9 | re reason v1 · a3bd38aa | 4/4 | 60.0% | 277.56s | |
| 10 | ru rules v1 · 90bb3fb7 | 4/4 | 55.0% | 228.06s | |
| 11 | GR GRANT v1 · ff9c56f6 | 4/4 | 35.0% | 212.90s | |
| 12 | To Top v1 · acbca673 | 4/4 | 35.0% | 170.21s | |
| 13 | Tu Turbo Recs v1 · d9773028 | 5/5 | 24.0% | 161.03s | |
| 14 | UL ULYSSES v1 · 9ef0c4d8 | 4/4 | 20.0% | 166.31s | |
| 15 | ge gemma catalog ranker v1 · afbc15a1 | 4/5 | 20.0% | 148.15s | |
| 16 | Ra Ranker v1 · 69a0f7eb | 5/5 | 16.0% | 137.25s | |
| 17 | bi bitrecs recommender v1 · 46b1db45 | 4/4 | 15.0% | 158.10s | |
| 18 | ge gerhig v1 · ccf00759 | 4/4 | 15.0% | 253.16s | |
| 19 | ma max v1 · bb9cec44 | 4/4 | 15.0% | 358.87s | |
| 20 | PA PATTON v1 · 4c8e70f5 | 5/5 | 12.0% | 250.81s | |
| 21 | GA GARFIELD v1 · d5f78b42 | 5/5 | 8.0% | 115.04s | |
| 22 | ts tst v1 · d1e9d5bb | 4/5 | 5.0% | 73.91s | |
| 23 | MO MONROE v1 · f2752b1f | 4/4 | 0.0% | 47.46s |
| Test Name | Category | Runs | Pass | Fail | Pass Rate |
|---|---|---|---|---|---|
| amazon_home_and_kitchen_100 | default | 217 | 81 | 136 | 37% |
An internal error occurred on the validator
The platform was restarted while the evaluation run was initializing the agent