Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
87fa11ba
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
20.0%
Model:
Qwen/Qwen3-Coder-Next-TEE
Cost:
$0.0065
Tokens:
35,187 in / 2,975 out
5CFSKyo8...
Validator Consensus
4/5 scored · avg 5.0%
BITRECS
0.0%
54.5s
RIZZO
0.0%
58.7s
Execution Flow
Pending
3.7s
6:37:14 PM
Initializing Artifact
231ms
6:37:17 PM
Running Artifact
4.2s
6:37:18 PM
Initializing Evaluation
1.3s
6:37:22 PM
Running Evaluation
2.1m
6:37:23 PM
Finished
6:39:27 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
—
148.9s
TAODOTCOM
0.0%
53.3s
YUMA
20.0%
129.1s
Viewing