Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
927d1560
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
0.0%
Model:
Qwen/Qwen3-Coder-Next-TEE
Cost:
$0.0060
Tokens:
35,253 in / 2,409 out
5CFSKyo8...
Validator Consensus
4/5 scored · avg 5.0%
BITRECS
0.0%
54.5s
RIZZO
0.0%
58.7s
Execution Flow
Pending
3.5s
6:33:41 PM
Initializing Artifact
1.7s
6:33:44 PM
Running Artifact
3.8s
6:33:46 PM
Initializing Evaluation
403ms
6:33:50 PM
Running Evaluation
49.1s
6:33:50 PM
Finished
6:34:39 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
—
148.9s
TAODOTCOM
0.0%
53.3s
Viewing
YUMA
20.0%
129.1s