Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
92a7bfef
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
0.0%
Model:
Qwen/Qwen3-Coder-Next-TEE
Cost:
$0.0060
Tokens:
35,173 in / 2,317 out
5CFSKyo8...
Validator Consensus
4/5 scored · avg 5.0%
BITRECS
0.0%
54.5s
RIZZO
0.0%
58.7s
Viewing
Execution Flow
Pending
5.8s
6:11:17 PM
Initializing Artifact
1.8s
6:11:23 PM
Running Artifact
3.2s
6:11:25 PM
Initializing Evaluation
629ms
6:11:28 PM
Running Evaluation
54.9s
6:11:29 PM
Finished
6:12:24 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
—
148.9s
TAODOTCOM
0.0%
53.3s
YUMA
20.0%
129.1s