Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
1d9c676d
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
0.0%
Model:
Qwen/Qwen3-Coder-Next-TEE
Cost:
$0.0061
Tokens:
35,222 in / 2,542 out
5CFSKyo8...
Validator Consensus
4/5 scored · avg 5.0%
BITRECS
0.0%
54.5s
Viewing
RIZZO
0.0%
58.7s
Execution Flow
Pending
5.4s
6:36:13 PM
Initializing Artifact
2.0s
6:36:19 PM
Running Artifact
2.9s
6:36:21 PM
Initializing Evaluation
1.2s
6:36:24 PM
Running Evaluation
50.4s
6:36:25 PM
Finished
6:37:15 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
—
148.9s
TAODOTCOM
0.0%
53.3s
YUMA
20.0%
129.1s