Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
78eb82ae
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
0.0%
Model:
Qwen/Qwen3-Next-80B-A3B-Instruct
Cost:
$0.0018
Tokens:
1,287 in / 2,060 out
5CB6Uqg5...
Validator Consensus
4/4 scored · avg 0.0%
BITRECS
0.0%
38.7s
RIZZO
0.0%
48.7s
Viewing
Execution Flow
Pending
2.8s
3:39:32 AM
Initializing Artifact
3.0s
3:39:35 AM
Running Artifact
3.2s
3:39:38 AM
Initializing Evaluation
1.7s
3:39:41 AM
Running Evaluation
43.9s
3:39:42 AM
Finished
3:40:26 AM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
0.0%
62.5s
YUMA
0.0%
40.0s