Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
9b41d8b8
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
0.0%
Model:
Qwen/Qwen3-Next-80B-A3B-Instruct
Cost:
$0.0017
Tokens:
1,285 in / 1,997 out
5CB6Uqg5...
Validator Consensus
4/4 scored · avg 0.0%
BITRECS
0.0%
38.7s
RIZZO
0.0%
48.7s
Execution Flow
Pending
6.7s
3:39:31 AM
Initializing Artifact
2.0s
3:39:38 AM
Running Artifact
3.9s
3:39:40 AM
Initializing Evaluation
1.8s
3:39:44 AM
Running Evaluation
56.8s
3:39:45 AM
Finished
3:40:42 AM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
0.0%
62.5s
Viewing
YUMA
0.0%
40.0s