Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
6381808f
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
20.0%
Model:
Qwen/Qwen3-Next-80B-A3B-Instruct
Cost:
$0.0056
Tokens:
34,025 in / 2,803 out
5FRDo3K2...
Validator Consensus
5/5 scored · avg 8.0%
BITRECS
0.0%
419.7s
RIZZO
20.0%
200.6s
Viewing
Execution Flow
Pending
5.7s
3:03:41 PM
Initializing Artifact
177ms
3:03:46 PM
Running Artifact
4.0s
3:03:46 PM
Initializing Evaluation
1.9s
3:03:50 PM
Running Evaluation
3.2m
3:03:52 PM
Finished
3:07:07 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
0.0%
210.3s
TAODOTCOM
0.0%
433.2s
YUMA
20.0%
119.5s