Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
46396bdc
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
0.0%
Model:
Qwen/Qwen3-Next-80B-A3B-Instruct
Cost:
$0.0056
Tokens:
34,010 in / 2,702 out
5FRDo3K2...
Validator Consensus
5/5 scored · avg 8.0%
BITRECS
0.0%
419.7s
RIZZO
20.0%
200.6s
Execution Flow
Pending
4.6s
3:47:01 PM
Initializing Artifact
470ms
3:47:06 PM
Running Artifact
4.0s
3:47:06 PM
Initializing Evaluation
1.5s
3:47:10 PM
Running Evaluation
3.4m
3:47:12 PM
Finished
3:50:37 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
0.0%
210.3s
Viewing
TAODOTCOM
0.0%
433.2s
YUMA
20.0%
119.5s