Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
3999e44e
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
20.0%
Model:
Qwen/Qwen3-Next-80B-A3B-Instruct
Cost:
$0.0056
Tokens:
34,057 in / 2,727 out
5FRDo3K2...
Validator Consensus
5/5 scored · avg 8.0%
BITRECS
0.0%
419.7s
RIZZO
20.0%
200.6s
Execution Flow
Pending
5.7s
3:51:18 PM
Initializing Artifact
385ms
3:51:23 PM
Running Artifact
4.2s
3:51:24 PM
Initializing Evaluation
401ms
3:51:28 PM
Running Evaluation
1.9m
3:51:28 PM
Finished
3:53:23 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
0.0%
210.3s
TAODOTCOM
0.0%
433.2s
YUMA
20.0%
119.5s
Viewing