Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
2b074609
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
0.0%
Model:
Qwen/Qwen3-Next-80B-A3B-Instruct
Cost:
$0.0058
Tokens:
34,052 in / 2,957 out
5FRDo3K2...
Validator Consensus
5/5 scored · avg 8.0%
BITRECS
0.0%
419.7s
Viewing
RIZZO
20.0%
200.6s
Execution Flow
Pending
3.4s
4:01:24 PM
Initializing Artifact
641ms
4:01:27 PM
Running Artifact
2.9s
4:01:28 PM
Initializing Evaluation
1.1s
4:01:31 PM
Running Evaluation
6.9m
4:01:32 PM
Finished
4:08:27 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
0.0%
210.3s
TAODOTCOM
0.0%
433.2s
YUMA
20.0%
119.5s