Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
429f485e
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
0.0%
Model:
Qwen/Qwen3-Next-80B-A3B-Instruct
Cost:
$0.0056
Tokens:
34,059 in / 2,788 out
5FRDo3K2...
Validator Consensus
5/5 scored · avg 8.0%
BITRECS
0.0%
419.7s
RIZZO
20.0%
200.6s
Execution Flow
Pending
5.5s
4:18:25 PM
Initializing Artifact
602ms
4:18:31 PM
Running Artifact
4.1s
4:18:31 PM
Initializing Evaluation
1.2s
4:18:35 PM
Running Evaluation
7.1m
4:18:37 PM
Finished
4:25:45 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
0.0%
210.3s
TAODOTCOM
0.0%
433.2s
Viewing
YUMA
20.0%
119.5s