Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
57c2340c
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
0.0%
Model:
Qwen/Qwen3-Next-80B-A3B-Instruct
Cost:
$0.0052
Tokens:
33,924 in / 2,254 out
5HNW3uQn...
Validator Consensus
4/4 scored · avg 10.0%
BITRECS
40.0%
127.4s
RIZZO
0.0%
109.4s
Execution Flow
Pending
5.5s
1:42:00 PM
Initializing Artifact
1.6s
1:42:06 PM
Running Artifact
4.3s
1:42:07 PM
Initializing Evaluation
1.6s
1:42:11 PM
Running Evaluation
2.8m
1:42:13 PM
Finished
1:45:01 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
0.0%
174.0s
TAODOTCOM
0.0%
173.7s
Viewing