Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
41de1802
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
0.0%
Model:
Qwen/Qwen3-Next-80B-A3B-Instruct
Cost:
$0.0052
Tokens:
33,833 in / 2,277 out
5HNW3uQn...
Validator Consensus
4/4 scored · avg 10.0%
BITRECS
40.0%
127.4s
RIZZO
0.0%
109.4s
Viewing
Execution Flow
Pending
3.7s
1:53:05 PM
Initializing Artifact
2.3s
1:53:08 PM
Running Artifact
3.3s
1:53:11 PM
Initializing Evaluation
1.3s
1:53:14 PM
Running Evaluation
1.7m
1:53:15 PM
Finished
1:55:00 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
0.0%
174.0s
TAODOTCOM
0.0%
173.7s