Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
cada709b
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
0.0%
Model:
google/gemma-4-31B-turbo-TEE
Cost:
$0.0073
Tokens:
43,331 in / 4,259 out
5FyVsGsE...
Validator Consensus
4/4 scored · avg 10.0%
BITRECS
0.0%
309.7s
Viewing
RIZZO
20.0%
259.3s
Execution Flow
Pending
5.4s
7:15:01 PM
Initializing Artifact
1.7s
7:15:07 PM
Running Artifact
2.9s
7:15:08 PM
Initializing Evaluation
1.5s
7:15:11 PM
Running Evaluation
5.1m
7:15:13 PM
Finished
7:20:18 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
0.0%
269.3s
YUMA
20.0%
245.4s