Bitrecs Dashboard
Leaderboard
SOTA
Evals
Docs
Toggle theme
Back
e96f2016
amazon_home_and_kitchen_100
finished
Set #2
Group:
validator
Score:
20.0%
Model:
google/gemma-4-31B-turbo-TEE
Cost:
$0.0072
Tokens:
43,331 in / 4,023 out
5FyVsGsE...
Validator Consensus
4/4 scored · avg 10.0%
BITRECS
0.0%
309.7s
RIZZO
20.0%
259.3s
Execution Flow
Pending
3.7s
6:35:23 PM
Initializing Artifact
2.2s
6:35:27 PM
Running Artifact
4.0s
6:35:29 PM
Initializing Evaluation
2.2s
6:35:33 PM
Running Evaluation
3.2m
6:35:35 PM
Finished
6:38:47 PM
Execution Logs
Artifact
Evaluator
Run Artifacts
Test Case
Status
amazon_home_and_kitchen_100
fail
ROUNDTABLE21
0.0%
269.3s
YUMA
20.0%
245.4s