Logo
Bitrecs Dashboard
LeaderboardSOTAEvals
Docs
Back

Validator · Set #2

recall_at10_bm25_electronics_500_medium

Pass Rate

15.4%

Avg Time

593.3s

Total Runs

227

Passed

21%

Failed

115%

Errors

21494%

Methodology

Reference PaperarXiv:1506.08839

Inferring Networks of Substitutable and Complementary Products

Julian McAuley · Rahul Pandey · Jure Leskovec

This evaluation uses the Amazon product graph dataset introduced in the paper. Recommendations are scored against ground-truth substitute and complementary edges using BM25 retrieval and ranked with NDCG@10 and Recall@K metrics.

Read on arXiv

All Runs

98 shown
#Artifact
Scored
Score
Duration
Latest
1
MO
MONROE
v1 · f2752b1f
4/46.3%410.36s5/4/2026
2
ts
tst
v1 · d1e9d5bb
2/55.0%496.33s4/30/2026
3
GA
GARFIELD
v1 · d5f78b42
5/51.0%310.93s5/4/2026
4
GR
GRANT
v1 · ff9c56f6
0/4——5/6/2026
5
re
rec
v1 · d22dd39c
0/4——5/5/2026
6
MA
MAX
v1 · d9476dcf
0/4——5/5/2026
7
BU
BUCHANAN
v1 · 24ddd201
0/4——5/4/2026
8
ca
candidate
v1 · 76825e93
0/4——5/4/2026
9
bi
bitrecs recommender
v1 · 46b1db45
0/4——5/4/2026
10
To
Top
v1 · acbca673
0/4——5/4/2026
11
ca
canary
v1 · f48c8cff
0/4——5/4/2026
12
UL
ULYSSES
v1 · 9ef0c4d8
0/4——5/3/2026
13
pr
preliminary
v1 · 74ca94c1
0/4——5/3/2026
14
ge
gerhig
v1 · ccf00759
0/4——5/3/2026
15
PA
PATTON
v1 · 4c8e70f5
0/5——5/3/2026
16
sk
sketch
v1 · 62789429
0/4——5/3/2026
17
Ra
Ranker
v1 · 69a0f7eb
0/5——5/2/2026
18
dr
draft
v1 · 6da53bdc
0/4——5/1/2026
19
ge
gemma catalog ranker
v1 · afbc15a1
0/5——5/1/2026
20
re
reason
v1 · a3bd38aa
0/4——5/1/2026
21
Tu
Turbo Recs
v1 · d9773028
0/5——5/1/2026
22
ru
rules
v1 · 90bb3fb7
0/4——5/1/2026
23
ma
max
v1 · bb9cec44
0/4——4/30/2026

Test Cases (1)

Test NameCategoryRunsPassFailPass Rate
recall_at10_bm25_electronics_500_mediumdefault13211
15%

Errors

Code 200210(98%)

An internal error occurred on the validator

Code 3004(2%)

The platform was restarted while the evaluation run was pending