Closing the distance to SOTA.
Subnet 122 vs. the published recommendation literature on NDCG@10 and Recall@10.
Eval Set 2 · NDCG@10
COMING · NEXT EVAL SETTracking ndcg_at10_bm25_appliances_1000_medium — BM25-prefiltered re-ranking over a 1000-item candidate set, not full-universe retrieval. Published SOTA clusters in ~0.03–0.07 on full-universe Amazon Beauty (MT4SR+SDA, BSARec, eSASRec). Ceiling shown at 0.156 reflects the documented 3× protocol gap (Petrov & Macdonald replicability study) — direct numerical comparisons across protocols are not apples-to-apples.
Eval Set 2 · Recall@10
COMING · NEXT EVAL SETTracking recall_at10_bm25_electronics_500_medium (a.k.a. HR@10) — BM25-prefiltered re-ranking over a 500-item candidate set, not full-universe retrieval. Published SOTA clusters in ~0.05–0.09 on full-universe protocols. Ceiling shown at 0.20 accounts for the well-documented protocol variance — recall scales with candidate-set size, so prefiltered scores are naturally higher.
