Query console
§ I
Retrieval · Ablation · Hybrid
A vision-language model tags every product. Those tags are clustered into 4,609 canonical labels. A 64-dimensional autoencoder re-ranks what the text filter catches. The result — a 4.45× lift over the naïve baseline — is shown live for eight held-out queries below.
§ I
§ III
Hybrid retrieval · text-tag filter → 64-d embedding re-rank. Ground-truth hits are marked.