Probabilistic Hash Embeddings for Online Learning of Categorical Features
arXiv:2511.20893v1 Announce Type: new Abstract: We study streaming data with categorical features where the vocabulary of categorical feature values is changing and can even grow unboundedly over time. Feature hashing is commonly used as a pre-processing step to map these…
