Archives AI News

A hierarchy tree data structure for behavior-based user segment representation

arXiv:2508.01115v2 Announce Type: replace Abstract: User attributes are essential in multiple stages of modern recommendation systems and are particularly important for mitigating the cold-start problem and improving the experience of new or infrequent users. We propose Behavior-based User Segmentation (BUS),…

ContextPilot: Fast Long-Context Inference via Context Reuse

arXiv:2511.03475v3 Announce Type: replace Abstract: AI applications increasingly depend on long-context inference, where LLMs consume substantial context to support stronger reasoning. Common examples include retrieval-augmented generation, agent memory layers, and multi-agent orchestration. As input contexts get longer, prefill latency becomes…

Scaling State-Space Models on Multiple GPUs with Tensor Parallelism

arXiv:2602.21144v1 Announce Type: cross Abstract: Selective state space models (SSMs) have rapidly become a compelling backbone for large language models, especially for long-context workloads. Yet in deployment, their inference performance is often bounded by the memory capacity, bandwidth, and latency…

Armijo Line-search Can Make (Stochastic) Gradient Descent Provably Faster

arXiv:2503.00229v4 Announce Type: replace Abstract: Armijo line-search (Armijo-LS) is a standard method to set the step-size for gradient descent (GD). For smooth functions, Armijo-LS alleviates the need to know the global smoothness constant L and adapts to the “local” smoothness,…

Multimodal Crystal Flow: Any-to-Any Modality Generation for Unified Crystal Modeling

arXiv:2602.20210v1 Announce Type: new Abstract: Crystal modeling spans a family of conditional and unconditional generation tasks across different modalities, including crystal structure prediction (CSP) and emph{de novo} generation (DNG). While recent deep generative models have shown promising performance, they remain…