Archives AI News

Evolution Strategies for Deep RL pretraining

arXiv:2604.00066v1 Announce Type: new Abstract: Although Deep Reinforcement Learning has proven highly effective for complex decision-making problems, it demands significant computational resources and careful parameter adjustment in order to develop successful strategies. Evolution strategies offer a more straightforward, derivative-free approach…

D4C: Data-Free Quantization for Contrastive Language-Image Pre-training Models

arXiv:2511.15411v2 Announce Type: replace-cross Abstract: Data-Free Quantization (DFQ) offers a practical solution for model compression without requiring access to real data, making it particularly attractive in privacy-sensitive scenarios. While DFQ has shown promise for unimodal models, its extension to Vision-Language…

The No-Clash Teaching Dimension is Bounded by VC Dimension

arXiv:2603.23561v3 Announce Type: replace-cross Abstract: In the realm of machine learning theory, to prevent unnatural coding schemes between teacher and learner, No-Clash Teaching Dimension was introduced as provably optimal complexity measure for collusion-free teaching. However, whether No-Clash Teaching Dimension is…

ParetoBandit: Budget-Paced Adaptive Routing for Non-Stationary LLM Serving

arXiv:2604.00136v1 Announce Type: new Abstract: Production LLM serving often relies on multi-model portfolios spanning a ~530x cost range, where routing decisions trade off quality against cost. This trade-off is non-stationary: providers revise pricing, model quality can regress silently, and new…

Diagnosing Neural Convergence with Topological Alignment Spectra

arXiv:2411.08687v2 Announce Type: replace Abstract: Representational similarity in neural networks is inherently scale-dependent, yet widely used metrics such as Centered Kernel Alignment (CKA) and Procrustes analysis provide only global scalar estimates. These scalars often fail to distinguish micro-scale geometric jitter…