Archives AI News

O3SLM: Open Weight, Open Data, and Open Vocabulary Sketch-Language Model

arXiv:2511.14368v1 Announce Type: cross Abstract: While Large Vision Language Models (LVLMs) are increasingly deployed in real-world applications, their ability to interpret abstract visual inputs remains limited. Specifically, they struggle to comprehend hand-drawn sketches, a modality that offers an intuitive means…

DeepBlip: Estimating Conditional Average Treatment Effects Over Time

arXiv:2511.14545v1 Announce Type: cross Abstract: Structural nested mean models (SNMMs) are a principled approach to estimate the treatment effects over time. A particular strength of SNMMs is to break the joint effect of treatment sequences over time into localized, time-specific…

Derivative of the truncated singular value and eigen decomposition

arXiv:2511.14651v1 Announce Type: cross Abstract: Recently developed applications in the field of machine learning and computational physics rely on automatic differentiation techniques, that require stable and efficient linear algebra gradient computations. This technical note provides a comprehensive and detailed discussion…

Beat the long tail: Distribution-Aware Speculative Decoding for RL Training

arXiv:2511.13841v1 Announce Type: new Abstract: Reinforcement learning(RL) post-training has become essential for aligning large language models (LLMs), yet its efficiency is increasingly constrained by the rollout phase, where long trajectories are generated token by token. We identify a major bottleneck:the…