Archives AI News

FATE: A Formal Benchmark Series for Frontier Algebra of Multiple Difficulty Levels

arXiv:2511.02872v2 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have demonstrated impressive capabilities in formal theorem proving, particularly on contest-based mathematical benchmarks like the IMO. However, these contests do not reflect the depth, breadth, and abstraction of…

Exact Expressive Power of Transformers with Padding

arXiv:2505.18948v2 Announce Type: replace Abstract: Chain of thought is a natural inference-time method for increasing the computational power of transformer-based large language models (LLMs), but comes at the cost of sequential decoding. Are there more efficient alternatives to expand a…

Optimizing Reasoning Efficiency through Prompt Difficulty Prediction

arXiv:2511.03808v1 Announce Type: new Abstract: Reasoning language models perform well on complex tasks but are costly to deploy due to their size and long reasoning traces. We propose a routing approach that assigns each problem to the smallest model likely…

One Size Does Not Fit All: Architecture-Aware Adaptive Batch Scheduling with DEBA

arXiv:2511.03809v1 Announce Type: new Abstract: Adaptive batch size methods aim to accelerate neural network training, but existing approaches apply identical adaptation strategies across all architectures, assuming a one-size-fits-all solution. We introduce DEBA (Dynamic Efficient Batch Adaptation), an adaptive batch scheduler…

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

arXiv:2511.03774v1 Announce Type: new Abstract: Recent advances in Vision-Language Models (VLMs) have achieved state-of-the-art performance on numerous benchmark tasks. However, the use of internet-scale, often proprietary, pretraining corpora raises a critical concern for both practitioners and users: inflated performance due…

Laugh, Relate, Engage: Stylized Comment Generation for Short Videos

arXiv:2511.03757v1 Announce Type: new Abstract: Short-video platforms have become a central medium in the modern Internet landscape, where efficient information delivery and strong interactivity are reshaping user engagement and cultural dissemination. Among the various forms of user interaction, comments play…