Archives AI News

Input-Time Scaling

arXiv:2508.13654v4 Announce Type: replace Abstract: Current Large Language Models (LLMs) are usually post-trained on large-scale carefully curated datasets (data & training scaling) and doing reasoning in test time (inference time scaling). In this work, we present a new scaling paradigm,…

September 15, 2025

HGEN: Heterogeneous Graph Ensemble Networks

arXiv:2509.09843v1 Announce Type: new Abstract: This paper presents HGEN that pioneers ensemble learning for heterogeneous graphs. We argue that the heterogeneity in node types, nodal features, and local neighborhood topology poses significant challenges for ensemble learning, particularly in accommodating diverse…

September 15, 2025

Your Image is Secretly the Last Frame of a Pseudo Video

arXiv:2410.20158v3 Announce Type: replace-cross Abstract: Diffusion models, which can be viewed as a special case of hierarchical variational autoencoders (HVAEs), have shown profound success in generating photo-realistic images. In contrast, standard HVAEs often produce images of inferior quality compared to…

September 15, 2025

Latency and Token-Aware Test-Time Compute

arXiv:2509.09864v1 Announce Type: new Abstract: Inference-time scaling has emerged as a powerful way to improve large language model (LLM) performance by generating multiple candidate responses and selecting among them. However, existing work on dynamic allocation for test-time compute typically considers…

September 15, 2025

MEMOIR: Lifelong Model Editing with Minimal Overwrite and Informed Retention for LLMs

arXiv:2506.07899v3 Announce Type: replace-cross Abstract: Language models deployed in real-world systems often require post-hoc updates to incorporate new or corrected knowledge. However, editing such models efficiently and reliably-without retraining or forgetting previous information-remains a major challenge. Existing methods for lifelong…

September 15, 2025

Variational Neural Networks for Observable Thermodynamics (V-NOTS)

arXiv:2509.09899v1 Announce Type: new Abstract: Much attention has recently been devoted to data-based computing of evolution of physical systems. In such approaches, information about data points from past trajectories in phase space is used to reconstruct the equations of motion…

September 15, 2025

Targeted Test Selection Approach in Continuous Integration

arXiv:2509.10279v1 Announce Type: cross Abstract: In modern software development change-based testing plays a crucial role. However, as codebases expand and test suites grow, efficiently managing the testing process becomes increasingly challenging, especially given the high frequency of daily code commits.…

September 15, 2025

LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios

arXiv:2509.09926v1 Announce Type: new Abstract: Long-tailed learning has garnered increasing attention due to its wide applicability in real-world scenarios. Among existing approaches, Long-Tailed Semi-Supervised Learning (LTSSL) has emerged as an effective solution by incorporating a large amount of unlabeled data…

September 15, 2025

Matrix-free Neural Preconditioner for the Dirac Operator in Lattice Gauge Theory

arXiv:2509.10378v1 Announce Type: cross Abstract: Linear systems arise in generating samples and in calculating observables in lattice quantum chromodynamics~(QCD). Solving the Hermitian positive definite systems, which are sparse but ill-conditioned, involves using iterative methods, such as Conjugate Gradient (CG), which…

September 15, 2025

Multi-Play Combinatorial Semi-Bandit Problem

arXiv:2509.09933v1 Announce Type: new Abstract: In the combinatorial semi-bandit (CSB) problem, a player selects an action from a combinatorial action set and observes feedback from the base arms included in the action. While CSB is widely applicable to combinatorial optimization…

September 15, 2025