Archives AI News

AMiD: Knowledge Distillation for LLMs with $alpha$-mixture Assistant Distribution

arXiv:2510.15982v1 Announce Type: new Abstract: Autoregressive large language models (LLMs) have achieved remarkable improvement across many tasks but incur high computational and memory costs. Knowledge distillation (KD) mitigates this issue by transferring knowledge from a large teacher to a smaller…

October 21, 2025

Convergence of Regret Matching in Potential Games and Constrained Optimization

arXiv:2510.17067v1 Announce Type: cross Abstract: Regret matching (RM} — and its modern variants — is a foundational online algorithm that has been at the heart of many AI breakthrough results in solving benchmark zero-sum games, such as poker. Yet, surprisingly…

October 21, 2025

MEET-Sepsis: Multi-Endogenous-View Enhanced Time-Series Representation Learning for Early Sepsis Prediction Representation Learning for Early Sepsis Prediction

arXiv:2510.15985v1 Announce Type: new Abstract: Sepsis is a life-threatening infectious syndrome associated with high mortality in intensive care units (ICUs). Early and accurate sepsis prediction (SP) is critical for timely intervention, yet remains challenging due to subtle early manifestations and…

October 21, 2025

VERINA: Benchmarking Verifiable Code Generation

arXiv:2505.23135v2 Announce Type: replace Abstract: Large language models (LLMs) are increasingly integrated in software development, but ensuring correctness in LLM-generated code remains challenging and often requires costly manual review. Verifiable code generation — jointly generating code, specifications, and proofs of…

October 21, 2025

MatPROV: A Provenance Graph Dataset of Material Synthesis Extracted from Scientific Literature

arXiv:2509.01042v2 Announce Type: replace Abstract: Synthesis procedures play a critical role in materials research, as they directly affect material properties. With data-driven approaches increasingly accelerating materials discovery, there is growing interest in extracting synthesis procedures from scientific literature as structured…

October 21, 2025

UniCrossFi: A Unified Framework For Cross-Domain Wi-Fi-based Gesture Recognition

arXiv:2310.06328v4 Announce Type: replace Abstract: Wi-Fi sensing systems are severely hindered by cross domain problem when deployed in unseen real-world environments. Existing methods typically design separate frameworks for either domain adaptation or domain generalization, often relying on extensive labeled data.…

October 21, 2025

Bayesian Computation in Deep Learning

arXiv:2502.18300v4 Announce Type: replace Abstract: Bayesian methods have shown success in deep learning applications. For example, in predictive tasks, Bayesian neural networks leverage Bayesian reasoning of model uncertainty to improve the reliability and uncertainty awareness of deep neural networks. In…

October 21, 2025

One-step Diffusion Models with Bregman Density Ratio Matching

arXiv:2510.16983v1 Announce Type: cross Abstract: Diffusion and flow models achieve high generative quality but remain computationally expensive due to slow multi-step sampling. Distillation methods accelerate them by training fast student generators, yet most existing objectives lack a unified theoretical foundation.…

October 21, 2025

AWARE: Audio Watermarking with Adversarial Resistance to Edits

arXiv:2510.17512v1 Announce Type: cross Abstract: Prevailing practice in learning-based audio watermarking is to pursue robustness by expanding the set of simulated distortions during training. However, such surrogates are narrow and prone to overfitting. This paper presents AWARE (Audio Watermarking with…

October 21, 2025

How Good Are LLMs at Processing Tool Outputs?

arXiv:2510.15955v1 Announce Type: new Abstract: Most realistic task automation problems require large language models (LLMs) to call tools, which often return complex JSON responses. These responses must be further processed to derive the information necessary for task completion. The ability…

October 21, 2025