Archives AI News

MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning

arXiv:2407.20999v4 Announce Type: replace Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across a wide range of tasks. Typically, LLMs are first pre-trained on large corpora and subsequently fine-tuned on task-specific datasets. However, during fine-tuning, LLMs may forget some…

October 21, 2025

Stratos: An End-to-End Distillation Pipeline for Customized LLMs under Distributed Cloud Environments

arXiv:2510.15992v1 Announce Type: new Abstract: The growing industrial demand for customized and cost-efficient large language models (LLMs) is fueled by the rise of vertical, domain-specific tasks and the need to optimize performance under constraints such as latency and budget. Knowledge…

October 21, 2025

VERINA: Benchmarking Verifiable Code Generation

arXiv:2505.23135v2 Announce Type: replace Abstract: Large language models (LLMs) are increasingly integrated in software development, but ensuring correctness in LLM-generated code remains challenging and often requires costly manual review. Verifiable code generation — jointly generating code, specifications, and proofs of…

October 21, 2025

MatPROV: A Provenance Graph Dataset of Material Synthesis Extracted from Scientific Literature

arXiv:2509.01042v2 Announce Type: replace Abstract: Synthesis procedures play a critical role in materials research, as they directly affect material properties. With data-driven approaches increasingly accelerating materials discovery, there is growing interest in extracting synthesis procedures from scientific literature as structured…

October 21, 2025

UniCrossFi: A Unified Framework For Cross-Domain Wi-Fi-based Gesture Recognition

arXiv:2310.06328v4 Announce Type: replace Abstract: Wi-Fi sensing systems are severely hindered by cross domain problem when deployed in unseen real-world environments. Existing methods typically design separate frameworks for either domain adaptation or domain generalization, often relying on extensive labeled data.…

October 21, 2025

Bayesian Computation in Deep Learning

arXiv:2502.18300v4 Announce Type: replace Abstract: Bayesian methods have shown success in deep learning applications. For example, in predictive tasks, Bayesian neural networks leverage Bayesian reasoning of model uncertainty to improve the reliability and uncertainty awareness of deep neural networks. In…

October 21, 2025

One-step Diffusion Models with Bregman Density Ratio Matching

arXiv:2510.16983v1 Announce Type: cross Abstract: Diffusion and flow models achieve high generative quality but remain computationally expensive due to slow multi-step sampling. Distillation methods accelerate them by training fast student generators, yet most existing objectives lack a unified theoretical foundation.…

October 21, 2025

AWARE: Audio Watermarking with Adversarial Resistance to Edits

arXiv:2510.17512v1 Announce Type: cross Abstract: Prevailing practice in learning-based audio watermarking is to pursue robustness by expanding the set of simulated distortions during training. However, such surrogates are narrow and prone to overfitting. This paper presents AWARE (Audio Watermarking with…

October 21, 2025

How Good Are LLMs at Processing Tool Outputs?

arXiv:2510.15955v1 Announce Type: new Abstract: Most realistic task automation problems require large language models (LLMs) to call tools, which often return complex JSON responses. These responses must be further processed to derive the information necessary for task completion. The ability…

October 21, 2025

A Principled Path to Fitted Distributional Evaluation

arXiv:2506.20048v2 Announce Type: replace-cross Abstract: In reinforcement learning, distributional off-policy evaluation (OPE) focuses on estimating the return distribution of a target policy using offline data collected under a different policy. This work focuses on extending the widely used fitted Q-evaluation…

October 21, 2025