Archives AI News

AMiD: Knowledge Distillation for LLMs with $alpha$-mixture Assistant Distribution

arXiv:2510.15982v1 Announce Type: new Abstract: Autoregressive large language models (LLMs) have achieved remarkable improvement across many tasks but incur high computational and memory costs. Knowledge distillation (KD) mitigates this issue by transferring knowledge from a large teacher to a smaller…

October 21, 2025

Convergence of Regret Matching in Potential Games and Constrained Optimization

arXiv:2510.17067v1 Announce Type: cross Abstract: Regret matching (RM} — and its modern variants — is a foundational online algorithm that has been at the heart of many AI breakthrough results in solving benchmark zero-sum games, such as poker. Yet, surprisingly…

October 21, 2025

MEET-Sepsis: Multi-Endogenous-View Enhanced Time-Series Representation Learning for Early Sepsis Prediction Representation Learning for Early Sepsis Prediction

arXiv:2510.15985v1 Announce Type: new Abstract: Sepsis is a life-threatening infectious syndrome associated with high mortality in intensive care units (ICUs). Early and accurate sepsis prediction (SP) is critical for timely intervention, yet remains challenging due to subtle early manifestations and…

October 21, 2025

Leveraging Group Relative Policy Optimization to Advance Large Language Models in Traditional Chinese Medicine

arXiv:2510.17402v1 Announce Type: cross Abstract: Traditional Chinese Medicine (TCM) presents a rich and structurally unique knowledge system that challenges conventional applications of large language models (LLMs). Although previous TCM-specific LLMs have shown progress through supervised fine-tuning, they often face limitations…

October 21, 2025

User Profiles of Sleep Disorder Sufferers: Towards Explainable Clustering and Differential Variable Analysis

arXiv:2510.15986v1 Announce Type: new Abstract: Sleep disorders have a major impact on patients’ health and quality of life, but their diagnosis remains complex due to the diversity of symptoms. Today, technological advances, combined with medical data analysis, are opening new…

October 21, 2025

Non-asymptotic error bounds for probability flow ODEs under weak log-concavity

arXiv:2510.17608v1 Announce Type: cross Abstract: Score-based generative modeling, implemented through probability flow ODEs, has shown impressive results in numerous practical settings. However, most convergence guarantees rely on restrictive regularity assumptions on the target distribution — such as strong log-concavity or…

October 21, 2025

Algorithmic Primitives and Compositional Geometry of Reasoning in Language Models

arXiv:2510.15987v1 Announce Type: new Abstract: How do latent and inference time computations enable large language models (LLMs) to solve multi-step reasoning? We introduce a framework for tracing and steering algorithmic primitives that underlie model reasoning. Our approach links reasoning traces…

October 21, 2025

Executable Knowledge Graphs for Replicating AI Research

arXiv:2510.17795v1 Announce Type: cross Abstract: Replicating AI research is a crucial yet challenging task for large language model (LLM) agents. Existing approaches often struggle to generate executable code, primarily due to insufficient background knowledge and the limitations of retrieval-augmented generation…

October 21, 2025

Can GRPO Help LLMs Transcend Their Pretraining Origin?

arXiv:2510.15990v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR), primarily driven by the Group Relative Policy Optimization (GRPO) algorithm, is a leading approach for enhancing the reasoning abilities of Large Language Models (LLMs). Despite its wide adoption, GRPO’s…

October 21, 2025

MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning

arXiv:2407.20999v4 Announce Type: replace Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across a wide range of tasks. Typically, LLMs are first pre-trained on large corpora and subsequently fine-tuned on task-specific datasets. However, during fine-tuning, LLMs may forget some…

October 21, 2025