Archives AI News

TED: Accelerate Model Training by Internal Generalization

arXiv:2405.03228v3 Announce Type: replace Abstract: Large language models have demonstrated strong performance in recent years, but the high cost of training drives the need for efficient methods to compress dataset sizes. We propose TED pruning, a method that addresses the…

Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models

arXiv:2309.01219v3 Announce Type: replace-cross Abstract: While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input,…

Hallucinated Span Detection with Multi-View Attention Features

arXiv:2504.04335v2 Announce Type: replace-cross Abstract: This study addresses the problem of hallucinated span detection in the outputs of large language models. It has received less attention than output-level hallucination detection despite its practical importance. Prior work has shown that attentions…

Adaptive Preference Optimization with Uncertainty-aware Utility Anchor

arXiv:2509.10515v1 Announce Type: new Abstract: Offline preference optimization methods are efficient for large language models (LLMs) alignment. Direct Preference optimization (DPO)-like learning, one of the most popular approaches, stands out for its efficiency in reward modeling. However, these methods typically…