Archives AI News

VIRTUE: Visual-Interactive Text-Image Universal Embedder

arXiv:2510.00523v1 Announce Type: new Abstract: Multimodal representation learning models have demonstrated successful operation across complex tasks, and the integration of vision-language models (VLMs) has further enabled embedding models with instruction-following capabilities. However, existing embedding models lack visual-interactive capabilities to specify…

October 2, 2025

Unpacking Let Alone: Human-Scale Models Generalize to a Rare Construction in Form but not Meaning

arXiv:2506.04408v2 Announce Type: replace-cross Abstract: Humans have a remarkable ability to acquire and understand grammatical phenomena that are seen rarely, if ever, during childhood. Recent evidence suggests that language models with human-scale pretraining data may possess a similar ability by…

October 2, 2025

Data Quality Challenges in Retrieval-Augmented Generation

arXiv:2510.00552v1 Announce Type: new Abstract: Organizations increasingly adopt Retrieval-Augmented Generation (RAG) to enhance Large Language Models with enterprise-specific knowledge. However, current data quality (DQ) frameworks have been primarily developed for static datasets, and only inadequately address the dynamic, multi-stage nature…

October 2, 2025

LLM Watermark Evasion via Bias Inversion

arXiv:2509.23019v2 Announce Type: replace-cross Abstract: Watermarking for large language models (LLMs) embeds a statistical signal during generation to enable detection of model-produced text. While watermarking has proven effective in benign settings, its robustness under adversarial evasion remains contested. To advance…

October 2, 2025

ACON: Optimizing Context Compression for Long-horizon LLM Agents

arXiv:2510.00615v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly deployed as agents in dynamic, real-world environments, where success requires both reasoning and effective tool use. A central challenge for agentic tasks is the growing context length, as agents…

October 2, 2025

Multi-modal Spatio-Temporal Transformer for High-resolution Land Subsidence Prediction

arXiv:2509.25393v2 Announce Type: replace-cross Abstract: Forecasting high-resolution land subsidence is a critical yet challenging task due to its complex, non-linear dynamics. While standard architectures like ConvLSTM often fail to model long-range dependencies, we argue that a more fundamental limitation of…

October 2, 2025

HARPA: A Testability-Driven, Literature-Grounded Framework for Research Ideation

arXiv:2510.00620v1 Announce Type: new Abstract: While there has been a surge of interest in automated scientific discovery (ASD), especially with the emergence of LLMs, it remains challenging for tools to generate hypotheses that are both testable and grounded in the…

October 2, 2025

From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling

arXiv:2510.00743v1 Announce Type: cross Abstract: Assessing the perceptual quality of synthetic speech is crucial for guiding the development and refinement of speech generation models. However, it has traditionally relied on human subjective ratings such as the Mean Opinion Score (MOS),…

October 2, 2025

Is Model Editing Built on Sand? Revealing Its Illusory Success and Fragile Foundation

arXiv:2510.00625v1 Announce Type: new Abstract: Large language models (LLMs) inevitably encode outdated or incorrect knowledge. Updating, deleting, and forgetting such knowledge is important for alignment, safety, and other issues. To address this issue, model editing has emerged as a promising…

October 2, 2025

Learning linear dynamical systems under convex constraints

arXiv:2303.15121v4 Announce Type: replace-cross Abstract: We consider the problem of finite-time identification of linear dynamical systems from $T$ samples of a single trajectory. Recent results have predominantly focused on the setup where either no structural assumption is made on the…

October 2, 2025