Archives AI News

VIRTUE: Visual-Interactive Text-Image Universal Embedder

arXiv:2510.00523v1 Announce Type: new Abstract: Multimodal representation learning models have demonstrated successful operation across complex tasks, and the integration of vision-language models (VLMs) has further enabled embedding models with instruction-following capabilities. However, existing embedding models lack visual-interactive capabilities to specify…

Data Quality Challenges in Retrieval-Augmented Generation

arXiv:2510.00552v1 Announce Type: new Abstract: Organizations increasingly adopt Retrieval-Augmented Generation (RAG) to enhance Large Language Models with enterprise-specific knowledge. However, current data quality (DQ) frameworks have been primarily developed for static datasets, and only inadequately address the dynamic, multi-stage nature…

LLM Watermark Evasion via Bias Inversion

arXiv:2509.23019v2 Announce Type: replace-cross Abstract: Watermarking for large language models (LLMs) embeds a statistical signal during generation to enable detection of model-produced text. While watermarking has proven effective in benign settings, its robustness under adversarial evasion remains contested. To advance…

ACON: Optimizing Context Compression for Long-horizon LLM Agents

arXiv:2510.00615v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly deployed as agents in dynamic, real-world environments, where success requires both reasoning and effective tool use. A central challenge for agentic tasks is the growing context length, as agents…

Learning linear dynamical systems under convex constraints

arXiv:2303.15121v4 Announce Type: replace-cross Abstract: We consider the problem of finite-time identification of linear dynamical systems from $T$ samples of a single trajectory. Recent results have predominantly focused on the setup where either no structural assumption is made on the…