Archives AI News

MDKeyChunker: Single-Call LLM Enrichment with Rolling Keys and Key-Based Restructuring for High-Accuracy RAG

arXiv:2603.23533v2 Announce Type: replace-cross Abstract: RAG pipelines typically rely on fixed-size chunking, which ignores document structure, fragments semantic units across boundaries, and requires multiple LLM calls per chunk for metadata extraction. We present MDKeyChunker, a three-stage pipeline for Markdown documents…

March 30, 2026

UNIFERENCE: A Discrete Event Simulation Framework for Developing Distributed AI Models

arXiv:2603.26469v1 Announce Type: cross Abstract: Developing and evaluating distributed inference algorithms remains difficult due to the lack of standardized tools for modeling heterogeneous devices and networks. Existing studies often rely on ad-hoc testbeds or proprietary infrastructure, making results hard to…

March 30, 2026

UNIFERENCE: A Discrete Event Simulation Framework for Developing Distributed AI Models

March 30, 2026

Towards Knowledge Guided Pretraining Approaches for Multimodal Foundation Models: Applications in Remote Sensing

arXiv:2407.19660v5 Announce Type: replace-cross Abstract: Self-supervised learning has emerged as a powerful paradigm for pretraining foundation models using large-scale data. Existing pretraining approaches predominantly rely on masked reconstruction or next-token prediction strategies, demonstrating strong performance across various downstream tasks, including…

March 30, 2026

Towards Knowledge Guided Pretraining Approaches for Multimodal Foundation Models: Applications in Remote Sensing

March 30, 2026

GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding

arXiv:2511.00810v3 Announce Type: replace-cross Abstract: Graphical user interface (GUI) grounding is a key capability for computer-use agents, mapping natural-language instructions to actionable regions on the screen. Existing Multimodal Large Language Model (MLLM) approaches typically formulate GUI grounding as a text-based…

March 30, 2026

GUI-AIMA: Aligning Intrinsic Multimodal Attention with a Context Anchor for GUI Grounding

March 30, 2026

FedRE: A Representation Entanglement Framework for Model-Heterogeneous Federated Learning

arXiv:2511.22265v2 Announce Type: replace Abstract: Federated learning (FL) enables collaborative training across clients while preserving privacy. While most existing FL methods assume homogeneous model architectures, client heterogeneity in both data and resources makes this assumption impractical, thus motivating model-heterogeneous FL.…

March 30, 2026

AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed Multi-Agent Systems

arXiv:2603.14688v2 Announce Type: replace Abstract: As multi-agent AI systems are increasingly deployed in real-world settings – from automated customer support to DevOps remediation – failures become harder to diagnose due to cascading effects, hidden dependencies, and long execution traces. We…

March 30, 2026

DRiffusion: Draft-and-Refine Process Parallelizes Diffusion Models with Ease

arXiv:2603.25872v1 Announce Type: new Abstract: Diffusion models have achieved remarkable success in generating high-fidelity content but suffer from slow, iterative sampling, resulting in high latency that limits their use in interactive applications. We introduce DRiffusion, a parallel sampling framework that…

March 30, 2026