Archives AI News

Pruning the Paradox: How CLIP’s Most Informative Heads Enhance Performance While Amplifying Bias

arXiv:2503.11103v3 Announce Type: replace-cross Abstract: CLIP is one of the most popular foundation models and is heavily used for many vision-language tasks, yet little is known about its inner workings. As CLIP is increasingly deployed in real-world applications, it is…

September 22, 2025

Building Data-Driven Occupation Taxonomies: A Bottom-Up Multi-Stage Approach via Semantic Clustering and Multi-Agent Collaboration

arXiv:2509.15786v1 Announce Type: new Abstract: Creating robust occupation taxonomies, vital for applications ranging from job recommendation to labor market intelligence, is challenging. Manual curation is slow, while existing automated methods are either not adaptive to dynamic regional markets (top-down) or…

September 22, 2025

CLEAR: A Clinically-Grounded Tabular Framework for Radiology Report Evaluation

arXiv:2505.16325v2 Announce Type: replace-cross Abstract: Existing metrics often lack the granularity and interpretability to capture nuanced clinical differences between candidate and ground-truth radiology reports, resulting in suboptimal evaluation. We introduce a Clinically-grounded tabular framework with Expert-curated labels and Attribute-level comparison…

September 22, 2025

A Comparative Study of Rule-Based and Data-Driven Approaches in Industrial Monitoring

arXiv:2509.15848v1 Announce Type: new Abstract: Industrial monitoring systems, especially when deployed in Industry 4.0 environments, are experiencing a shift in paradigm from traditional rule-based architectures to data-driven approaches leveraging machine learning and artificial intelligence. This study presents a comparison between…

September 22, 2025

DualEdit: Dual Editing for Knowledge Updating in Vision-Language Models

arXiv:2506.13638v2 Announce Type: replace-cross Abstract: Model editing aims to efficiently update a pre-trained model’s knowledge without the need for time-consuming full retraining. While existing pioneering editing methods achieve promising results, they primarily focus on editing single-modal language models (LLMs). However,…

September 22, 2025

EHR-MCP: Real-world Evaluation of Clinical Information Retrieval by Large Language Models via Model Context Protocol

arXiv:2509.15957v1 Announce Type: new Abstract: Background: Large language models (LLMs) show promise in medicine, but their deployment in hospitals is limited by restricted access to electronic health record (EHR) systems. The Model Context Protocol (MCP) enables integration between LLMs and…

September 22, 2025

LongCat-Flash Technical Report

arXiv:2509.01322v2 Announce Type: replace-cross Abstract: We introduce LongCat-Flash, a 560-billion-parameter Mixture-of-Experts (MoE) language model designed for both computational efficiency and advanced agentic capabilities. Stemming from the need for scalable efficiency, LongCat-Flash adopts two novel designs: (a) Zero-computation Experts, which enables…

September 22, 2025

Structured Information for Improving Spatial Relationships in Text-to-Image Generation

arXiv:2509.15962v1 Announce Type: new Abstract: Text-to-image (T2I) generation has advanced rapidly, yet faithfully capturing spatial relationships described in natural language prompts remains a major challenge. Prior efforts have addressed this issue through prompt optimization, spatially grounded generation, and semantic refinement.…

September 22, 2025

Empathy-R1: A Chain-of-Empathy and Reinforcement Learning Framework for Long-Form Mental Health Support

arXiv:2509.14851v2 Announce Type: replace-cross Abstract: Empathy is critical for effective mental health support, especially when addressing Long Counseling Texts (LCTs). However, existing Large Language Models (LLMs) often generate replies that are semantically fluent but lack the structured reasoning necessary for…

September 22, 2025

Attention Schema-based Attention Control (ASAC): A Cognitive-Inspired Approach for Attention Management in Transformers

arXiv:2509.16058v1 Announce Type: new Abstract: Attention mechanisms have become integral in AI, significantly enhancing model performance and scalability by drawing inspiration from human cognition. Concurrently, the Attention Schema Theory (AST) in cognitive science posits that individuals manage their attention by…

September 22, 2025