Archives AI News

Bilevel Models for Adversarial Learning and A Case Study

arXiv:2510.25121v2 Announce Type: replace Abstract: Adversarial learning has been attracting more and more attention thanks to the fast development of machine learning and artificial intelligence. However, due to the complicated structure of most machine learning models, the mechanism of adversarial…

CID: Measuring Feature Importance Through Counterfactual Distributions

arXiv:2511.15371v2 Announce Type: replace Abstract: Assessing the importance of individual features in Machine Learning is critical to understand the model’s decision-making process. While numerous methods exist, the lack of a definitive ground truth for comparison highlights the need for alternative,…

Learning to Orchestrate Agents in Natural Language with the Conductor

arXiv:2512.04388v1 Announce Type: new Abstract: Powerful large language models (LLMs) from different providers have been expensively trained and finetuned to specialize across varying domains. In this work, we introduce a new kind of Conductor model trained with reinforcement learning to…

DAVE: Diagnostic benchmark for Audio Visual Evaluation

arXiv:2503.09321v2 Announce Type: replace-cross Abstract: Audio-visual understanding is a rapidly evolving field that seeks to integrate and interpret information from both auditory and visual modalities. Despite recent advances in multi-modal learning, existing benchmarks often suffer from strong visual bias —…

GraphBench: Next-generation graph learning benchmarking

arXiv:2512.04475v1 Announce Type: new Abstract: Machine learning on graphs has recently achieved impressive progress in various domains, including molecular property prediction and chip design. However, benchmarking practices remain fragmented, often relying on narrow, task-specific datasets and inconsistent evaluation protocols, which…