Archives AI News

Visually Prompted Benchmarks Are Surprisingly Fragile

arXiv:2512.17875v2 Announce Type: replace-cross Abstract: A key challenge in evaluating VLMs is testing models’ ability to analyze visual content independently from their textual priors. Recent benchmarks such as BLINK probe visual perception through visual prompting, where questions about visual content…

VGC-Bench: Towards Mastering Diverse Team Strategies in Competitive Pok’emon

arXiv:2506.10326v3 Announce Type: replace-cross Abstract: Developing AI agents that can robustly adapt to varying strategic landscapes without retraining is a central challenge in multi-agent learning. Pok’emon Video Game Championships (VGC) is a domain with a vast space of approximately $10^{139}$…

Directed Homophily-Aware Graph Neural Network

arXiv:2505.22362v3 Announce Type: replace Abstract: Graph Neural Networks (GNNs) have achieved significant success in various learning tasks on graph-structured data. Nevertheless, most GNNs struggle to generalize to heterophilic neighborhoods. Additionally, many GNNs ignore the directional nature of real-world graphs, resulting…

NOVAK: Unified adaptive optimizer for deep neural networks

arXiv:2601.07876v1 Announce Type: new Abstract: This work introduces NOVAK, a modular gradient-based optimization algorithm that integrates adaptive moment estimation, rectified learning-rate scheduling, decoupled weight regularization, multiple variants of Nesterov momentum, and lookahead synchronization into a unified, performance-oriented framework. NOVAK adopts…

E^2-LLM: Bridging Neural Signals and Interpretable Affective Analysis

arXiv:2601.07877v1 Announce Type: new Abstract: Emotion recognition from electroencephalography (EEG) signals remains challenging due to high inter-subject variability, limited labeled data, and the lack of interpretable reasoning in existing approaches. While recent multimodal large language models (MLLMs) have advanced emotion…