Archives AI News

Hybrid SARIMA LSTM Model for Local Weather Forecasting: A Residual Learning Approach for Data Driven Meteorological Prediction

arXiv:2601.07951v1 Announce Type: new Abstract: Accurately forecasting long-term atmospheric variables remains a defining challenge in meteorological science due to the chaotic nature of atmospheric systems. Temperature data represents a complex superposition of deterministic cyclical climate forces and stochastic, short-term fluctuations.…

January 14, 2026

YRC-Bench: A Benchmark for Learning to Coordinate with Experts

arXiv:2502.09583v3 Announce Type: replace Abstract: When deployed in the real world, AI agents will inevitably face challenges that exceed their individual capabilities. A critical component of AI safety is an agent’s ability to recognize when it is likely to fail…

January 14, 2026

DataScribe: An AI-Native, Policy-Aligned Web Platform for Multi-Objective Materials Design and Discovery

arXiv:2601.07966v1 Announce Type: new Abstract: The acceleration of materials discovery requires digital platforms that go beyond data repositories to embed learning, optimization, and decision-making directly into research workflows. We introduce DataScribe, an AI-native, cloud-based materials discovery platform that unifies heterogeneous…

January 14, 2026

Visually Prompted Benchmarks Are Surprisingly Fragile

arXiv:2512.17875v2 Announce Type: replace-cross Abstract: A key challenge in evaluating VLMs is testing models’ ability to analyze visual content independently from their textual priors. Recent benchmarks such as BLINK probe visual perception through visual prompting, where questions about visual content…

January 14, 2026

Accelerated Methods with Complexity Separation Under Data Similarity for Federated Learning Problems

arXiv:2601.08614v1 Announce Type: cross Abstract: Heterogeneity within data distribution poses a challenge in many modern federated learning tasks. We formalize it as an optimization problem involving a computationally heavy composite under data similarity. By employing different sets of assumptions, we…

January 14, 2026

Buffered AUC maximization for scoring systems via mixed-integer optimization

arXiv:2601.05544v2 Announce Type: replace Abstract: A scoring system is a linear classifier composed of a small number of explanatory variables, each assigned a small integer coefficient. This system is highly interpretable and allows predictions to be made with simple manual…

January 14, 2026

VGC-Bench: Towards Mastering Diverse Team Strategies in Competitive Pok’emon

arXiv:2506.10326v3 Announce Type: replace-cross Abstract: Developing AI agents that can robustly adapt to varying strategic landscapes without retraining is a central challenge in multi-agent learning. Pok’emon Video Game Championships (VGC) is a domain with a vast space of approximately $10^{139}$…

January 14, 2026

Directed Homophily-Aware Graph Neural Network

arXiv:2505.22362v3 Announce Type: replace Abstract: Graph Neural Networks (GNNs) have achieved significant success in various learning tasks on graph-structured data. Nevertheless, most GNNs struggle to generalize to heterophilic neighborhoods. Additionally, many GNNs ignore the directional nature of real-world graphs, resulting…

January 14, 2026

Aligning by Misaligning: Boundary-aware Curriculum Learning for Multimodal Alignment

arXiv:2511.08399v2 Announce Type: replace Abstract: Most multimodal models treat every negative pair alike, ignoring the ambiguous negatives that differ from the positive by only a small detail. We propose Boundary-Aware Curriculum with Local Attention (BACL), a lightweight add-on that turns…

January 14, 2026

Sliced-Wasserstein Distribution Alignment Loss Improves the Ultra-Low-Bit Quantization of Large Language Models

arXiv:2601.07878v1 Announce Type: new Abstract: The benefits of most large language models come with steep and often hidden economic and environmental costs due to their resource usage inefficiency during deployment. Model quantization improves energy and memory efficiency through representing model…

January 14, 2026