Archives AI News

Benchmarking System Dynamics AI Assistants: Cloud Versus Local LLMs on CLD Extraction and Discussion

arXiv:2604.18566v1 Announce Type: cross Abstract: We present a systematic evaluation of large language model families — spanning both proprietary cloud APIs and locally-hosted open-source models — on two purpose-built benchmarks for System Dynamics AI assistance: the textbf{CLD Leaderboard} (53 tests,…

April 21, 2026

Beyond Verifiable Rewards: Rubric-Based GRM for Reinforced Fine-Tuning SWE Agents

arXiv:2604.16335v1 Announce Type: new Abstract: Despite recent progress in Large Language Model (LLM) Agents for Software Engineering (SWE) tasks, end-to-end fine-tuning typically relies on verifiable terminal rewards such as whether all unit tests pass. While these binary signals reflect whether…

April 21, 2026

Sampling for Quality: Training-Free Reward-Guided LLM Decoding via Sequential Monte Carlo

arXiv:2604.16453v1 Announce Type: new Abstract: We introduce a principled probabilistic framework for reward-guided decoding in large language models, addressing the limitations of standard decoding methods that optimize token-level likelihood rather than sequence-level quality. Our method defines a reward-augmented target distribution…

April 21, 2026

Machine Learning Based Prediction of Proton Conductivity in Metal-Organic Frameworks

arXiv:2407.09514v3 Announce Type: replace-cross Abstract: Recently, metal-organic frameworks (MOFs) have demonstrated their potential as solid-state electrolytes in proton exchange membrane fuel cells. However, the number of MOFs reported to exhibit proton conductivity remains limited, and the mechanisms underlying this phenomenon…

April 21, 2026

Non-Stationarity in the Embedding Space of Time Series Foundation Models

arXiv:2604.16428v1 Announce Type: new Abstract: Time series foundation models (TSFMs) are widely used as generic feature extractors, yet the notion of non-stationarity in their embedding spaces remains poorly understood. Recent work often conflates non-stationarity with distribution shift, blurring distinctions fundamental…

April 21, 2026

Instance-Adaptive Parametrization for Amortized Variational Inference

arXiv:2604.06796v2 Announce Type: replace Abstract: Variational autoencoders (VAEs) rely on amortized variational inference to enable efficient posterior approximation, but this efficiency comes at the cost of a shared parametrization, giving rise to the amortization gap. We propose the instance-adaptive variational…

April 21, 2026

Positive-Only Drifting Policy Optimization

arXiv:2604.16519v1 Announce Type: new Abstract: In the field of online reinforcement learning (RL), traditional Gaussian policies and flow-based methods are often constrained by their unimodal expressiveness, complex gradient clipping, or stringent trust-region requirements. Moreover, they all rely on post-hoc penalization…

April 21, 2026

DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty

arXiv:2506.12622v2 Announce Type: replace Abstract: Deep reinforcement learning (RL) has achieved remarkable success, yet its deployment in real-world scenarios is often limited by vulnerability to environmental uncertainties. Distributionally robust RL (DR-RL) algorithms have been proposed to resolve this challenge, but…

April 21, 2026

Multi-Label Phase Diagram Prediction in Complex Alloys via Physics-Informed Graph Attention Networks

arXiv:2604.16468v1 Announce Type: new Abstract: Accurate phase equilibria are foundational to alloy design because they encode the underlying thermodynamics governing stability, transformations, and processing windows. However, while the CALculation of Phase Diagrams (CALPHAD) provides a rigorous thermodynamic framework, exploring multicomponent…

April 21, 2026

Culinary Crossroads: A RAG Framework for Enhancing Diversity in Cross-Cultural Recipe Adaptation

arXiv:2507.21934v2 Announce Type: replace-cross Abstract: In cross-cultural recipe adaptation, the goal is not only to ensure cultural appropriateness and retain the original dish’s essence, but also to provide diverse options for various dietary needs and preferences. Retrieval Augmented Generation (RAG)…

April 21, 2026