Archives AI News

Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning

arXiv:2508.03501v2 Announce Type: replace Abstract: Research on applications of reinforcement learning (RL) to large language models has mostly been focused on single-turn problems, such as mathematical reasoning or single-shot code generation. While these problems can be viewed as token-level multi-turn…

October 14, 2025

Deep Neural Networks Inspired by Differential Equations

arXiv:2510.09685v1 Announce Type: new Abstract: Deep learning has become a pivotal technology in fields such as computer vision, scientific computing, and dynamical systems, significantly advancing these disciplines. However, neural Networks persistently face challenges related to theoretical understanding, interpretability, and generalization.…

October 14, 2025

Enhancing XAI Narratives through Multi-Narrative Refinement and Knowledge Distillation

arXiv:2510.03134v2 Announce Type: replace Abstract: Explainable Artificial Intelligence has become a crucial area of research, aiming to demystify the decision-making processes of deep learning models. Among various explainability techniques, counterfactual explanations have been proven particularly promising, as they offer insights…

October 14, 2025

On the Occurence of Critical Learning Periods in Neural Networks

arXiv:2510.09687v1 Announce Type: new Abstract: This study delves into the plasticity of neural networks, offering empirical support for the notion that critical learning periods and warm-starting performance loss can be avoided through simple adjustments to learning hyperparameters. The critical learning…

October 14, 2025

Reinforcement learning-based statistical search strategy for an axion model from flavor

arXiv:2409.10023v2 Announce Type: replace-cross Abstract: We propose a reinforcement learning-based search strategy to explore new physics beyond the Standard Model. The reinforcement learning, which is one of machine learning methods, is a powerful approach to find model parameters with phenomenological…

October 14, 2025

Evaluation of Differential Privacy Mechanisms on Federated Learning

arXiv:2510.09691v1 Announce Type: new Abstract: Federated learning is distributed model training across several clients without disclosing raw data. Despite advancements in data privacy, risks still remain. Differential Privacy (DP) is a technique to protect sensitive data by adding noise to…

October 14, 2025

Noise Injection Systemically Degrades Large Language Model Safety Guardrails

arXiv:2505.13500v2 Announce Type: replace-cross Abstract: Safety guardrails in large language models (LLMs) are a critical component in preventing harmful outputs. Yet, their resilience under perturbation remains poorly understood. In this paper, we investigate the robustness of safety fine-tuning in LLMs…

October 14, 2025

Neural PDE Solvers with Physics Constraints: A Comparative Study of PINNs, DRM, and WANs

arXiv:2510.09693v1 Announce Type: new Abstract: Partial differential equations (PDEs) underpin models across science and engineering, yet analytical solutions are atypical and classical mesh-based solvers can be costly in high dimensions. This dissertation presents a unified comparison of three mesh-free neural…

October 14, 2025

FSA: An Alternative Efficient Implementation of Native Sparse Attention Kernel

arXiv:2508.18224v2 Announce Type: replace-cross Abstract: Recent advance in sparse attention mechanisms has demonstrated strong potential for reducing the computational cost of long-context training and inference in large language models (LLMs). Native Sparse Attention (NSA), one state-of-the-art approach, introduces natively trainable,…

October 14, 2025

Kelp: A Streaming Safeguard for Large Models via Latent Dynamics-Guided Risk Detection

arXiv:2510.09694v1 Announce Type: new Abstract: Large models (LMs) are powerful content generators, yet their open-ended nature can also introduce potential risks, such as generating harmful or biased content. Existing guardrails mostly perform post-hoc detection that may expose unsafe content before…

October 14, 2025