Archives AI News

Ratio-Variance Regularized Policy Optimization for Efficient LLM Fine-tuning

arXiv:2601.03320v1 Announce Type: new Abstract: On-policy reinforcement learning (RL), particularly Proximal Policy Optimization (PPO) and Group Relative Policy Optimization (GRPO), has become the dominant paradigm for fine-tuning large language models (LLMs). While policy ratio clipping stabilizes training, this heuristic hard…

January 8, 2026

Using Small Language Models to Reverse-Engineer Machine Learning Pipelines Structures

arXiv:2601.03988v1 Announce Type: cross Abstract: Background: Extracting the stages that structure Machine Learning (ML) pipelines from source code is key for gaining a deeper understanding of data science practices. However, the diversity caused by the constant evolution of the ML…

January 8, 2026

Physics-Informed Gaussian Process Regression for the Constitutive Modeling of Concrete: A Data-Driven Improvement to Phenomenological Models

arXiv:2601.03367v1 Announce Type: new Abstract: Understanding and modeling the constitutive behavior of concrete is crucial for civil and defense applications, yet widely used phenomenological models such as Karagozian & Case concrete (KCC) model depend on empirically calibrated failure surfaces that…

January 8, 2026

Scanner-Induced Domain Shifts Undermine the Robustness of Pathology Foundation Models

arXiv:2601.04163v1 Announce Type: cross Abstract: Pathology foundation models (PFMs) have become central to computational pathology, aiming to offer general encoders for feature extraction from whole-slide images (WSIs). Despite strong benchmark performance, PFM robustness to real-world technical domain shifts, such as…

January 8, 2026

Enhancing Small Dataset Classification Using Projected Quantum Kernels with Convolutional Neural Networks

arXiv:2601.03375v1 Announce Type: new Abstract: Convolutional Neural Networks (CNNs) have shown promising results in efficiency and accuracy in image classification. However, their efficacy often relies on large, labeled datasets, posing challenges for applications with limited data availability. Our research addresses…

January 8, 2026

Architecture independent generalization bounds for overparametrized deep ReLU networks

arXiv:2504.05695v4 Announce Type: replace Abstract: We prove that overparametrized neural networks are able to generalize with a test error that is independent of the level of overparametrization, and independent of the Vapnik-Chervonenkis (VC) dimension. We prove explicit bounds that only…

January 8, 2026

Weather-Aware Transformer for Real-Time Route Optimization in Drone-as-a-Service Operations

arXiv:2601.03376v1 Announce Type: new Abstract: This paper presents a novel framework to accelerate route prediction in Drone-as-a-Service operations through weather-aware deep learning models. While classical path-planning algorithms, such as A* and Dijkstra, provide optimal solutions, their computational complexity limits real-time…

January 8, 2026

A Hybrid Computational Intelligence Framework with Metaheuristic Optimization for Drug-Drug Interaction Prediction

arXiv:2510.09668v2 Announce Type: replace Abstract: Drug-drug interactions (DDIs) are a leading cause of preventable adverse events, often complicating treatment and increasing healthcare costs. At the same time, knowing which drugs do not interact is equally important, as such knowledge supports…

January 8, 2026

SIGMA: Scalable Spectral Insights for LLM Collapse

arXiv:2601.03385v1 Announce Type: new Abstract: The rapid adoption of synthetic data for training Large Language Models (LLMs) has introduced the technical challenge of “model collapse”-a degenerative process where recursive training on model-generated content leads to a contraction of distributional variance…

January 8, 2026

Intrinsic-Metric Physics-Informed Neural Networks (IM-PINN) for Reaction-Diffusion Dynamics on Complex Riemannian Manifolds

arXiv:2601.00834v2 Announce Type: replace Abstract: Simulating nonlinear reaction-diffusion dynamics on complex, non-Euclidean manifolds remains a fundamental challenge in computational morphogenesis, constrained by high-fidelity mesh generation costs and symplectic drift in discrete time-stepping schemes. This study introduces the Intrinsic-Metric Physics-Informed Neural…

January 8, 2026