Archives AI News

SWE-Arena: An Interactive Platform for Evaluating Foundation Models in Software Engineering

arXiv:2502.01860v5 Announce Type: replace-cross Abstract: Foundation models (FMs), particularly large language models (LLMs), have shown significant promise in various software engineering (SE) tasks, including code generation, debugging, and requirement refinement. Despite these advances, existing evaluation frameworks are insufficient for assessing…

DPCformer: An Interpretable Deep Learning Model for Genomic Prediction in Crops

arXiv:2510.08662v1 Announce Type: new Abstract: Genomic Selection (GS) uses whole-genome information to predict crop phenotypes and accelerate breeding. Traditional GS methods, however, struggle with prediction accuracy for complex traits and large datasets. We propose DPCformer, a deep learning model integrating…

FreqCa: Accelerating Diffusion Models via Frequency-Aware Caching

arXiv:2510.08669v1 Announce Type: new Abstract: The application of diffusion transformers is suffering from their significant inference costs. Recently, feature caching has been proposed to solve this problem by reusing features from previous timesteps, thereby skipping computation in future timesteps. However,…

CATS-Linear: Classification Auxiliary Linear Model for Time Series Forecasting

arXiv:2510.08661v1 Announce Type: new Abstract: Recent research demonstrates that linear models achieve forecasting performance competitive with complex architectures, yet methodologies for enhancing linear models remain underexplored. Motivated by the hypothesis that distinct time series instances may follow heterogeneous linear mappings,…

Provably Robust Adaptation for Language-Empowered Foundation Models

arXiv:2510.08659v1 Announce Type: new Abstract: Language-empowered foundation models (LeFMs), such as CLIP and GraphCLIP, have transformed multimodal learning by aligning visual (or graph) features with textual representations, enabling powerful downstream capabilities like few-shot learning. However, the reliance on small, task-specific…

Inner-Instance Normalization for Time Series Forecasting

arXiv:2510.08657v1 Announce Type: new Abstract: Real-world time series are influenced by numerous factors and exhibit complex non-stationary characteristics. Non-stationarity can lead to distribution shifts, where the statistical properties of time series change over time, negatively impacting model performance. Several instance…

Knowledge Graph Sparsification for GNN-based Rare Disease Diagnosis

arXiv:2510.08655v1 Announce Type: new Abstract: Rare genetic disease diagnosis faces critical challenges: insufficient patient data, inaccessible full genome sequencing, and the immense number of possible causative genes. These limitations cause prolonged diagnostic journeys, inappropriate treatments, and critical delays, disproportionately affecting…

Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance

arXiv:2507.17131v2 Announce Type: replace Abstract: Large language model (LLM) agents often struggle in environments where rules and required domain knowledge frequently change, such as regulatory compliance and user risk screening. Current approaches, like offline fine-tuning and standard prompting, are insufficient…

Don’t Waste Mistakes: Leveraging Negative RL-Groups via Confidence Reweighting

arXiv:2510.08696v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has become a standard recipe for improving large language models (LLMs) on reasoning tasks, with Group Relative Policy Optimization (GRPO) widely used in practice. Yet GRPO wastes substantial compute…