Archives AI News

Representation-Level Counterfactual Calibration for Debiased Zero-Shot Recognition

arXiv:2510.26466v2 Announce Type: replace-cross Abstract: Object-context shortcuts remain a persistent challenge in vision-language models, undermining zero-shot reliability when test-time scenes differ from familiar training co-occurrences. We recast this issue as a causal inference problem and ask: Would the prediction remain…

November 4, 2025

Adaptive Spatio-Temporal Graphs with Self-Supervised Pretraining for Multi-Horizon Weather Forecasting

arXiv:2511.00049v1 Announce Type: new Abstract: Accurate and robust weather forecasting remains a fundamental challenge due to the inherent spatio-temporal complexity of atmospheric systems. In this paper, we propose a novel self-supervised learning framework that leverages spatio-temporal structures to improve multi-variable…

November 4, 2025

Ranking hierarchical multi-label classification results with mLPRs

arXiv:2205.07833v2 Announce Type: replace Abstract: Hierarchical multi-label classification (HMC) has gained considerable attention in recent decades. A seminal line of HMC research addresses the problem in two stages: first, training individual classifiers for each class, then integrating these classifiers to…

November 4, 2025

FLoRA: Fused forward-backward adapters for parameter efficient fine-tuning and reducing inference-time latencies of LLMs

arXiv:2511.00050v1 Announce Type: new Abstract: As the large language models (LLMs) grow in size each day, efficient training and fine-tuning has never been as important as nowadays. This resulted in the great interest in parameter efficient fine-tuning (PEFT), and effective…

November 4, 2025

Towards Large-Scale In-Context Reinforcement Learning by Meta-Training in Randomized Worlds

arXiv:2502.02869v4 Announce Type: replace Abstract: In-Context Reinforcement Learning (ICRL) enables agents to learn automatically and on-the-fly from their interactive experiences. However, a major challenge in scaling up ICRL is the lack of scalable task collections. To address this, we propose…

November 4, 2025

Calibrating and Rotating: A Unified Framework for Weight Conditioning in PEFT

arXiv:2511.00051v1 Announce Type: new Abstract: Parameter-Efficient Fine-Tuning (PEFT) methods are crucial for adapting large pre-trained models. Among these, LoRA is considered a foundational approach. Building on this, the influential DoRA method enhances performance by decomposing weight updates into magnitude and…

November 4, 2025

TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning

arXiv:2505.23719v2 Announce Type: replace Abstract: In-context learning, the ability of large language models to perform tasks using only examples provided in the prompt, has recently been adapted for time series forecasting. This paradigm enables zero-shot prediction, where past values serve…

November 4, 2025

Feature-Guided Analysis of Neural Networks: A Replication Study

arXiv:2511.00052v1 Announce Type: new Abstract: Understanding why neural networks make certain decisions is pivotal for their use in safety-critical applications. Feature-Guided Analysis (FGA) extracts slices of neural networks relevant to their tasks. Existing feature-guided approaches typically monitor the activation of…

November 4, 2025

A Generalized Bisimulation Metric of State Similarity between Markov Decision Processes: From Theoretical Propositions to Applications

arXiv:2509.18714v3 Announce Type: replace Abstract: The bisimulation metric (BSM) is a powerful tool for computing state similarities within a Markov decision process (MDP), revealing that states closer in BSM have more similar optimal value functions. While BSM has been successfully…

November 4, 2025

Quadratic Direct Forecast for Training Multi-Step Time-Series Forecast Models

arXiv:2511.00053v1 Announce Type: new Abstract: The design of training objective is central to training time-series forecasting models. Existing training objectives such as mean squared error mostly treat each future step as an independent, equally weighted task, which we found leading…

November 4, 2025