Archives AI News

A Generalized Bisimulation Metric of State Similarity between Markov Decision Processes: From Theoretical Propositions to Applications

arXiv:2509.18714v3 Announce Type: replace Abstract: The bisimulation metric (BSM) is a powerful tool for computing state similarities within a Markov decision process (MDP), revealing that states closer in BSM have more similar optimal value functions. While BSM has been successfully…

November 4, 2025

Quadratic Direct Forecast for Training Multi-Step Time-Series Forecast Models

arXiv:2511.00053v1 Announce Type: new Abstract: The design of training objective is central to training time-series forecasting models. Existing training objectives such as mean squared error mostly treat each future step as an independent, equally weighted task, which we found leading…

November 4, 2025

Knolling Bot: Teaching Robots the Human Notion of Tidiness

arXiv:2310.04566v3 Announce Type: replace-cross Abstract: For robots to truly collaborate and assist humans, they must understand not only logic and instructions, but also the subtle emotions, aesthetics, and feelings that define our humanity. Human art and aesthetics are among the…

November 4, 2025

SpatialTraceGen: High-Fidelity Traces for Efficient VLM Spatial Reasoning Distillation

arXiv:2511.00054v1 Announce Type: new Abstract: While Vision-Language Models (VLMs) excel in many areas, they struggle with complex spatial reasoning, which requires problem decomposition and strategic tool use. Fine-tuning smaller, more deployable models offers an efficient path to strong performance, but…

November 4, 2025

LLM Strategic Reasoning: Agentic Study through Behavioral Game Theory

arXiv:2502.20432v3 Announce Type: replace-cross Abstract: Strategic decision-making involves interactive reasoning where agents adapt their choices in response to others, yet existing evaluations of large language models (LLMs) often emphasize Nash Equilibrium (NE) approximation, overlooking the mechanisms driving their strategic choices.…

November 4, 2025

Exploring Federated Learning for Thermal Urban Feature Segmentation — A Comparison of Centralized and Decentralized Approaches

arXiv:2511.00055v1 Announce Type: new Abstract: Federated Learning (FL) is an approach for training a shared Machine Learning (ML) model with distributed training data and multiple participants. FL allows bypassing limitations of the traditional Centralized Machine Learning CL if data cannot…

November 4, 2025

SplashNet: Split-and-Share Encoders for Accurate and Efficient Typing with Surface Electromyography

arXiv:2506.12356v2 Announce Type: replace-cross Abstract: Surface electromyography (sEMG) at the wrists could enable natural, keyboard-free text entry, yet the state-of-the-art emg2qwerty baseline still misrecognizes $51.8%$ of characters in the zero-shot setting on unseen users and $7.0%$ after user-specific fine-tuning. We…

November 4, 2025

MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling

arXiv:2511.00056v1 Announce Type: new Abstract: The substantial memory demands of pre-training and fine-tuning large language models (LLMs) require memory-efficient optimization algorithms. One promising approach is layer-wise optimization, which treats each transformer block as a single layer and optimizes it sequentially,…

November 4, 2025

Localist LLMs — A Mathematical Framework for Dynamic Locality Control

arXiv:2510.09338v2 Announce Type: replace-cross Abstract: We present a novel framework for training large language models with continuously adjustable internal representations that span the full spectrum from localist (interpretable, rule-based) to distributed (generalizable, efficient) encodings. The key innovation is a locality…

November 4, 2025

Automatically Finding Rule-Based Neurons in OthelloGPT

arXiv:2511.00059v1 Announce Type: new Abstract: OthelloGPT, a transformer trained to predict valid moves in Othello, provides an ideal testbed for interpretability research. The model is complex enough to exhibit rich computational patterns, yet grounded in rule-based game logic that enables…

November 4, 2025