Archives AI News

Rewarding the Journey, Not Just the Destination: A Composite Path and Answer Self-Scoring Reward Mechanism for Test-Time Reinforcement Learning

arXiv:2510.17923v3 Announce Type: replace Abstract: Reinforcement Learning (RL) has emerged as a powerful paradigm for advancing Large Language Models (LLMs), achieving remarkable performance in complex reasoning domains such as mathematics and code generation. However, current RL methods face a fundamental…

December 3, 2025

WhAM: Towards A Translative Model of Sperm Whale Vocalization

arXiv:2512.02206v1 Announce Type: new Abstract: Sperm whales communicate in short sequences of clicks known as codas. We present WhAM (Whale Acoustics Model), the first transformer-based model capable of generating synthetic sperm whale codas from any audio prompt. WhAM is built…

December 3, 2025

Sigma: The Key for Vision-Language-Action Models toward Telepathic Alignment

arXiv:2512.00783v2 Announce Type: replace Abstract: To address the gap in humanoid robot cognitive systems regarding the lack of a time-updable mediating thought space between semantics and continuous control, this study constructs and trains a VLA model named “Sigma” that runs…

December 3, 2025

InstructLR: A Scalable Approach to Create Instruction Dataset for Under-Resourced Languages

arXiv:2512.02213v1 Announce Type: new Abstract: Effective text generation and chat interfaces for low-resource languages (LRLs) remain a challenge for state-of-the-art large language models (LLMs) to support. This is mainly due to the difficulty of curating high-quality instruction datasets for LRLs,…

December 3, 2025

Rank Matters: Understanding and Defending Model Inversion Attacks via Low-Rank Feature Filtering

arXiv:2410.05814v4 Announce Type: replace-cross Abstract: Model Inversion Attacks (MIAs) pose a significant threat to data privacy by reconstructing sensitive training samples from the knowledge embedded in trained machine learning models. Despite recent progress in enhancing the effectiveness of MIAs across…

December 3, 2025

Improved Training Mechanism for Reinforcement Learning via Online Model Selection

arXiv:2512.02214v1 Announce Type: new Abstract: We study the problem of online model selection in reinforcement learning, where the selector has access to a class of reinforcement learning agents and learns to adaptively select the agent with the right configuration. Our…

December 3, 2025

CNN-Enabled Scheduling for Probabilistic Real-Time Guarantees in Industrial URLLC

arXiv:2506.14987v3 Announce Type: replace-cross Abstract: Ensuring packet-level communication quality is vital for ultra-reliable, low-latency communications (URLLC) in large-scale industrial wireless networks. We enhance the Local Deadline Partition (LDP) algorithm by introducing a CNN-based dynamic priority prediction mechanism for improved interference…

December 3, 2025

Uncertainty Reasoning with Photonic Bayesian Machines

arXiv:2512.02217v1 Announce Type: new Abstract: Artificial intelligence (AI) systems increasingly influence safety-critical aspects of society, from medical diagnosis to autonomous mobility, making uncertainty awareness a central requirement for trustworthy AI. We present a photonic Bayesian machine that leverages the inherent…

December 3, 2025

The Algorithmic Phase Transition in Correlated Spiked Models

arXiv:2511.06040v4 Announce Type: replace-cross Abstract: We study the computational task of detecting and estimating correlated signals in a pair of spiked matrices $$ X=tfrac{lambda}{sqrt{n}} xu^{top}+W, quad Y=tfrac{mu}{sqrt{n}} yv^{top}+Z $$ where the spikes $x,y$ have correlation $rho$. Specifically, we consider two…

December 3, 2025

On the Approximation of Phylogenetic Distance Functions by Artificial Neural Networks

arXiv:2512.02223v1 Announce Type: new Abstract: Inferring the phylogenetic relationships among a sample of organisms is a fundamental problem in modern biology. While distance-based hierarchical clustering algorithms achieved early success on this task, these have been supplanted by Bayesian and maximum…

December 3, 2025