Archives AI News

OceanGym: A Benchmark Environment for Underwater Embodied Agents

arXiv:2509.26536v2 Announce Type: replace-cross Abstract: We introduce OceanGym, the first comprehensive benchmark for ocean underwater embodied agents, designed to advance AI in one of the most demanding real-world environments. Unlike terrestrial or aerial domains, underwater settings present extreme perceptual and…

TouchFormer: A Robust Transformer-based Framework for Multimodal Material Perception

arXiv:2511.19509v1 Announce Type: new Abstract: Traditional vision-based material perception methods often experience substantial performance degradation under visually impaired conditions, thereby motivating the shift toward non-visual multimodal material perception. Despite this, existing approaches frequently perform naive fusion of multimodal inputs, overlooking…

Learning Degenerate Manifolds of Frustrated Magnets with Boltzmann Machines

arXiv:2511.19879v1 Announce Type: cross Abstract: We show that Restricted Boltzmann Machines (RBMs) provide a flexible generative framework for modeling spin configurations in disordered yet strongly correlated phases of frustrated magnets. As a benchmark, we first demonstrate that an RBM can…

CostNav: A Navigation Benchmark for Cost-Aware Evaluation of Embodied Agents

arXiv:2511.20216v1 Announce Type: cross Abstract: Existing navigation benchmarks focus on task success metrics while overlooking economic viability — critical for commercial deployment of autonomous delivery robots. We introduce emph{CostNav}, a textbf{Micro-Navigation Economic Testbed} that evaluates embodied agents through comprehensive cost-revenue…

Automating Deception: Scalable Multi-Turn LLM Jailbreaks

arXiv:2511.19517v1 Announce Type: new Abstract: Multi-turn conversational attacks, which leverage psychological principles like Foot-in-the-Door (FITD), where a small initial request paves the way for a more significant one, to bypass safety alignments, pose a persistent threat to Large Language Models…

Learning to Solve Weighted Maximum Satisfiability with a Co-Training Architecture

arXiv:2511.19544v1 Announce Type: new Abstract: Wepropose SplitGNN, a graph neural network (GNN)-based approach that learns to solve weighted maximum satisfiabil ity (MaxSAT) problem. SplitGNN incorporates a co-training architecture consisting of supervised message passing mech anism and unsupervised solution boosting layer.…