Archives AI News

Topological Invariance and Breakdown in Learning

arXiv:2510.02670v1 Announce Type: new Abstract: We prove that for a broad class of permutation-equivariant learning rules (including SGD, Adam, and others), the training process induces a bi-Lipschitz mapping between neurons and strongly constrains the topology of the neuron distribution during…

Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement

arXiv:2509.23799v2 Announce Type: replace Abstract: Steering has emerged as a promising approach in controlling large language models (LLMs) without modifying model parameters. However, most existing steering methods rely on large-scale datasets to learn clear behavioral information, which limits their applicability…

Model Parallelism With Subnetwork Data Parallelism

arXiv:2507.09029v4 Announce Type: replace-cross Abstract: Pre-training large neural networks at scale imposes heavy memory demands on accelerators and often requires costly communication. We introduce Subnetwork Data Parallelism (SDP), a distributed training framework that partitions a model into structured subnetworks trained…

Improving GUI Grounding with Explicit Position-to-Coordinate Mapping

arXiv:2510.03230v1 Announce Type: cross Abstract: GUI grounding, the task of mapping natural-language instructions to pixel coordinates, is crucial for autonomous agents, yet remains difficult for current VLMs. The core bottleneck is reliable patch-to-pixel mapping, which breaks when extrapolating to high-resolution…

GUI-PRA: Process Reward Agent for GUI Tasks

arXiv:2509.23263v2 Announce Type: replace Abstract: Graphical User Interface (GUI) Agents powered by Multimodal Large Language Models (MLLMs) show significant potential for automating tasks. However, they often struggle with long-horizon tasks, leading to frequent failures. Process Reward Models (PRMs) are a…

Agentic Additive Manufacturing Alloy Discovery

arXiv:2510.02567v1 Announce Type: new Abstract: Agentic systems enable the intelligent use of research tooling, augmenting a researcher’s ability to investigate and propose novel solutions to existing problems. Within Additive Manufacturing (AM), alloy discovery remains a complex challenge, often requiring expertise…