Archives AI News

SplashNet: Split-and-Share Encoders for Accurate and Efficient Typing with Surface Electromyography

arXiv:2506.12356v2 Announce Type: replace-cross Abstract: Surface electromyography (sEMG) at the wrists could enable natural, keyboard-free text entry, yet the state-of-the-art emg2qwerty baseline still misrecognizes $51.8%$ of characters in the zero-shot setting on unseen users and $7.0%$ after user-specific fine-tuning. We…

November 4, 2025

MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling

arXiv:2511.00056v1 Announce Type: new Abstract: The substantial memory demands of pre-training and fine-tuning large language models (LLMs) require memory-efficient optimization algorithms. One promising approach is layer-wise optimization, which treats each transformer block as a single layer and optimizes it sequentially,…

November 4, 2025

Localist LLMs — A Mathematical Framework for Dynamic Locality Control

arXiv:2510.09338v2 Announce Type: replace-cross Abstract: We present a novel framework for training large language models with continuously adjustable internal representations that span the full spectrum from localist (interpretable, rule-based) to distributed (generalizable, efficient) encodings. The key innovation is a locality…

November 4, 2025

Automatically Finding Rule-Based Neurons in OthelloGPT

arXiv:2511.00059v1 Announce Type: new Abstract: OthelloGPT, a transformer trained to predict valid moves in Othello, provides an ideal testbed for interpretability research. The model is complex enough to exhibit rich computational patterns, yet grounded in rule-based game logic that enables…

November 4, 2025

Extremal Contours: Gradient-driven contours for compact visual attribution

arXiv:2511.01411v1 Announce Type: cross Abstract: Faithful yet compact explanations for vision models remain a challenge, as commonly used dense perturbation masks are often fragmented and overfitted, needing careful post-processing. Here, we present a training-free explanation method that replaces dense masks…

November 4, 2025

EVINGCA: Adaptive Graph Clustering with Evolving Neighborhood Statistics

arXiv:2511.00064v1 Announce Type: new Abstract: Clustering algorithms often rely on restrictive assumptions: K-Means and Gaussian Mixtures presuppose convex, Gaussian-like clusters, while DBSCAN and HDBSCAN capture non-convexity but can be highly sensitive. I introduce EVINGCA (Evolving Variance-Informed Nonparametric Graph Construction Algorithm),…

November 4, 2025

Probabilistic Robustness for Free? Revisiting Training via a Benchmark

arXiv:2511.01724v1 Announce Type: cross Abstract: Deep learning models are notoriously vulnerable to imperceptible perturbations. Most existing research centers on adversarial robustness (AR), which evaluates models under worst-case scenarios by examining the existence of deterministic adversarial examples (AEs). In contrast, probabilistic…

November 4, 2025

Aligning Brain Signals with Multimodal Speech and Vision Embeddings

arXiv:2511.00065v1 Announce Type: new Abstract: When we hear the word “house”, we don’t just process sound, we imagine walls, doors, memories. The brain builds meaning through layers, moving from raw acoustics to rich, multimodal associations. Inspired by this, we build…

November 4, 2025

Enhancing Sequential Model Performance with Squared Sigmoid TanH (SST) Activation Under Data Constraints

arXiv:2402.09034v2 Announce Type: replace Abstract: Activation functions enable neural networks to learn complex representations by introducing non-linearities. While feedforward models commonly use rectified linear units, sequential models like recurrent neural networks, long short-term memory (LSTMs) and gated recurrent units (GRUs)…

November 4, 2025

Lightning-prediction tool could help protect the planes of the future

The new approach maps aircraft sections most vulnerable to lightning, including on planes with experimental designs.

November 4, 2025