Archives AI News

FRIDA: Free-Rider Detection using Privacy Attacks

arXiv:2410.05020v2 Announce Type: replace Abstract: Federated learning is increasingly popular as it enables multiple parties with limited datasets and resources to train a machine learning model collaboratively. However, similar to other collaborative systems, federated learning is vulnerable to free-riders –…

September 22, 2025

Computing Linear Regions in Neural Networks with Skip Connections

arXiv:2509.15441v1 Announce Type: new Abstract: Neural networks are important tools in machine learning. Representing piecewise linear activation functions with tropical arithmetic enables the application of tropical geometry. Algorithms are presented to compute regions where the neural networks are linear maps.…

September 22, 2025

Domain-invariant feature learning in brain MR imaging for content-based image retrieval

arXiv:2501.01326v2 Announce Type: replace Abstract: When conducting large-scale studies that collect brain MR images from multiple facilities, the impact of differences in imaging equipment and protocols at each site cannot be ignored, and this domain gap has become a significant…

September 22, 2025

Hierarchical Self-Attention: Generalizing Neural Attention Mechanics to Multi-Scale Problems

arXiv:2509.15448v1 Announce Type: new Abstract: Transformers and their attention mechanism have been revolutionary in the field of Machine Learning. While originally proposed for the language data, they quickly found their way to the image, video, graph, etc. data modalities with…

September 22, 2025

StFT: Spatio-temporal Fourier Transformer for Long-term Dynamics Prediction

arXiv:2503.11899v2 Announce Type: replace Abstract: Simulating the long-term dynamics of multi-scale and multi-physics systems poses a significant challenge in understanding complex phenomena across science and engineering. The complexity arises from the intricate interactions between scales and the interplay of diverse…

September 22, 2025

IMPQ: Interaction-Aware Layerwise Mixed Precision Quantization for LLMs

arXiv:2509.15455v1 Announce Type: new Abstract: Large Language Models (LLMs) promise impressive capabilities, yet their multi-billion-parameter scale makes on-device or low-resource deployment prohibitive. Mixed-precision quantization offers a compelling solution, but existing methods struggle when the average precision drops below four bits,…

September 22, 2025

ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning

arXiv:2505.04881v2 Announce Type: replace Abstract: Large Reasoning Models (LRMs) perform strongly in complex reasoning tasks via Chain-of-Thought (CoT) prompting, but often suffer from verbose outputs, increasing computational overhead. Existing fine-tuning-based compression methods either operate post-hoc pruning, risking disruption to reasoning…

September 22, 2025

Temporal Reasoning with Large Language Models Augmented by Evolving Knowledge Graphs

arXiv:2509.15464v1 Announce Type: new Abstract: Large language models (LLMs) excel at many language understanding tasks but struggle to reason over knowledge that evolves. To address this, recent work has explored augmenting LLMs with knowledge graphs (KGs) to provide structured, up-to-date…

September 22, 2025

Perception-R1: Advancing Multimodal Reasoning Capabilities of MLLMs via Visual Perception Reward

arXiv:2506.07218v2 Announce Type: replace Abstract: Enhancing the multimodal reasoning capabilities of Multimodal Large Language Models (MLLMs) is a challenging task that has attracted increasing attention in the community. Recently, several studies have applied Reinforcement Learning with Verifiable Rewards (RLVR) to…

September 22, 2025

Solar Forecasting with Causality: A Graph-Transformer Approach to Spatiotemporal Dependencies

arXiv:2509.15481v1 Announce Type: new Abstract: Accurate solar forecasting underpins effective renewable energy management. We present SolarCAST, a causally informed model predicting future global horizontal irradiance (GHI) at a target site using only historical GHI from site X and nearby stations…

September 22, 2025