Archives AI News

FRIDA: Free-Rider Detection using Privacy Attacks

arXiv:2410.05020v2 Announce Type: replace Abstract: Federated learning is increasingly popular as it enables multiple parties with limited datasets and resources to train a machine learning model collaboratively. However, similar to other collaborative systems, federated learning is vulnerable to free-riders –…

Computing Linear Regions in Neural Networks with Skip Connections

arXiv:2509.15441v1 Announce Type: new Abstract: Neural networks are important tools in machine learning. Representing piecewise linear activation functions with tropical arithmetic enables the application of tropical geometry. Algorithms are presented to compute regions where the neural networks are linear maps.…

StFT: Spatio-temporal Fourier Transformer for Long-term Dynamics Prediction

arXiv:2503.11899v2 Announce Type: replace Abstract: Simulating the long-term dynamics of multi-scale and multi-physics systems poses a significant challenge in understanding complex phenomena across science and engineering. The complexity arises from the intricate interactions between scales and the interplay of diverse…

IMPQ: Interaction-Aware Layerwise Mixed Precision Quantization for LLMs

arXiv:2509.15455v1 Announce Type: new Abstract: Large Language Models (LLMs) promise impressive capabilities, yet their multi-billion-parameter scale makes on-device or low-resource deployment prohibitive. Mixed-precision quantization offers a compelling solution, but existing methods struggle when the average precision drops below four bits,…

ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning

arXiv:2505.04881v2 Announce Type: replace Abstract: Large Reasoning Models (LRMs) perform strongly in complex reasoning tasks via Chain-of-Thought (CoT) prompting, but often suffer from verbose outputs, increasing computational overhead. Existing fine-tuning-based compression methods either operate post-hoc pruning, risking disruption to reasoning…