Archives AI News

LOTION: Smoothing the Optimization Landscape for Quantized Training

arXiv:2510.08757v1 Announce Type: new Abstract: Optimizing neural networks for quantized objectives is fundamentally challenging because the quantizer is piece-wise constant, yielding zero gradients everywhere except at quantization thresholds where the derivative is undefined. Most existing methods deal with this issue…

October 13, 2025

Robustness in Both Domains: CLIP Needs a Robust Text Encoder

arXiv:2506.03355v2 Announce Type: replace Abstract: Adversarial input attacks can cause a significant shift of CLIP embeddings. This can affect the downstream robustness of models incorporating CLIP in the pipeline, such as text-to-image generative models or large vision language models. While…

October 13, 2025

Fair Graph Machine Learning under Adversarial Missingness Processes

arXiv:2311.01591v4 Announce Type: replace Abstract: Graph Neural Networks (GNNs) have achieved state-of-the-art results in many relevant tasks where decisions might disproportionately impact specific communities. However, existing work on fair GNNs often assumes that either sensitive attributes are fully observed or…

October 13, 2025

Detecting and Filtering Unsafe Training Data via Data Attribution with Denoised Representation

arXiv:2502.11411v2 Announce Type: replace Abstract: Large language models (LLMs) are highly sensitive to even small amounts of unsafe training data, making effective detection and filtering essential for trustworthy model development. Current state-of-the-art (SOTA) detection approaches primarily rely on moderation classifiers,…

October 13, 2025

Investigating the Impact of Rational Dilated Wavelet Transform on Motor Imagery EEG Decoding with Deep Learning Models

arXiv:2510.09242v1 Announce Type: cross Abstract: The present study investigates the impact of the Rational Discrete Wavelet Transform (RDWT), used as a plug-in preprocessing step for motor imagery electroencephalographic (EEG) decoding prior to applying deep learning classifiers. A systematic paired evaluation…

October 13, 2025

Active Model Selection for Large Language Models

arXiv:2510.09418v1 Announce Type: cross Abstract: We introduce LLM SELECTOR, the first framework for active model selection of Large Language Models (LLMs). Unlike prior evaluation and benchmarking approaches that rely on fully annotated datasets, LLM SELECTOR efficiently identifies the best LLM…

October 13, 2025

SWE-Arena: An Interactive Platform for Evaluating Foundation Models in Software Engineering

arXiv:2502.01860v5 Announce Type: replace-cross Abstract: Foundation models (FMs), particularly large language models (LLMs), have shown significant promise in various software engineering (SE) tasks, including code generation, debugging, and requirement refinement. Despite these advances, existing evaluation frameworks are insufficient for assessing…

October 13, 2025

DPCformer: An Interpretable Deep Learning Model for Genomic Prediction in Crops

arXiv:2510.08662v1 Announce Type: new Abstract: Genomic Selection (GS) uses whole-genome information to predict crop phenotypes and accelerate breeding. Traditional GS methods, however, struggle with prediction accuracy for complex traits and large datasets. We propose DPCformer, a deep learning model integrating…

October 13, 2025

FreqCa: Accelerating Diffusion Models via Frequency-Aware Caching

arXiv:2510.08669v1 Announce Type: new Abstract: The application of diffusion transformers is suffering from their significant inference costs. Recently, feature caching has been proposed to solve this problem by reusing features from previous timesteps, thereby skipping computation in future timesteps. However,…

October 13, 2025

How Scale Breaks “Normalized Stress” and KL Divergence: Rethinking Quality Metrics

arXiv:2510.08660v1 Announce Type: new Abstract: Complex, high-dimensional data is ubiquitous across many scientific disciplines, including machine learning, biology, and the social sciences. One of the primary methods of visualizing these datasets is with two-dimensional scatter plots that visually capture some…

October 13, 2025