Archives AI News

Teaching Metric Distance to Discrete Autoregressive Language Models

arXiv:2503.02379v4 Announce Type: replace Abstract: As large language models expand beyond natural language to domains such as mathematics, multimodal understanding, and embodied agents, tokens increasingly reflect metric relationships rather than purely linguistic meaning. We introduce DIST2Loss, a distance-aware framework designed…

A Generative Approach to LLM Harmfulness Mitigation with Red Flag Tokens

arXiv:2502.16366v4 Announce Type: replace-cross Abstract: Many safety post-training methods for large language models (LLMs) are designed to modify the model’s behaviour from producing unsafe answers to issuing refusals. However, such distribution shifts are often brittle and degrade performance on desirable…

Adjusting the Output of Decision Transformer with Action Gradient

arXiv:2510.05285v1 Announce Type: new Abstract: Decision Transformer (DT), which integrates reinforcement learning (RL) with the transformer model, introduces a novel approach to offline RL. Unlike classical algorithms that take maximizing cumulative discounted rewards as objective, DT instead maximizes the likelihood…

Computing frustration and near-monotonicity in deep neural networks

arXiv:2510.05286v1 Announce Type: new Abstract: For the signed graph associated to a deep neural network, one can compute the frustration level, i.e., test how close or distant the graph is to structural balance. For all the pretrained deep convolutional neural…