Archives AI News

Scaling Patterns in Adversarial Alignment: Evidence from Multi-LLM Jailbreak Experiments

arXiv:2511.13788v1 Announce Type: new Abstract: Large language models (LLMs) increasingly operate in multi-agent and safety-critical settings, raising open questions about how their vulnerabilities scale when models interact adversarially. This study examines whether larger models can systematically jailbreak smaller ones –…

November 19, 2025

DeepBlip: Estimating Conditional Average Treatment Effects Over Time

arXiv:2511.14545v1 Announce Type: cross Abstract: Structural nested mean models (SNMMs) are a principled approach to estimate the treatment effects over time. A particular strength of SNMMs is to break the joint effect of treatment sequences over time into localized, time-specific…

November 19, 2025

ScoresActivation: A New Activation Function for Model Agnostic Global Explainability by Design

arXiv:2511.13809v1 Announce Type: new Abstract: Understanding the decision of large deep learning models is a critical challenge for building transparent and trustworthy systems. Although the current post hoc explanation methods offer valuable insights into feature importance, they are inherently disconnected…

November 19, 2025

Derivative of the truncated singular value and eigen decomposition

arXiv:2511.14651v1 Announce Type: cross Abstract: Recently developed applications in the field of machine learning and computational physics rely on automatic differentiation techniques, that require stable and efficient linear algebra gradient computations. This technical note provides a comprehensive and detailed discussion…

November 19, 2025

Beat the long tail: Distribution-Aware Speculative Decoding for RL Training

arXiv:2511.13841v1 Announce Type: new Abstract: Reinforcement learning(RL) post-training has become essential for aligning large language models (LLMs), yet its efficiency is increasingly constrained by the rollout phase, where long trajectories are generated token by token. We identify a major bottleneck:the…

November 19, 2025

High Dimensional Distributed Gradient Descent with Arbitrary Number of Byzantine Attackers

arXiv:2307.13352v3 Announce Type: replace Abstract: Adversarial attacks pose a major challenge to distributed learning systems, prompting the development of numerous robust learning methods. However, most existing approaches suffer from the curse of dimensionality, i.e. the error increases with the number…

November 19, 2025

AnaCP: Toward Upper-Bound Continual Learning via Analytic Contrastive Projection

arXiv:2511.13880v1 Announce Type: new Abstract: This paper studies the problem of class-incremental learning (CIL), a core setting within continual learning where a model learns a sequence of tasks, each containing a distinct set of classes. Traditional CIL methods, which do…

November 19, 2025

Predicting the Performance of Black-box LLMs through Self-Queries

arXiv:2501.01558v3 Announce Type: replace Abstract: As large language models (LLMs) are increasingly relied on in AI systems, predicting when they make mistakes is crucial. While a great deal of work in the field uses internal representations to interpret model behavior,…

November 19, 2025

Tractable Probabilistic Models for Investment Planning

arXiv:2511.13888v1 Announce Type: new Abstract: Investment planning in power utilities, such as generation and transmission expansion, requires decade-long forecasts under profound uncertainty. Forecasting of energy mix and energy use decades ahead is nontrivial. Classical approaches focus on generating a finite…

November 19, 2025

A Survey of Cross-domain Graph Learning: Progress and Future Directions

arXiv:2503.11086v2 Announce Type: replace Abstract: Graph learning plays a vital role in mining and analyzing complex relationships within graph data and has been widely applied to real-world scenarios such as social, citation, and e-commerce networks. Foundation models in computer vision…

November 19, 2025