Archives AI News

OWL: Probing Cross-Lingual Recall of Memorized Texts via World Literature

arXiv:2505.22945v2 Announce Type: replace-cross Abstract: Large language models (LLMs) are known to memorize and recall English text from their pretraining data. However, the extent to which this ability generalizes to non-English languages or transfers across languages remains unclear. This paper…

RooseBERT: A New Deal For Political Language Modelling

arXiv:2508.03250v2 Announce Type: replace-cross Abstract: The increasing amount of political debates and politics-related discussions calls for the definition of novel computational methods to automatically analyse such content with the final goal of lightening up political deliberation to citizens. However, the…

Do Code Models Suffer from the Dunning-Kruger Effect?

arXiv:2510.05457v1 Announce Type: new Abstract: As artificial intelligence systems increasingly collaborate with humans in creative and technical domains, questions arise about the cognitive boundaries and biases that shape our shared agency. This paper investigates the Dunning-Kruger Effect (DKE), the tendency…

VAL-Bench: Measuring Value Alignment in Language Models

arXiv:2510.05465v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used for tasks where outputs shape human decisions, so it is critical to test whether their responses reflect consistent human values. Existing benchmarks mostly track refusals or predefined safety…

Vul-R2: A Reasoning LLM for Automated Vulnerability Repair

arXiv:2510.05480v1 Announce Type: new Abstract: The exponential increase in software vulnerabilities has created an urgent need for automatic vulnerability repair (AVR) solutions. Recent research has formulated AVR as a sequence generation problem and has leveraged large language models (LLMs) to…