Archives AI News

Geometry-Guided Adversarial Prompt Detection via Curvature and Local Intrinsic Dimension

arXiv:2503.03502v2 Announce Type: replace-cross Abstract: Adversarial prompts are capable of jailbreaking frontier large language models (LLMs) and inducing undesirable behaviours, posing a significant obstacle to their safe deployment. Current mitigation strategies primarily rely on activating built-in defence mechanisms or fine-tuning…

Can We Ignore Labels In Out of Distribution Detection?

arXiv:2504.14704v2 Announce Type: replace-cross Abstract: Out-of-distribution (OOD) detection methods have recently become more prominent, serving as a core element in safety-critical autonomous systems. One major purpose of OOD detection is to reject invalid inputs that could lead to unpredictable errors…

Graph-based LLM over Semi-Structured Population Data for Dynamic Policy Response

arXiv:2510.05196v1 Announce Type: new Abstract: Timely and accurate analysis of population-level data is crucial for effective decision-making during public health emergencies such as the COVID-19 pandemic. However, the massive input of semi-structured data, including structured demographic information and unstructured human…

Efficient Prediction of Pass@k Scaling in Large Language Models

arXiv:2510.05197v1 Announce Type: new Abstract: Assessing the capabilities and risks of frontier AI systems is a critical area of research, and recent work has shown that repeated sampling from models can dramatically increase both. For instance, repeated sampling has been…

Kaputt: A Large-Scale Dataset for Visual Defect Detection

arXiv:2510.05903v1 Announce Type: cross Abstract: We present a novel large-scale dataset for defect detection in a logistics setting. Recent work on industrial anomaly detection has primarily focused on manufacturing scenarios with highly controlled poses and a limited number of object…