Archives AI News

LLMZ+: Contextual Prompt Whitelist Principles for Agentic LLMs

arXiv:2509.18557v1 Announce Type: new Abstract: Compared to traditional models, agentic AI represents a highly valuable target for potential attackers as they possess privileged access to data sources and API tools, which are traditionally not incorporated into classical agents. Unlike a…

Algorithms for Adversarially Robust Deep Learning

arXiv:2509.19100v1 Announce Type: cross Abstract: Given the widespread use of deep learning models in safety-critical applications, ensuring that the decisions of such models are robust against adversarial exploitation is of fundamental importance. In this thesis, we discuss recent progress toward…

Solving Math Word Problems Using Estimation Verification and Equation Generation

arXiv:2509.18565v1 Announce Type: new Abstract: Large Language Models (LLMs) excel at various tasks, including problem-solving and question-answering. However, LLMs often find Math Word Problems (MWPs) challenging because solving them requires a range of reasoning and mathematical abilities with which LLMs…

Generative Propaganda

arXiv:2509.19147v1 Announce Type: cross Abstract: Generative propaganda is the use of generative artificial intelligence (AI) to shape public opinion. To characterize its use in real-world settings, we conducted interviews with defenders (e.g., factcheckers, journalists, officials) in Taiwan and creators (e.g.,…

MsFIN: Multi-scale Feature Interaction Network for Traffic Accident Anticipation

arXiv:2509.19227v1 Announce Type: cross Abstract: With the widespread deployment of dashcams and advancements in computer vision, developing accident prediction models from the dashcam perspective has become critical for proactive safety interventions. However, two key challenges persist: modeling feature-level interactions among…

TERAG: Token-Efficient Graph-Based Retrieval-Augmented Generation

arXiv:2509.18667v1 Announce Type: new Abstract: Graph-based Retrieval-augmented generation (RAG) has become a widely studied approach for improving the reasoning, accuracy, and factuality of Large Language Models. However, many existing graph-based RAG systems overlook the high cost associated with LLM token…

Measuring Sample Quality with Copula Discrepancies

arXiv:2507.21434v2 Announce Type: replace Abstract: The scalable Markov chain Monte Carlo (MCMC) algorithms that underpin modern Bayesian machine learning, such as Stochastic Gradient Langevin Dynamics (SGLD), sacrifice asymptotic exactness for computational speed, creating a critical diagnostic gap: traditional sample quality…

Packed-Ensembles for Efficient Uncertainty Estimation

arXiv:2210.09184v4 Announce Type: replace-cross Abstract: Deep Ensembles (DE) are a prominent approach for achieving excellent performance on key metrics such as accuracy, calibration, uncertainty estimation, and out-of-distribution detection. However, hardware limitations of real-world systems constrain to smaller ensembles and lower-capacity…

A Neural Difference-of-Entropies Estimator for Mutual Information

arXiv:2502.13085v2 Announce Type: replace Abstract: Estimating Mutual Information (MI), a key measure of dependence of random quantities without specific modelling assumptions, is a challenging problem in high dimensions. We propose a novel mutual information estimator based on parametrizing conditional densities…