Archives AI News

Quantized but Deceptive? A Multi-Dimensional Truthfulness Evaluation of Quantized LLMs

Quantized but Deceptive? A Multi-Dimensional Truthfulness Evaluation of Quantized LLMs arXiv:2508.19432v1 Announce Type: new Abstract: Quantization enables efficient deployment of large language models (LLMs) in resource-constrained environments by significantly reducing memory and computation costs. While quantized LLMs often maintain performance…

AI-Powered Detection of Inappropriate Language in Medical School Curricula

AI-Powered Detection of Inappropriate Language in Medical School Curricula arXiv:2508.19883v1 Announce Type: cross Abstract: The use of inappropriate language — such as outdated, exclusionary, or non-patient-centered terms — medical instructional materials can significantly influence clinical training, patient interactions, and health…

MathBuddy: A Multimodal System for Affective Math Tutoring

MathBuddy: A Multimodal System for Affective Math Tutoring arXiv:2508.19993v1 Announce Type: cross Abstract: The rapid adoption of LLM-based conversational systems is already transforming the landscape of educational technology. However, the current state-of-the-art learning models do not take into account the…

Tracking World States with Language Models: State-Based Evaluation Using Chess

Tracking World States with Language Models: State-Based Evaluation Using Chess arXiv:2508.19851v1 Announce Type: new Abstract: Large Language Models (LLMs) exhibit emergent capabilities in structured domains, suggesting they may implicitly internalize high-fidelity representations of world models. While probing techniques have shown…

EnvInjection: Environmental Prompt Injection Attack to Multi-modal Web Agents

EnvInjection: Environmental Prompt Injection Attack to Multi-modal Web Agents arXiv:2505.11717v2 Announce Type: replace-cross Abstract: Multi-modal large language model (MLLM)-based web agents interact with webpage environments by generating actions based on screenshots of the webpages. Environmental prompt injection attacks manipulate the…

CASE: An Agentic AI Framework for Enhancing Scam Intelligence in Digital Payments

CASE: An Agentic AI Framework for Enhancing Scam Intelligence in Digital Payments arXiv:2508.19932v1 Announce Type: new Abstract: The proliferation of digital payment platforms has transformed commerce, offering unmatched convenience and accessibility globally. However, this growth has also attracted malicious actors,…