Archives AI News

A Qualitative Comparative Evaluation of Cognitive and Generative Theories

arXiv:2510.03453v1 Announce Type: new Abstract: Evaluation is a critical activity associated with any theory. Yet this has proven to be an exceptionally challenging activity for theories based on cognitive architectures. For an overlapping set of reasons, evaluation can also be…

ContraGen: A Multi-Agent Generation Framework for Enterprise Contradictions Detection

arXiv:2510.03418v1 Announce Type: new Abstract: Retrieval-Augmented Generation (RAG) integrates LLMs with external sources, offering advanced capabilities for information access and decision-making. However, contradictions in retrieved evidence can result in inconsistent or untrustworthy outputs, which is especially problematic in enterprise settings…

Know Thyself? On the Incapability and Implications of AI Self-Recognition

arXiv:2510.03399v1 Announce Type: new Abstract: Self-recognition is a crucial metacognitive capability for AI systems, relevant not only for psychological analysis but also for safety, particularly in evaluative scenarios. Motivated by contradictory interpretations of whether models possess self-recognition (Panickssery et al.,…

Cross-Modal Content Optimization for Steering Web Agent Preferences

arXiv:2510.03612v1 Announce Type: new Abstract: Vision-language model (VLM)-based web agents increasingly power high-stakes selection tasks like content recommendation or product ranking by combining multimodal perception with preference reasoning. Recent studies reveal that these agents are vulnerable against attackers who can…

SurGE: A Benchmark and Evaluation Framework for Scientific Survey Generation

arXiv:2508.15658v2 Announce Type: replace-cross Abstract: The rapid growth of academic literature makes the manual creation of scientific surveys increasingly infeasible. While large language models show promise for automating this process, progress in this area is hindered by the absence of…