Archives AI News

STAF: Leveraging LLMs for Automated Attack Tree-Based Security Test Generation

arXiv:2509.20190v1 Announce Type: cross Abstract: In modern automotive development, security testing is critical for safeguarding systems against increasingly advanced threats. Attack trees are widely used to systematically represent potential attack vectors, but generating comprehensive test cases from these trees remains…

DRES: Benchmarking LLMs for Disfluency Removal

arXiv:2509.20321v1 Announce Type: cross Abstract: Disfluencies — such as “um,” “uh,” interjections, parentheticals, and edited statements — remain a persistent challenge for speech-driven systems, degrading accuracy in command interpretation, summarization, and conversational agents. We introduce DRES (Disfluency Removal Evaluation Suite),…

CON-QA: Privacy-Preserving QA using cloud LLMs in Contract Domain

arXiv:2509.19925v1 Announce Type: new Abstract: As enterprises increasingly integrate cloud-based large language models (LLMs) such as ChatGPT and Gemini into their legal document workflows, protecting sensitive contractual information – including Personally Identifiable Information (PII) and commercially sensitive clauses – has…

Embodied AI: From LLMs to World Models

arXiv:2509.20021v1 Announce Type: new Abstract: Embodied Artificial Intelligence (AI) is an intelligent system paradigm for achieving Artificial General Intelligence (AGI), serving as the cornerstone for various applications and driving the evolution from cyberspace to physical systems. Recent breakthroughs in Large…

MACD: Multi-Agent Clinical Diagnosis with Self-Learned Knowledge for LLM

arXiv:2509.20067v1 Announce Type: new Abstract: Large language models (LLMs) have demonstrated notable potential in medical applications, yet they face substantial challenges in handling complex real-world clinical diagnoses using conventional prompting methods. Current prompt engineering and multi-agent approaches typically optimize isolated…