Archives AI News

Efficient Content-based Recommendation Model Training via Noise-aware Coreset Selection

arXiv:2601.10067v1 Announce Type: new Abstract: Content-based recommendation systems (CRSs) utilize content features to predict user-item interactions, serving as essential tools for helping users navigate information-rich web services. However, ensuring the effectiveness of CRSs requires large-scale and even continuous model training…

Permissive Information-Flow Analysis for Large Language Models

arXiv:2410.03055v3 Announce Type: replace Abstract: Large Language Models (LLMs) are rapidly becoming commodity components of larger software systems. This poses natural security and privacy problems: poisoned data retrieved from one component can change the model’s behavior and compromise the entire…

Disco-RAG: Discourse-Aware Retrieval-Augmented Generation

arXiv:2601.04377v3 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) has emerged as an important means of enhancing the performance of large language models (LLMs) in knowledge-intensive tasks. However, most existing RAG strategies treat retrieved passages in a flat and unstructured way,…

Fairness Definitions in Language Models Explained

arXiv:2407.18454v3 Announce Type: replace-cross Abstract: Language Models (LMs) have demonstrated exceptional performance across various Natural Language Processing (NLP) tasks. Despite these advancements, LMs can inherit and amplify societal biases related to sensitive attributes such as gender and race, limiting their…