Archives AI News

BiasJailbreak:Analyzing Ethical Biases and Jailbreak Vulnerabilities in Large Language Models

arXiv:2410.13334v5 Announce Type: replace-cross Abstract: Although large language models (LLMs) demonstrate impressive proficiency in various tasks, they present potential safety risks, such as `jailbreaks’, where malicious inputs can coerce LLMs into generating harmful content bypassing safety alignments. In this paper,…

November 26, 2025

Researchers discover a shortcoming that makes LLMs less reliable

Large language models can learn to mistakenly link certain sentence patterns with specific topics — and may then repeat these patterns instead of reasoning.

November 26, 2025

Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting

arXiv:2506.20024v2 Announce Type: replace Abstract: Diffusion models are a powerful tool for probabilistic forecasting, yet most applications in high-dimensional complex systems predict future states individually. This approach struggles to model complex temporal dependencies and fails to explicitly account for the…

November 26, 2025

Unleashing the Power of Vision-Language Models for Long-Tailed Multi-Label Visual Recognition

arXiv:2511.20641v1 Announce Type: cross Abstract: Long-tailed multi-label visual recognition poses a significant challenge, as images typically contain multiple labels with highly imbalanced class distributions, leading to biased models that favor head classes while underperforming on tail classes. Recent efforts have…

November 26, 2025

Domain Fusion Controllable Generalization for Cross-Domain Time Series Forecasting from Multi-Domain Integrated Distribution

arXiv:2412.03068v2 Announce Type: replace Abstract: Conventional deep models have achieved unprecedented success in time series forecasting. However, facing the challenge of cross-domain generalization, existing studies utilize statistical prior as prompt engineering fails under the huge distribution shift among various domains.…

November 26, 2025

Softmax Transformers are Turing-Complete

arXiv:2511.20038v1 Announce Type: cross Abstract: Hard attention Chain-of-Thought (CoT) transformers are known to be Turing-complete. However, it is an open problem whether softmax attention Chain-of-Thought (CoT) transformers are Turing-complete. In this paper, we prove a stronger result that length-generalizable softmax…

November 26, 2025

STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flow

arXiv:2511.20462v1 Announce Type: cross Abstract: Normalizing flows (NFs) are end-to-end likelihood-based generative models for continuous data, and have recently regained attention with encouraging progress on image generation. Yet in the video generation domain, where spatiotemporal complexity and computational cost are…

November 26, 2025

Efficient Inference Using Large Language Models with Limited Human Data: Fine-Tuning then Rectification

arXiv:2511.19486v1 Announce Type: new Abstract: Driven by recent advances in artificial intelligence (AI), a growing body of work demonstrates the potential of using large language models (LLMs) to generate human-like responses in market research and social science applications. Two primary…

November 26, 2025

FedPromo: Federated Lightweight Proxy Models at the Edge Bring New Domains to Foundation Models

arXiv:2508.03356v2 Announce Type: replace-cross Abstract: Federated Learning (FL) is an established paradigm for training deep learning models on decentralized data. However, as the size of the models grows, conventional FL approaches often require significant computational resources on client devices, which…

November 26, 2025

Quality analysis and evaluation prediction of RAG retrieval based on machine learning algorithms

arXiv:2511.19481v1 Announce Type: new Abstract: With the rapid evolution of large language models, retrieval enhanced generation technology has been widely used due to its ability to integrate external knowledge to improve output accuracy. However, the performance of the system is…

November 26, 2025