Archives AI News

Two ways to knowledge?

arXiv:2509.18131v1 Announce Type: new Abstract: It is shown that the weight matrices of transformer-based machine learning applications to the solution of two representative physical applications show a random-like character which bears no directly recognizable link to the physical and mathematical…

Self-Evolving LLMs via Continual Instruction Tuning

arXiv:2509.18133v1 Announce Type: new Abstract: In real-world industrial settings, large language models (LLMs) must learn continually to keep pace with diverse and evolving tasks, requiring self-evolution to refine knowledge under dynamic data distributions. However, existing continual learning (CL) approaches, such…

A Weighted Gradient Tracking Privacy-Preserving Method for Distributed Optimization

arXiv:2509.18134v1 Announce Type: new Abstract: This paper investigates the privacy-preserving distributed optimization problem, aiming to protect agents’ private information from potential attackers during the optimization process. Gradient tracking, an advanced technique for improving the convergence rate in distributed optimization, has…

LLM-based Vulnerability Discovery through the Lens of Code Metrics

arXiv:2509.19117v1 Announce Type: cross Abstract: Large language models (LLMs) excel in many tasks of software engineering, yet progress in leveraging them for vulnerability discovery has stalled in recent years. To understand this phenomenon, we investigate LLMs through the lens of…

Language Models as Causal Effect Generators

arXiv:2411.08019v2 Announce Type: replace-cross Abstract: In this work, we present sequence-driven structural causal models (SD-SCMs), a framework for specifying causal models with user-defined structure and language-model-defined mechanisms. We characterize how an SD-SCM enables sampling from observational, interventional, and counterfactual distributions…