Archives AI News

Persona Features Control Emergent Misalignment

arXiv:2506.19823v2 Announce Type: replace-cross Abstract: Understanding how language models generalize behaviors from their training to a broader deployment distribution is an important problem in AI safety. Betley et al. discovered that fine-tuning GPT-4o on intentionally insecure code causes “emergent misalignment,”…

Lang-PINN: From Language to Physics-Informed Neural Networks via a Multi-Agent Framework

arXiv:2510.05158v1 Announce Type: new Abstract: Physics-informed neural networks (PINNs) provide a powerful approach for solving partial differential equations (PDEs), but constructing a usable PINN remains labor-intensive and error-prone. Scientists must interpret problems as PDE formulations, design architectures and loss functions,…

Representation Potentials of Foundation Models for Multimodal Alignment: A Survey

arXiv:2510.05184v1 Announce Type: new Abstract: Foundation models learn highly transferable representations through large-scale pretraining on diverse data. An increasing body of research indicates that these representations exhibit a remarkable degree of similarity across architectures and modalities. In this survey, we…

An Algorithmic Information-Theoretic Perspective on the Symbol Grounding Problem

arXiv:2510.05153v1 Announce Type: new Abstract: This paper provides a definitive, unifying framework for the Symbol Grounding Problem (SGP) by reformulating it within Algorithmic Information Theory (AIT). We demonstrate that the grounding of meaning is a process fundamentally constrained by information-theoretic…

Structuring Reasoning for Complex Rules Beyond Flat Representations

arXiv:2510.05134v1 Announce Type: new Abstract: Large language models (LLMs) face significant challenges when processing complex rule systems, as they typically treat interdependent rules as unstructured textual data rather than as logically organized frameworks. This limitation results in reasoning divergence, where…

Optimization Modeling via Semantic Anchored Alignment

arXiv:2510.05115v1 Announce Type: new Abstract: Large language models (LLMs) have opened new paradigms in optimization modeling by enabling the generation of executable solver code from natural language descriptions. Despite this promise, existing approaches typically remain solver-driven: they rely on single-pass…