Archives AI News

HiMAC: Hierarchical Macro-Micro Learning for Long-Horizon LLM Agents

arXiv:2603.00977v2 Announce Type: replace-cross Abstract: Large language model (LLM) agents have recently demonstrated strong capabilities in interactive decision-making, yet they remain fundamentally limited in long-horizon tasks that require structured planning and reliable execution. Existing approaches predominantly rely on flat autoregressive…

On the Invariants of Softmax Attention

arXiv:2605.02907v1 Announce Type: new Abstract: Softmax attention maps every query–key interaction into a probability distribution, but the underlying structure remains largely unexplored. We define the emph{energy field}, the row-centered attention logit, and show that it exhibits invariant properties across models,…

An End-to-End Framework for Building Large Language Models for Software Operations

arXiv:2605.02906v1 Announce Type: new Abstract: In the field of software operations, Large Language Models (LLMs) have attracted increasing attention. However, existing research has not yet achieved efficient and effective end-to-end intelligent operations due to low-quality data, fragmented knowledge and insufficient…

psifx — Psychological and Social Interactions Feature Extraction Package

arXiv:2407.10266v5 Announce Type: replace-cross Abstract: psifx is a plug-and-play multi-modal feature extraction toolkit, aiming to facilitate and democratize the use of state-of-the-art machine learning techniques for human sciences research. It is motivated by a need (a) to automate and standardize…