Archives AI News

Confidence Improves Self-Consistency in LLMs

arXiv:2502.06233v2 Announce Type: replace-cross Abstract: Self-consistency decoding enhances LLMs’ performance on reasoning tasks by sampling diverse reasoning paths and selecting the most frequent answer. However, it is computationally expensive, as sampling many of these (lengthy) paths is required to increase…

Multiplayer Nash Preference Optimization

arXiv:2509.23102v1 Announce Type: new Abstract: Reinforcement learning from human feedback (RLHF) has emerged as the standard paradigm for aligning large language models (LLMs) with human preferences. However, reward-based methods built on the Bradley-Terry assumption struggle to capture the non-transitive and…

InfoDet: A Dataset for Infographic Element Detection

arXiv:2505.17473v4 Announce Type: replace-cross Abstract: Given the central role of charts in scientific, business, and communication contexts, enhancing the chart understanding capabilities of vision-language models (VLMs) has become increasingly critical. A key limitation of existing VLMs lies in their inaccurate…

AttAnchor: Guiding Cross-Modal Token Alignment in VLMs with Attention Anchors

arXiv:2509.23109v1 Announce Type: new Abstract: A fundamental reason for the dominance of attention over RNNs and LSTMs in LLMs is its ability to capture long-range dependencies by modeling direct interactions between all tokens, overcoming the sequential limitations of recurrent architectures.…

Exploring LLM-based Frameworks for Fault Diagnosis

arXiv:2509.23113v1 Announce Type: new Abstract: Large Language Model (LLM)-based systems present new opportunities for autonomous health monitoring in sensor-rich industrial environments. This study explores the potential of LLMs to detect and classify faults directly from sensor data, while producing inherently…