Archives AI News

SNaRe: Domain-aware Data Generation for Low-Resource Event Detection

arXiv:2502.17394v3 Announce Type: replace-cross Abstract: Event Detection (ED) — the task of identifying event mentions from natural language text — is critical for enabling reasoning in highly specialized domains such as biomedicine, law, and epidemiology. Data generation has proven to…

Internalizing Self-Consistency in Language Models: Multi-Agent Consensus Alignment

arXiv:2509.15172v1 Announce Type: new Abstract: Language Models (LMs) are inconsistent reasoners, often generating contradictory responses to identical prompts. While inference-time methods can mitigate these inconsistencies, they fail to address the core problem: LMs struggle to reliably select reasoning pathways leading…

EnCoBo: Energy-Guided Concept Bottlenecks for Interpretable Generation

arXiv:2507.08334v2 Announce Type: replace-cross Abstract: Concept Bottleneck Models (CBMs) provide interpretable decision-making through explicit, human-understandable concepts. However, existing generative CBMs often rely on auxiliary visual cues at the bottleneck, which undermines interpretability and intervention capabilities. We propose EnCoBo, a post-hoc…

Generalizable Geometric Image Caption Synthesis

arXiv:2509.15217v1 Announce Type: new Abstract: Multimodal large language models have various practical applications that demand strong reasoning abilities. Despite recent advancements, these models still struggle to solve complex geometric problems. A key challenge stems from the lack of high-quality image-text…

LLM-JEPA: Large Language Models Meet Joint Embedding Predictive Architectures

arXiv:2509.14252v1 Announce Type: cross Abstract: Large Language Model (LLM) pretraining, finetuning, and evaluation rely on input-space reconstruction and generative capabilities. Yet, it has been observed in vision that embedding-space training objectives, e.g., with Joint Embedding Predictive Architectures (JEPAs), are far…