Archives AI News

Linear Dimensionality Reduction for Word Embeddings in Tabular Data Classification

arXiv:2509.12346v1 Announce Type: new Abstract: The Engineers’ Salary Prediction Challenge requires classifying salary categories into three classes based on tabular data. The job description is represented as a 300-dimensional word embedding incorporated into the tabular features, drastically increasing dimensionality. Additionally,…

MillStone: How Open-Minded Are LLMs?

arXiv:2509.11967v2 Announce Type: replace Abstract: Large language models equipped with Web search, information retrieval tools, and other agentic capabilities are beginning to supplant traditional search engines. As users start to rely on LLMs for information on many topics, including controversial…

SIFThinker: Spatially-Aware Image Focus for Visual Reasoning

arXiv:2508.06259v4 Announce Type: replace-cross Abstract: Current multimodal large language models (MLLMs) still face significant challenges in complex visual tasks (e.g., spatial understanding, fine-grained perception). Prior methods have tried to incorporate visual reasoning, however, they fail to leverage attention correction with…

TokenSkip: Controllable Chain-of-Thought Compression in LLMs

arXiv:2502.12067v3 Announce Type: replace-cross Abstract: Chain-of-Thought (CoT) has been proven effective in enhancing the reasoning capabilities of large language models (LLMs). Recent advancements, such as OpenAI’s o1 and DeepSeek-R1, suggest that scaling up the length of CoT sequences during inference…