Archives AI News

MathTutorBench: A Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors

arXiv:2502.18940v2 Announce Type: replace-cross Abstract: Evaluating the pedagogical capabilities of AI-based tutoring models is critical for making guided progress in the field. Yet, we lack a reliable, easy-to-use, and simple-to-run evaluation that reflects the pedagogical abilities of models. To fill…

October 14, 2025

Spatial Uncertainty Quantification in Wildfire Forecasting for Climate-Resilient Emergency Planning

arXiv:2510.09666v1 Announce Type: new Abstract: Climate change is intensifying wildfire risks globally, making reliable forecasting critical for adaptation strategies. While machine learning shows promise for wildfire prediction from Earth observation data, current approaches lack uncertainty quantification essential for risk-aware decision…

October 14, 2025

SecureBERT 2.0: Advanced Language Model for Cybersecurity Intelligence

arXiv:2510.00240v2 Announce Type: replace-cross Abstract: Effective analysis of cybersecurity and threat intelligence data demands language models that can interpret specialized terminology, complex document structures, and the interdependence of natural language and source code. Encoder-only transformer architectures provide efficient and robust…

October 14, 2025

A Hybrid Computational Intelligence Framework with Metaheuristic Optimization for Drug-Drug Interaction Prediction

arXiv:2510.09668v1 Announce Type: new Abstract: Drug-drug interactions (DDIs) are a leading cause of preventable adverse events, often complicating treatment and increasing healthcare costs. At the same time, knowing which drugs do not interact is equally important, as such knowledge supports…

October 14, 2025

When Does Supervised Training Pay Off? The Hidden Economics of Object Detection in the Era of Vision-Language Models

arXiv:2510.11302v1 Announce Type: cross Abstract: Object detection systems have traditionally relied on supervised learning with manually annotated bounding boxes, achieving high accuracy at the cost of substantial annotation investment. The emergence of Vision-Language Models (VLMs) offers an alternative paradigm enabling…

October 14, 2025

Population synthesis with geographic coordinates

arXiv:2510.09669v1 Announce Type: new Abstract: It is increasingly important to generate synthetic populations with explicit coordinates rather than coarse geographic areas, yet no established methods exist to achieve this. One reason is that latitude and longitude differ from other continuous…

October 14, 2025

Designing Algorithms Empowered by Language Models: An Analytical Framework, Case Studies, and Insights

arXiv:2407.14788v3 Announce Type: replace Abstract: This work presents an analytical framework for the design and analysis of LLM-based algorithms, i.e., algorithms that contain one or multiple calls of large language models (LLMs) as sub-routines and critically rely on the capabilities…

October 14, 2025

A physics-aware deep learning model for shear band formation around collapsing pores in shocked reactive materials

arXiv:2510.09670v1 Announce Type: new Abstract: Modeling shock-to-detonation phenomena in energetic materials (EMs) requires capturing complex physical processes such as strong shocks, rapid changes in microstructural morphology, and nonlinear dynamics of chemical reaction fronts. These processes participate in energy localization at…

October 14, 2025

Why Ask One When You Can Ask $k$? Learning-to-Defer to the Top-$k$ Experts

arXiv:2504.12988v4 Announce Type: replace Abstract: Existing Learning-to-Defer (L2D) frameworks are limited to single-expert deferral, forcing each query to rely on only one expert and preventing the use of collective expertise. We introduce the first framework for Top-$k$ Learning-to-Defer, which allocates…

October 14, 2025

Coupled Data and Measurement Space Dynamics for Enhanced Diffusion Posterior Sampling

arXiv:2510.09676v1 Announce Type: new Abstract: Inverse problems, where the goal is to recover an unknown signal from noisy or incomplete measurements, are central to applications in medical imaging, remote sensing, and computational biology. Diffusion models have recently emerged as powerful…

October 14, 2025