Archives AI News

Decade-long Emission Forecasting with an Ensemble Model in Taiwan

arXiv:2510.05548v1 Announce Type: new Abstract: Taiwan’s high population and heavy dependence on fossil fuels have led to severe air pollution, with the most prevalent greenhouse gas being carbon dioxide (CO2). There-fore, this study presents a reproducible and comprehensive case study…

Cross-Embodiment Dexterous Hand Articulation Generation via Morphology-Aware Learning

arXiv:2510.06068v1 Announce Type: cross Abstract: Dexterous grasping with multi-fingered hands remains challenging due to high-dimensional articulations and the cost of optimization-based pipelines. Existing end-to-end methods require training on large-scale datasets for specific hands, limiting their ability to generalize across different…

MetaVLA: Unified Meta Co-training For Efficient Embodied Adaption

arXiv:2510.05580v1 Announce Type: new Abstract: Vision-Language-Action (VLA) models show promise in embodied reasoning, yet remain far from true generalists-they often require task-specific fine-tuning, and generalize poorly to unseen tasks. We propose MetaVLA, a unified, backbone-agnostic post-training framework for efficient and…

Multi-Task Reinforcement Learning with Language-Encoded Gated Policy Networks

arXiv:2510.06138v1 Announce Type: cross Abstract: Multi-task reinforcement learning often relies on task metadata — such as brief natural-language descriptions — to guide behavior across diverse objectives. We present Lexical Policy Networks (LEXPOL), a language-conditioned mixture-of-policies architecture for multi-task RL. LEXPOL…

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

arXiv:2510.05592v1 Announce Type: new Abstract: Outcome-driven reinforcement learning has advanced reasoning in large language models (LLMs), but prevailing tool-augmented approaches train a single, monolithic policy that interleaves thoughts and tool calls under full context; this scales poorly with long horizons…

Oblivious Stochastic Composite Optimization

arXiv:2306.17470v2 Announce Type: replace-cross Abstract: In stochastic convex optimization problems, most existing adaptive methods rely on prior knowledge about the diameter bound $D$ when the smoothness or the Lipschitz constant is unknown. This often significantly affects performance as only a…

Learning to Price Bundles: A GCN Approach for Mixed Bundling

arXiv:2509.22557v2 Announce Type: replace Abstract: Bundle pricing refers to designing several product combinations (i.e., bundles) and determining their prices in order to maximize the expected profit. It is a classic problem in revenue management and arises in many industries, such…

Modulation Discovery with Differentiable Digital Signal Processing

arXiv:2510.06204v1 Announce Type: cross Abstract: Modulations are a critical part of sound design and music production, enabling the creation of complex and evolving audio. Modern synthesizers provide envelopes, low frequency oscillators (LFOs), and more parameter automation tools that allow users…