Portable vLLM Model Inference Kernels in HelionJune 10, 2026 2026-06-10 08:00 GMT · 3 days ago aimagpro.com vendor source link TL;DR Helion kernels were integrated into vLLM for FP8 inference using Qwen3 models and evaluated across NVIDIA H100 and B200 GPUs. The experiments show that Helion provides a productive PyTorch-native…