Black-Box On-Policy Distillation of Large Language Models
arXiv:2511.10643v1 Announce Type: cross Abstract: Black-box distillation creates student large language models (LLMs) by learning from a proprietary teacher model’s text outputs alone, without access to its internal logits or parameters. In this work, we introduce Generative Adversarial Distillation (GAD),…
