amazon/GPT-OSS-20B-P-EAGLE
Updated
•
33
Scalable Artificial Intelligence
DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers
Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training