Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix skip softmax defaults
#923 opened Feb 24, 2026 by rohansjoshi Loading…
Diffusers 2:4 Sparse Attention
#921 opened Feb 23, 2026 by jingyu-ml Draft
Add 2:4 Sparse Attention
#916 opened Feb 22, 2026 by kaix-nv Draft
fix
#908 opened Feb 19, 2026 by h-guo18 Draft
PTQ and QAD with Qwen Image
#905 opened Feb 18, 2026 by AliesTaha Loading…
Support mbridge distillation for any_model
#904 opened Feb 18, 2026 by danielkorzekwa Loading…
Enable Qwen3.5-MoE PTQ
#897 opened Feb 16, 2026 by Edwardf0t1 Draft
Add Qwen3VL
#895 opened Feb 16, 2026 by hychiang-git Loading…
gpt-oss 20b support
#889 opened Feb 13, 2026 by chochowski Loading…
Implicit Gemm NVFP4 on Conv3D
#886 opened Feb 13, 2026 by jingyu-ml Loading…
update qwen quant
#880 opened Feb 11, 2026 by zhewenl Draft
SpecDec Bench: February Update
#875 opened Feb 10, 2026 by IzzyPutterman Loading…
hardcode: use mrope for qwen3vl
#873 opened Feb 10, 2026 by h-guo18 Draft
ProTip! Add no:assignee to see everything that’s not assigned.