-
Notifications
You must be signed in to change notification settings - Fork 450
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(launcher): add Megatron-Bridge quantize/generate/export wrappers
#1767
opened Jun 17, 2026 by
yueshen2016
Contributor
Loading…
launcher: package as modelopt_launcher; mcp: call console script directly
#1766
opened Jun 17, 2026 by
ChenhanYu
Collaborator
Loading…
5 tasks
fix(puzzletron): correct val_dataset_name from 'valid' to 'validation'
#1765
opened Jun 17, 2026 by
TheSabari07
Contributor
Loading…
launcher: add Qwen3-8B/specdec_bench_dflash_vllm.yaml parent (OMNIML-5057)
#1764
opened Jun 17, 2026 by
ChenhanYu
Collaborator
Loading…
2 of 3 tasks
feat(recipes): add nvfp4_mlp_only-kv_fp8-novit (exclude VL vision tower)
#1760
opened Jun 17, 2026 by
Edwardf0t1
Contributor
Loading…
refactor(examples): rename llm_ptq → hf_ptq (symlink for back-compat)
#1759
opened Jun 17, 2026 by
Edwardf0t1
Contributor
Loading…
Add p quantization to our triton fa kernel
#1757
opened Jun 16, 2026 by
sychen52
Contributor
Loading…
[OMNIML-5003] Support non-gated fused MoE experts (NemotronH) in HF PTQ
#1756
opened Jun 16, 2026 by
jenchen13
Contributor
Loading…
DFlash for MiniMax-M3 (WIP): synthesis thinking-mode mix
#1749
opened Jun 16, 2026 by
yeyu-nvidia
Contributor
•
Draft
[minor] Pre-build CUDA quantization extensions before unit tests run
#1748
opened Jun 16, 2026 by
shengliangxu
Collaborator
Loading…
Add Minitron pruning support for GatedDeltaNet, MLA, and latent MoE
#1747
opened Jun 16, 2026 by
kevalmorabia97
Collaborator
Loading…
fix(autocast): add missing guard for use_standalone_type_inference
#1743
opened Jun 15, 2026 by
agkphysics
Loading…
fix(megatron): handle MambaMixer conv1d refactor in importer/exporter
#1730
opened Jun 15, 2026 by
AAnoosheh
Contributor
Loading…
2 tasks done
[DRAFT] Add heterogeneous AnyModel distillation example for Puzzletron.
#1725
opened Jun 15, 2026 by
chochowski
Contributor
Loading…
Fix autotune warm restart to retry failed schemes
#1722
opened Jun 15, 2026 by
willg-nv
Contributor
Loading…
[OMNIML-5072] Triton fakequant adapter for N-quantizer per-expert path
#1717
opened Jun 14, 2026 by
hychiang-git
Contributor
•
Draft
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-14.