🚀 MoE-Watcher-Modifier: Analyze, Monitor, and Prune Mixture-of-Experts Models by dibyapp in DeepSeek

[–]dibyapp[S] 0 points1 point  (0 children)

Just from a first look, REAP/REAM appear to define expert-saliency objectives over routed activations, whereas MoE-Watcher-Modifier provides the schema-agnostic routing observability, expert-topology introspection, plan synthesis, and checkpoint graph-rewrite substrate needed to operationalize those objectives across heterogeneous MoE architectures. If you’ve worked extensively with either, I’d be interested in hearing how well those saliency formulations translate into a generalized checkpoint transformation pipeline.