Skip to content
View Shwai-He's full-sized avatar

Block or report Shwai-He

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. CASE-Lab-UMD/LLM-Drop CASE-Lab-UMD/LLM-Drop Public

    The official implementation of the paper "Uncovering the Redundancy in Transformers via a Unified Study of Layer Dropping (TMLR)".

    Python 190 24

  2. CASE-Lab-UMD/Unified-MoE-Compression CASE-Lab-UMD/Unified-MoE-Compression Public

    The official implementation of the paper "Towards Efficient Mixture of Experts: A Holistic Study of Compression Techniques (TMLR)".

    Python 91 6

  3. CASE-Lab-UMD/Router-Tuning-Mixture-of-Depths CASE-Lab-UMD/Router-Tuning-Mixture-of-Depths Public

    The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for Enabling Dynamic Depth in Transformers. (EMNLP 2025)"

    Python 31 3

  4. CASE-Lab-UMD/Capacity-Aware-MoE CASE-Lab-UMD/Capacity-Aware-MoE Public

    The official implementation of the paper "Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts" (ICLR 2026).

    Python 16 2

  5. CASE-Lab-UMD/Pruning-on-Representations CASE-Lab-UMD/Pruning-on-Representations Public

    The official implementation of the paper "Demystifying When Pruning Works via Representation Hierarchies".

    Python 10 2

  6. SparseUnifiedModel SparseUnifiedModel Public

    The official implementation of the paper "Understanding and Harnessing Sparsity in Unified Multimodal Models".

    Python 20 1