Skip to content
@mit-han-lab

MIT HAN Lab

Efficient AI Computing. PI: Song Han

Pinned Loading

  1. streaming-llm streaming-llm Public

    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks

    Python 6.4k 355

  2. smoothquant smoothquant Public

    [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

    Python 1.1k 127

  3. llm-awq llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python 2.1k 154

  4. bevfusion bevfusion Public

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    Python 2.2k 383

  5. once-for-all once-for-all Public

    [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment

    Python 1.9k 332

  6. temporal-shift-module temporal-shift-module Public

    [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

    Python 2k 415

Repositories

Showing 10 of 51 repositories
  • torchquantum Public

    A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.

    mit-han-lab/torchquantum’s past year of commit activity
    Jupyter Notebook 1,244 MIT 186 56 (4 issues need help) 7 Updated Jul 16, 2024
  • llm-awq Public

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    mit-han-lab/llm-awq’s past year of commit activity
    Python 2,136 MIT 154 112 7 Updated Jul 16, 2024
  • hardware-aware-transformers Public

    [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

    mit-han-lab/hardware-aware-transformers’s past year of commit activity
    Python 323 48 3 0 Updated Jul 14, 2024
  • smoothquant Public

    [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

    mit-han-lab/smoothquant’s past year of commit activity
    Python 1,108 MIT 127 58 1 Updated Jul 12, 2024
  • spatten Public

    [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

    mit-han-lab/spatten’s past year of commit activity
    Scala 56 MIT 3 1 0 Updated Jul 12, 2024
  • spvnas Public

    [ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution

    mit-han-lab/spvnas’s past year of commit activity
    Python 578 MIT 109 2 1 Updated Jul 11, 2024
  • bevfusion Public

    [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

    mit-han-lab/bevfusion’s past year of commit activity
    Python 2,153 Apache-2.0 383 60 4 Updated Jul 11, 2024
  • lite-transformer Public

    [ICLR 2020] Lite Transformer with Long-Short Range Attention

    mit-han-lab/lite-transformer’s past year of commit activity
    Python 598 80 9 0 Updated Jul 11, 2024
  • torchsparse Public

    [MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.

    mit-han-lab/torchsparse’s past year of commit activity
    Cuda 1,156 MIT 132 28 1 Updated Jul 11, 2024
  • temporal-shift-module Public

    [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding

    mit-han-lab/temporal-shift-module’s past year of commit activity
    Python 2,034 MIT 415 93 6 Updated Jul 12, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.