[feature]: support FP8 communication in pipeline parallelism #5885

BurkeHulk · 2024-07-04T12:46:23Z

📌 Checklist before creating the PR

I have created an issue for this PR for traceability
The title follows the standard format: [doc/gemini/tensor/...]: A concise description
I have added relevant tags if possible for us to better distinguish different PRs
I have installed pre-commit: pip install pre-commit && pre-commit install

🚨 Issue number

📝 What does this PR do?

Implement per-channel scaling (in PyTorch) for FP8 quantization.
Support PyTorch native FP8 formats.
Refer to:
https://pytorch.org/docs/stable/tensors.html#id7
https://arxiv.org/pdf/2209.05433

cast_to_fp8, cast_from_fp8, all_reduce_fp8

for more information, see https://pre-commit.ci

colossalai/quantization/fp8.py

…p8_comm # Conflicts: # colossalai/quantization/fp8.py

for more information, see https://pre-commit.ci

BurkeHulk added 2 commits July 1, 2024 13:44

fp8 operators for compressed communication

f5a52e1

cast_to_fp8, cast_from_fp8, all_reduce_fp8

Merge branch 'hpcaitech:main' into feature/fp8_comm

6991819

BurkeHulk requested a review from a team as a code owner July 4, 2024 12:46

[pre-commit.ci] auto fixes from pre-commit.com hooks

e17f835

for more information, see https://pre-commit.ci

GuangyaoZhang reviewed Jul 8, 2024

View reviewed changes

colossalai/quantization/fp8.py Outdated Show resolved Hide resolved

fix typo

dbfa7d3

ver217 reviewed Jul 10, 2024

View reviewed changes

colossalai/quantization/fp8.py Outdated Show resolved Hide resolved

colossalai/quantization/fp8.py Outdated Show resolved Hide resolved

colossalai/quantization/fp8.py Outdated Show resolved Hide resolved

BurkeHulk and others added 5 commits July 12, 2024 15:23

fix scaling algorithm in FP8 casting

1e19594

support fp8 communication in pipeline parallelism

e881901

add fp8_communication flag in the script

6601874

Merge remote-tracking branch 'origin/feature/fp8_comm' into feature/f…

1f1b856

…p8_comm # Conflicts: # colossalai/quantization/fp8.py

[pre-commit.ci] auto fixes from pre-commit.com hooks

51f916b

for more information, see https://pre-commit.ci

BurkeHulk enabled auto-merge July 16, 2024 03:21

BurkeHulk changed the title ~~Feature/fp8 comm~~ [feature]: support FP8 communication in pipeline parallelism Jul 16, 2024

This was linked to issues Jul 16, 2024

[FEATURE]: [PyTorch] per-channel FP8 quantization #5873

Open

[PyTorch] FP8 all-reduce using all-to-all and all-gather #5886

Open

ver217 approved these changes Jul 16, 2024

View reviewed changes

BurkeHulk merged commit 9470701 into hpcaitech:feature/fp8_comm Jul 16, 2024
5 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature]: support FP8 communication in pipeline parallelism #5885

[feature]: support FP8 communication in pipeline parallelism #5885

BurkeHulk commented Jul 4, 2024 •

edited

Loading

[feature]: support FP8 communication in pipeline parallelism #5885

[feature]: support FP8 communication in pipeline parallelism #5885

Conversation

BurkeHulk commented Jul 4, 2024 • edited Loading

📌 Checklist before creating the PR

🚨 Issue number

📝 What does this PR do?

BurkeHulk commented Jul 4, 2024 •

edited

Loading