mixed-precision

A Post-Training Quantizer for the Design of Mixed Low-Precision DNNs with Dynamic Fixed-Point Representation for Efficient Hardware Acceleration on Edge Devices

acceleration deep-learning neural-networks accuracy quantization model-compression efficient-inference mixed-precision fixed-point-arithmetic resource-constrained-devices

Updated Dec 10, 2021

zjykzj / YOLOv1

Star

You Only Look Once: Unified, Real-Time Object Detection

python pytorch apex yolo imagenet nvidia-docker pascal-voc yolov1 mixed-precision nvidia-apex distributed-data-parallel yolov1-loss

Updated Jul 9, 2023
Python

AkashSDas / cassava-leaf-disease-classification

Star

Deep learning solution for Cassava Leaf Disease Classification, a Kaggle's Research Code Competition using Tensorflow.

cross-validation kaggle kaggle-competition image-classification data-augmentation kaggle-dataset tpu mixup unbalanced-data tensorflow-pipeline mixed-precision xla tensorflow2 cutmix albumentations cassava-leaf-classifcation fmix

Updated Apr 23, 2021
Jupyter Notebook

sayakpaul / Mixed-Precision-Training-in-tf.keras-2.0

Star

This repository contains notebooks showing how to perform mixed precision training in tf.keras 2.0

deep-learning nvidia mixed-precision tensorflow2

Updated Dec 15, 2019
Jupyter Notebook

thu-nics / MixDQ

Star

[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization

efficient quantization mixed-precision diffusion-models

Updated Jul 11, 2024
Python

enp1s0 / ozIMMU

Star

FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme

cuda gemm mixed-precision tensorcore tensorcores

Updated Jul 14, 2024
Cuda

tlkh / pycon-sg19-tensorflow-tutorial

Star

PyCon SG 2019 Tutorial: Optimizing TensorFlow Performance

deep-learning tensorflow keras nvidia mixed-precision

Updated Nov 20, 2019
Jupyter Notebook

Andras7 / gpt2-pytorch

Star

Extremely simple and understandable GPT2 implementation with minor tweaks

transformers pytorch mixed-precision gpt2 sentencepiece

Updated Dec 6, 2019
Python

at-aaims / OpenMxP

Star

This is the open source version of HPL-MXP. The code performance has been verified on Frontier

performance hpc mixed-precision

Updated May 1, 2023
C++

wu-kan / HPL-AI

Star

An implementation of HPL-AI Mixed-Precision Benchmark based on hpl-2.3

benchmarking performance hpc mpi gmres hpl mixed-precision linpack hpl-ai

Updated May 30, 2021
C

kentaroy47 / pytorch-cifar10-fp16

Star

Let's train CIFAR 10 Pytorch with Half-Precision!

training cnn pytorch cifar10 fp16 mixed-precision mixed-precision-training

Updated Oct 25, 2019
Python

qleenju / PDPU

Star

PDPU: An Open-Source Posit Dot-Product Unit for Deep Learning Applications

systemverilog posit unum dot-product mixed-precision posit-arithmetic posit-arithmetic-generator arithmetic-units

Updated May 5, 2023
SystemVerilog

EEESlab / CMix-NN

Star

CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices

iot arm stm32 inference cnn cmsis stm32f4 stm32f7 arm-cortex-m7 stm32l4 edge-computing stm32h7 mixed-precision cmsis-nn arm-cortex-m4 quantized-neural-networks edge-ai tinyml

Updated Mar 19, 2020
C

andreped / GradientAccumulator

Star

🎯 Accumulated Gradients for TensorFlow 2

deep-learning tensorflow gpu keras tf2 hacktoberfest multi-gpu distributed-training float16 tpu batch-size mixed-precision gradient-accumulation tensorflow2 huggingface adaptive-gradient-clipping accumulated-gradients memory-constraints accumulated-batch-normalization

Updated Feb 11, 2024
Python

huanghuidmml / tfbert

Star

基于tensorflow1.x的预训练模型调用，支持单机多卡、梯度累积，XLA加速，混合精度。可灵活训练、验证、预测。

tensorflow trainer albert bert electra mixed-precision xla ernie ernie-gram chinesebert

Updated Aug 16, 2021
Python

Zhen-Dong / BitPack

Star

BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.

memory pytorch quantization model-compression mixed-precision quantized-neural-networks

Updated Feb 7, 2023
Python

Improve this page

Add a description, image, and links to the mixed-precision topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mixed-precision topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mixed-precision

Here are 30 public repositories matching this topic...

lnugraha / CG-Mixed-Precision

enp1s0 / cuMpSGEMM

hinofafa / torch_accelerator

mfuntowicz / RNet

Ahmad-Shawahna / FxP-QNet

zjykzj / YOLOv1

AkashSDas / cassava-leaf-disease-classification

sayakpaul / Mixed-Precision-Training-in-tf.keras-2.0

thu-nics / MixDQ

enp1s0 / ozIMMU

tlkh / pycon-sg19-tensorflow-tutorial

Andras7 / gpt2-pytorch

at-aaims / OpenMxP

wu-kan / HPL-AI

kentaroy47 / pytorch-cifar10-fp16

qleenju / PDPU

EEESlab / CMix-NN

andreped / GradientAccumulator

huanghuidmml / tfbert

Zhen-Dong / BitPack

Improve this page

Add this topic to your repo