Bridge the gap between deep learning training and serving
deployment
inference
pytorch
ray
serve
tensorrt
serving
pipeline-parallelism
torch2trt
triton-inference-server
ray-serve
cvcuda
-
Updated
Jul 16, 2024 - C++