inference
Here are 1,227 public repositories matching this topic...
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
-
Updated
Jul 16, 2024 - C++
A high-throughput and memory-efficient inference and serving engine for LLMs
-
Updated
Jul 16, 2024 - Python
A universal scalable machine learning model deployment solution
-
Updated
Jul 16, 2024 - Java
Utilities to use the Hugging Face Hub API
-
Updated
Jul 16, 2024 - TypeScript
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
-
Updated
Jul 16, 2024 - Python
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
-
Updated
Jul 16, 2024 - Python
🔮 SuperDuper: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.
-
Updated
Jul 16, 2024 - Python
A matlab suite for Psycho-Physiological Modelling
-
Updated
Jul 16, 2024 - MATLAB
High-efficiency floating-point neural network inference operators for mobile, server, and Web
-
Updated
Jul 16, 2024 - C
A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
-
Updated
Jul 16, 2024 - Python
Large Language Model Text Generation Inference
-
Updated
Jul 16, 2024 - Python
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
-
Updated
Jul 16, 2024 - Python
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
-
Updated
Jul 16, 2024 - Python
Exploration of interactive inference rule application/proofs using a unification approach.
-
Updated
Jul 16, 2024 - TypeScript
📖A small curated list of Awesome SD/DiT/ViT/Diffusion Distributed Inference(Multi-GPUs) Paper with codes, such as DistriFusion, PipeFusion, AsyncDiff, DeepCache etc.
-
Updated
Jul 16, 2024
-
Updated
Jul 16, 2024 - Java
Pre-trained Deep Learning models and demos (high quality and extremely fast)
-
Updated
Jul 16, 2024 - Python
Cross-platform, customizable ML solutions for live and streaming media.
-
Updated
Jul 16, 2024 - C++
📚 Jupyter notebook tutorials for OpenVINO™
-
Updated
Jul 16, 2024 - Jupyter Notebook
Improve this page
Add a description, image, and links to the inference topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the inference topic, visit your repo's landing page and select "manage topics."