llm-serving

Star

Here are 58 public repositories matching this topic...

valyu-network / Stitch

Star

Stitch simplifies and scales LLM application deployment, reducing infrastructure complexity and costs.

llm-serving llm-inference llm-framework llmstack

Updated Jun 2, 2024
Python

friendliai / lm-evaluation-harness

Star

A framework for few-shot evaluation of autoregressive language models.

llms generative-ai llm-serving llm-inference

Updated Sep 30, 2023
Python

ivynya / illm

Sponsor

Star

internet llm - access your ollama (or any other local llm) instance from across the internet

llm-serving ollama ollama-interface

Updated Jun 25, 2024
Go

France-Travail / benchmark_llm_serving

Star

A library to benchmark LLMs via their API exposure. For now, it is vLLM oriented

benchmark llm llm-serving vllm

Updated Jul 5, 2024
Python

InquestGeronimo / horizon-takeoff

Star

Automating the deployment of the Takeoff Server on AWS for LLMs

aws machine-learning cloud deep-learning ec2 server artificial-intelligence llmops llm-serving llm-inference

Updated Jan 16, 2024
Python

CentML / llm-inference-bench

Star

Lightweight and extensible LLM Inference serving benchmark tool written in Rust.

benchmarking llm-serving llm-inference

Updated Apr 4, 2024
Rust

liux2 / DL_env_Setups

Star

Deep learning environment setups

docker deep-learning environments llm-serving

Updated Jan 24, 2024
Shell

diverged / tavily-go

Star

An unofficial Go port of the official Tavily API Python Wrapper.

search-engine rag answer-engine chatgpt llm-serving retrieval-augmented-generation

Updated Jul 2, 2024
Go

suleymansevimli / run-llm-model-locally

Star

You can run any large language model on your local machine with this repository.

python git-lfs huggingface llm llm-serving

Updated Dec 19, 2023
Python

biosfood / intel-llm-guide

Star

A guide on how to run LLMs on intel CPUs

setup machine-learning tutorial guide intel setup-development-environment llm llm-serving llm-inference

Updated Jan 23, 2024
Python

ray-project / anyscale-berkeley-ai-hackathon

Star

Ray and Anyscale for UC Berkeley AI Hackathon!

hackathon berkeley-ai ray-distributed anyscale llm llm-serving llm-inference

Updated Jun 17, 2023
Jupyter Notebook

Neural-Dragon-AI / Cynde

Star

A Framework For Intelligence Farming

xgboost autoscaling pydantic openai-api polars llm-serving llm-inference modal-labs pydantic-logfire intelligence-farming

Updated May 18, 2024
Python

okikorg / okik

Star

Okik is serving framework to deploy LLMs and much more.

python machine-learning deeplearning model-serving llm llmops llm-serving llm-inference

Updated Jun 14, 2024
Python

azminewasi / Awesome-LLMs-ICLR-24

Star

It is a comprehensive resource hub compiling all LLM papers accepted at the International Conference on Learning Representations (ICLR) in 2024.

pretrained-models pretrained-weights pretrained-language-model large-language-models llm llms llmops large-language-model llm-serving llm-prompting llm-agent llm-security llm-training llm-inference llm-framework llm-privacy llm-evaluation large-language-models-for-graph-learning large-language-models-and-translation-systems

Updated Apr 4, 2024

France-Travail / happy_vllm

Star

A REST API for vLLM, production ready

production api-rest llm llm-serving vllm

Updated Jul 16, 2024
Python

biomchen / llm-serving

Star

Basic APIs for serving LLMs locally.

llm-serving llama2-7b llama2-13b

Updated Oct 29, 2023
Python

EmbeddedLLM / embeddedllm

Star

EmbeddedLLM: API server for Embedded Device Deployment. Currently support IpexLLM/DirectML./CPU

windows cpu llama gemma mistral directx-12 npu aipc directml llm model-inference llm-serving llm-inference open-source-llm phi-3 ipexllm

Updated Jul 16, 2024
Python

george-mountain / LLM-Local-Streaming

Star

Streaming of LLM responses in realtime using Fastapi and Streamlit.

ai fastapi streamlit llm llm-serving llm-streaming

Updated Jan 21, 2024
Python

george-mountain / web-app-builder--LLM

Star

Building Static Web Applications using Large Language Model. From hand sketched documents, images and screenshots to proper web pages.

ai pypi pypi-package streamlit llm llm-serving

Updated Mar 12, 2024
Python

Jason-cs18 / HetServe-LLMs

Star

A Overview of Efficiently Serving Large Language Models across Edge Devices

survey llm-serving

Updated Jun 15, 2024

Improve this page

Add a description, image, and links to the llm-serving topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-serving topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-serving

Here are 58 public repositories matching this topic...

valyu-network / Stitch

friendliai / lm-evaluation-harness

ivynya / illm

France-Travail / benchmark_llm_serving

InquestGeronimo / horizon-takeoff

CentML / llm-inference-bench

liux2 / DL_env_Setups

diverged / tavily-go

suleymansevimli / run-llm-model-locally

biosfood / intel-llm-guide

ray-project / anyscale-berkeley-ai-hackathon

Neural-Dragon-AI / Cynde

okikorg / okik

azminewasi / Awesome-LLMs-ICLR-24

France-Travail / happy_vllm

biomchen / llm-serving

EmbeddedLLM / embeddedllm

george-mountain / LLM-Local-Streaming

george-mountain / web-app-builder--LLM

Jason-cs18 / HetServe-LLMs

Improve this page

Add this topic to your repo