Logo
Explore Help
Sign In
Repositories Users Organizations
Sort
Newest Oldest Alphabetically Reverse alphabetically Recently updated Least recently updated Most stars Fewest stars Most forks Fewest forks
mkrz5h4u8 / gpustack
Python 0 0

Simple, scalable AI model deployment on GPU clusters

llm-inference
vllm
rocm
qwen
openai
mindie
metal
maas
local-ai
llm-serving
ascend
llm
llamacpp
llama
inference
heterogeneous-cluster
genai
distributed-inference
deepseek
cuda

Updated 2 weeks ago

Powered by Gitea Version: development Page: 15ms Template: 1ms
English
Bahasa Indonesia Deutsch English Español Français Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API