Commit Graph

987 Commits (b4bc99802c378cdd952a314dfc19443afb4507a3)
 

Author SHA1 Message Date
gitlawr b4bc99802c feat: add benchmark
10 months ago
gitlawr 291ff064f0 fix: update size test
10 months ago
thxCode 5e8a0b561a feat: support mindie
10 months ago
gitlawr d1e8dc22d0 chore: update exception message for empty architectures
11 months ago
gitlawr 887f41e9c4 chore: bump box
11 months ago
gitlawr 59bf544908 chore: update vram claim env
11 months ago
gitlawr db4b780b7c feat: add evaluation cache
11 months ago
gitlawr 0a5facdb40 fix: update non-LLM vllm claim
11 months ago
gitlawr 95916b82b5 feat: evaluated resource claim
11 months ago
gitlawr 592723e291 chore: update cosyvoice models
11 months ago
gitlawr d50e2b95f3 feat: add model info in profiling
11 months ago
gitlawr f01ee2e267 fix: migrate legacy hf cache
11 months ago
gitlawr cb8d75c4e0 chore: update box
11 months ago
gitlawr 32721e754c feat: add deepseek-v3-0324
11 months ago
gitlawr 00e53c9a00 chore: update evaluate message
11 months ago
gitlawr ef3cbac12e fix: use soft filelock
11 months ago
gitlawr 69c6fac56f feat: validation for dist vllm limit per worker
11 months ago
gitlawr 5f312e9ed3 feat: add model evaluations
11 months ago
gitlawr 160fefa8de refactor: rename claim dataclass
11 months ago
gitlawr 1f20e208b8 chore: update delaying restart log level
11 months ago
gitlawr d85a861eee chore: bump dependencies
11 months ago
gitlawr 25930f2075 refactor: use local dir in hf downloader
11 months ago
gitlawr f386508e13 refactor: add tini
11 months ago
gitlawr 80ad177a21 Revert "fix: In the terminate_process method, adding wait after a failed kill operation helps reclaim the child process and prevents the creation of zombie processes"
11 months ago
gitlawr e72c0a45ab fix: resolved paths contains
11 months ago
gitlawr ecc45a3b69 fix: restart ray on exit
11 months ago
gitlawr 7e700545c5 chore: bump dependencies
11 months ago
gitlawr 0d2423e0bb fix: none file exception
11 months ago
gitlawr e30747e66c fix: bool gpustack env
11 months ago
gitlawr 4e5e954803 fix: compute allocatable vram
11 months ago
gitlawr ffedf70089 refactor: move gpu filtering from worker filter to gguf resource selector
11 months ago
gitlawr 1fefb68a34 feat: get sharded file paths for localpath files
11 months ago
gitlawr 699e863325 fix: remove instances on worker deletion
11 months ago
gitlawr ed47e61939 feat: support restart on errors
11 months ago
gitlawr b0d7d08bdf fix: powershell lint
11 months ago
gitlawr 7af01766a1 fix: empty vram claim
11 months ago
gitlawr d1efdf5730 fix: match existing model files for local path models
11 months ago
gitlawr 8c0e3ef8c0 fix: reverse local verify
11 months ago
gitlawr 6f05dd9f0f feat: detect reranker architectures
11 months ago
gitlawr a0df56059d fix: missing token usage in vllm reranker
11 months ago
gitlawr f6451f1ef4 fix: avoid repatching tqdm
11 months ago
gitlawr 56e618d1be fix: no file match on empty filename
11 months ago
gitlawr b90f88fb59 feat: cleanup option on model file deletion
11 months ago
gitlawr a1638344a7 fix: download task exception
11 months ago
gitlawr 9409565daf fix: get config attr
11 months ago
gitlawr 164375f4a7 feat: add worker name label
11 months ago
Jiangzhou Li 8796ca44b4 fix: In the terminate_process method, adding wait after a failed kill operation helps reclaim the child process and prevents the creation of zombie processes
11 months ago
Jiangzhou Li edfbd61c6f feat: Add an option to use hf_transfer for HF Hub downloads
11 months ago
gitlawr 47412f9cf7 fix: docker ci
11 months ago
Jiangzhou Li d9f409ee4f fix: avoid rpc server and inference server use same name
11 months ago