-
Notifications
You must be signed in to change notification settings - Fork 50
Pull requests: ServerlessLLM/ServerlessLLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: example dtype hardcode
ready-for-gpu-testing
This PR is ready for testing on self-hosted GPU runners
#287
opened Jul 11, 2025 by
drunkcoding
Loading…
2 of 7 tasks
fix: quantization on multiple gpus
ready-for-gpu-testing
This PR is ready for testing on self-hosted GPU runners
#284
opened Jul 2, 2025 by
dalongbao
Loading…
2 of 7 tasks
feat: endpoints for getting worker and query information
ready-for-gpu-testing
This PR is ready for testing on self-hosted GPU runners
#281
opened Jun 25, 2025 by
dalongbao
Loading…
2 of 7 tasks
feat+refactor: continuous worker discovery + offline worker pruning + worker recovery
#280
opened Jun 17, 2025 by
dalongbao
Loading…
2 of 7 tasks
feat(backend): add SGLang backend implementation
#278
opened Jun 16, 2025 by
FionaY728
Loading…
3 of 7 tasks
feat: Separate fine-tuning backend from the transformers backend
#251
opened May 29, 2025 by
MartinRepo
•
Draft
2 of 7 tasks
feat(cli): unify sllm-cli and sllm-serve into one CLI entrypoint
#245
opened May 1, 2025 by
FionaY728
Loading…
4 of 7 tasks
feat: support stream generate for transformers backend
#225
opened Mar 11, 2025 by
dblate
Loading…
2 of 7 tasks
ProTip!
What’s not been updated in a month: updated:<2025-06-15.