Skip to content

Pull requests: ServerlessLLM/ServerlessLLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: example dtype hardcode ready-for-gpu-testing This PR is ready for testing on self-hosted GPU runners
#287 opened Jul 11, 2025 by drunkcoding Loading…
2 of 7 tasks
fix: quantization on multiple gpus ready-for-gpu-testing This PR is ready for testing on self-hosted GPU runners
#284 opened Jul 2, 2025 by dalongbao Loading…
2 of 7 tasks
feat: endpoints for getting worker and query information ready-for-gpu-testing This PR is ready for testing on self-hosted GPU runners
#281 opened Jun 25, 2025 by dalongbao Loading…
2 of 7 tasks
feat(backend): add SGLang backend implementation
#278 opened Jun 16, 2025 by FionaY728 Loading…
3 of 7 tasks
feat(cli): unify sllm-cli and sllm-serve into one CLI entrypoint
#245 opened May 1, 2025 by FionaY728 Loading…
4 of 7 tasks
feat: support stream generate for transformers backend
#225 opened Mar 11, 2025 by dblate Loading…
2 of 7 tasks
draft: RAG demo with serverlessLLM
#132 opened Oct 31, 2024 by SecretSettler Draft
4 of 7 tasks
Colab quickstart example New feature
#111 opened Oct 20, 2024 by asda488 Draft
2 of 5 tasks
ProTip! What’s not been updated in a month: updated:<2025-06-15.