Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Remove Qwen Omni workaround that's no longer necessary qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#21057 opened Jul 16, 2025 by hmellor Loading…
[Feature][EPLB] Add EPLB support for MiniMax-01
#21056 opened Jul 16, 2025 by haveheartt Loading…
1 of 4 tasks
[fix] fix qwen image_embeds input qwen Related to Qwen models ready ONLY add when PR is ready to merge/full CI is needed
#21049 opened Jul 16, 2025 by h-avsha Loading…
Add test case for compiling multiple graphs
#21044 opened Jul 16, 2025 by sarckk Loading…
3 of 4 tasks
[Model] Reasoning Parser for Nemotron Models
#21041 opened Jul 16, 2025 by schoennenbeck Loading…
Fix minor docs issues and fix metric requests documentation Improvements or additions to documentation v1
#21040 opened Jul 16, 2025 by SriRangaTarun Draft
[XPU] Add xpu support for fusion and collective fusion
#21036 opened Jul 16, 2025 by chaojun-zhang Loading…
4 tasks
[Feature][EPLB] Add eplb support for Qwen2 qwen Related to Qwen models
#21035 opened Jul 16, 2025 by lengrongfu Draft
1 of 4 tasks
[bugfix][WIP] Fix auto thread-binding when world_size > 1 in CPU backend and refactor code ci/build documentation Improvements or additions to documentation v1
#21032 opened Jul 16, 2025 by bigPYJ1151 Loading…
1 of 4 tasks
Add the instruction to run e2e validation manually before release documentation Improvements or additions to documentation
#21023 opened Jul 16, 2025 by huydhn Loading…
4 tasks done
[Misc] Minor comment reorganization in capture_model() v1
#21015 opened Jul 15, 2025 by ruisearch42 Loading…
3 of 4 tasks
[Docker] Allow FlashInfer to be built in the ARM CUDA Dockerfile ci/build ready ONLY add when PR is ready to merge/full CI is needed
#21013 opened Jul 15, 2025 by mgoin Loading…
4 tasks
Update PyTorch to torch==2.7.1 for CUDA ci/build ready ONLY add when PR is ready to merge/full CI is needed
#21011 opened Jul 15, 2025 by mgoin Loading… v0.10.0
[protocol] Add request_id to the Request object so they can be controlled better via external load balancers frontend ready ONLY add when PR is ready to merge/full CI is needed
#21009 opened Jul 15, 2025 by kouroshHakha Loading…
4 tasks
[Not for merge] Unshift eagle prefill documentation Improvements or additions to documentation llama Related to Llama models needs-rebase new-model Requests to new models speculative-decoding v1
#21008 opened Jul 15, 2025 by morgendave Draft
4 tasks
Start using py3.12 for TPU. ci/build documentation Improvements or additions to documentation tpu Related to Google TPUs
#21000 opened Jul 15, 2025 by vanbasten23 Loading…
3 of 4 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.