-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Pull requests: triton-inference-server/server
Author
Label
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: async model load/unload to prevent evhtp thread starvation
#8737
opened Apr 14, 2026 by
itsnothuy
Loading…
11 of 22 tasks
build(deps): bump pytest from 8.1.1 to 9.0.3 in /python/openai
dependencies
Pull requests that update a dependency file
python
Python related, whether backend, in-process API, client, etc
#8735
opened Apr 13, 2026 by
dependabot
Bot
Loading…
ci: Automated document links and anchors validation
PR: ci
Changes to our CI configuration files and scripts
#8638
opened Feb 4, 2026 by
yinggeh
Contributor
Loading…
5 of 11 tasks
sagemaker: restrict model repository paths to configured root
#8630
opened Feb 1, 2026 by
HyperPS
Contributor
Loading…
feat: Update build.py to skip libnvshmem3-cuda-13 for cpu only build.
#8528
opened Nov 20, 2025 by
Sunidhi-Gaonkar1
Loading…
4 of 22 tasks
fix: Fix gRPC handler thread stall on completion queue shutdown
#8495
opened Nov 6, 2025 by
TheRobotCarlson
Loading…
9 of 22 tasks
Feat: revamp build.py CLI to improve usability and maintainability
#8437
opened Oct 2, 2025 by
kpedro88
Contributor
Loading…
9 of 22 tasks
feat: Minor improvements to build.py
Build
Issues pertaining to builds
Enhancement
New feature or request
#8362
opened Aug 19, 2025 by
kpedro88
Contributor
Loading…
6 of 22 tasks
Support tokenizer override per model for multi-model Triton + vLLM serving with OpenAI-Compatible
#8321
opened Jul 31, 2025 by
JunmooByun
Loading…
11 of 13 tasks
fix: Fix the server runtime errors on cpu only platform and with pytorch backend
#8272
opened Jun 27, 2025 by
snadampal
Loading…
6 of 21 tasks
docs: fix capitalization of Triton Inference Server
#8252
opened Jun 13, 2025 by
ShriyashP
Loading…
5 of 13 tasks
docs: update the link formats for additional security networking guides
#8229
opened Jun 2, 2025 by
xander-aphe-hatschi
Loading…
22 tasks
test: L0_orca_trtllm fixed
#8191
opened May 7, 2025 by
indrajit96
Contributor
Loading…
7 of 20 tasks
refactor: replace tf model with onnx model for L0_response_cache
#8114
opened Apr 2, 2025 by
ziqifan617
Contributor
•
Draft
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.