-
Notifications
You must be signed in to change notification settings - Fork 742
Issues: sgl-project/sglang
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Question About Model Integration and Parameter Updates (update_weight) in Sglang
#3101
opened Jan 24, 2025 by
davidlvxin
[Bug] The batch decoding speed of DeepSeek V3 is too slow.
#3100
opened Jan 24, 2025 by
SonChoulJun
5 tasks
[Feature] Support InterVL
good first issue
Good for newcomers
#3092
opened Jan 24, 2025 by
zhaochenyang20
2 tasks done
[Feature] Add support for Phi4
help wanted
Extra attention is needed
#3090
opened Jan 23, 2025 by
Stealthwriter
2 tasks
[Feature] docs: Improve documentation on how to use EAGLE speculative docoding
documentation
Improvements or additions to documentation
good first issue
Good for newcomers
#3077
opened Jan 23, 2025 by
daviddl9
2 tasks done
[Feature] Support service discovery on Kubernetes in router
router
#3073
opened Jan 23, 2025 by
gaocegege
2 tasks done
[Bug]ImportError: undefined symbol: cuModuleGetFunction when using lmsysorg/sglang:v0.4.1.post7-cu124
help wanted
Extra attention is needed
#3065
opened Jan 23, 2025 by
aooxin
5 tasks done
[Bug] Problems with logit_bias.
help wanted
Extra attention is needed
#3059
opened Jan 22, 2025 by
cinjon
5 tasks done
[Bug] Decode Throughput Inconsistency Between bench_serving and Engine Logs
help wanted
Extra attention is needed
#3050
opened Jan 22, 2025 by
leepoly
5 tasks done
[Help wanted] CANN'T capture GPU activities using
nsight system
#3049
opened Jan 22, 2025 by
sleepwalker2017
[Feature] Reasoning model API support
help wanted
Extra attention is needed
#3043
opened Jan 22, 2025 by
lambert0312
2 tasks done
[Bug] Qwen2-VL-7B with sglang Performance Degradation
high priority
#3041
opened Jan 22, 2025 by
yileld
5 tasks done
[Feature] batch concurrent requests while streaming responses
#3040
opened Jan 22, 2025 by
moxiegushi
2 tasks
[Feature] Support Beam Search
enhancement
New feature or request
#3032
opened Jan 21, 2025 by
laixinn
2 of 4 tasks
[Feature] FP8 weight only w8a16 quantization native support
quant
LLM Quantization
#3007
opened Jan 20, 2025 by
arunpatala
2 tasks done
what is the most efficient way to do with a 72b model and 8 * A100 ?
#3002
opened Jan 20, 2025 by
Chandler-Bing
[Feature] Add docs for Offline Engine token-in token-out
documentation
Improvements or additions to documentation
good first issue
Good for newcomers
RLHF
Using SGLang for post training
#2968
opened Jan 18, 2025 by
zhaochenyang20
2 tasks
[Feature] remove vllm _custom_ops
good first issue
Good for newcomers
help wanted
Extra attention is needed
high priority
#2965
opened Jan 18, 2025 by
zhyncs
7 tasks
[Bug] Regex isn't precluding parentheticals. And maybe more.
help wanted
Extra attention is needed
#2957
opened Jan 17, 2025 by
cinjon
5 tasks done
[Bug] JSONResponse fails if the probability distribution is very spiky.
#2955
opened Jan 17, 2025 by
cinjon
5 tasks done
[Feature] Add docs for local accuracy tests
documentation
Improvements or additions to documentation
good first issue
Good for newcomers
#2953
opened Jan 17, 2025 by
zhaochenyang20
2 tasks
[Feature] Enhancement on Sparse Attention and KV-Cache Compression
#2946
opened Jan 17, 2025 by
shadowpa0327
2 tasks done
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.