GitHub / InseeFrLab / auto-tuning-vllm
Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)
JSON API: https://ecosystem.code.gouv.fr/api/v1/hosts/GitHub/repositories/InseeFrLab%2Fauto-tuning-vllm
Stars: 3
Forks: 0
Open issues: 13
License: apache-2.0
Language: Python
Size: 2.92 MB
Dependencies parsed at: Pending
Created at: about 1 month ago
Updated at: 6 days ago
Pushed at: 6 days ago
Last synced at: 3 days ago
Loading...