chore: initial public snapshot for github upload

2026-03-26 20:06:14 +08:00
commit 0e5ecd930e
3497 changed files with 1586236 additions and 0 deletions
--- a/llm-gateway-competitors/litellm-wheel-src/litellm/passthrough/README.md
+++ b/llm-gateway-competitors/litellm-wheel-src/litellm/passthrough/README.md
@@ -0,0 +1,118 @@
+This makes it easier to pass through requests to the LLM APIs.
+
+E.g. Route to VLLM's `/classify` endpoint:
+
+
+## SDK (Basic)
+
+```python
+import litellm
+
+
+response = litellm.llm_passthrough_route(
+    model="hosted_vllm/papluca/xlm-roberta-base-language-detection",
+    method="POST",
+    endpoint="classify",
+    api_base="http://localhost:8090",
+    api_key=None,
+    json={
+        "model": "swapped-for-litellm-model",
+        "input": "Hello, world!",
+    }
+)
+
+print(response)
+```
+
+## SDK (Router)
+
+```python
+import asyncio
+from litellm import Router
+
+router = Router(
+    model_list=[
+        {
+            "model_name": "roberta-base-language-detection",
+            "litellm_params": {
+                "model": "hosted_vllm/papluca/xlm-roberta-base-language-detection",
+                "api_base": "http://localhost:8090", 
+            }
+        }
+    ]
+)
+
+request_data = {
+    "model": "roberta-base-language-detection",
+    "method": "POST",
+    "endpoint": "classify",
+    "api_base": "http://localhost:8090",
+    "api_key": None,
+    "json": {
+        "model": "roberta-base-language-detection",
+        "input": "Hello, world!",
+    }
+}
+
+async def main():
+    response = await router.allm_passthrough_route(**request_data)
+    print(response)
+
+if __name__ == "__main__":
+    asyncio.run(main())
+```
+
+## PROXY 
+
+1. Setup config.yaml 
+
+```yaml
+model_list:
+  - model_name: roberta-base-language-detection
+    litellm_params:
+      model: hosted_vllm/papluca/xlm-roberta-base-language-detection
+      api_base: http://localhost:8090
+```
+
+2. Run the proxy
+
+```bash
+litellm proxy --config config.yaml
+
+# RUNNING on http://localhost:4000
+```
+
+3. Use the proxy
+
+```bash
+curl -X POST http://localhost:4000/vllm/classify \
+-H "Content-Type: application/json" \
+-H "Authorization: Bearer <your-api-key>" \
+-d '{"model": "roberta-base-language-detection", "input": "Hello, world!"}' \
+```
+
+# How to add a provider for passthrough
+
+See [VLLMModelInfo](https://github.com/BerriAI/litellm/blob/main/litellm/llms/vllm/common_utils.py) for an example.
+
+1. Inherit from BaseModelInfo
+
+```python
+from litellm.llms.base_llm.base_utils import BaseLLMModelInfo
+
+class VLLMModelInfo(BaseLLMModelInfo):
+    pass
+```
+
+2. Register the provider in the ProviderConfigManager.get_provider_model_info
+
+```python
+from litellm.utils import ProviderConfigManager
+from litellm.types.utils import LlmProviders
+
+provider_config = ProviderConfigManager.get_provider_model_info(
+    model="my-test-model", provider=LlmProviders.VLLM
+)
+
+print(provider_config)
+```