> ## Documentation Index > Fetch the complete documentation index at: https://docs.camel-ai.org/llms.txt > Use this file to discover all available pages before exploring further. # Camel.models.sglang model ## SGLangModel ```python theme={"system"} class SGLangModel(BaseModelBackend): ``` SGLang service interface. **Parameters:** * **model\_type** (Union\[ModelType, str]): Model for which a backend is created. * **model\_config\_dict** (Optional\[Dict\[str, Any]], optional): A dictionary that will be fed into:obj:`openai.ChatCompletion.create()`. If :obj:`None`, :obj:`SGLangConfig().as_dict()` will be used. (default: :obj:`None`) * **api\_key** (Optional\[str], optional): The API key for authenticating with the model service. SGLang doesn't need API key, it would be ignored if set. (default: :obj:`None`) * **url** (Optional\[str], optional): The url to the model service. If not provided, :obj:`"http://127.0.0.1:30000/v1"` will be used. (default: :obj:`None`) * **token\_counter** (Optional\[BaseTokenCounter], optional): Token counter to use for the model. If not provided, :obj:`OpenAITokenCounter( ModelType.GPT_4O_MINI)` will be used. (default: :obj:`None`) * **timeout** (Optional\[float], optional): The timeout value in seconds for API calls. If not provided, will fall back to the MODEL\_TIMEOUT environment variable or default to 180 seconds. (default: :obj:`None`) * **max\_retries** (int, optional): Maximum number of retries for API calls. (default: :obj:`3`) * **client** (Optional\[Any], optional): A custom synchronous OpenAI-compatible client instance. If provided, this client will be used instead of creating a new one. Note: When using custom clients with SGLang, server auto-start features will be disabled. (default: :obj:`None`) * **async\_client** (Optional\[Any], optional): A custom asynchronous OpenAI-compatible client instance. If provided, this client will be used instead of creating a new one. (default: :obj:`None`) \*\*kwargs (Any): Additional arguments to pass to the client initialization. Ignored if custom clients are provided. * **Reference**: [https://sgl-project.github.io/backend/openai\_api\_completions](https://sgl-project.github.io/backend/openai_api_completions). html ### **init** ```python theme={"system"} def __init__( self, model_type: Union[ModelType, str], model_config_dict: Optional[Dict[str, Any]] = None, api_key: Optional[str] = None, url: Optional[str] = None, token_counter: Optional[BaseTokenCounter] = None, timeout: Optional[float] = None, max_retries: int = 3, client: Optional[Any] = None, async_client: Optional[Any] = None, **kwargs: Any ): ``` ### \_start\_server ```python theme={"system"} def _start_server(self): ``` ### \_ensure\_server\_running ```python theme={"system"} def _ensure_server_running(self): ``` Ensures that the server is running. If not, starts the server. ### \_monitor\_inactivity ```python theme={"system"} def _monitor_inactivity(self): ``` Monitor whether the server process has been inactive for over 10 minutes. ### token\_counter ```python theme={"system"} def token_counter(self): ``` **Returns:** BaseTokenCounter: The token counter following the model's tokenization style. ### \_run ```python theme={"system"} def _run( self, messages: List[OpenAIMessage], response_format: Optional[Type[BaseModel]] = None, tools: Optional[List[Dict[str, Any]]] = None ): ``` Runs inference of OpenAI chat completion. **Parameters:** * **messages** (List\[OpenAIMessage]): Message list with the chat history in OpenAI API format. **Returns:** Union\[ChatCompletion, Stream\[ChatCompletionChunk]]: `ChatCompletion` in the non-stream mode, or `Stream[ChatCompletionChunk]` in the stream mode. ### stream ```python theme={"system"} def stream(self): ``` **Returns:** bool: Whether the model is in stream mode. ### **del** ```python theme={"system"} def __del__(self): ``` Properly clean up resources when the model is destroyed. ### cleanup ```python theme={"system"} def cleanup(self): ``` Terminate the server process and clean up resources. ## \_terminate\_process ```python theme={"system"} def _terminate_process(process): ``` ## \_kill\_process\_tree ```python theme={"system"} def _kill_process_tree( parent_pid, include_parent: bool = True, skip_pid: Optional[int] = None ): ``` Kill the process and all its child processes. ## \_execute\_shell\_command ```python theme={"system"} def _execute_shell_command(command: str): ``` Execute a shell command and return the process handle **Parameters:** * **command**: Shell command as a string (can include \ line continuations) **Returns:** subprocess.Popen: Process handle ## \_wait\_for\_server ```python theme={"system"} def _wait_for_server(base_url: str, timeout: Optional[float] = 30): ``` Wait for the server to be ready by polling the /v1/models endpoint. **Parameters:** * **base\_url** (str): The base URL of the server * **timeout** (Optional\[float]): Maximum time to wait in seconds. (default: :obj:`30`)