> ## Documentation Index
> Fetch the complete documentation index at: https://docs.camel-ai.org/llms.txt
> Use this file to discover all available pages before exploring further.

# Camel.models.sglang model

<a id="camel.models.sglang_model" />

<a id="camel.models.sglang_model.SGLangModel" />

## SGLangModel

```python theme={"system"}
class SGLangModel(BaseModelBackend):
```

SGLang service interface.

**Parameters:**

* **model\_type** (Union\[ModelType, str]): Model for which a backend is created.
* **model\_config\_dict** (Optional\[Dict\[str, Any]], optional): A dictionary that will be fed into:obj:`openai.ChatCompletion.create()`. If :obj:`None`, :obj:`SGLangConfig().as_dict()` will be used. (default: :obj:`None`)
* **api\_key** (Optional\[str], optional): The API key for authenticating with the model service. SGLang doesn't need API key, it would be ignored if set. (default: :obj:`None`)
* **url** (Optional\[str], optional): The url to the model service. If not provided, :obj:`"http://127.0.0.1:30000/v1"` will be used. (default: :obj:`None`)
* **token\_counter** (Optional\[BaseTokenCounter], optional): Token counter to use for the model. If not provided, :obj:`OpenAITokenCounter( ModelType.GPT_4O_MINI)` will be used. (default: :obj:`None`)
* **timeout** (Optional\[float], optional): The timeout value in seconds for API calls. If not provided, will fall back to the MODEL\_TIMEOUT environment variable or default to 180 seconds. (default: :obj:`None`)
* **max\_retries** (int, optional): Maximum number of retries for API calls. (default: :obj:`3`)
* **client** (Optional\[Any], optional): A custom synchronous OpenAI-compatible client instance. If provided, this client will be used instead of creating a new one. Note: When using custom clients with SGLang, server auto-start features will be disabled. (default: :obj:`None`)
* **async\_client** (Optional\[Any], optional): A custom asynchronous OpenAI-compatible client instance. If provided, this client will be used instead of creating a new one. (default: :obj:`None`) \*\*kwargs (Any): Additional arguments to pass to the client initialization. Ignored if custom clients are provided.
* **Reference**: [https://sgl-project.github.io/backend/openai\_api\_completions](https://sgl-project.github.io/backend/openai_api_completions). html

<a id="camel.models.sglang_model.SGLangModel.__init__" />

### **init**

```python theme={"system"}
def __init__(
    self,
    model_type: Union[ModelType, str],
    model_config_dict: Optional[Dict[str, Any]] = None,
    api_key: Optional[str] = None,
    url: Optional[str] = None,
    token_counter: Optional[BaseTokenCounter] = None,
    timeout: Optional[float] = None,
    max_retries: int = 3,
    client: Optional[Any] = None,
    async_client: Optional[Any] = None,
    **kwargs: Any
):
```

<a id="camel.models.sglang_model.SGLangModel._start_server" />

### \_start\_server

```python theme={"system"}
def _start_server(self):
```

<a id="camel.models.sglang_model.SGLangModel._ensure_server_running" />

### \_ensure\_server\_running

```python theme={"system"}
def _ensure_server_running(self):
```

Ensures that the server is running. If not, starts the server.

<a id="camel.models.sglang_model.SGLangModel._monitor_inactivity" />

### \_monitor\_inactivity

```python theme={"system"}
def _monitor_inactivity(self):
```

Monitor whether the server process has been inactive for over 10
minutes.

<a id="camel.models.sglang_model.SGLangModel.token_counter" />

### token\_counter

```python theme={"system"}
def token_counter(self):
```

**Returns:**

BaseTokenCounter: The token counter following the model's
tokenization style.

<a id="camel.models.sglang_model.SGLangModel._run" />

### \_run

```python theme={"system"}
def _run(
    self,
    messages: List[OpenAIMessage],
    response_format: Optional[Type[BaseModel]] = None,
    tools: Optional[List[Dict[str, Any]]] = None
):
```

Runs inference of OpenAI chat completion.

**Parameters:**

* **messages** (List\[OpenAIMessage]): Message list with the chat history in OpenAI API format.

**Returns:**

Union\[ChatCompletion, Stream\[ChatCompletionChunk]]:
`ChatCompletion` in the non-stream mode, or
`Stream[ChatCompletionChunk]` in the stream mode.

<a id="camel.models.sglang_model.SGLangModel.stream" />

### stream

```python theme={"system"}
def stream(self):
```

**Returns:**

bool: Whether the model is in stream mode.

<a id="camel.models.sglang_model.SGLangModel.__del__" />

### **del**

```python theme={"system"}
def __del__(self):
```

Properly clean up resources when the model is destroyed.

<a id="camel.models.sglang_model.SGLangModel.cleanup" />

### cleanup

```python theme={"system"}
def cleanup(self):
```

Terminate the server process and clean up resources.

<a id="camel.models.sglang_model._terminate_process" />

## \_terminate\_process

```python theme={"system"}
def _terminate_process(process):
```

<a id="camel.models.sglang_model._kill_process_tree" />

## \_kill\_process\_tree

```python theme={"system"}
def _kill_process_tree(
    parent_pid,
    include_parent: bool = True,
    skip_pid: Optional[int] = None
):
```

Kill the process and all its child processes.

<a id="camel.models.sglang_model._execute_shell_command" />

## \_execute\_shell\_command

```python theme={"system"}
def _execute_shell_command(command: str):
```

Execute a shell command and return the process handle

**Parameters:**

* **command**: Shell command as a string (can include \ line continuations)

**Returns:**

subprocess.Popen: Process handle

<a id="camel.models.sglang_model._wait_for_server" />

## \_wait\_for\_server

```python theme={"system"}
def _wait_for_server(base_url: str, timeout: Optional[float] = 30):
```

Wait for the server to be ready by polling the /v1/models endpoint.

**Parameters:**

* **base\_url** (str): The base URL of the server
* **timeout** (Optional\[float]): Maximum time to wait in seconds. (default: :obj:`30`)
