NemotronModel
- model_type (Union[ModelType, str]): Model for which a backend is created.
- api_key (Optional[str], optional): The API key for authenticating with the Nvidia service. (default: :obj:
None
) - url (Optional[str], optional): The url to the Nvidia service. (default: :obj:
https://integrate.api.nvidia.com/v1
) - timeout (Optional[float], optional): The timeout value in seconds for API calls. If not provided, will fall back to the MODEL_TIMEOUT environment variable or default to 180 seconds. (default: :obj:
None
) - max_retries (int, optional): Maximum number of retries for API calls. (default: :obj:
3
) **kwargs (Any): Additional arguments to pass to the client initialization.