NemotronRewardModel

class NemotronRewardModel(BaseRewardModel):

Reward model based on the Nemotron model with OpenAI compatibility.

Parameters:

  • model_type (Union[ModelType, str]): Model for which a backend is created.
  • api_key (Optional[str], optional): The API key for authenticating with the model service. (default: :obj:None)
  • url (Optional[str], optional): The url to the model service.

Note:

The Nemotron model does not support model config.

init

def __init__(
    self,
    model_type: Union[ModelType, str],
    api_key: Optional[str] = None,
    url: Optional[str] = None
):

evaluate

def evaluate(self, messages: List[Dict[str, str]]):

Evaluate the messages using the Nemotron model.

Parameters:

  • messages (List[Dict[str, str]]): A list of messages where each message is a dictionary format.

Returns:

Dict[str, float]: A dictionary mapping score types to their values.

get_scores_types

def get_scores_types(self):

Returns:

List[str]: A list of score types that the reward model can return.

_parse_scores

def _parse_scores(self, response: ChatCompletion):

Parse the scores from the response.

Parameters:

  • response (ChatCompletion): A ChatCompletion object with the scores.

Returns:

Dict[str, float]: A dictionary mapping score types to their values.