NemotronRewardModel

class NemotronRewardModel(BaseRewardModel):
Reward model based on the Nemotron model with OpenAI compatibility. Parameters:
  • model_type (Union[ModelType, str]): Model for which a backend is created.
  • api_key (Optional[str], optional): The API key for authenticating with the model service. (default: :obj:None)
  • url (Optional[str], optional): The url to the model service.
Note: The Nemotron model does not support model config.

init

def __init__(
    self,
    model_type: Union[ModelType, str],
    api_key: Optional[str] = None,
    url: Optional[str] = None
):

evaluate

def evaluate(self, messages: List[Dict[str, str]]):
Evaluate the messages using the Nemotron model. Parameters:
  • messages (List[Dict[str, str]]): A list of messages where each message is a dictionary format.
Returns: Dict[str, float]: A dictionary mapping score types to their values.

get_scores_types

def get_scores_types(self):
Returns: List[str]: A list of score types that the reward model can return.

_parse_scores

def _parse_scores(self, response: ChatCompletion):
Parse the scores from the response. Parameters:
  • response (ChatCompletion): A ChatCompletion object with the scores.
Returns: Dict[str, float]: A dictionary mapping score types to their values.