MisalignmentPromptTemplateDict

class MisalignmentPromptTemplateDict(TextPromptDict):

A dictionary containing :obj:TextPrompt used in the Misalignment task.

Attributes: DAN_PROMPT (TextPrompt): A prompt for jail breaking. GENERATE_TASKS (TextPrompt): A prompt to list unique malicious that the AI assistant can assist AI user with. TASK_SPECIFY_PROMPT (TextPrompt): A prompt to specify a task in more detail. ASSISTANT_PROMPT (TextPrompt): A system prompt for the AI assistant that outlines the rules of the conversation and provides instructions for completing tasks. USER_PROMPT (TextPrompt): A system prompt for the AI user that outlines the rules of the conversation and provides instructions for giving instructions to the AI assistant.

init

def __init__(self, *args: Any, **kwargs: Any):