0
, the pipeline will generate an initial trace without any improvement iterations. (default: :obj:3
){"correctness": 0.8, "coherence": 0.7}
. If using reward model and threshold for a dimension is not specified, will use the default value 0.7. (default: :obj:0.7
)None
)None
)None
, uses Agent self-evaluation. (default: :obj:None
)None
, results will only be returned without saving to file. (default: :obj:None
)None
)None
)None
)r'\\boxed{(.*?)}'
)None
, uses the same pattern as solution_pattern. (default: :obj:None
)None
)None
)False
)False
)