"auto"
resolution model will be treated as :obj:"high"
. All images with
:obj:"low"
detail cost 85 tokens each. Images with :obj:"high"
detail
are first scaled to fit within a 2048 x 2048 square, maintaining their
aspect ratio. Then, they are scaled such that the shortest side of the
image is 768px long. Finally, we count how many 512px squares the image
consists of. Each of those squares costs 170 tokens. Another 85 tokens are
always added to the final total. For more details please refer to OpenAI
vision docs
Parameters: