``` ValueError: size must contain 'shortest_edge' and 'longest_edge' keys. ``` related code is https://github.com/Alpha-Innovator/OmniCaptioner/blob/4c07d277d02f326973da8d15c3625d526ed4ac82/src/inference_single_image.py#L191-L195 transformer version is 4.51.2 how to fix that?