May I ask if the CogAgent in the paper is the same model as the open-source CogAgent9B-20241220? I haven't come across the corresponding lightweight high-resolution image encoder. It seems that CogAgent9B-20241220 is based on ChatGLM, which differs from the CogVLM described in the paper.