Hello, thank you for your impressive work on GPGait/GPGait++ and for sharing the code.
I am currently trying to train GPGait/GPGait++ using the OUMVLP-Poss dataset.
However, I encountered a shape mismatch error.
ValueError: could not broadcast input array from shape (17,3) into shape (18,3)
It seems that the error is caused by the discrepancy between the number of keypoints in the dataset (18 keypoints, AlphaPose format) and the model/code expectation (17 keypoints, COCO format).
In the GPGait paper, it is mentioned:
"For OUMVLP-Pose, the sequences are generated by AlphaPose [7] consisting of 18 keypoints for each frame. In our experiments, we transform the keypoints into the COCO2017 format with 17 keypoints for the cross-domain evaluation."
However, I checked the data loading code and could not find the implementation for this specific transformation (18 -> 17 keypoints). It seems the code attempts to process the data without converting the format, leading to the broadcast error.
Question & Requests
- Missing Keypoint Transformation Could you advise on how to handle this mismatch?
- Is there a specific preprocessing script or a flag I missed that handles the 18-to-17 conversion?
- If not, could you share the code snippet used for this transformation in your experiments?
- Pretrained Checkpoints I noticed that releasing model checkpoints is on the TODO list (or commonly requested).
- Could you provide the pretrained checkpoints (.pth or .ckpt) for GPGait++?
- Having the checkpoints would be extremely helpful for verifying my environment and reproducing the results reported in the paper.
Thank you in advance for your help!
Hello, thank you for your impressive work on GPGait/GPGait++ and for sharing the code.
I am currently trying to train GPGait/GPGait++ using the OUMVLP-Poss dataset.
However, I encountered a shape mismatch error.
It seems that the error is caused by the discrepancy between the number of keypoints in the dataset (18 keypoints, AlphaPose format) and the model/code expectation (17 keypoints, COCO format).
In the GPGait paper, it is mentioned:
However, I checked the data loading code and could not find the implementation for this specific transformation (18 -> 17 keypoints). It seems the code attempts to process the data without converting the format, leading to the broadcast error.
Question & Requests
Thank you in advance for your help!