FasterRCNN bug with grayscale input #10338
Unanswered
adeschemps
asked this question in
Lightning Trainer API: Trainer, LightningModule, LightningDataModule
Replies: 2 comments
-
Seems like it converts it automatically to 3 channels during the normalization (see below). This does not have to be the desired way of doing it, one could also just copy the single channel 3 times. Or change the architecture. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Dear @adeschemps, I believe you should open this issue on Bolts directly. Best, |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
The following code doesn't fail, even though I believe it should:
as
print(model)
returns:showing that the first convolution of the backbone expects an input with 3 channels instead of one. This is confirmed by https://github.com/pytorch/vision/blob/3300692c6e7c2023d2f2356a69ec22ca91e38790/torchvision/models/resnet.py#L323:
Because of this, I don't understand what the model is actually computing, which is very confusing. Any help would be much appreciated
Beta Was this translation helpful? Give feedback.
All reactions