Why use nonempty bins rather than all bins? #13

DHPO · 2019-03-14T07:00:27Z

Why you divide weights by nonempty bins (n) rather than all bins(self.bins)?

GHM_Detection/mmdetection/mmdet/core/loss/ghm_loss.py

Line 54 in 3647287

weights = weights / n

I think M is the amount of all bins in the paper. Am I missing something?

The text was updated successfully, but these errors were encountered:

libuyu · 2019-03-14T09:17:38Z

@DHPO You are right. In the paper, we define the M as the number of all bins. And in the latest version of our code, we choose the number of valid (non-empty) bins.

Suppose that you have 100 bins, and all the examples have the same gradient norm of 0.8 (although this is impossible in practice). Then each example will get a harmonizing parameter of 1/100 according to the original equation. And when the bin number is 10000, the parameter will become 1/10000. But in these cases, we would like to use a harmonizing parameter of 1 for all examples since they should be equally treated and should not be down-weighted. And the harmonizing parameters should not depend on the bin numbers. So we think the number of valid bins is more reasonable.

Thank you for reading the code and paper so carefully.

DHPO mentioned this issue Nov 22, 2019

bin_count * nonempty_bins DHPO/GHM_Loss.pytorch#1

Closed

DovahCoding mentioned this issue Dec 5, 2019

closed #28

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why use nonempty bins rather than all bins? #13

Why use nonempty bins rather than all bins? #13

DHPO commented Mar 14, 2019

libuyu commented Mar 14, 2019 •

edited

Loading

Why use nonempty bins rather than all bins? #13

Why use nonempty bins rather than all bins? #13

Comments

DHPO commented Mar 14, 2019

libuyu commented Mar 14, 2019 • edited Loading

libuyu commented Mar 14, 2019 •

edited

Loading