Skip to content

Figures

Dean Sumner edited this page Mar 21, 2022 · 5 revisions

Covariate Shift:

Plot showing the covariate shift for labels in provided train and validation data:

image

The distribution in GEX (labels) values across all chromosomes is not drastically different, given a randomly balanced training subset vs full validation set. The data extends in the negative range which is something to keep in mind. Also remember that the data is a ZINB distribution - may cause issues with the Gaussian input assumption of deep-neural nets. - Dean

Clone this wiki locally