We have some Groovy code that checks e.g. that the phenotype file is good. This requires that the data is local -- but some users want it to be in S3