This can be used for: * downloading the dataset (only for once, to avoid the HF network issue) * starting a local search engine * etc.