You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm from https://huggingface.co/datasets/U4R/GeoX-data The data and pretrain files have been downloaded from this website. May I ask which 100 million token dataset was used to train geoLLM in the first stage of the paper?