Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using OSM GPX planet as a data-source #32

Open
laurentg opened this issue Apr 15, 2015 · 2 comments
Open

Using OSM GPX planet as a data-source #32

laurentg opened this issue Apr 15, 2015 · 2 comments

Comments

@laurentg
Copy link
Collaborator

Announcing the GPX Planet. 2.6 Trillion GPX points.

It could be nice to be able to use this (or part of it) as a GPS source.

@kpwebb
Copy link
Contributor

kpwebb commented Apr 22, 2015

Let's ingest it! Just need to get the traces into a time sorted format with and we're good! We just need:

trace_id,time,lat,lon

2.6 trillion points will take a bit of crunching but we're currently getting about 15k points/sec/thread (and lots of room still for optimization) so that's 2000 days on a single thread. Scaling up is linear by throwing more CPU power at this.

@laurentg
Copy link
Collaborator Author

There exists regional extracts that can simplify things where only a small region is needed (GPX files in each extracts are stored by lat/lon bins). Also lots of those files are walk / bike / boat / planes etc... traces, a pre-filtering by mode would help reduce the amount of data to process (altough some heuristics are needed as the traces do not have mode information attached to them).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants