-
Notifications
You must be signed in to change notification settings - Fork 85
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Persistent Index? #98
Comments
To be able to use indexes, RDBMS have to have control over modification of indexed data, so changes to it could be reflected on indexes as well. Since this application doesn't control who changes files on your disk and how, i'm not sure how it would work. Perhaps adding FS watcher (like with Or populating CSV file with |
Maybe using third-party indices is a more viable solution for |
Sounds like you're looking more for a cache? (I suppose this is one of those things where SQL & Filesystem terms kinda clash... because It would seem ideal and fitting to store cached results in some kind of database... (Perhaps SQLite or otherwise?) Maybe even two modes:
I think it would be important to actually benchmark how long it takes to scan videos, etc. Using the sha256 (or ) as the primary key would allow the file path to change to anywhere and still have metadata on the media file... But whether or not this is actually faster would be down to [Time to Scan Video File Metadata] vs. [Time to Hash File]. On systems with SHA processor extensions, it'd probably be faster to hash the file, but that's just a guess. (Storing the filepath alongside in the DB would have the advantage that you could easily add verification of files at zero performance penalty - if you're already hashing the file, that is.) On the file name front, maybe it's possible to use an existing mlocate/plocate database to speed up name searches? (Pretty sure those two don't store file metadata though.) |
Any plans to allow for the creation + update of a persistent index (perhaps SQLite backed)?
I'd love to be able to query over a large amount of data, where realtime queries for things like video width / height is extremely slow.
Thanks
The text was updated successfully, but these errors were encountered: