
GitLotus is a tool for extracting RDF data from Git repositories.
Git-based project data holds valuable insights beyond version control, including collaboration patterns and project management details. However, the heterogeneity of Git hosting platforms complicates unified analysis. GitLotus addresses this challenge by transforming Git repositories into RDF, enabling a structured and platform-independent representation. Hence, our approach is generating an easy-to-use bridge from the source code world to the semantic web; just like a Lotus flower floats on a lake, GitLotus floats on Git data and creates new offsprings (views). It extracts repository data via the Git protocol, integrates metadata from GitHub while handling API limitations, and exposes results through persistent SPARQL endpoints.
-
Extracts Git repository data via the Git protocol
-
Integrates metadata from GitHub while handling API limitations
-
Exposes results through persistent SPARQL endpoints (one for each stored Git repository transformation task) or as downloadable RDF/Turtle files
GitLotus is a Docker-based application that can be installed on a server or a local machine via Docker compose (for installation instructions see project-deployment-compose) or as a standalone applications where four Spring Boot components need to be installed separately:
-
GitLotus listener service for accepting Git repositories and providing status information
-
GitLotus worker service for scheduling conversion of Git repositories to RDF files
-
GitLotus SPARQL query endpoint for querying the extracted RDF data from the different instances
-
GitLotus data sink (we use by default a PostgreSQL database, but other data sinks can be used)
The GitLotus RDF vocabulary defines the classes and properties used to represent Git repository data in RDF. It is available at gitlotus-vocabulary.
We are happy to receive your contributions. Please create a pull request or an issue in the corresponding repository. As this tool is published under the MIT license, feel free to fork it and use it in your own projects.