This package is interacting with the Semantic Scholar Open Research Corpus data, and the Semantic Scholar API.
Work in progress
go get -u github.com/max-planck-innovation-competition/go-semanticscholar
NEO4J=TRUE // enables the specific header csv formatting for neo4j database ingestion
Semantic Scholar's records for research papers published in all fields provided as an easy-to-use JSON archive.
Corpus can be downloaded from: http://s2-public-api-prod.us-west-2.elasticbeanstalk.com/corpus/download/
The program can handle compressed (.gz
) and uncompressed files.
results, err := semanticscholar.ParseFile("/PATH/TO/BULK/DATA/DIRECTORY/FILE")
with compressed data
results, err := semanticscholar.ParseFile("/PATH/TO/BULK/DATA/DIRECTORY/FILE.gz")
results, err := semanticscholar.ReadFromDirectory("/PATH/TO/BULK/DATA/DIRECTORY")
- Sebastian Erhardt