Avoid loading EFO ontology for each PMID in Data Release #1589

ala-ebi · 2025-03-12T10:37:52Z

In the DR, each PMID job loads the ontology in memory before starting indexing, which takes time and sometimes takes longer than the indexing itself, we can eliminate this in different ways, one way is bundling multiple pmids in in the same job, the indexer already supports that using -p pmid1,pmid2,pmid3...
ideally we need some intelligence to figure out which pmids to bundle together in the same job, we dont want multiple big pmids in the same job.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid loading EFO ontology for each PMID in Data Release #1589

Avoid loading EFO ontology for each PMID in Data Release #1589

ala-ebi commented Mar 12, 2025

Avoid loading EFO ontology for each PMID in Data Release #1589

Avoid loading EFO ontology for each PMID in Data Release #1589

Comments

ala-ebi commented Mar 12, 2025