-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gemma propagation doesn't recognise substantia nigra as part of the brain and #842
Comments
Thanks for pointing this out. Substantia nigra should definitely be included in "brain" as it is a child term. ![]() But for GSE161045, UBERON_0002038 is not putamen, it's substantia nigra. |
Yep. Pasted the wrong term. It is UBERON_0001874 |
The annotations do appear to be ontology terms and not free text. UBERON_0002038 does appear in the propagated filter despite missing the dataset annotated with it, similar to the putamen term. UBERON_0001965, UBERON_0002661 and UBERON_0001966 however do not appear in the propagated filter at all https://gemma.msl.ubc.ca/rest/v2/datasets/GSE168496/annotations |
Ok, substantia nigra is inferred, but not substantia nigra pars compacta which is a children as per the "part of" relation. I'll debug that... |
just to make sure, while substantia nigra is inferred, the dataset annotated with it still isn't within the results |
It looks like a bug in the inference code. I can reproduce this. With Uberon base OWL, I get none of the terms and regular Uberon only returns "substantia nigra" |
This could be due to the fact we're only using the transitive inference instead of the full OWL inference. |
I'll try using the micro OWL instead of the transitive inference. |
I did some additional tests and it turns out that for our version of Jena, inference with I've introduced a workaround in baseCode to revisit parents/children that might have been missed due to limited inference capabilities. PavlidisLab/baseCode@09d7b20 We won't need to use a newer version of Jena for now as per PavlidisLab/baseCode#36. I'll push the fix on the development branch for testing. |
Update baseCode to 1.1.19 which adds support for setting language levels and inference mode. Allow language level and inference mode to be fine-tuned in OntologyServiceFactory. The combination of the two has a significant incidence on loading time, so one must carefully set them. Add support for setting additional property URIs to be used for inferring parents and children. Add tests to ensure that "substantia nigra pars compacta" is inferred from "brain".
None of the examples above are returned with the 'allCharacteristics.valueUri = http://purl.obolibrary.org/obo/UBERON_0000955' in the development version as of now |
I can confirm that both "substantia nigra" and "substantia nigra pars compacta" are inferred from brain. I will investigate why those specific datasets are not coming up. |
I see! The tag is on the sample itself, but we only index the FactorValue of the BioMaterial. This is utterly useless because those factors are already declared in the experimental design which we add separately. |
Fully fixed in 570a7c6 |
When using the term for brain (http://purl.obolibrary.org/obo/UBERON_000095) to fetch datasets from all brain regions, the propagation seems to be missing a few terms, most but not all of the missing items can be explained with missing terms in the propagated filter term the endpoint returns
Attempted query: https://gemma.msl.ubc.ca/rest/v2/datasets/?filter=allCharacteristics.valueUri%20%3D%20http%3A%2F%2Fpurl.obolibrary.org%2Fobo%2FUBERON_0000955
Missing results that should have been in there
The text was updated successfully, but these errors were encountered: