-
Notifications
You must be signed in to change notification settings - Fork 58
Web VEP: fix incorrect variant end #1059
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: postreleasefix/112
Are you sure you want to change the base?
Web VEP: fix incorrect variant end #1059
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @nuno-agostinho ,
Thanks for the changes. I can see it has updated the END value of the variant to be same as what we get in the CLI. I tested with two variants, and from CLI I get -
rs72549353: 22:42128251-42128254
rs5030656: 22:42128176-42128178
In the sandbox they gave the same END value - http://wp-np2-11.ebi.ac.uk:7070/Homo_sapiens/Tools/VEP/Results?tl=smKQK4ocAyXPdUac-1
But you will notice that the start value does not match. Is that something you think we should consider? Notice that in the CLI we minimise the allele and gets the position.
|
The first variant got position - The second variant got - |
|
Hey @nakib, do you think we should output a
|
Hey there, @nuno-agostinho! Nakib here, albeit a different one. :) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @nuno-agostinho,
Yes it makes sense to use $vf->{original_start}. We have UPLOADED_ALLELE and REF_ALLELE in the web VEP output. Than it would at least be matched with the UPLOADED_ALLELE. Furthermore this is also what is reported by the Variant Table page for location.
Currently it is not matching either of the allele from those two column values.
User-requested bug fix: This PR fixes the incorrect
ENDlocation for short variants in web VEP by storing this information into theINFO/ENDfield by default, as already done for SVs.Motivation
Web VEP is returning incorrect
ENDlocation for multiple variants when using rsIDs. Given that web VEP output is in VCF format, noENDlocation is available by default. To prepare the variant location, web VEP first checks if it's stored inINFO/ENDfield, otherwise it calculates based on the length of the reference allele: https://github.com/Ensembl/public-plugins/blob/main/tools/modules/EnsEMBL/Web/TmpFile/VcfTabix.pm#L254-L264VEP stores the correct
ENDposition in the VariantFeature object. As such, we simply need to save this into theINFO/ENDfield of the VCF output, like in SVs. As this is done for SVs by default, this could also be the default for short variants.Testing
Compare results for the following variants between:
ENDlocation that is incorrect when compared with VEP CLI