-
Notifications
You must be signed in to change notification settings - Fork 244
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
norm split multiallelics does not split SVLEN (v4.2 Number=.) #2371
Comments
The program splits fields declared as Number=A, otherwise there is no guarantee which value belongs to which allele. I see the VCF specification suggests Number=., but it is unclear why, as it also writes that it should have "one value for each ALT allele", so Number=A would make more sense. As a quick workaround, the header can be modified with |
Thanks. Looks like the spec was changed from . in 4.3 to A in 4.4 With Novaseq X having inbuilt Dragen, there will be a lot of these files. |
OK. Definitely the best solution is to use |
I understand not wanting special case code to do something not technically right. I think this may bite a lot of people so maybe as a compromise how about a warning for SVLEN Number = . for split multiallelic the first time multiple values are found? |
Cheers @pd3 - slight typo in your change - "INFO/SVLE" |
Is split into:
I think it should be:
Full example
Command line:
Input file:
Output:
Version:
Workaround:
Post process VCF, use index at the end of old-rec-tag to pick which SVLEN to take
The text was updated successfully, but these errors were encountered: