Problem with protein/transcript identifiers

Hi Hesham,

this is not a real problem but solving it would make life easier :-D
I successfully ran VCF2PROT v0.1.4 but I add to correct the transcript identifier in the VCF file as well as in the reference fasta file.

My bcftools-annotated VCF file has `Solyc02g062560.3|Solyc02g062560.3.1` identifiers in the BCSQ fields and my protein file header is `Solyc02g062560.3.1`.
It seems that the `.` in the sequence name is causing some problem and in this case, the output `.fasta` file was empty. After removing the end of the sequence name (moving to  `Solyc02g062560.3|Solyc02g062560` in the VCF and to  `Solyc02g062560` in the reference fasta), vcf2prot finally succeeded in writing the proper corrected sequences.

Hope it will help for the future!

Best regards,

Thomas


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem with protein/transcript identifiers #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Problem with protein/transcript identifiers #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions