pdbfixer is mislabeling built residues with negative res IDs #175

nitroamos · 2018-10-03T19:47:50Z

In this line

        newResidue = chain.topology.addResidue(residueName, chain, "%d" % ((firstIndex+i)%10000))

PDBFixer is wrapping negative residue numbers around 10000, meaning that a residue whose number is supposed to be -4 is ending up as 9996.

One fix would look like this:

        newResId = firstIndex+i
        if len(str(newResId)) >= 5:
	   newResId = (firstIndex+i)%10000
        newResidue = chain.topology.addResidue(residueName, chain, "%d" % (newResId))

which is closer to what happens in OpenMM

Or even simpler would be to not do the modulo in PDBFixer since OpenMM does it.

The text was updated successfully, but these errors were encountered:

peastman · 2018-10-03T19:55:57Z

Where did you find a PDB file with negative residue numbers? Residue numbers are supposed to be the index within the SEQRES section, which by definition can never be negative.

nitroamos · 2018-10-04T01:49:43Z

In my test case, it's coming from a REMARK 465 section which is integrated with PDBFixer as outlined here. For example, here's a random one Google found for me, take a look here

REMARK 465                                                                      
REMARK 465 MISSING RESIDUES                                                     
REMARK 465 THE FOLLOWING RESIDUES WERE NOT LOCATED IN THE                       
REMARK 465 EXPERIMENT. (RES=RESIDUE NAME; C=CHAIN IDENTIFIER;                   
REMARK 465 SSSEQ=SEQUENCE NUMBER; I=INSERTION CODE.)                            
REMARK 465     RES C SSSEQI                                                     
REMARK 465     MET A   -19                                                      
REMARK 465     GLY A   -18                                                      
REMARK 465     SER A   -17                                                      
REMARK 465     SER A   -16
...

I think the scientific origin of this is when people want to number their residues based on a pre-existing sequence alignment.

peastman · 2018-10-04T21:24:57Z

Ok, that makes sense. Your solution looks fine. Note that when PDBFixer calls PDBFile.writeFile(), it specifies keepIds=True. That's why it needs to do the modulo itself instead of relying on PDBFile to do it.

nitroamos · 2018-10-05T01:25:30Z

oops, didn't mean to close it. 😄

peastman · 2018-10-05T20:59:58Z

Good point. So PDBFixer really doesn't need to do this.

nitroamos mentioned this issue Oct 4, 2018

Some bug fixes related to building missing residues #178

Closed

nitroamos closed this as completed Oct 5, 2018

nitroamos reopened this Oct 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdbfixer is mislabeling built residues with negative res IDs #175

pdbfixer is mislabeling built residues with negative res IDs #175

nitroamos commented Oct 3, 2018

peastman commented Oct 3, 2018

nitroamos commented Oct 4, 2018

peastman commented Oct 4, 2018

nitroamos commented Oct 5, 2018

peastman commented Oct 5, 2018

pdbfixer is mislabeling built residues with negative res IDs #175

pdbfixer is mislabeling built residues with negative res IDs #175

Comments

nitroamos commented Oct 3, 2018

peastman commented Oct 3, 2018

nitroamos commented Oct 4, 2018

peastman commented Oct 4, 2018

nitroamos commented Oct 5, 2018

peastman commented Oct 5, 2018