Question
Asked 11th Jul, 2017
  • National Centre for Biological Sciences (NCBS-TIFR)

How to calculate PBC box size while doing protein unfolding MD simulation

I am trying to do protein folding studies for approx 70aa length protein using Gromacs. However, when I perform simulations, my protein goes out of the box and interacts with its images when simulated at 1.00nm box.
Can anyone please guide me as to how to calculate (if there is a formula or so) how big the PBC box should be such that no interactions are allowed between the periodic images?

Most recent answer

Dolly Mehta
National Centre for Biological Sciences (NCBS-TIFR)
Thank you very much, I have changed the thickness value to 15 and now my simulation works fine. Thank you very much.

Popular answers (1)

Hiqmet Kamberaj
National Institute of Physics
You should determine the water box sizes based on the protein unfolded structure, using for example one of the unfolded configurations. Based on this unfolded structure, you can determine (x_max, x_min), (y_max, y_min) and (z_max,z_min) of protein. Then, construct a water layer around the protein with thickness at least the cutoff of non-bonded interactions size: let's say if the cutoff=11-12 Angstrom, then the thickness of the layer is 15 Angstrom (assuming that the water box may shrink a little if the water box is not initially equilibrated at normal conditions, i.e., density = 1 gr/cm^3). Then, the big water box sizes are defined as:
big_sizeX = x_max-x_min + 2*thickness
big_sizeY = y_max-y_min + 2*thickness
big_sizeZ = z_max-z_min + 2*thickness
Then, after building a water box with these dimensions (big_sizeX, big_sizeY, big_sizeZ), you can solvate the folded protein structure in this 'big' water box. With this, then, you are sure that even if the protein gets unfolded, the images will not see each other.
3 Recommendations

All Answers (6)

Hiqmet Kamberaj
National Institute of Physics
You should determine the water box sizes based on the protein unfolded structure, using for example one of the unfolded configurations. Based on this unfolded structure, you can determine (x_max, x_min), (y_max, y_min) and (z_max,z_min) of protein. Then, construct a water layer around the protein with thickness at least the cutoff of non-bonded interactions size: let's say if the cutoff=11-12 Angstrom, then the thickness of the layer is 15 Angstrom (assuming that the water box may shrink a little if the water box is not initially equilibrated at normal conditions, i.e., density = 1 gr/cm^3). Then, the big water box sizes are defined as:
big_sizeX = x_max-x_min + 2*thickness
big_sizeY = y_max-y_min + 2*thickness
big_sizeZ = z_max-z_min + 2*thickness
Then, after building a water box with these dimensions (big_sizeX, big_sizeY, big_sizeZ), you can solvate the folded protein structure in this 'big' water box. With this, then, you are sure that even if the protein gets unfolded, the images will not see each other.
3 Recommendations
Dolly Mehta
National Centre for Biological Sciences (NCBS-TIFR)
Thank you Hiqmet Kamberaj.
But where do I get this information from? How can I determine (x_max, x_min), (y_max, y_min) and (z_max,z_min) of protein from trajectory files?
Hiqmet Kamberaj
National Institute of Physics
You can use vmd, for example, with command 'measure minmax' in TkConsole.
Dolly Mehta
National Centre for Biological Sciences (NCBS-TIFR)
Thank you very much.
I calculated minmax using VMD as you suggested and calculated big_sizeX, big_sizeY and big_sizeZ taking the thickness of 10. How do I correlate these values to size of the cubic box (-d option) I need to use in the following command:
gmx editconf -f protein.gro -o protein_newbox.gro -c -d 1.0 -bt cubic
Hiqmet Kamberaj
National Institute of Physics
Thickness 10 angstrom with a cutoff of at least 10 angstrom (as usually suggested for biomolecular systems) is a little small. It is recommended that the thickness should be greater than cutoff of non-bonded interactions. As I said in the previous post,  first, 
big_sizeX = x_max-x_min + 2*thickness
big_sizeY = y_max-y_min + 2*thickness
big_sizeZ = z_max-z_min + 2*thickness
then,
d = max(big_sizeX, big_sizeY, big_sizeZ)
where (x_min,x_max,y_min,y_max,z_min,z_max) are calculated using the most unfolded protein structure that you may have to see during the simulations.
Dolly Mehta
National Centre for Biological Sciences (NCBS-TIFR)
Thank you very much, I have changed the thickness value to 15 and now my simulation works fine. Thank you very much.

Similar questions and discussions

Related Publications

Article
Replica-exchange molecular dynamics (REMD) method is one of the enhanced conformational sampling algorithms widely applied in computational biophysics and biochemistry. In the method, molecular dynamics (MD) simulations for multiple replicas of a target system are performed simultaneously and independently. Every few steps, temperatures or other pa...
Article
In this paper we investigate the possibility of combining a cluster Monte Carlo method with molecular dynamics simulations in order to study the true dynamics of a mechanosensitive channel protein from Mycobacterium Tuberculosis (Tb-MscL) in its native lipid environment. Our goal was to explore, at an atomistic level, protein-lipid interactions tha...
Article
Biological functions are intimately rooted in biopolymer dynamics. It is commonly accepted that proteins can be considered as complex systems, but the origin of such complexity is still not fully understood. Moreover, it is still not really clear if proteins are true complex systems or complicated ones. Here, molecular dynamics simulations on a two...
Got a technical question?
Get high-quality answers from experts.