Trajectory analysis: MDAnalysis (Part II)

In this lecture, we will perform some analyses on a trajectory of the T4 lysozyme. You can find the necessary material here.



Analysis with Gromacs

Download the folder.

The output of an MD calculation performed using Gromacs is usually the trajectory in .xtc format. As we saw in lecture n.2, the trajectory is usually processed before starting the data analysis. Indeed, we remove the effects of the Periodic Boundary Conditions and the diffusive dynamics by using the trjconv command:

gmx trjconv -s md_tpr.tpr -f md_trajectory.xtc -o output_trajectory_noPBC.xtc -pbc mol -center

a further usefull step is to remove the rotations and translation of the protein with respect to the initial condidions:

gmx trjconv -s md_tpr.tpr -f output_trajectory_noPBC.xtc -o output_trajectory_fitted.xtc -fit rot+trans

Now we can start the data analysis using the fitted trajectory output_trajectory_fitted.xtc that we can rename in traj_fitted.xtc.

Apply the commands above to process the 3 trajectories in the folder

Pay attention to specify the name of the system in the output file names! Keep the input names also in the output files.



RMSD

The RMSD is defined as following:

The RMSD can be computed by using the program gmx rms. It compares two structures by computing the root mean square deviation (RMSD). Each structure from a trajectory (-f) is compared to a reference structure. The reference structure is taken from the structure file (-s)

gmx rms -f traj_fitted.xtc -s tpr_file.tpr -o output_file.xvg

option: -n index_file.ndx


Calculate the RMSD of the two trajectories (about p53 protein) and plot the results.


We can plot the data using gnuplot (type it in a new shell), and then:

>plot “file.dat” w lp
>replot “file2.dat” w lp

To quit from gnuplot, type q and Enter.

Which information can you extrapolate about the systems from the plots?


Create an index file and calculate the RMSD for a subset of residues

Suggestion:
to create the index file start by using the command gmx make_ndx

gmx make_ndx -f tpr_file.tpr -o index_file.ndx



RMSF

The RMSF is defined as following:

The RMSF (Root mean square fluctuation) can be computed by the program gmx rmsf.

gmx rmsf computes the root mean square fluctuation (RMSF, i.e. standard deviation) of atomic positions in the trajectory (supplied with -f) after (optionally) fitting to a reference frame (supplied with -s).

gmx rmsf -f trajectory_file.xtc -s tpr_file.tpr -o output_file.xvg

option: -n index_file.ndx

Calculate the RMSF of the trajectory (p53_wt) and plot the results.

Which information can you extrapolate about the system from the plots?



Radius of Gyration

The radius of gyration is calculated as:

It is a quantity that quantifies you how compact your protein is.

gmx gyrate -f trajectory_file.xtc -s tpr_file.tpr -o output_file.xvg


Which processes can be monitored by calculating the radius of gyration in time?

Calculate the radius of gyration for the trajectory (GB1), and plot the results over time.

Calculate the RMSD for the trajectory (GB1), and plot the results over time.

How we can interpret these two results? Visualize the trajectory (GB1) in VMD.



Chimera

Now we can try to download another useful software for molecular visualization: Chimera.

Visualize the initial structure of each trajectory (in .pdb format)

In order to be able to extract the first frame of the trajectory and to save it in .pdb format we have to use the gmx trjconv command:

gmx trjconv -f traj_fitted.xtc -s tpr_file.tpr -o first_frame.pdb -dump 0

Pay attention to specify the name of the system in the output file names! keep the input names also for the output files.



Notes