Preparing a Small Molecule from PDB to Simulation

Ryan Snyder¹ and Darrin M. York¹

¹ Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA

Learning Objectives

Introduction

This tutorial will demonstrate how to prepare a small-molecule with coordinates in PDB format for simulation in AMBER using AMBERTools. Eventually, we would like to be able to use this molecule to perform an absolute solvation free energy calculation in a later tutorial; As such, we will prepare an acetic acid molecule for simulation in both the gas and aqueous phases. The aqueous phase system will be solvated using the 4 point OPC water model [ Izadi 2014], which demonstratably shows improved agreement of hydration free energies of small molecules relative to experiment compared to other water models. The coordinates of the acetic acid are taken from the FreeSolv database [ Mobley 2014] (FreeSolv ID: 3034976).

Initial Setup

To begin, let us prepare some directories. First, make a directory called SmallMolecules. With-in that directory, create a second directory called AceticAcid. You can download our starting PDB file for acetic acid here and put the downloaded file in the /path/to/SmallMolecules/AceticAcid directory. Generally, preparing small molecules follows the steps of :

Antechamber

First, we will assign atomic charges and atom types. These properties, anlong with the coordinates will be stored in a mol2 file. The atom types will be assigned using the GAFF2 forcefield which is parameterized for RESP charges; consequently, we will assign chargers using the RESP method, which requires the Gaussian QM software package. We will do this in two steps. First, we will generate the inputfiles needed for Gaussian using antechamber:

antechamber -i acetic_acid.pdb -fi pdb -o acetic_acid.g16 -fo gcrt -nc 0 -gk 'p HF/6-31G(d) nosymm Pop=MK iop(6/33=2) iop(6/42=6) iop(6/50=1)' -gv 1 -ge acetic_acid.resp -gn '%nproc=1' -gm '%mem=8000MB'

For more information about the antechamber executable, you can type "antechamber -h" into the command line. This command will produce acetic_acid.g16.

We will now load the Gaussian module and set the environmental variable for the scratch directory.

module purge
module load cpu/0.15.4
module load gaussian/16.C.01
export GAUSS_SCRDIR=/scratch/$USER/job_$SLURM_JOBID

Now that we have the Gaussian files and enviornment needed to calculate the RESP charges, we can pass the file to Gaussian :

g16 acetic_acid.g16 > acetic_acid.log

If Gaussian is unavailable, the log file is available at /expanse/projects/qstore/amber_ws/tutorials/Param_acetic_acid/antechamber/intermediates_io. Now reload the modules needed for Amber :

module purge
module use /expanse/projects/qstore/amber_ws/packages/modulefiles
module load workshop/software

We can now use antechamber to extract the charges from the log file and assign atom types:

antechamber -i acetic_acid.log -fi gout -gv 0 -o acetic_acid.mol2 -fo mol2 -c resp -nc 0 -rn LIG -at gaff2

This command will produce the acetic_acid.mol2 file. This file can also be found at /expanse/projects/qstore/amber_ws/tutorials/Param_acetic_acid/antechamber/outputs/acetic_acid.mol2.

parmchk2

Now that we have the mol2 file, we can generate the frcmod file using the parmchk2 executable:

parmchk2 -i acetic_acid.mol2 -o acetic_acid.frcmod -f mol2 -s 2

Like antechamber, you can learn more about the executable options for parmchk2 using "parmchk2 -h". The frcmod file is available at /expanse/projects/qstore/amber_ws/tutorials/Param_acetic_acid/parmchk/outputs/acetic_acid.frcmod

tleap

Next, we will use the mol2 and frcmod files to generate our AMBER formated parameters and restart files. We will start by preparing the gas phase system. Open a file acetic_acid_vac_tleap.in in a text editior and write the following:

source leaprc.protein.ff19SB
loadamberparams frcmod.ff19SB
source leaprc.gaff2
loadamberparams acetic_acid.frcmod

acetic_acid = loadmol2 "acetic_acid.mol2"
saveOff acetic_acid acetic_acid.lib
saveAmberParm acetic_acid acetic_acid_vac.parm7 acetic_acid_vac.rst7

quit

Then run the command:

tleap -f acetic_acid_vac_tleap.in

This command will generate a acetic_acid_vac.parm7 and acetic_acid_vac.rst7, which contain the parameters and coordiates for the gas phase system respectively. Note that in addition to the parm7 and rst7 files, we are saving a library file which will be used in later tutorials. Now we will make some small changes to the tleap procedure to solvate the sytem using the OPC water model Open a file acetic_acid_aq_tleap.in in a text editor and write the following :

source leaprc.protein.ff19SB
loadamberparams frcmod.ff19SB
source leaprc.water.opc
loadamberparams frcmod.opc
source leaprc.gaff2
loadamberparams acetic_acid.frcmod

acetic_acid = loadmol2 "acetic_acid.mol2"
solvateBox acetic_acid OPCBOX 20.0
saveAmberParm acetic_acid acetic_acid_aq.parm7 acetic_acid_aq.rst7

quit

Then run the command:

tleap -f acetic_acid_aq_tleap.in

This command will generate a acetic_acid_aq.parm7 and acetic_acid_aq.rst7, which contain the parameters and coordiates for the aqueous phase system respectively. All parm7 and rst7 files are available in the expanse/projects/qstore/amber_ws/tutorials/Param_acetic_acid/tleap/outputs directory.

References