Gaussian Tutorial
Gaussian Tutorial
Gaussian Tutorial
AN
INTRODUCTION TO
COMPUTATIONAL CHEMISTRY
AVOGADRO
USING
G09W
AND
SOFTWARE
Anna Tomberg
[email protected]
This is a quick tutorial that will help you to make your way through the
first steps of computational chemistry using Gaussian 09W software (G09).
The tutorial is oriented to beginners and describes in detail the most used
calculations done using G09. However, the theoretical basis of these calculations will not be covered here. If you are interested in understading
the details, please refer to textbooks targeting this subject. I found [1] and
[2] very helpful, and strongly recommend to take a look at these wonderful
books.
G09 Tutorial - AT
page 1 of 34
CONTENTS
1 Program specifications
1.1 Input . . . . . . . . . .
1.2 Output . . . . . . . . .
1.3 Visualization . . . . .
1.4 Computation model .
.
.
.
.
3
3
5
5
5
5
5
7
7
7
8
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
4 Theoretical Models
4.1 What is a theoretical model? . . . . . . . . .
4.2 Ab initio methods . . . . . . . . . . . . . . . .
4.2.1 Examples of Ab initio methods . . .
4.3 Semiempirical methods . . . . . . . . . . . .
4.3.1 Examples of semiempirical methods
4.4 Density Functional Theory . . . . . . . . . . .
4.4.1 Examples of DFT methods . . . . . .
4.5 Molecular Mechanics . . . . . . . . . . . . . .
4.5.1 Examples of MM methods . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
8
8
9
9
10
10
10
10
10
10
5 Basis Sets
5.1 What is a basis set and why is its selection important?
5.2 A bit of theory: Slater VS Gaussian . . . . . . . . . . . .
5.3 Types of Basis Sets and Notation . . . . . . . . . . . . . .
5.3.1 Minimal . . . . . . . . . . . . . . . . . . . . . . . .
5.3.2 Split Valence . . . . . . . . . . . . . . . . . . . . .
5.3.3 Correlation-consistent . . . . . . . . . . . . . . . .
5.3.4 Useful Tips from David Sherrill [5] . . . . . . . .
5.3.5 Comparison between Pople and CC basis sets .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
11
11
11
11
11
12
13
13
13
6 Types of calculation
6.1 Geometry Optimization . . . . . . . . . . . . . . . . . . . . . . .
6.1.1 What is it? . . . . . . . . . . . . . . . . . . . . . . . . . . .
6.1.2 For your Job . . . . . . . . . . . . . . . . . . . . . . . . . .
6.1.3 What information do you get out of this calculation? .
6.2 Single point energy . . . . . . . . . . . . . . . . . . . . . . . . . .
6.2.1 What is it? . . . . . . . . . . . . . . . . . . . . . . . . . . .
6.2.2 For your Job . . . . . . . . . . . . . . . . . . . . . . . . . .
6.2.3 What information do you get out of this calculation? .
6.3 Frequencies and Thermochemistry . . . . . . . . . . . . . . . .
6.3.1 What is it? . . . . . . . . . . . . . . . . . . . . . . . . . . .
6.3.2 For your Job . . . . . . . . . . . . . . . . . . . . . . . . . .
6.3.3 What information do you get out of this calculation? .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
13
14
14
14
15
15
15
15
15
16
16
16
17
G09 Tutorial - AT
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
page 2 of 34
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
17
18
18
18
18
18
19
20
20
20
20
21
21
21
21
21
22
22
22
22
22
22
22
22
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
22
23
23
23
23
23
23
24
24
24
24
25
9 Examples
9.1 EXAMPLE 1:
9.2 EXAMPLE 2:
9.3 EXAMPLE 3:
9.4 EXAMPLE 4:
25
25
25
26
29
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
SP of H2O . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Opt of Ethanol . . . . . . . . . . . . . . . . . . . . . . . . .
Molecular Orbitals Calculation and Visualization of HF
The Energy of Stereisomers and Scaling Factors (butene)
10 TroubleShooting
10.1 Convergence cannot be achieved? . . . . . . . . . . . . . . . . . . . .
10.2 My job froze, what can I do to avoid restarting from scratch? . . .
10.3 A double bond disappeared from the structure after computation?
10.4 My batch file does not run properly, why? . . . . . . . . . . . . . . .
G09 Tutorial - AT
.
.
.
.
.
.
.
.
30
30
30
30
30
page 3 of 34
11 Appendix
31
12 References
33
G09 Tutorial - AT
page 4 of 34
PROGRAM
SPECIFICATIONS
Gaussian 09W (G09) is a computational chemistry program that runs on any modern Windows 32-bit PC. If you want to install G09 on a 64bit PC, there is a special
procedure you must follow:
1. Insert the CD with G09 and copy its content onto you computer. Any folder
will do; I copied directly into the :C\ directory.
2. Open directory containing G09
3. Find the g09w.exe file
4. Right click on exe file, select Properties, a new window should appear
5. Go into the Compatibility menu
6. Put a checkmark next to : Run as administrator (this should enable other
checkboxes)
7. Put a checkmark next to : Run this program in compatibility with (select
Windows version that you are using)
The installation requires the Gaussian CD and a registration key.
VISUALIZATION SOFTWARE ChemDraw (ChemBio 3D Ultra) and Avogadro (v.1.0.3)
softwares can be used for visualization.
VIDEO TUTORIALS If you prefer watching video tutorials for a better understanding, please see below:
Part 1: http://www.youtube.com/watch?v=LOZswm07j0U&feature=
plcp
Part 2: http://www.youtube.com/watch?v=6O4DEoETAnU&feature=
plcp
Part 3: http://www.youtube.com/watch?v=_REn2MsqUc4&feature=
plcp
Part 4: http://www.youtube.com/watch?v=WpyqqDwspoo&feature=
plcp
Part 5: http://www.youtube.com/watch?v=AScn1OuyBoY&feature=
plcp
1.1
INPUT
page 5 of 34
The input can be done manually or come from another software, such as ChemDraw (3D) or Avogadro. G09s input consists of the following parts, shown on
Figure 1:
G09 Tutorial - AT
page 6 of 34
1.2
OUTPUT
To run the G09 job, click on the RUN button, on the right panel of the Job Entry
window (Figure 1). The output has only one extension: .out. You will be prompt
to save the output file before closing G09 program.
1.3
VISUALIZATION
This tutorial will use Avogadro software for visualization of the G09 output. Open
the .out file in Avogadro, your input molecule should appear on the view screen.
A detailed demostration of Avogadro is available in the first two parts of the video
tutorials.
NOTE If you try to visualize things you have not calculated using G09, the program
will freeze!
1.4
COMPUTATION
MODEL
The mathematical models used to do the computations are called FEM (Final
Element Method) and Symplex method. Using matrices, this method cuts the Ndimensional space into small sub-systems that can be described by N linear equations. These equations can be solved as soon as one of them is solved. Therefore,
one must take a guess of the solution and then, recursively solve all the other ones.
Once all solutions are obtained, the initial guess can be modified and the calculations repeated. This process is run until the new solution outputs the same result
as in the previous iteration. This is called convergence.
Because the initial guess can be very far off the real value, many thousands of
iterations are often needed. The basis set selected influences, among other, the
quality of the guesses, while the theoretical model influences the type of calculations that the matrices will be subjected to.
The calculation stops as soon as the result converges, however convergence does
not mean that the system reached its minimum. It is very likely that the minimum
outputted is actually only a saddle point on the potential surface, but the program
will not be aware of this. The user must be careful with this, and always perform
a check of stability of the system. This is done by perturbing the "stable" system
and re-calculating the minimum. If the output is the same, one can assume that
the energy obtained is the total minimum.
FOR
FIRST
[3]
LOOK AT A COMPUTATION
This section will describe how to do a simple calculation in G09 using Avogadro.
All the parts of this procedure (and more) are explained more in details in corresponding sections. Lets take a look.
2.1
OF
WATER
Lets calculate the Single point energy of water. For this, we will first open Avogadro and draw the molecule (Figure 2).
If you think you will reuse this molecule in Avogadro, you can save it with an
.cml extention.
NOTE It is good practice to save all parts of your job along the way!
G09 Tutorial - AT
page 7 of 34
page 8 of 34
Figure 3: (A) Name of job, (B) type of calculation, (C) theoretical model, (D) Basis
set, (E) charge and multiplicity, (F) the route section that will be given to G09. (G)
the molecular coordinates.
GAUSSIAN INPUT
3.1
DETAILS
LINK 0
This very first line of the input usally contains the name of the checkpoint file to
save additional information in. This tutorial will not cover the use of .Chk files, but
you can learn about them from the reference manual.
3.2
ROUTE SECTION
This section contains the instructions for a job you want G09 to execute. The input
for this section is as follows:
#X Theoretical model/Basis set Type of calculation Options
# : mandatory sign to begin route section.
X : specifies the amount of detail you want to acquire: X = T (terse output);
X = P (max output); X = N (normal output).
Theoretical model : keyword that tells G09 which theoretical model to use
(ex: RHF) See more detail on theoretical models in section 4.
Basis set : specifies the basis set to use (ex: 6-31G(d)). See more detail on
basis sets in section 5.
Type of calculation : specifies one or more keywords for G09 jobs to do
separated by a space. See more detail on types of calculations in section 6.
G09 Tutorial - AT
page 9 of 34
3.3
MOLECULAR STRUCTURE
This section is usually composed of the atoms and Cartesian or Z-matrix coordinates. It is possible to calculate and input the coordinates manually, but it is much
easier to obtain them through software such as Avogadro or ChemBio 3D Ultra.
ChemDraw can be used solely to make the input molecule. Once the molecule
is created and optimized, save it as a .gjf file. You will have to alter the input the
other fields manually once the file will be loaded into G09. ChemDraw provides
the coordinates, but nothing else.
Avogadro can easily generate the most popular G09 inputs (Opt, Freq, SP). To see
how, refer to the video tutorials (links above) or to the example section. In short,
once the molecule is drawn and optimized, click on Extentions (on top)Gaussian....
To create a G09 input, click on Generate... and save as a .com file (shown on
Figure 5). This file should then be loaded into G09 for further processing.
4
4.1
THEORETICAL MODELS
WHAT
IS A THEORETICAL MODEL ?
In short, a theoretical model or method is a way to model a system using a specific set of approximations. These approximations are combined with a calculation
algorithm and are applied to atomic orbitals, defined by the basis set (see section
5), in order to compute molecular orbitals and energy. In general, the methods
G09 Tutorial - AT
page 10 of 34
4.2 Ab initio
METHODS
This type of computation is based only on theoretical principles, using no experimental data. The numerous methods have the same basic approach, but differ in
the mathematical approximations used. These are the most popular type of models,
despite the fact that the calculations take unbelievably long time.
4.2.1
EXAMPLES
OF
Ab initio
METHODS
G09 Tutorial - AT
page 11 of 34
MPN Moller-Plesset perturbation theory are denoted as MPn (n=2,...,6). In practice, MP2 and MP4 are the only methods used, since the other ns are either too
computationally expensive or do not significantly improve the results compared
with a lower level of complexity.
CI Configuration Interaction calculations are most often used for excited states.
CI can be very accurate, but are also very CPU expensive.
4.3
SEMIEMPIRICAL
METHODS
EXAMPLES
OF SEMIEMPIRICAL METHODS
4.4
DFT methods are becoming more and more popular because the results obtained
are comparable to the ones obtained using ab initio methods, however CPU time is
drastically reduced. DFT differs from methods based on HF calculations in the way
that it is the electron density that is used to compute the energy instead of a wave
function.
4.4.1
EXAMPLES
OF
DFT
METHODS
B3LYP This is the most popular DFT model. This method is called to be a hybrid,
because is uses corrections for both gradient and exchage correlations.
PW91
Gradient-corrected method.
4.5
MOLECULAR MECHANICS
What is the system you are working with is giant? No panic, you can still model
it! This is possible by using molecular mechanics. MM methods approximate atoms
as spheres and bonds as springs. They use an algebraic equation for the energy calculation, not a wave function or electron density. The constants in the equation
are obtained from experimental data or other calculations and are stored in a data
library. The combination of constants and equations is called a force field. These
calculations are so simple that you dont even need to perform them in a complicated software such as G09. You can run your calculation right in Avogadro!
4.5.1
EXAMPLES
OF
MM
METHODS
UFF Universal Force Field is the method used in Avogadro by default. It can be
used on organic and inorganic molecules.
MMFF Merk Molecular Force Field is another general-purpose model, used mainly
with organic systems.
G09 Tutorial - AT
page 12 of 34
BASIS SETS
5.1
WHAT
A basis set is a set of wave functions that describes the shape of atomic orbitals
(AOs). The molecular orbitals (MOs) are computed using the selected theoretical
model by linearly combining the AOs (LCAO). Not all theoretical models require
the user to choose a basis set to work with. For example, PMn (n=3,...,6) models
use an internal basis set, while ab initio or density functional theory require a basis
set specification. The level of approximation of your calculation is directly related
to the basis set used. The choice to make is a trade-off between accuracy of results
and CPU time.
5.2
BIT OF THEORY:
SLATER VS GAUSSIAN
Both Slater Type Orbitals (STOs) and Gaussian Type Orbitals (GTOs) are used to
describe AOs. STOs describe the shape of AOs more closely than GTOs, but GTOs
have an unbeatable advantage: they are much easier to compute. In fact, it is faster
to compute several GTOs and combine them to describe an orbital than to compute
one STO! This is why combinations of GTOs are commonly used to describe STOs,
which in turn, describe AOs. Yes, a bit complicated, but the computers dont mind.
There are other differences between STOs and GTOs, but they will not be covered
here.
5.3
5.3.1
TYPES
OF
BASIS SETS
AND
NOTATION
MINIMAL
These basis sets use only one function for each AO. STO-nG (n=2,...,6) means
that n GTOs are used to decribe one STO, and only one STO is used to describe
an AO (single Zeta). Usally n < 3 gives too poor results, so STO-3G is called the
G09 Tutorial - AT
page 13 of 34
minimal basis set. Minimal basis sets are used for either qualitative results, very
large molecules or quantitaive results for very small molecules (atoms).
5.3.2
SPLIT VALENCE
These basis sets are also called Pople basis sets and allow to specify the number
of GTOs to use for core and valence electrons separately (size adjustable). These
are double Zeta (2 functions per AO) or triple Zeta. The notation is as follows:
K-LMG, where
K = number of sp-type inner shell GTOs
L = number of inner valence s- and p-type GTOs
M = number of outer valence s- and p-type GTOs
G = indicates that GTOs are used
POPLE
3-21G : 3 GTOs for inner shell, 2 GTOs for inner valence, 1 GTO for outer
valence
4-31G
4-22G
6-21G
6-31G
6-311G : 6 GTOs for core orbital, 3 GTOs for inner valence, 2 different GTOs
for outer valence (triple zeta)
7-41G
POLARIZED Pople basis sets can be modified to obtain an approximation that better
describes the system you are working with. This can be done, for example, by
letting the AOs distort from original shape (get polarized under the influence of
the surroundings). Polarization can be added as * or (d).
(d) or * type : d-type functions added on to atoms other than Hydrogens and
f-type functions added on to transition metals
(d,p) or ** type : p-type functions added on to Hydrogens, d-type functions
added on to all other atoms, f-type functions added on to transition metals
EX:
6-31G(d) or 6-31G**
DIFFUSE Pople basis sets can also be modified by letting the electron move far
away from the nucleus, creating diffuse orbitals. This modification is useful when
working with anions, excited states and molecules with lone pairs. Diffuse functions can be added as + or ++ in front of the G.
+ : diffuse functions added on to atoms other than Hydrogens
++ : diffuse functions added on to all atoms
G09 Tutorial - AT
page 14 of 34
EX:
6-31+G(d) or 6-31++G(d)
5.3.3
CORRELATION-CONSISTENT
All of the basis sets described until now were optimized at a Hartree-Fock level.
However, it is legitimate to doubt that this optimization might not be the best for
correlated computations. Thom Dunning created a set of basis sets optimized using
correlated (CISD) wavefunctions. They are denoted as cc-pVXZ, where:
cc = indicates that it is a correlation-consistent basis
pV = indicates that it is a polarized valence basis
XZ = indicates the zeta number (X= D for double, T for triple, Q for quadruple, 5,6,7)
The prefix aug- can be used to add diffuse functions.
EX:
cc-pVDZ or aug-cc-pVTZ
5.3.4
USEFUL TIPS
FROM
a bigger basis set is not always the best! (ex: cc-pVQZ is overkill for RHF)
STO-3G should be used only on very big systems
anions must have diffuse functions
usually cc-pVDZ is equivalent or worse than 6-31G(d,p)
usually cc-pVTZ is better than 6-311G(d,p) or similar
convergence of ab initio methods is very slow
5.3.5
COMPARISON
BETWEEN
POPLE
AND
CC
BASIS SETS
TYPES
OF CALCULATION
page 15 of 34
6.1
GEOMETRY OPTIMIZATION
KEYWORD:
Opt
6.1.1
IS IT ?
WHAT
FOR
YOUR
JOB
page 16 of 34
WHAT
DETAILS HERE :
[10]
6.2
SINGLE
POINT ENERGY
KEYWORD:
SP
6.2.1
IS IT ?
WHAT
FOR
YOUR
JOB
This keyword is the default one, and is assumed if no keyword is specified in the
route section. G09 was found to make systematic errors in the frequencies and
Zero-Point Energies. The scaling factors vary according to the different theoretical models and basis sets used for the calculation. Table 1 (Scaling factors for
Freq and ZPE/Thermal calculations), which can be found in the Appendix, lists the
recommended correction factors for these calculations.[10, 11]
6.2.3
WHAT
page 17 of 34
6.3
FREQUENCIES
KEYWORD:
Freq
6.3.1
IS IT ?
WHAT
AND
THERMOCHEMISTRY
This keyword will indicate to G09 to calculate the vibrations of the molecule in
question, along with other parameters that you can choose to include or erase from
the output. Each vibrational analysis also includes a thermochemistry section by
default. If you want to learn how G09 computes these values, please read [12-14].
If you generate the input for G09 using Avogadro, then the Opt keyword will be
automatically added in front of Freq, and the calculation will run in 2 steps.
Because the molecular frequencies depend on the second derivative of the energy
with respect to the nuclear positions, you should make sure that the theoretical
model you chose for the calculation computes second derivatives (ex: HF, B3LYP,
MP2, CASSCF). Ab initio and DFT theoretical models are known to give the best
results (based on experimental values).
NOTE: You must use the same combination of theoretical model and basis set for
both the Opt and Freq calculations.
6.3.2
FOR
YOUR
JOB
Most likely, you would do a vibrational analysis to assign the peaks of an experimentally obtained spectrum to specific vibrations. Visualization using Avogadro
software can be extremely helpful, however as a rule of thumb, the following can
be used: bond stretches> bond bends > bond torsions (in order of decreasing
energy).
SCALING The frequencies generated must be scaled according the the theoretical
method and basis set used. A short list of the scaling factors is collected in Table
1 of the Appendix, but a complete list can be found at [11]. Note that the scaling
factors for frequencies and for zero point energies are different! You can input the
scale factor to be used for thermochemistry analysis using the Scale keyword as an
option (ex: "Scale=0.95").
STABILITY The stability of your molecule can easily be checked by verifying whether
there exists imaginary (negative) frequencies. Imaginary frequencies indicate instability of molecular geometry. Also, knowing that each molecule should have
either 3N-6 or 3N-5 vibrational modes, you can notice that there are more frequencies than supposed to be! These are the translational and rotational modes and
their intensities should close to zero (+/- 10). Big intensities for these frequencies
is another way to verify the stability of you molecule. See more on p.72 of [10].
IMAGINARY FREQUENCIES Negative frequencies indicate instability in the molecule
or, in other words, saddle points on the potential energy surface. A stable molecule
should have no imaginary frequencies, a transition state should have one (1st order
saddle point), while more than one imaginary frequency means that there is a
problem with your molecules geometry.
ANHARMONIC FREQUENCIES By default, G09 computes frequencies based on harmonic oscillator approximation (second order derivative with respect to the nuclei
movement). If you wish to obtain more accurate results, you might want to consider using anharmonic corrections, i.e. computing higher-order derivatives. For
this, add "Anharmonic" keyword into the route section.
G09 Tutorial - AT
page 18 of 34
RAMAN If you wish to add Raman intensities to the output, add "Raman" keyword
to the Route section.
THERMOCHEMSITRY By default, several thermodynamic values are computed during a vibrational analysis. You can change the thermochemistry parameters (temperature, pressure, ect) by writing Freq=ReadIsotopes in the Route section. Tha
values obtained are based on ideal gas assumptions.
See examples in the video tutorial Part 5.
6.3.3
WHAT
6.4
STABILITY
KEYWORD:
CHECK
Stable
G09 Tutorial - AT
page 19 of 34
6.4.1
WHAT
IS IT ?
The stability calculation determines whether the wavefunction computed for the molecular system is stable or not: in other words, whether
there is a lower energy wavefunction corresponding to a different solution of the SCF equations. If the wavefunction is unstable, then
whatever calculation you are performing is not being done on the expected/desired state of the molecule. The stability of SCF solutions
for unknown systems should always be tested. Stability considerations apply and may be tested for in calculations using DFT methods as
well.[10]
SCF = self consistent field
DFT
6.4.2
6.4.3
FOR
YOUR
JOB
The output will then contain a message telling you whether the system is stable or
not under the "stability analysis":
Stable geometry message
The wavefunction is stable under the perturbations considered.
Unstable geometry messages
The wavefunction has an RHF UHF instability.
The wavefunction has internal instability.
6.5
MOLECULAR ORBITALS
AND
POPULATION ANALYSIS
KEYWORD:
6.5.1
IS IT ?
WHAT
POPULATION
ANALYSIS
page 20 of 34
CHARGES
FOR
YOUR
JOB
Pop=X keyword
X=NONE
X=REG
X=FULL
X=NBO
X=MK, CHELP,
OR
CHELPG
G09 uses Mulliken analysis by default. Remember that if you need orbitals energies, select a bigger basis set, if you need partial charges, select minimum basis
set. It is fine to use the obtained results from Mulliken analysis for a qualitative
overview of partial charges, but these values are not reliable and cannot be compared with literature ones! If you need quantitative results, you should do for
example a Natural Bond-Order analysis (NBO) or use Electrostatic charges (ESP).
NBO
NBO uses eigenfunctions of the first order reduced density matrix, localized and
orthogonalized.
ESP This method uses electrostatic potential to compute charges on nuclei. ESP
is usually the best to describe charge interactions with other species, but require
significant amount of CPU time.
G09 Tutorial - AT
page 21 of 34
6.5.3
WHAT
6.6
UV-VIS
AND
ELECTRONIC
KEYWORD:
TD or CIS or ZINDO
6.6.1
IS IT ?
WHAT
TRANSITIONS
FOR
YOUR
JOB
Depending on what your system is, you need to choose to work with either vertical or adiabatic excitation energies. Vertical excitations are obtained by taking the
difference between the excited state potential energy curve and the ground state
potential energy curve using the optimized geometry of ground state. This is done
automatically for you using the methods listed below. Vertical excitations energies
are used for a system that has no time to get to equilibrium before returning to
ground state. Adiabatic excitations energies are obtained by first optimizing the
geometry of the excited electronic state and then taking the difference between
excited and ground energies. This is only possible if the excited states are long
lived.
ZINDO This is a semiempirical method that was optimized to compute electronic
transitions. Also applicable to inorganic systems.
G09 Tutorial - AT
page 22 of 34
CIS / CIS(D) Single-excitation configuration interaction is the easiest way to obtain excited states energies. CIS calculation uses the optimized MOs (for ground
state) and calculates the energy to move one electron to one of the virtual orbitals.
CIS(D) also includes correlation of excited states, and is a quick and relatively good
method. (Always use CIS(D) instead of CIS)
TD This Time Dependent calculation is based on a DFT computational method.
TD-DFT obtains the wave functions of MOs that oscillate between ground state and
the first excited states.
You can select the number of excited states to compute (ex: TD=NStates=5).
The default is 3. You can also select what type of transitions to look for (ex:
TD=Singlets).
For more details of TD: [20].
6.6.3
WHAT
6.7
KEYWORD:
Scan
6.7.1
IS IT ?
WHAT
The potential energy surface (PES) is a central concept in computational chemistry. A PES is the relationship, mathematical or graphical, between the energy of a molecule (or a collection of molecules)
and its geometry. the Born-Oppenheimer approximation says that in
a molecule the nuclei are essentially stationary compared to the electrons. This is one of the cornerstones of computational chemistry because it makes the concept of molecular shape (geometry) meaningful,
makes possible the concept of a PES, and simplifies the application of
the Schrodinger equation to molecules by allowing us to focus on the
electronic energy and add in the nuclear repulsion energy later. [21]
See also: [22]
6.7.2
FOR
YOUR
JOB
The PES Scan is somewhat complicated to setup since you need to specify which
coordinate to "freeze" for the calculation. Please refer to Chapter 8 in [10].
6.8
SOLVATION
KEYWORD:
EFFECT
SCRF
G09 Tutorial - AT
page 23 of 34
6.8.1
WHAT
IS IT ?
It is crucial to remember that by default G09 uses gas phase for any calculation.
Most probably, you will be interested to compute molecular properties in solution.
For this purpose, you have a choice between implicitly or explicitly adding solvation
effects. Explicitly adding solvent molecules in your system, by drawing them in
Avogadro, is extremely computationally expensive. As an alternative, you can use
the SCRF keyword, which will create a solvent cavity around your molecule and
perform an given calculation.
See also: [23]
6.8.2
FOR
YOUR
JOB
It is important to first optimize the geometry in the solvent of choice before doing
any other calculation. You can also compute the excited states in solvent to see how
solvation affects the transition energies. For a list of solvents that are available in
G09 check the reference manual.
EX1: To compute geometry optimization in methanol, using RHF/6-31G(d), the
Route section will look like this:
#n RHF/6-31G(d) Opt SCRF=(Solvent=Methanol)
EX2: To compute transition states in methanol, using RHF/6-31G(d), the Route
section will look like this:
#n RHF/6-31G(d) TD SCRF=(Solvent=Methanol)
6.9
6.9.1
POLARIZABILITY
KEYWORD:
6.9.2
FORCES
KEYWORD:
6.9.3
NMR
KEYWORD:
6.9.5
NUCLEI
Force
Volume
ANALYSIS
NMR
ELECTROSTATIC
KEYWORD:
ON
MOLECULAR VOLUME
KEYWORD:
6.9.4
Polar
FOR
POTENTIAL AND
ELECTRON
DENSITY
cubegen
A SUCCESSFULL ANALYSIS
This section will describe in short the main steps to perform a good study of a
given system. There are many ways to go about doing computations, but a systematic approach will give you more chances to stay on track and obtain reasonable
results.
G09 Tutorial - AT
page 24 of 34
7.1
7.1.1
MAIN
STEP 1: DEFINE
THE BOUNDRIES
You should start by defining how much time, energy and CPU are available for
your study. There are some steps that could be skipped if results must be obtained
in little time. The choice of theoretical models and basis sets will also heavly depend on time and CPU resources. If your computer is not powerful enough, you
will need much more computational time than when a more powerful computer is
used.
7.1.2
STEP 2: SET
UP YOU COMPUTER
To speed up computations, it is strongly recommended to turn off all the programs (except G09) on your PC. Programs like Skype, open folders, Internet browsers,
etc., all demand CPU power. Shut them off! If you know how to, also turn off all
the other services (if youre not sure, skip this). Make sure your computer does not
go into sleeping or hibernating mode automatically after some period of time, this
might kill a G09 job.
7.1.3
STEP 3: DEFINE
It is crucial to have a strict notation for all of your G09 jobs. For example, you
could use something like this:
trans_azo_UFF(MP2_6-31G(d))_Opt
This notation puts the name of the molecule followed by the MM method that
was used initially to optimize the geometry, followed by the theoretical model,
followed by the basis set and finally by the type of calculation performed.
7.1.4
STEP 4: DEFINE
A PARENT MOLECULE
STEP 5: SUCCESFULL
SEQUENCE OF CALCULATIONS
You should perform the following sequence of calculations for any molecule:
1. Geometry Opt in Avogadro using MM (UFF or MMFF).
2. Geometry Opt and Freq calculation in G09 .
3. Check for imaginary frequencies (none for ground state, 1 for TS, etc).
4. If frequencies are alright, continue, otherwise restart the geometry opt.
5. Compute any property that you need.
6. Re-opt and re-compute frequencies in solvent (if needed).
7. Re-compute any property that you need in solvent (if needed).
G09 Tutorial - AT
page 25 of 34
7.1.6
STEP 6: READ
THE OUTPUT
You can use a script called "G09\output\interpreter" that can be found on the
desktop of the Gaussian workstation (PC). This program lists all the .out files found
in C:\G09W\IO\COMPUTED\BATCH\done directory, asks which file you want to
work with and lets you extract only the pertinent information out of it. This avoids
scrolling up and down to find the right values, and saves you lots of time. Dont forget that the .out file must be copied into the directory mentioned above, otherwise
the list will be empty!
BATCH
This section will explain how to use batch file to speed up you computational
project.
8.1
WHAT
ARE
BATCH FILES?
A Batch file is a text file that contains a specially formatted list of G09 jobs that
will be completed automatically one after the other without the need for the user
to physically launch the calculation. Such a file has an extension .bcf recognized by
G09. When you create a batch file, you list all the jobs you want G09 to do and the
names of the corresponding output files, then open the .bcf file in G09, and start
the computation! That easy!
8.2
HOW
To make a new batch file, you fist must prepare the input files separately using
Avogadro. Ideally, your input files should be independent of each other (to avoid
errors that will be carried on). This means that the output of one of the files in the
batch should not be used as input for some other file in the same batch file.
Once you create the input (.com) files, save them all in the same folder named
"Batch". If you dont have this folder already, you should create it in the G09
directory.
Now, open a blank (new) file in Notepad. The first two lines of this file must be
exactly as follows:
! Name of the batch file
! Start=1
Then follow the list of input files that will be in this batch file followed by the
name of their corresponding output files, separated by a comma. Each pair of input
and output names must be on a new line. (See Figure 7)
When you have listed all the files you want to include in this batch, save this file
with .bcf extension in the Batch folder. Open your batch file in G09. This will open
a new window called "Edit Batch Control List" where your jobs should be listed.
Make sure they are the right ones by verifying that the output files correspond to
the input files. When you are ready, click on Exit in the right bottom of this window.
This will bring you back to the main G09 screen. To start the calculations, select
"Process" on the top task bar and click on "Begin Processing". Your batch file should
start executing.
G09 Tutorial - AT
page 26 of 34
8.3
EXAMPLES
Here are some detailed examples that use Avogadro and G09. If you wish to see
more examples, refer to [10] or watch the video tutorials listed in section 1.
9.1
EXAMPLE 1: SP
OF
H2O
See section 2.
9.2
EXAMPLE 2: OPT
OF
ETHANOL
Lets optimize the geometry of ethanol, and compare the results obtained by
Avogadro, and the one obtained from the Opt in G09. For this, we will first open
Avogadro and draw the molecule. Now, click on Extensions > Optimize Geometry.
(shortcut: Ctrl+Alt+O) Save the result as ethanol.cml. This will be our result (1).
Now, generate the G09 input by selecting Extensions Guassian...; change the
title to "ethanolOpt", and the calculation to "Geometry Optimization", click on generate and save as "ethanolOpt.com". Open "ethanolOpt.com" in G09, and run the
job. Save the output as "ethanolOpt.out".
Finally, in Avogadro, open "ethanolOpt.out" and compare this result (2) with
what you obtained initially, using the Avogadro Optimization (result 1).
G09 Tutorial - AT
page 27 of 34
Figure 8: Optimization of ethanol: (1) Avogadros Opt structure (2) Gaussian final
Opt structure
9.3
AND
VISUAL-
Lets calculate the molecular orbitals of hydrogen fluoride. First, draw the molecule
in Avogadro, optimize its geometry and generate the G09 input file. To see how
these steps are done in details, refer to examples 1 and 2. The change now comes
in the Route section: you have to change the calculation to "Pop=Reg". This command will display the higher five occupied and the lowest five virtual molecular
orbitals and other pertinent information. If you wish to see more orbitals, type
"Pop=Full". Your G09 job should look like the window in Figure 9.
page 28 of 34
orbitals, which will be plotted later. Press RUN and start the calculation. When the
process is done, the output will contain several results, among which you can find
the sequence shown on Figure 10.
page 29 of 34
G09 Tutorial - AT
page 30 of 34
9.4
OF
STEREISOMERS
AND
SCALING FAC -
Lets compare the output of a geometry optimization for the cis and trans isomers
of butene. First, draw cis-butene in Avogadro, optimize its geometry. Generate
the G09 input file selecting Geometry Optimization, using B3LYP model with a 631G(d) basis set. To see how these steps are done in details, refer to example
1 and 2. Save as cis_butene.com. Repeat the same for trans-butene. Save as
trans_butene.com.
Run the jobs in G09. Save as their respective names with the .out extension. You
can visualize the molecules in Avogadro, but we will focus on their SCF energies:
Cis-butene: SCF Done: E(RB3LYP) = -157.224311885 A.U. after 12
cycles
Trans-butene: SCF Done: E(RB3LYP) = -157.226083227 A.U. after
12 cycles
We have to use the correction factor listed in Table1 for the method and basis set
(B3LYP/6-31G(d)) we used for the calculation: 0.9804. Therefore the real energies
are actually:
Cis-butene: SCF Done: E(RB3LYP) = -154.1427154 Hartrees
Trans-butene: SCF Done: E(RB3LYP) = -154.144452 Hartrees
As proved by the energies, the trans configuration is more stable than the cis one.
G09 Tutorial - AT
page 31 of 34
10
10.1
TROUBLESHOOTING
CONVERGENCE
CANNOT BE ACHIEVED ?
SCF=Tight is the default for G09. You should retry your computation after including SCF=QC in the Route section.
10.2
MY
SCRATCH ?
Check out the in the Gaussian reference manual whether the "Restart" option is
available for the type of calculation the you were doing. This option will read the
parameters from the checkpoint file (aka you can use this only if you made a .Chk
file for the job that froze), and continue on.
10.3
G09 has a visualization bug that sometimes happens after an "Opt Freq" calculation: the double bond looks like a single bond, but no hydrogens are added to
compensate the valency of atoms involved in that bond. This is a visual problem,
and if you continue on with our computations of some property, you will obtain
results as if you molecule was of good geometry (i.e. with all correct bonds).
10.4
MY
G09 Tutorial - AT
page 32 of 34
11
APPENDIX
Very good online notes are available on the web. I found computational courses
by Dr Ephraim Eliav particularly good for beginners! Google him =)
G09 Tutorial - AT
page 33 of 34
G09 Tutorial - AT
page 34 of 34
12
REFERENCES
1. Young, D. Computational Chemistry: A Practical Guide for Applying Techniques to Real World Problems; John Wiley & Sons, inc: USA, 2001.
2. Jensen, F. Introduction to computational Chemistry, 2nd ed.; John Wiley &
Sons, inc: England, 2007.
3.
http://math.nist.gov/mcsd/savg/tutorial/ansys/FEM/
http://en.wikipedia.org/wiki/Finite_element_method
http://http://www.britannica.com/EBchecked/topic/430575/
optimization/235508/The-simplex-method
http://www.math.cornell.edu/~web401/matt.simplex.pdf
G09 Tutorial - AT
page 35 of 34
G09 Tutorial - AT
page 36 of 34