Wednesday, July 8, 2015
Gamess (US) frequently asked questions. Part 7: How to distinguish alpha from beta orbitals in the $VEC deck
1st) the number of the orbital to which the coefficients belong (written with at most two characters, so that 1 means orbital 1, .. , 99 means orbital 99, 00 means orbital 100) . This number is repeated in the beginning of every line, until all coefficients for that orbital have been written
2nd) this number tells the program how to assign the coefficients to the basis functions. "1" means that the coefficients are for basis functions 1-5, "2" means that the coefficients are for basis functions 5-10, etc. In general , that number "n" directs the program to assign the five coefficients present in the line to basis functions 5*(n-1)+1 to 5*n.
3rd to 7th) coefficients of five basis functions
BETA orbitals are punched as a group immediately after all ALPHA orbitals.
This format entails that in molecules with more than 100 orbitals the $VEC group contains several blocks with the same 1st number. For example, in a molecule with 200 orbitals, alpha orbital 27 is described by the first block of lines beginning with "27", and alpha orbital 127 is described by the SECOND block of lines beginning with "27".
I usually find the beginning of the BETA orbitals by repeating a search for the string " 1 1" : if that string is preceded by a block beginning with "00 1", it usually refers to orbitals 101, or 201, etc. (the exception being those systems with exactly 100, 200, etc. orbitals). If string " 1 1" is NOT preceded by a block beginning with "00 1", you are sure to have found the beginnning of the BETA orbitals
Thursday, March 19, 2015
My new preprint is up
Addendum: the paper has been published
Monday, September 8, 2014
Making good on my "Open Access" pledge
My most recent paper has just been published in PeerJ . It was a LONG time in the making, to the point that my 12-yo daughter once told me (only half-in-jest), that I should "cut my losses and forget about it". I am quite happy about how it turned out: besides describing an analysis of a reaction mechanism and the influence of the redox state of a hard-to-converge Fe-S cluster , it also contains the first computations including the weighed contributions of 1.2*1013 protonations states of a protein on the reaction it catalyzes. The computational approach described here is relatively simple to perform provided that one has a good estimate of the relative abundances of those protonation states, which can be obtained through Monte Carlo sampling once the site-site interactions have been computed with a Poisson-Boltzmann solver. To my mind, this is clearly superior to the usual approach of considering only the "most likely" protonation state (which may often not be the state with the most significant influence on the electrostatic field surrounding the active site). What do you think of it?
Programs needed to use this approach:
MCRP, by Baptista et al., ITQB, Lisbon
MEAD, by Don Bashford, currently at St. Jude Children's research hospital
Any molecular mechanics code, to compute the change of the total electrostatic energy as each individual amino acid is protonated/deprotonated
Thursday, April 24, 2014
Gamess (US) frequently asked questions part 6: Obtaining proper SCF convergence (Anti-)ferromagnetic coupled Fe-S clusters
- Fe1 and Fe2 up-spin, Fe3 and Fe4 down-spin; or
- Fe1 and Fe4 up-spin, Fe2 and Fe3 down-spin; or
- Fe1 and Fe3 up-spin, Fe2 and Fe4 down-spin;
- obtain orbitals for bare Fe2+, Fe3+, S2-, and isolated ligands, with proper spins on the Fe atoms (5/2 for Fe3+, 2 for Fe2+)
- Manually split the "alpha/up" and "beta/down" portions of the resulting $VEC groups. For example, assuming you have a system with three Fe atoms (two Fe2+ and one Fe3+) with total spin S=5/2 and the $VEC groups for bare Fe2+ and bare Fe3+, you should cut the $VEC groups of Fe2+ and Fe3+ as:
$VEC for the alpha (up) electrons of Fe3+ (let's call it "Fe3+_5_d_electrons")
$VEC for the beta (down) electrons of Fe2+ (let's call it "Fe2+_1_d_electron")
$VEC for the beta (down) electrons of Fe3+ (let's call it "Fe3+_0_d_electrons")
"Fe2+_5_d_electrons" for one of the Fe2+ ions,
"Fe2+_1_d_electrons" for the other Fe2+,
"Fe3+_5_d_electrons" for the Fe3+
Building the new guess for the "down" electrons should include:
"Fe2+_1_d_electrons" for the FIRST Fe2+ ions,
"Fe2+_5_d_electrons" for the other Fe2+,
"Fe3+_0_d_electrons" for the Fe3+
- combine the orbitals using the small utility called combo, which you may obtain from Alex Granovsky's Firefly website.
- Manually paste the "alpha" and "beta" guesses into a single $vec group, which would be the proper guess.
- cross all your fingers and toes, and expect it to converge into the proper state. If it does not converge, change convergers (SOSCF=.T. DIIS=.F.), onset of SOSCF (SOGTOL=1e-3) , etc.
- after SCF optimization using this guess, manually scramble the ordering of Fe atoms in your input, to ascertain whether a lower energy solution can be obtained with a different spin distribution.
Good Luck!
Friday, July 5, 2013
Gamess (US) frequently asked questions Part 5: "THE VIBRATIONAL ANALYSIS IS NOT VALID"
$STATPT OPTTOL=
It is well known that the vibrational analysis is strictly valid mathematically when the Hessian is computed in true stationary points (i.e when the gradient is exactly equal to zero). If the maximum gradient is sufficiently close to zero, the vibrational analysis (although not absolutely correct) is still close enough to the "true" solution for all practical purposes.
This introduction brings us to today's FAQ. A recurring question in both the Gamess-US list and the Firefly forums concerns the message often printed by the program after a vibrational analysis:
*THIS IS NOT A STATIONARY POINT ON THE MOLECULAR PES THE VIBRATIONAL ANALYSIS IS NOT VALID*
This message arises from the way gradients are analyzed by Gamess: gradients are originally computed in one set of coordinates (cartesian coordinates, I believe) , and then transformed into the coordinate system specified by the user. Optimizations stop when the "transformed gradient" lies below OPTTOL, but Gamess uses the original, non-transformed, gradient to decide whether to consider the geometry as a stationary point on the molecular PES. Therefore, if the geometry is converged, the scary message in capital letters above may be safely disregarded. When in doubt, simply decrease your OPTTOL value, continue the optimization and re-compute the hessian.
Wednesday, June 19, 2013
Gamess (US) frequently asked questions Part 1: SCF convergence
When your SCF does not converge, you should re-run the job including a $guess guess=moread $end line, as well as the complete $VEC group present in the output PUNCH file (usually called <jobname>.dat, and present in you scratch directory).
- Addendum:
Whenever you read a $VEC group from a UHF run you must assign NORB in the $GUESS group. An additional problem is that by default the $VEC group only includes the occupied orbitals, and this means that in UHF runs the $VEC group does not include equal numbers of alpha and beta orbitals (e.g., a run with 41 electrons and MULT=2) will have 21 alpha orbitals and 20 beta orbitals. Therefore, if you include
$guess guess=moread NORB=21 $end
Gamess will crash because there are not 21 beta orbitals, and if you input
$guess guess=moread NORB=20 $end
there will be another error, since there are more than 20 alpha orbitals. In these cases, you should check the number of alpha and beta orbitals. Then , copy the coefficients of the extra alpha orbitals to the end of the beta orbitals. In my example above
$guess guess=moread NORB=21 $end
will yield no problems, since the modification of the VEC group yields equal numbers of alpha and beta orbitals. There is also an option to PUNCH every orbital (occupied+virtuals) at every step. In this case, Gamess always punches a full $VEC group, making it very easy to assign NORB as one can simply inspect the output file to learn the number of orbitals. However, this yields gigantic PUNCH files, and may therefore not be feasible.
You should also experiment with changing convergers, damping, etc. Some systems are notoriously hard to converge, and may require several re-iterations of the whole process.
Thursday, March 15, 2012
QM/MM vs. QM-only studies of large cluster models
Walter Thiel has now published a QM/MM analysis of the reaction mechanism of acetylene hydratase (previously studied by Fahmi Himo using increasingly large QM-only models). Inclusion of the surrounding protein dramatically changed the results for the largest model studied by Himo, due to the absence (in the "cluster model") of two negatively charged phosphate groups adjacent to the active site. Although these charges are quite "shielded" from the active site because of neighbouring positively-charged amino acids, they originate local charge assymmetries that interact differently with the active site during each step of the catalytic cycle. This effect is quite similar to the major influence of the internal protein dipoles on enzyme catalysis expounded by Arieh Warshel, and should be kept in mind by all of us who tend to prefer the QM-only approach: a polarizable-continuum model assumes a homogeneous environment surrounding the QM system, and in proteins "it ain't necessarily so".
Tuesday, November 29, 2011
The limits of homology modeling
Recently, two small proteins with very high homology (>95%) but widely differing structure have been designed and studied. Starting from a pair of proteins with < 20 % identity and different 3D structures, the authors gradually mutated one sequence into the other, and ended up generating two sequences differing only in one amino acid, but with different folds. Attempts to unravel the precise mechanisms governing the selection of one fold over the other have however been inconclusive, because current molecular dynamics protocols and force fields are not accurate enough to measure the small energy differences involved.
Monday, October 17, 2011
Limitations of PCM
Tuesday, September 20, 2011
QM molecular dynamics
Ab initio molecular simulations (e.g. Car-Parrinello MD) are much more expensive, and are generally limited to (at most) a few dozen atoms and <100 ps. Two papers from Prof. Shogo Sakai's group show that QM molecular simulations can be performed with considerable time-savings if the system is partitioned into several smaller systems. They have not yet developed the theory to the point where one can attempt bond-breaking, but theirs seems a fruitful approach to the problem.
Thursday, July 14, 2011
Fe-S clusters
The large number of electrons in Fe and the complexity of the possible couplings between spin states make the theoretical analysis of the electronic structures in Fe-S clusters quite difficult.
Takano et al. have recently published a paper on the differences between a Cys3Asp ligated 4Fe-4S cluster and the "regular" (all Cys) 4Fe-4S cluster. The authors nicely analyze the influence of the Asp (and other) ligands on the electronic structure of the 4Fe-4S cluster, observe a -0.10 V difference in redox potential (vs. normal 4Fe-4S) in high dielectric constants, and offer this observation as the reason for the low potential of this cluster.
I do not accept this last conclusion for two reasons:
Thursday, July 1, 2010
Computing redox potentials
a) the intrinsic error of the theoretical level used to compute the electronic energies
b) the error associated with the continuum method itself.
Whereas the first error may be rigorously quantified by comparison with experimental gas phase values and made very small with the choice of an appropriate basis set/theory level combination , most continuum methods yield less predictable errors (especially when the redox-active portion of the solute is present in a very heterogenous environment, like an enzyme active site).
Dejun Si and Hui Li have now improved the continuum solvation methods by including the possibility of assigning different dielectric constants to different parts of the solute cavity surface, thus improving the description of heterogeneous environments. These authors have also shown this approach to correctly predict the relative redox potentials of the type I copper centers (optimized in vacuo) in eleven different proteins with maximum errors < 0.1 V (provided that the systems include approximately 100 protein atoms around the Cu Center). The error can be minimized to < 0.05 V by optimizing the geometries using the newly-developed heterogenuous polarizable continuum.
This new continuum method is implemented in the latest release of GAMESS, a free and very powerful quantum chemistry package available from Mark Gordon's group, at Iowa State University.