… but were afraid to ask
How I learned to stop worrying and not caring that much about hybridization.
The math behind orbital hybridization is fairly simple as I’ll try to show below, but first let me give my praise once again to the formidable Linus Pauling, whose creation of this model built a bridge between quantum mechanics and chemistry; I often say Pauling was the first Quantum Chemist (Gilbert N. Lewis’ fans, please settle down). Hybrid orbitals are therefore a way to create a basis that better suits the geometry formed by the bonds around a given atom and not the result of a process in which atomic orbitals transform themselves for better sterical fitting, or like I’ve said before, the C atom in CH4 is sp3 hybridized because CH4 is tetrahedral and not the other way around. Jack Simmons put it better in his book:
The atomic orbitals we all know and love are the set of solutions to the Schrödinger equation for the Hydrogen atom and more generally they are solutions to the hydrogen-like atoms for which the value of Z in the potential term of the Hamiltonian changes according to each element’s atomic number.
Since the Hamiltonian, and any other quantum mechanical operator for that matter, is a Hermitian operator, any given linear combination of wave functions that are solutions to it, will also be an acceptable solution. Therefore, since the 2s and 2p valence orbitals of Carbon do not point towards the edges of a tetrahedron they don’t offer a suitable basis for explaining the geometry of methane; even more so these atomic orbitals are not degenerate and there is no reason to assume all C-H bonds in methane aren’t equal. However we can come up with a linear combination of them that might and at the same time will be a solution to the Schrödinger equation of the hydrogen-like atom.
Ok, so we need four degenerate orbitals which we’ll name ζi and formulate them as linear combinations of the C atom valence orbitals:
ζ1= a12s + b12px + c12py + d12pz
ζ2= a22s + b22px + c22py + d22pz
ζ3= a32s + b32px + c32py + d32pz
ζ4= a42s + b42px + c42py + d42pz
to comply with equivalency lets set a1 = a2 = a3 = a4 and normalize them:
a12 + a22 + a32 + a42 = 1 ∴ ai = 1/√4
Lets take ζ1 to be directed along the z axis so b1 = c1 = 0
ζ1 = 1/√4(2s) + d12pz
since ζ1 must be normalized the sum of the squares of the coefficients is equal to 1:
1/4 + d12 = 1;
d1 = √3/2
Therefore the first hybrid orbital looks like:
ζ1 = 1/√4(2s) +√3/2(2pz)
We now set the second hybrid orbital on the xz plane, therefore c2 = 0
ζ2 = 1/√4(2s) + b22px + d22pz
since these hybrid orbitals must comply with all the conditions of atomic orbitals they should also be orthonormal:
〈ζ1|ζ2〉 = δ1,2 = 0
1/4 + d2√3/2 = 0
d2 = –1/2√3
our second hybrid orbital is almost complete, we are only missing the value of b2:
ζ2 = 1/√4(2s) +b22px +-1/2√3(2pz)
again we make use of the normalization condition:
1/4 + b22 + 1/12 = 1; b2 = √2/√3
Finally, our second hybrid orbital takes the following form:
ζ2 = 1/√4(2s) +√2/√3(2px) –1/√12(2pz)
The procedure to obtain the remaining two hybrid orbitals is the same but I’d like to stop here and analyze the relative direction ζ1 and ζ2 take from each other. To that end, we take the angular part of the hydrogen-like atomic orbitals involved in the linear combinations we just found. Let us remember the canonical form of atomic orbitals and explicitly show the spherical harmonic functions to which the 2s, 2px, and 2pz atomic orbitals correspond:
ψ2s = (1/4π)½R(r)
ψ2px = (3/4π)½sinθcosφR(r)
ψ2pz = (3/4π)½cosθR(r)
we substitute these in ζ2 and factorize R(r) and 1/√(4π)
ζ2 = (R(r)/√(4π))[1/√4 + √2 sinθcosφ –√3/√12cosθ]
We differentiate ζ2 respect to θ, and set it to zero to find the maximum value of θ respect to the z axis we get the angle between the first to hybrid orbitals ζ1 and ζ2 (remember that ζ1 is projected entirely over the z axis)
dζ2/dθ = (R(r)/√(4π))[√2 cosθ –√3/√12sinθ] = 0
sinθ/cosθ = tanθ = -√8
θ = -70.53°,
but since θ is measured from the z axis towards the xy plane this result is equivalent to the complementary angle 180.0° – 70.53° = 109.47° which is exactly the angle between the C-H bonds in methane we all know! and we didn’t need to invoke the unpairing of electrons in full orbitals, their promotion of any electron into empty orbitals nor the ‘reorganization‘ of said orbitals into new ones. Orbital hybridization is nothing but a mathematical tool to find a set of orbitals which comply with the experimental observation and that is the important thing here!
To summarize, you can take any number of orbitals and build any linear combination you want, in order to comply with the observed geometry. Furthermore, no matter what hybridization scheme you follow, you still take the entire orbital, you cannot take half of it because they are basis functions. That is why you should never believe that any atom exhibits something like an sp2.5 hybridization just because their bond angles lie between 109 and 120°. Take a vector v = xi+yj+zk, even if you specify it to be v = 1/2i that means x = 1/2, not that you took half of the unit vector i, and it doesn’t mean you took nothing of j and k but rather than y = z = 0.
This was a very lengthy post so please let me know if you read it all the way through by commenting, liking, or sharing. Thanks for reading.
The concept of electronic orbital has become such a useful and engraved tool in understanding chemical structure and reactivity that it has almost become one of those things whose original meaning has been lost and replaced for a utilitarian concept, one which is not bad in itself but that may lead to some wrong conclusions when certain fundamental facts are overlooked.
Last week a wrote -what I thought was- a humorous post on this topic because a couple of weeks ago a viewpoint in JPC-A was published by Pham and Gordon on the possibility of observing molecular orbitals through microscopy methods, which elicited a ‘seriously? again?‘ reaction from me, since I distinctly remember the Nature article by Zuo from the year 2000 when I just had entered graduate school. The article is titled “direct observation of d-orbital holes.” We discussed this paper in class and the discussion it prompted was very interesting at various levels: for starters, the allegedly observed d-orbital was strikingly similar to a dz2, which we had learned in class (thanks, prof. Carlos Amador!) that is actually a linear combination of d(z2-x2) and d(z2-y2) orbitals, a mathematical -lets say- trick to conform to spectroscopic observations.
Pham and Gordon are pretty clear in their first paragraph: “The wave function amplitude Ψ*Ψ is interpreted as the probability density. All observable atomic or molecular properties are determined by the probability and a corresponding quantum mechanical operator, not by the wave function itself. Wave functions, even exact wave functions, are not observables.” There is even another problem, about which I wrote a post long time ago: orbitals are non-unique, this means that I could get a set of orbitals by solving the Schrödinger equation for any given molecule and then perform a unit transformation on them (such as renormalizing them, re-orthonormalizing them to get a localized version, or even hybridizing them) and the electronic density derived from them would be the same! In quantum mechanical terms this means that the probability density associated with the wave function internal product, Ψ*Ψ, is not changed upon unit transformations; why then would a specific version be “observed” under a microscope? As Pham and Gordon state more eloquently it has to do with the Density of States (DOS) rather than with the orbitals. Furthermore, an orbital, or more precisely a spinorbital, is conveniently (in math terms) separated into a radial, an angular and a spin component R(r)Ylm(θ,φ)σ(α,β) with the angular part given by the spherical harmonic functions Ylm(θ,φ), which in turn -when plotted in spherical coordinates- create the famous lobes we all chemists know and love. Zuo’s observation claim was based on the resemblance of the observed density to the angular part of an atomic orbital. Another thing, orbitals have phases, no experimental observation claims to have resolved those.
Now, I may be entering a dangerous comparison but, can you observe a 2? If you say you just did, well, that “2” is just a symbol used to represent a quantity: two, the cardinality of a set containing two elements. You might as well depict such quantity as “II” or “⋅⋅” but still cannot observe “a two”. (If any mathematician is reading this, please, be gentle.) I know a number and a function are different, sorry if I’m just rambling here and overextending a metaphor.
Pretending to having observed an orbital through direct experimental methods is to neglect the Born interpretation of the wave function, Heisenberg’s uncertainty principle and even Schrödinger’s cat! (I know, I know, Schrödinger came up with this gedankenexperiment in order to refute the Copenhagen interpretation of quantum mechanics, but it seems like after all the cat is still not out of the box!)
So, the take home message from the viewpoint in JPC is that molecular properties are defined by the expected values of a given wave function for a specific quantum mechanical operator of the property under investigation and not from the wave function itself. Wave functions are not observables and although some imaging techniques seem to accomplish a formidable task the physical impossibility hints to a misinterpretation of facts.
I think I’ll write more about this in a future post but for now, my take home message is to keep in mind that orbitals are wave functions and therefore are not more observable (as in imaging) than a partition function is in statistical mechanics.
Communication of scientific findings is an essential skill for any scientist, yet it’s one of those things some students are reluctant to do partially because of the infamous blank page scare. Once they are confronted to writing their thesis or papers they make some common mistakes like for instance not thinking who their audience is or not adhering to the main points. One of the the highest form of communication, believe it or not, is gossip, because gossip goes straight to the point, is juicy (i.e. interesting) and seldom needs contextualization i.e. you deliver it just to the right audience (that’s why gossiping about friends to your relatives is almost never fun) and you do it at the right time (that’s the difference between gossips and anecdotes). Therefore, I tell my students to write as if they were gossiping; treat your research in a good narrative way, because a poor narrative can make your results be overlooked.
I’ve read too many theses in which conclusions are about how well the methods work, and unless your thesis has to do with developing a new method, that is a terrible mistake. Methods work well, that is why they are established methods.
Take the following example for a piece of gossip: Say you are in a committed monogamous relationship and you have the feeling your significant other is cheating on you. This is your hypothesis. This hypothesis is supported by their strange behavior, that would be the evidence supporting your hypothesis; but be careful because there could also be anecdotal evidence which isn’t significant to your own as in the spouse of a friend had this behavior when cheating ergo mine is cheating too. The use of anecdotal evidence to support a hypothesis should be avoided like the plague. Then, you need an experimental setup to prove, or even better disprove, your hypothesis. To that end you could hack into your better half’s email, have them followed either by yourself or a third party, confronting their friends, snooping their phone, just basically about anything that might give you some information. This is the core of your research: your data. But data is meaningless without a conclusion, some people think data should speak for itself and let each reader come up with their own conclusions so they don’t get biased by your own vision and while there is some truth to that, your data makes sense in a context that you helped develop so providing your own conclusions is needed or we aren’t scientists but stamp collectors.
This is when most students make a terrible mistake because here is where gossip skills come in handy: When asked by friends (peers) what was it that you found out, most students will try to convince them that they knew the best algorithms for hacking a phone or that they were super conspicuous when following their partners or even how important was the new method for installing a third party app on their phones to have a text message sent every time their phone when outside a certain area, and yeah, by the way, I found them in bed together. Ultimately their question is left unanswered and the true conclusion lies buried in a lengthy boring description of the work performed; remember, you performed all that work to reach an ultimate goal not just for the sake of performing it.
Writers say that every sentence in a book should either move the story forward or show character; in the same way, every section of your scientific written piece should help make the point of your research, keep the why and the what distinct from the how, and don’t be afraid about treating your research as the best piece of gossip you’ve had in years because if you are a science student it is.
Out of some +1000 twitter accounts I follow about a quarter are related computational chemistry. The following public list isn’t comprehensive and prone to errors and contains researchers, programmers, students, journals, products and companies who gravitate around the use of in silico methods for the understanding and design of chemical and biochemical compounds.
As is the case of proteins, the functioning of DNA is highly dependent on its 3D structure and not just only on its sequence but the difference is that protein tertiary structure has an enormous variety whereas DNA is (almost) always a double helix with little variations. The canonical base pairs AT, CG stabilize the famous double helix but the same cannot be guaranteed when non-canonical -unnatural- base pairs (UBPs) are introduced.
When I first took a look at Romesberg’s UBPS, d5SICS and dNaM (throughout the study referred to as X and Y see Fig.1) it was evident that they could not form hydrogen bonds, in the end they’re substituted naphtalenes with no discernible ways of creating a synton like their natural counterparts. That’s when I called Dr. Rodrigo Galindo at Utah University who is one of the developers of the AMBER code and who is very knowledgeable on matters of DNA structure and dynamics; he immediately got on board and soon enough we were launching molecular dynamics simulations and quantum mechanical calculations. That was more than two years ago.
Our latest paper in Phys.Chem.Chem.Phys. deals with the dynamical and structural stability of a DNA strand in which Romesberg’s UBPs are introduced sequentially one pair at a time into Dickerson’s dodecamer (a palindromic sequence) from the Protein Data Bank. Therein d5SICS-dNaM pair were inserted right in the middle forming a trisdecamer; as expected, +10 microseconds molecular dynamics simulations exhibited the same stability as the control dodecamer (Fig.2 left). We didn’t need to go far enough into the substitutions to get the double helix to go awry within a couple of microseconds: Three non-consecutive inclusions of UBPs were enough to get a less regular structure (Fig. 2 right); with five, a globular structure was obtained for which is not possible to get a proper average of the most populated structures.
X and Y don’t form hydrogen bonds so the pairing is pretty much forced by the scaffold of the rest of the DNA’s double helix. There are some controversies as to how X and Y fit together, whether they overlap or just wedge between each other and according to our results, the pairing suggests that a C1-C1′ distance of 11 Å is most stable consistent with the wedging conformation. Still much work is needed to understand the pairing between X and Y and even more so to get a pair of useful UBPs. More papers on this topic in the near future.
As if I didn’t have enough things to do I’m launching a new blog inspired by the #365papers hashtag on Twitter and the naturalproductman.wordpress.com blog. In it I’ll hopefully list, write a femto-review of all the papers I read. This new effort is even more daunting than the actual reading of the huge digital pile of papers I have in my Mendeley To-Be-Read folder, the fattest of them all. The papers therein wont be a comprehensive review of Comp.Chem. must-read papers but rather papers relevant to our lab’s research or curiosity.
Maybe I’ll include some papers brought to my attention by the group and they could do the review. The whole endeavor might flop in a few weeks but I want to give it a shot; we’ll see how it mutates and if it survives or not. So far I haven’t managed to review all papers read but maybe this post will prompt to do so if only to save some face. The domain of the new blog is compchemdigest.wordpress.com but I think it should have included the word MY at the beginning so as to convey the idea that it is only my own biased reading list. Anyway, if you’re interested share it and subscribe, those post will not be publicized.
Ever since I read the highly praised article by Floyd Romesberg in Nature back in 2013 I got really interested in synthetic biology. In said article, an unnatural base pair (UBP) was not only inserted into a DNA double strand in vivo but the organism was even able to reproduce the UBPs present in subsequent generations.
Inserting new unnatural base pairs in DNA works a lot like editing a computer’s code. Inserting a couple UBPs in vitro is like inserting a comment; it wont make a difference but its still there. If the DNA sequence containing the UBPs can be amplified by molecular biology techniques such as PCR it means that a polymerase enzyme is able to recognize it and place it in site, this is equivalent to inserting a ‘hello world’ section into a working code; it will compile but it’s pretty much useless. Inserting these UBPs in vivo means that the organism is able to thrive despite the large deformation in a short section of its genetic code, but having it replicated by the chemical machinery of the nucleus is an amazing feat that only a few molecules could allow.
The ultimate goal of synthetic biology would be to find a UBP which codes effectively and purposefully during translation of DNA.This last feat would be equivalent to inserting a working subroutine in a program with a specific purpose. But not only could the use of UBPs serve for the purposes of expanding the genetic code from a quaternary (base four) to a senary (base six) system: the field of DNA origami could also benefit from having an expansion in the chemical and structural possibilities of the famous double helix; marking and editing a sequence would also become easier by having distinctive sections with nucleotides other than A, T, C and G.
It is precisely in the concept of double helix that our research takes place since the available biochemical machinery for translation and replication can only work on a double helix, else, the repair mechanisms get activated or the DNA will just stop serving its purpose (i.e. the code wont compile).
My good friend, Dr. Rodrigo Galindo and I have worked on the simulation of Romesberg’s UBPs in order to understand the underlying structural, dynamical and electronic causes that made them so successful and to possibly design more efficient UBPs based on a set of general principles. A first paper has been accepted for publication in Phys.Chem.Chem.Phys. and we’re very excited for it; more on that in a future post.