Tight Binding
Tight Binding
Tight Binding
Mervyn Roy
May 7, 2015
When the number of atoms and electrons is very small we can use an exact
method like configuration interaction to calculate the true many-electron
wavefunction. However, beyond about 10-electrons we hit the exponential
wall and such calculations become impossible.
For larger systems containing up to a few hundred or a few thousand atoms
we can use density functional theory (DFT) techniques to find the true
ground state density and ground state energy of the interacting system
without explicitly calculating the many-electron wavefunction. In a DFT
calculation we calculate approximate single-particle energies that, in practice, often give a reasonable approximation to the actual band structure of
the crystal.
In even larger systems, with around 10,000 or more atoms, we can no longer
use self-consistent DFT calculations to take into account the full interaction.
To calculate the band structure and a set of approximate single particle
states we instead try to include the effects of the interaction in a semiempirical way, using parameters that we can adjust to match experiment.
The starting point for all semi-empirical approaches is the physics. In metals,
for example, the electrons are almost free and so we can treat the single
1
particle states in terms of plane waves (leading to the Central equation, and
nearly free electron theory). We could also take a very different approach and
assume the states in a crystal look like combinations of the wavefunctions
of isolated atoms. We might imagine this is more likely to be the case in
insulators or semiconductors.
Here, we will solve the single particle Schrodinger equation for the states in
a crystal by expanding the Bloch states in terms of a linear combination of
atomic orbitals.
2
2.1
(1)
where Hat is the hamiltonian for a single atom and U encodes all the
differences between the true potential in the crystal and the potential of an
isolated atom. We assume U 0 at the centre of each atom in the crystal.
The single particle states in the crystal are then nk (r), where
Hnk (r) = Enk nk (r),
(2)
2.2
(3)
i (r)j (r + R)dr =
(4)
0 otherwise.
2
0.5
0
-1
x (a0)
2.3
Blochs theorem
(5)
(6)
where there are N lattice sites in the crystal and the factor of 1/ N ensures
the Bloch state is normalised (see appendix A).
2.3.1
Proof that
Re
ikR
n (r
Re
ikR
n (r
R)
1 X ikR
e
n (r (R R0 )).
N R
e
n (r (R00 )),
N R00
1 X ikR00
0
= eikR
e
n (r (R00 )),
N R00
nk (r + R0 ) =
= eikR nk (r),
P
so that the nk (r) = R eikR n (r R)/ N from Eq. (6) satisfies Blochs
theorem (Eq. (5)).
3
3.1
Imagine a crystal with translation vectors R, that has one atom in the unit
cell, and where only atomic s-orbitals s (r) contribute to the crystal states.
Then there is only 1 band (n = 1) and there is only one Bloch state we can
construct,
1 X ikR
k (r) =
e
s (r R).
(7)
N R
In this simple case we can find the dispersion relation (the relation between
energy and wavevector) simply by calculating the expectation value of the
energy,
Z
E(k) =
k (r)Hk (r)dr,
(8)
where the integrals are over all space. Then, substituting for k (r) from Eq.
(7) we find,
Z
1 X X ik(R0 R)
E(k) =
e
s (r R)Hs (r R0 )dr
N
0
R R
Z
1 X X ik(R0 R)
=
e
s (x)Hs (x (R0 R))dx,
(9)
N
0
R
and, because each of the terms in the sum over R is now identical, this sum
simply gives us a factor of N , one term for each of the N possible values of
R. Then,
Z
X
ikR00
E(k) =
e
s (x)Hs (x R00 )dx.
(10)
R00
We can now separate out different terms in the sum over R00 by considering
the range of the atomic s-orbitals, s (r). The atomic orbitals are tightly
localised: they are large when |r| is small and decay rapidly away from
r = 0.
First, if R00 = 0 then the integral in Eq. (10) becomes
Z
Z
s (x)Hs (x)dx = s (x)s s (x)dx = s ,
(11)
because Hs (x) = s s (x) and the atomic states s (x) are normalised. So,
the R00 = 0 simply gives s , the energy of the atomic s-orbital in an isolated
atom.
R
Next, if |R00 | is large we expect that the integral s (x)Hs (x R00 )dx 0
because the overlap between wavefunctions separated by large R00 is very
small. Typically, in a semi-empirical tight binding calculation we therefore only include terms where |R00 | is small, for example if R00 = is the
translation vector between an atom and its nearest neighbours (see Fig. 1).
Then,
Z
X
ik
s (x)Hs (x )dx.
(12)
e
E(k) = s +
eik (| |).
(14)
Often empirical relations are also used to scale the overlap integrals with the
2
separation | |. For example, in silicon the relation (| |) = Ae| | /| |2
gives the approximate scaling of the overlap integral with near neighbour
separation | |. By using approximate scaling relations, we can investigate
the effect on the band structure of straining or deforming a crystal.
3.2
= s +
eik (| |),
(15)
This is the dispersion relation for a single s-band in a 1D crystal. It describes how the energy varies with crystal momentum, k. It also tells us the
bandwidth (see Fig. 2).
In the 1D crystal the length of the unit cell in real space is a0 , so the length
of the unit cell in reciprocal space is 2/a0 . In figure 2 we plot E(k) inside
the unit cell in reciprocal space, from k = 0 to k = 2/a0 . At large k the
dispersion relation simply repeats.1 The bandwidth, 4, is marked on the
plot. This is the difference between the maximum and minimum allowed
energy of the band.
Energy
s+2
s
4
s-2
1
k (2/a0)
Usually we would plot E(k) within the first Brillouin zone (BZ). This is an equivalent
cell (the Wigner-Seitz cell) in reciprocal space. The BZ length is still 2/a0 but the BZ
extends from /a0 to /a0 rather than 0 to 2/a0 .
3.3
s-band in a 2D crystal
(16)
2/b
a
2/a
unit cell in reciprocal space (see Fig. 4). The cell lengths are 2/a and 2/b
(see Fig. 3) and the BZ extends from /a to /a and /b to /b. The
bandwidth is 4(a) + 4(b).
4
0.6
0.4
0.2
0.2
2
1
1
0
1
0
0.3
0.2
0.1
0
0.1
kx (1/)
0.2
0.3
0.6
0.4
0.2
0
0.2
ky (1/)
0.4
0.6
0.4
0.6
1
0.3 0.2 0.1 0
0.1
kx (1/)
0.2
Energy (eV)
ky (1/)
Energy (eV)
Energy (eV)
0.3
3.4
s-band in a 3D crystal
P
Generalising to 3D is now very easy. We still have E(k) = s + eik (| |)
but now k = (kx , ky , kz ) and will, in general, have x, y, and z components.
For example, in a face centred cubic crystal the 12 nearest neighbour vectors
are = (1, 1, 0)a/2, (1, 0, 1)a/2, (0, 1, 1)a/2. With a little algebra
this eventually gives
ky a
kx a
E(kx , ky , kz ) = s + 4(| |) cos
cos
+
2
2
ky a
kz a
kz a
kx a
cos
,
(17)
cos
+ cos
cos
2
2
2
2
where | | = a/ 2.
3.5
The parameter controls the bandwidth and the curvature of the bands.
We can adjust to match experiment. Or, if we know how scales with, for
example, atomic separation, we can see how the bandwidth will change when
we strain the crystal. Figure 5 shows the dispersion relation E(/2a, ky )
from Eq. (16) for 3 different values of (b). As (b) decreases, the bandwidth
and the curvature of the band also decrease. The effective mass is inversely
proportional to the curvature so, if is small the relevant effective masses
tend to be large.
5
4
Energy (eV)
2
1
0
1
0.6
3.6
0.4
0.2
0
0.2
ky (1/)
0.4
0.6
Now that we have seen how the bandwidth scales with the overlap integral,
, we can gain some insight into the origin of the bands in a crystal.
Energy levels
r
1/atomic spacing
n=3
n=2
n=1
N-fold
degenerate levels
Figure 6: Schematic diagram illustrating the origin of bands in a tight binding picture.
Left: the undegenerate electronic levels in a single atom. Right: energy levels for the N
atoms in a crystal, plotted as a function of overlap integral or the inverse of the atomic
spacing. Reproduced from Solid State Physics, N.W. Ashcroft and N.D. Mermin, Holt
Saunders International edition (1981).
3.7
Nb -atom basis
In general, a material will have more than one atom in the unit cell and,
in modern applications, the tight-binding method only comes into its own
when there are a very large number of atoms in the unit cell.
The chain molecule trans-polyacetylene shown in Fig. 7 is an example of a
material with 2 atoms in the unit cell.
In a crystal with an Nb atom basis (and where only one type of atomic
orbital contributes to the band states) we can make Nb linear combinations
of atomic orbitals that satisfy Blochs theorem,
1 X ikRi
ik (r) =
e
(r Ri ).
N R
i
10
(18)
a/2
a/2
Figure 7: Trans-polyacetylene. Top: plan view, with the two non-identical atom sites
in the basis labelled A and B. Bottom: side view showing a schematic of the pz orbitals
located on the central 4 atoms. The filled black circles label the carbon atoms and the
boxes denoted with a dashed line show the
unit cell. The unit cell contains two atoms,
one at (0, 0) and one at RAB = (a/2, a/2 3).
The i = 1, 2, . . . , Nb label each of the different atoms in the basis and the Ri
are translation vectors that take us between atoms of type i. For example,
in trans-polyacetylene (see Fig. 7) i would label an A or a B atom, and the
translation vectors would be RA = ai, 2ai, . . . with RB = RAB ai, RAB
2ai, . . ., where RAB is a vector between the A and B lattice sites in the basis,
and, as before, i and j are unit vectors in the x and y directions.
The crystal states can be expanded as a linear combination of the Nb Bloch
states,
X
nk (r) =
cik ik (r)
i
X
i
cik
eikRi (r Ri )/ N .
(19)
Ri
From the variational theorem, the best set of states we can find are the
ones with the lowest energy. So, to find the nk , we must minimise the
expectation value of the energy with respect to the coefficients, cik . This
11
where Hij = hik |H|jk i. This only has non-trivial solutions if the determinant,
|H E(k)I| = 0,
(21)
where H is a matrix of elements, Hij , and I is the unit matrix.
3.7.1
2-atom basis
(22)
With two atoms in the unit cell we get 2 valid solutions at each k. This
means two bands.
We can calculate the hamiltonian matrix elements in essentially the same
way as we did for the single s-band. For example, in trans-polyacetylene
(see Fig. 7) each carbon atom contribute a single p-orbital (of energy p)
to the conduction and valence bands. In this case we have,
Z
1 X X ik(R0 RA )
A
HAA =
e
s (r RA )Hs (r R0A )dr,
N
RA R0A
Z
X
ikR00
A
=
e
s (x)Hs (x R00A )dx,
RA 00
= p +
eimka (|ma|),
(24)
m6=0
12
Next, for simplicity we look at a case where the overlap integrals fall off
quickly with distance2 , and restrict the overlap integrals to distances a.
Then (ma) = 0 for |m| > 1, and we simply have that,
HAA = HBB = p + 2(a) cos(ka).
We follow the same procedure to calculate HAB ,
Z
1 X X ik(RA RB )
HAB =
e
s (r RA )Hs (r RB )dr,
N
RA RB
Z
X
0
=
eik(RAB +RA ) s (x)Hs (x (RAB + R0A ))dx.
RA
(25)
(26)
(28)
Using the results for HAA , HBB , and HAB we can explicitly calculate the
dispersion relation from Eq. (23). In trans-polyacetylene,
E(k) = p + 2(a) cos(ka) 2 cos(ka/2)(| |).
(29)
This dispersion relation is plotted in Fig. 8. The figure shows two different
possible dispersion relations, calculated with different parameters.
The famous graphene band structure near the Dirac point can be obtained
from an approach almost identical to that used for trans-polyacetylene. The
main difference is that graphene is a 2-dimensional crystal so k and the real
space lattice vectors vary in both x and y directions.
2
Of course we could retain terms from atoms further away and use the general result
for HAA from Eq. 24.
13
1.5
Figure 8:
Energy (eV)
0.5
0.5
1
0.3
3.8
0.2
0.1
0
k (1/)
0.1
0.2
0.3
In general, bands will contain contributions from more than one type of
orbital. In graphene, for example, the lowest energy valence bands (which
form the bonds between atoms) and the highest energy conduction bands,
are constructed from a mixture of s, px and py orbitals (this is known as sp2
hybridisation).
However, it is very easy to generalise the formalism in Eq.s (19) and (20) to
deal with multiple types of orbital. All we need to do is use the index i to
label both different basis sites and different orbital types.
For example, imagine a Nb = 2 basis crystal in which s, px , py and pz orbitals
all contribute to the bands of interest. We would then expect 24 = 8 bands
(from Nb = 2 atoms per unit cell, with 4 orbitals per atom) and we would
have to solve an 8 8 matrix eigenvalue equation to find the energies at each
k. In this case, if i = 1 labels an s-orbital on basis site A, and j = 5 labels
a px orbital on basis site B the relevant hamiltonian matrix element would
be,
Z
1 X X ik(RA RB )
Hij =
e
s (r RA )Hpx (r RB )dr.
(30)
N
RA RB
14
10
Figure 9:
Energy (eV)
0
-5
-10
-15
-20
-25
-30
wavevector
(r)nk (r)dr
I =
nk
Z
1 XX
0
=
eikR n (r R)eikR n (r R0 )dr
N
R R0
Z
X
X
1
ik(R0 R)
e
n (r R)n (r R0 )dr,
=
N
0
R
where each of the sums over R and R0 go over the N possible translation
vectors in the crystal.
R
However, from Eq. (4), n (r)n (r R)dr = (R), where (R) is equal to
one if R = 0 and zero otherwise. Hence,
1 X X ik(R0 R)
I =
e
(R R0 )
N
R R0
X
1
=
N
R
= 1,
(31)
and the linear combination of atomic orbitals in Eq. (6) is correctly normalised.
15