## Why Schrödinger’s equation?

**“Why this equation?”**

I recently overheard someone ask this about Schrödinger’s equation. The answer they received was, for me, unsatisfying. “Because it agrees with experiment.” Of course, that answers perfectly why the equation was adopted by future generations of physicists and indeed the calculation of the spectrum of atomic hydrogen from the energy eigenvalues of the Schrödinger operator is one of the most convincing and wholesome computations a young physicist can do. But the question that was left unanswered, the question I believe was being asked, was: “Why did Schrödinger write this equation down? Why not something else?” I don’t believe for a second that Schrödinger sat down with an array of different equations and worked out what each of them predicted about hydrogen before he found the one that fit…

I turned to one of Schrödinger’s (eminently readable) original papers on the subject:

and was overjoyed to find that Schrödinger had a very definite picture in mind when he derived his equation. The idea was this: some ninety-nine years previously, William Rowan Hamilton had presented his general theory of geometric optics to the Royal Irish Academy, a mathematical description of light rays. This theory is a good approximation to reality when the light a has very short wavelength (like violet) but doesn’t account for various optical phenomena like diffraction, which require the finer description of light as an electromagnetic wave. This description was put on a mathematical footing by Maxwell who proved that electromagnetic fields propagate in free space according to the wave equation.

Hamilton later applied his formalism to describe classical mechanics. Schrödinger and de Broglie thought that one might reasonably expect there to be a wave theory underlying classical mechanics and reducing to it in the short wavelength limit. The difficulty was how to guess a wave equation that would give the right short wavelength limit.

Schrödinger took as his starting point the Hamilton-Jacobi equation, so let’s review this. I’ll assume you’re happy with the usual Hamiltonian/Lagrangian formulation of classical mechanics.

**Hamilton-Jacobi theory.**

For any pair of points consider the space of paths between and . Let be a Lagrangian and

be the action of that path. A classical path of time joining and is a solution to the corresponding Euler-Lagrange equation. Let’s suppose we’re in an ideal situation: for any and any pair of points , there is a unique classical path of time joining and .

Fix , but allow and to vary. Define the function

The Hamilton-Jacobi equation is a PDE satisfied by this function. Let’s first compute the derivatives of with respect to . Replace by and suppose that . Then, for small , writing , we have

by the usual Euler-Lagrange argument for computing the first variation of . Since is the classical path, the first term vanishes. Since (the point is fixed) the only remaining term is . Since , this means that the first variation of W is

By Hamilton’s equations, so this says that is the th component of momentum at the endpoint of the path.

Now, by the fundamental theorem of calculus:

but by the chain rule

This gives

where is the momentum at the endpoint. Since the Hamiltonian and Lagrangian are related by a Legendre transform, we have

so we see that satisfies the Hamilton-Jacobi equation

**Autonomous case.**

If we assume that the Hamiltonian is of the form

(in particular time-independent) then we know by energy conservation that

so

for some constant , and the Hamilton-Jacobi equation reduces to

We also have the momentum from our earlier computation so the speed is .

**Schrödinger’s idea.**

In going from geometric optics to wave optics you imagine little sine waves travelling along your rays and you imagine that the phase of the sine wave changes linearly with the optical path length. Via the optical/mechanical analogy (the direct correspondence between the Hamiltonian formalism of optics and of mechanics) one translates optical path length into the action , so Schrödinger’s guess was to replace classical trajectories by sine waves whose phase is proportional to the function . In other words, he postulates a wavefunction

for some constant . This constant has units of action so that the argument of sine is dimensionless.

The frequency of this sine wave is so, comparing with the empirical relationship coming from the Einstein/Planck analyses of the photoelectric effect/black body radiation formula, Schrödinger guessed

.

I want to quickly recall the notion of phase velocity. This is different to the velocity of our classical particles – rather it is the speed of a crest of the underlying wave. The crest is a surface of constant phase , that is at each instant the crest is a level surface of the function , that is . Let be the vector field and let be an integral curve of starting at time 0 on the crest. Then

so the integral curve keeps up with the crest. We think of as the phase velocity vector, so the phase speed is

Crucially, the phase speed depends on which depends on the frequency, so the wave equation which underlies Schrödinger’s matter waves must be dispersive (which means exactly that the phase speed depends on the frequency – in other words, different frequencies will disperse because they travel with different speeds). Schrödinger next made the simplest guess as to what the equation should be governing waves of a fixed frequency, namely he guessed the usual wave equation

for waves whose time dependence is through a factor . The usual (light) wave equation replaces u by the constant . Here is given by the dispersion relation

Since has time dependence this means that

Now subsituting this and the dispersion relation into the wave equation gives

or the more familiar

which is Schrödinger’s equation.

For me, this route to Schrödinger’s equation seems extremely natural when compared to Dirac’s magic with Poisson brackets.

## Leave a Reply