Photons

Here are some things you probably know about photons.

  • A photon is a particle of light.

  • A photon is a quantized excitation of the electromagnetic field.

  • Photons were first proposed by Einstein, who used them to explain the photoelectric effect.

All of these statements are half true. They are not entirely wrong, but also not entirely right. If you take them too literally, they will mislead you.

Let’s start at the beginning. What exactly is a photon? Here we immediately run into our first source of confusion, because the word “photon” is commonly used to mean two entirely different things. Writers often do not bother to clarify which meaning they are using. Worse, they may even jump back and forth between the two meanings without giving any indication they have just changed definitions. In some cases, they may not even realize they are doing it.

First, the word “photon” is commonly used to refer to a plane wave excitation of the electromagnetic field. This is just a mathematical abstraction. There is nothing fundamental about plane waves. You can describe the field in terms of any basis functions you choose. For a particular problem, it might be more convenient to work in a different basis, and sometimes authors refer to excitations of those other basis functions as “photons”. But most often, the word is reserved for plane wave excitations.

This is the definition that is used when people refer to photons as “particles”. Notice, however, that it is totally different from what you probably think of when you hear the word “particle”. In classical mechanics, a particle is a point-like object. It exists at a particular location. In fact, that is probably part of your intuitive definition of what it means for something to be a particle: it is highly localized. It is over here, not over there. But a plane wave is completely non-localized. It exists with constant amplitude at every point in the universe. It is the exact opposite of what the word “particle” means to most people.

Of course, this leads to much confusion. When you are told that a photon is a “particle of light”, you probably imagine it as a tiny glowing ball. When you are told that a beam of light is “made up of photons”, you imagine a steady rain of glowing balls. Some authors even explicitly describe it in that way. But this picture is totally wrong. It has nothing to do with the actual definition being used.

Also keep in mind that true plane waves do not exist in the real world. Over a limited region, an excitation might be well approximated as a plane wave, but all real excitations are finite in extent.

To summarize, a photon (in this sense of the word) is a mathematical abstraction corresponding to something that does not exist in the real world, and that is the exact opposite of what you probably think of when you hear the word “particle”. How could that possibly be confusing?

There is a second common definition of the word “photon”. When an electron in an atom or molecule transitions between states, energy must be conserved. If the initial and final states have different energies, it must absorb or emit energy in some form to make up the difference. Often that takes the form of electromagnetic radiation. A “photon” is the radiation absorbed or emitted in this process to conserve energy.

This definition is the one relevant to the photoelectric effect. Einstein showed that he could correctly predict experiment if he assumed a photoelectric material always absorbs light in units of \(\hbar \omega\). 1 He postulated that this was due to a fundamental property of the electromagnetic field: that radiation is made up of discrete “light quanta”. (The word “photon” was not introduced until many years later.)

We no longer need to make that assumption. Quantum mechanics gives a completely different explanation. One can easily show that if a system is subject to an oscillating perturbation (for example, an external electromagnetic field) of angular frequency \(\omega\), it will tend to induce transitions between states whose energies are \(\hbar \omega\) apart. The derivation uses a completely classical treatment of the perturbation. In the case of the photoelectric effect, the electrons in the material are described with quantum mechanics, but the radiation is described as a classical field. Radiation of frequency \(\omega\) will tend to be absorbed in units of \(\hbar \omega\) because those are the transitions it stimulates. One need not assume anything about the field being quantized.

The photoelectric effect is the first in a long line of phenomena that have been widely cited as evidence for the electromagnetic field being quantized, only to later be shown to be entirely consistent with a classical field. Other examples include the blackbody spectrum, the Compton effect, the Lamb shift, spontaneous emission, antibunching, and the Hong-Ou-Mandel dip. At this time, I am not aware of any compelling evidence proving the electromagnetic field must be quantized.

When reviewing the history of this subject, it is hard to escape the impression that many people approach the question backwards. They begin by assuming the electromagnetic field must be quantized, then search for evidence they can interpret as supporting that assumption. The number of incorrect answers that have been widely accepted within the community, only to later be disproven, is remarkable. It seems likely that if the supposed evidence had contradicted, not supported, what everyone “knew” to be true, it would have received a much more critical reception.

There is an important lesson here. As scientists and as rationalists, we all must be constantly vigilant against confirmation bias. When a new argument supports our beliefs, that is when we must be most on our guard. We must not relax our standards for rigor just because it produces what we think is the correct result.


1

Einstein, A. “Über einen die Erzeugung und Verwandlung des Lichtes betreffenden heuristischen Gesichtspunkt.” Annalen der Physik 17(6): 132-148 (1905).