Sean Carroll

New Course: The Many Hidden Worlds of Quantum Mechanics

2023-11-27T18:37:36Z

In past years I’ve done several courses for The Great Courses/Wondrium (formerly The Teaching Company): Dark Matter and Dark Energy, Mysteries of Modern Physics:Time, and The Higgs Boson and Beyond. Now I’m happy to announce a new one, The Many Hidden Worlds of Quantum Mechanics.

Wondrium (streaming)
The Great Courses (DVD)

This is a series of 24 half-hour lectures, given by me with impressive video effects from the Wondrium folks.

The content will be somewhat familiar if you’ve read my book Something Deeply Hidden — the course follows a similar outline, with a few new additions and elaborations along the way. So it’s both a general introduction to quantum mechanics, and also an in-depth exploration of the Many Worlds approach in particular. It’s meant for absolutely everybody — essentially no equations this time! — but 24 lectures is plenty of time to go into depth.

Check out this trailer:

As I type this on Monday 27 November, I believe there is some kind of sale going on! So move quickly to get your quantum mechanics at unbelievably affordable prices.

Thanksgiving

2023-11-23T13:41:52Z

This year we give thanks for a feature of nature that is frequently misunderstood: quanta. (We’ve previously given thanks for the Standard Model Lagrangian, Hubble’s Law, the Spin-Statistics Theorem, conservation of momentum, effective field theory, the error bar, gauge symmetry, Landauer’s Principle, the Fourier Transform, Riemannian Geometry, the speed of light, the Jarzynski equality, the moons of Jupiter, space, black hole entropy, electromagnetism, and Arrow’s Impossibility Theorem.)

Of course quantum mechanics is very important and somewhat misunderstood in its own right; I can recommend a good book if you’d like to learn more. But we’re not getting into the measurement problem or the reality problem just now. I want to highlight one particular feature of quantum mechanics that is sometimes misinterpreted: the fact that some things, like individual excitations of quantized fields (“particles”) or the energy levels of atoms, come in sets of discrete numbers, rather than taking values on a smooth continuum. These discrete chunks of something-or-other are the “quanta” being referred to in the title of a different book, scheduled to come out next spring.

The basic issue is that people hear the phrase “quantum mechanics,” or even take a course in it, and come away with the impression that reality is somehow pixelized — made up of smallest possible units — rather than being ultimately smooth and continuous. That’s not right! Quantum theory, as far as it is currently understood, is all about smoothness. The lumpiness of “quanta” is just apparent, although it’s a very important appearance.

What’s actually happening is a combination of (1) fundamentally smooth functions, (2) differential equations, (3) boundary conditions, and (4) what we care about.

This might sound confusing, so let’s fix ideas by looking at a ubiquitous example: the simple harmonic oscillator. That can be thought of as a particle moving in one dimension, x, with a potential energy that looks like a parabola: . In classical mechanics, there is a lowest-energy state where the particle just sits at the bottom of its potential, unmoving, so both its kinetic and potential energies are zero. We can give it any positive amount of energy we like, either by kicking it to impart motion or just picking it up and dropping it in the potential at some point other than the origin.

Quantum mechanically, that’s not quite true (although it’s truer than you might think). Now we have a set of discrete energy levels, starting from the ground state and going upward in equal increments. Quanta!

But we didn’t put the quanta in. They come out of the above four ingredients. First, the particle is described not by its position and momentum, but by its wave function, . Nothing discrete about that; it’s a fundamentally smooth function. But second, that function isn’t arbitrary; it’s going to obey the Schrödinger equation, which is a special differential equation. The Schrödinger equation tells us how the wave function evolves with time, and we can solve it starting with any initial wave function we like. Still nothing discrete there. But there is one requirement, coming from the idea of boundary conditions: if the wave function grows (or remains constant) as , the potential energy grows along with it. (It actually has to diminish at infinity just to be a wave function at all, but for the moment let’s think about the energy.) When we bring in the fourth ingredient, “what we care about,” the answer is that we care about low-energy states of the oscillator. That’s because in real-world situations, there is dissipation. Whatever physical system is being modeled by the harmonic oscillator, in reality it will most likely have friction or be able to give off photons or something like that. So no matter where we start, left to its own devices the oscillator will diminish in energy. So we generally care about states with relatively low energy.

Since this is quantum mechanics after all, most states of the wave function won’t have a definite energy, in much the same way they will not have a definite position or momentum. (They have “an energy” — the expectation value of the Hamiltonian — but not a “definite” one, since you won’t necessarily observe that value.) But there are some special states, the energy eigenstates, associated with a specific, measurable amount of energy. It is those states that are discrete: they come in a set made of a lowest-energy “ground” state, plus a ladder of evenly-spaced states of ever-higher energy.

We can even see why that’s true, and why the states look the way they do, just by thinking about boundary conditions. Since each state has finite energy, the wave function has to be zero at the far left and also at the far right. The energy in the state comes from two sources: the potential, and the “gradient” energy from the wiggles in the wave function. The lowest-energy state will be a compromise between “staying as close to as possible” and “not changing too rapidly at any point.” That compromise looks like the bottom (red) curve in the figure: starts at zero on the left, gradually increases and then decreases as it continues on to the right. It is a feature of eigenstates that they are all “orthogonal” to each other — there is zero net overlap between them. (Technically, if you multiply them together and integrate over , the answer is zero.) So the next eigenstate will first oscillate down, then up, then back to zero. Subsequent energy eigenstates will each oscillate just a bit more, so they contain the least possible energy while being orthogonal to all the lower-lying states. Those requirements mean that they will each pass through zero exactly one more time than the state just below them.

And that is where the “quantum” nature of quantum mechanics comes from. Not from fundamental discreteness or anything like that; just from the properties of the set of solutions to a perfectly smooth differential equation. It’s precisely the same as why you get a fundamental note from a violin string tied at both ends, as well as a series of discrete harmonics, even though the string itself is perfectly smooth.

One cool aspect of this is that it also explains why quantum fields look like particles. A field is essentially the opposite of a particle: the latter has a specific location, while the former is spread all throughout space. But quantum fields solve equations with boundary conditions, and we care about the solutions. It turns out (see above-advertised book for details!) that if you look carefully at just a single “mode” of a field — a plane-wave vibration with specified wavelength — its wave function behaves much like that of a simple harmonic oscillator. That is, there is a ground state, a first excited state, a second excited state, and so on. Through a bit of investigation, we can verify that these states look and act like a state with zero particles, one particle, two particles, and so on. That’s where particles come from.

We see particles in the world, not because it is fundamentally lumpy, but because it is fundamentally smooth, while obeying equations with certain boundary conditions. It’s always tempting to take what we see to be the underlying truth of nature, but quantum mechanics warns us not to give in.

Is reality fundamentally discrete? Nobody knows. Quantum mechanics is certainly not, even if you have quantum gravity. Nothing we know about gravity implies that “spacetime is discrete at the Planck scale.” (That may be true, but it is not implied by anything we currently know; indeed, it is counter-indicated by things like the holographic principle.) You can think of the Planck length as the scale at which the classical approximation to spacetime is likely to break down, but that’s a statement about our approximation schemes, not the fundamental nature of reality.

States in quantum theory are described by rays in Hilbert space, which is a vector space, and vector spaces are completely smooth. You can construct a candidate vector space by starting with some discrete things like bits, then considering linear combinations, as happens in quantum computing (qubits) or various discretized models of spacetime. The resulting Hilbert space is finite-dimensional, but is still itself very much smooth, not discrete. (Rough guide: “quantizing” a discrete system gets you a finite-dimensional Hilbert space, quantizing a smooth system gets you an infinite-dimensional Hilbert space.) True discreteness requires throwing out ordinary quantum mechanics and replacing it with something fundamentally discrete, hoping that conventional QM emerges in some limit. That’s the approach followed, for example, in models like the Wolfram Physics Project. I recently wrote a paper proposing a judicious compromise, where standard QM is modified in the mildest possible way, replacing evolution in a smooth Hilbert space with evolution on a discrete lattice defined on a torus. It raises some cosmological worries, but might otherwise be phenomenologically acceptable. I don’t yet know if it has any specific experimental consequences, but we’re thinking about that.

Proposed Closure of the Dianoia Institute at Australian Catholic University

2023-09-18T21:17:58Z

Just a few years ago, Australian Catholic University (ACU) established a new Dianoia Institute of Philosophy. They recruited a number of researchers and made something of a splash, leading to a noticeable leap in ACU’s rankings in philosophy — all the way to second among Catholic universities in the English-speaking world, behind only Notre Dame.

Now, without warning, ACU has announced plans to completely disestablish the institute, along with eliminating 35 other academic positions in other fields. This leaves the faculty, some of which left permanent jobs elsewhere to join the new institute, completely stranded.

I sent the letter below to the Vice-Chancellor of ACU and other interested parties. I hope the ongoing international outcry leads the administration to change its mind.

Thanksgiving

2022-11-24T16:45:12Z

This year we give thanks for Arrow’s Impossibility Theorem. (We’ve previously given thanks for the Standard Model Lagrangian, Hubble’s Law, the Spin-Statistics Theorem, conservation of momentum, effective field theory, the error bar, gauge symmetry, Landauer’s Principle, the Fourier Transform, Riemannian Geometry, the speed of light, the Jarzynski equality, the moons of Jupiter, space, black hole entropy, and electromagnetism.)

Arrow’s Theorem is not a result in physics or mathematics, or even in physical science, but rather in social choice theory. To fans of social-choice theory and voting models, it is as central as conservation of momentum is to classical physics; if you’re not such a fan, you may never have even heard of it. But as you will see, there is something physics-y about it. Connections to my interests in the physics of democracy are left as an exercise for the reader.

Here is the setup. You have a set of voters {1, 2, 3, …} and a set of choices {A, B, C, …}. The choices may be candidates for office, but they may equally well be where a group of friends is going to meet for dinner; it doesn’t matter. Each voter has a ranking of the choices, from most favorite to least, so that for example voter 1 might rank D first, A second, C third, and so on. We will ignore the possibility of ties or indifference concerning certain choices, but they’re not hard to include. What we don’t include is any measure of intensity of feeling: we know that a certain voter prefers A to B and B to C, but we don’t know whether (for example) they could live with B but hate C with a burning passion. As Kenneth Arrow observed in his original 1950 paper, it’s hard to objectively compare intensity of feeling between different people.

The question is: how best to aggregate these individual preferences into a single group preference? Maybe there is one bully who just always gets their way. But alternatively, we could try to be democratic about it and have a vote. When there is more than one choice, however, voting becomes tricky.

This has been appreciated for a long time, for example in the Condorcet Paradox (1785). Consider three voters and three choices, coming out as in this table.

Voter 1	Voter 2	Voter 3
A	B	C
B	C	A
C	A	B

Then simply posit that one choice is preferred to another if a majority of voters prefer it. The problem is immediate: more voters prefer A over B, and more voters prefer B over C, but more voters also prefer C over A. This violates the transitivity of preferences, which is a fundamental postulate of rational choice theory. Maybe we have to be more clever.

So, much like Euclid did a while back for geometry, Arrow set out to state some simple postulates we can all agree a good voting system should have, then figure out what kind of voting system would obey them. The postulates he settled on (as amended by later work) are:

Nobody is a dictator. The system is not just “do what Voter 1 wants.”
Independence of irrelevant alternatives. If the method says that A is preferred to B, adding in a new alternative C will not change the relative ranking between A and B.
Pareto efficiency. If every voter prefers A over B, the group prefers A over B.
Unrestricted domain. The method provides group preferences for any possible set of individual preferences.

These seem like pretty reasonable criteria! And the answer is: you can’t do it. Arrow’s Theorem proves that there is no ranked-choice voting method that satisfies all of these criteria. I’m not going to prove the theorem here, but the basic strategy is to find a subset of the voting population whose preferences are always satisfied, and then find a similar subset of that population, and keep going until you find a dictator.

It’s fun to go through different proposed voting systems and see how they fall short of Arrow’s conditions. Consider for example the Borda Count: give 1 point to a choice for every voter ranking it first, 2 points for second, and so on, finally crowning the choice with the least points as the winner. (Such a system is used in some political contexts, and frequently in handing out awards like the Heisman Trophy in college football.) Seems superficially reasonable, but this method violates the independence of irrelevant alternatives. Adding in a new option C that many voters put between A and B will increase the distance in points between A and B, possibly altering the outcome.

Arrow’s Theorem reflects a fundamental feature of democratic decision-making: the idea of aggregating individual preferences into a group preference is not at all straightforward. Consider the following set of preferences:

Voter 1	Voter 2	Voter 3	Voter 4	Voter 5
A	A	A	D	D
B	B	B	B	B
C	D	C	C	C
D	C	D	A	A

Here a simple majority of voters have A as their first choice, and many common systems will spit out A as the winner. But note that the dissenters seem to really be against A, putting it dead last. And their favorite, D, is not that popular among A’s supporters. But B is ranked second by everyone. So perhaps one could make an argument that B should actually be the winner, as a consensus not-so-bad choice?

Perhaps! Methods like the Borda Count are intended to allow for just such a possibility. But it has it’s problems, as we’ve seen. Arrow’s Theorem assures us that all ranked-voting systems are going to have some kind of problems.

By far the most common voting system in the English-speaking world is plurality voting, or “first past the post.” There, only the first-place preferences count (you only get to vote for one choice), and whoever gets the largest number of votes wins. It is universally derided by experts as a terrible system! A small improvement is instant-runoff voting, sometimes just called “ranked choice,” although the latter designation implies something broader. There, we gather complete rankings, count up all the top choices, and declare a winner if someone has a majority. If not, we eliminate whoever got the fewest first-place votes, and run the procedure again. This is … slightly better, as it allows for people to vote their conscience a bit more easily. (You can vote for your beloved third-party candidate, knowing that your vote will be transferred to your second-favorite if they don’t do well.) But it’s still rife with problems.

One way to avoid Arrow’s result is to allow for people to express the intensity of their preferences after all, in what is called cardinal voting (or range voting, or score voting). This allows the voters to indicate that they love A, would grudgingly accept B, but would hate to see C. This slips outside Arrow’s assumptions, and allows us to construct a system that satisfies all of his criteria.

There is some evidence that cardinal voting leads to less “regret” among voters than other systems, for example as indicated in this numerical result from Warren Smith, where it is labeled “range voting” and left-to-right indicates best-to-worst among voting systems.

On the other hand — is it practical? Can you imagine elections with 100 candidates, and asking voters to give each of them a score from 0 to 100?

I honestly don’t know. Here in the US our voting procedures are already laughably primitive, in part because that primitivity serves the purposes of certain groups. I’m not that optimistic that we will reform the system to obtain a notably better result, but it’s still interesting to imagine how well we might potentially do.

The Biggest Ideas in the Universe: Space, Time, and Motion

2022-09-21T13:10:04Z

Just in case there are any blog readers out there who haven’t heard from other channels: I have a new book out! The Biggest Ideas in the Universe: Space, Time, and Motion is Volume One of a planned three-volume series. It grew out of the videos that I did in 2020, trying to offer short and informal introductions to big ideas in physics. Predictably, they grew into long and detailed videos. But they never lost their informal charm, especially since I didn’t do that much in the way of research or preparation.

For the book, by contrast, I actually did research and preparation! So the topics are arranged a bit more logically, the presentation is a bit more thorough and coherent, and the narrative is sprinkled with fun anecdotes about the philosophy and history behind the development of these ideas. In this volume, “these ideas” cover classical physics, from Aristotle through Newton up through Einstein.

The gimmick, of course, is that we don’t shy away from using equations. The goal of this book is to fill the gap between what you generally get as a professional physics student, who the teacher can rely on to spend years of study and hours of doing homework problems, and what you get as an interested amateur, where it is assumed that you are afraid of equations or can’t handle them. I think equations are not so scary, and that essentially everyone can handle them, if they are explained fully along the way. So there are no prerequisites, but we will teach you about calculus and vectors and all that stuff along the way. Not enough to actually solve the equations and become a pro, but enough to truly understand what the equations are saying. If it all works, this will open up a new way of looking at the universe for people who have been denied it for a long time.

The payoff at the end of the book is Einstein’s theory of general relativity and its prediction of black holes. You will understand what Einstein’s equation really says, and why black holes are an inevitable outcome of that equation. Something most people who get an undergraduate university degree in physics typically don’t get to.

Table of contents:

Introduction
1. Conservation
2. Change
3. Dynamics
4. Space
5. Time
6. Spacetime
7. Geometry
8. Gravity
9. Black Holes
Appendices

Available wherever books are available: Amazon * Barnes and Noble * BAM * IndieBound * Bookshop.org * Apple Books.

Johns Hopkins

2022-03-06T23:25:00Z

As far as I remember, the first time I stepped onto a university campus was in junior high school, when I visited Johns Hopkins for an awards ceremony for the Study of Mathematically Precocious Youth. (I grew up in an environment that didn’t involve spending a lot of time on college campuses, generally speaking.) The SMPY is a longitudinal study that looks for kids who do well on standardized math tests, encourages them to take the SATs at a very young age, and follows the progress of those who do really well. I scored as “pretty precocious” but “not precocious enough to be worth following up.” Can’t really argue. My award was a slim volume on analytic geometry, which — well, the thought was nice.

But the campus made an impression. It was elegant and evocative in a way that was new to me and thoroughly compelling. Grand architecture, buildings stuffed with books and laboratories, broad green commons criss-crossed by students and professors talking about ideas. (I presumed that was what they were talking about). Magical. I was already committed to the aspiration that I would go to university, get a Ph.D., and become a theoretical physicist, although I had very little specific concept of what that entailed. Soaking in the campus atmosphere redoubled my conviction that this was the right path for me.

So it is pretty special to me to announce that I am going to become a professor at Hopkins. This summer Jennifer and I will move from Los Angeles to Baltimore, and I will take up a position as Homewood Professor of Natural Philosophy. (She will continue writing about science and culture at Ars Technica, which she can do from any geographic location.)

The title requires some explanation. Homewood Professors are a special category at Hopkins. There aren’t many of them. Some are traditional academics like famous cosmologist Joseph Silk; others are not traditional academics, like former Senator Barbara Mikulski, musician Thomas Dolby, or former UK Poet Laureate Andrew Motion. The official documentation states that a Homewood Professor should be “a person of high scholarly, professional, or artistic distinction whose appointment brings luster to the University.” (You see why I waited to announce until my appointment was completely official, so nobody could write in objecting that I don’t qualify. Too late!)

It’s a real, permanent faculty job — teaching, students, grant proposals, the whole nine yards. Homewood Professors are not tenured, but in some sense it’s better — the position floats freely above any specific department lines, so administrative/committee obligations are minimized. (They told me they could think about a tenure process if I insisted. Part of me wanted to, for purely symbolic reasons. But once all the ins and outs were explained, I decided not to bother.)

In practice, my time will be split between the Department of Physics and Astronomy and the Department of Philosophy. I will have offices in both places, and teach roughly one course/year in each department. The current plan is for me to teach two classes this fall: a first-year seminar on the Physics of Democracy, and an upper-level seminar on Topics in the Philosophy of Physics. (The latter will probably touch on the arrow of time, philosophy of cosmology, and the foundations of quantum mechanics, but all is subject to change.) And of course I’ll be supervising grad students and eventually hiring postdocs in both departments — let me know if you’re interested in applying!

You’ll note that both departments have recently been named after William Miller. That’s because Bill Miller, who was a graduate student in philosophy at Hopkins and became a successful investment banker, has made generous donations both to philosophy and to physics. (He’s also donated to, and served as board chair for, the Santa Fe Institute, where I will continue to be Fractal Faculty — our interests have considerable overlap!) Both departments are already very high-quality; physics and astronomy includes friends and colleagues like Adam Riess, Marc Kamionkowski, and David Kaplan, not to mention benefitting from association with the Space Telescope Science Institute. But these gifts will allow us to grow in substantial ways, which makes for a very exciting time.

One benefit of being a Homewood Professor is that you get to choose what you will be designated a professor “of.” I asked that it be Natural Philosophy, harkening back to the days before science and philosophy split into distinct disciplines. (Resisted the temptation to go with a Latin version.) This is what makes this opportunity so special. I’ve always been interdisciplinary, between physics and philosophy and other things, and also always had an interest in reaching out to wider audiences. But there was inevitably tension with what I was supposed to be doing as a theoretical physicist and cosmologist. My predilections don’t fit comfortably with the academic insistence on putting everyone into a silo and encouraging them to stay there.

Now, for the first time in my life, all that stuff I want to do will be my job, rather than merely tolerated. (Or not tolerated, as the case may be.) The folks at JHU want me to build connections between different departments, and they very much want me to both keep up with the academic work, and with the podcasts and books and all that. Since that’s exactly what I want to do myself, it’s a uniquely good fit.

I’ve had a great time at Caltech, and have nothing bad to say about it. I have enormous fondness for my colleagues and especially for the many brilliant students and postdocs who I’ve been privileged to interact with along the way. But a new adventure awaits, and I can’t wait to dive in. I have a long list of ideas I want to pursue in cosmology, quantum mechanics, complexity, statistical mechanics, emergence, information, democracy, origin of life, and elsewhere. Maybe we’ll start up a seminar series in Complexity and Emergence that brings different people together. Maybe it will grow into a Center of some kind. Maybe I’ll write academic papers on moral philosophy! Who knows? It’s all allowed. Can’t ask for more than that.

Thanksgiving

2021-11-25T17:01:03Z

This year we give thanks for something we’ve all heard of, but maybe don’t appreciate as much as we should: electromagnetism. (We’ve previously given thanks for the Standard Model Lagrangian, Hubble’s Law, the Spin-Statistics Theorem, conservation of momentum, effective field theory, the error bar, gauge symmetry, Landauer’s Principle, the Fourier Transform, Riemannian Geometry, the speed of light, the Jarzynski equality, the moons of Jupiter, space, and black hole entropy.)

Physicists like to say there are four forces of nature: gravitation, electromagnetism, the strong nuclear force, and the weak nuclear force. That’s a somewhat sloppy and old-fashioned way of talking. In the old days it made sense to distinguish between “matter,” in the form of particles or fluids or something like that, and “forces,” which pushed around the matter. These days we know it’s all just quantum fields, and both matter and forces arise from the behavior of quantum fields interacting with each other. There is an important distinction between fermions and bosons, which almost maps onto the old-fashioned matter/force distinction, but not quite. If it did, we’d have to include the Higgs force among the fundamental forces, but nobody is really inclined to do that.

The real reason we stick with the traditional four forces is that (unlike the Higgs) they are all mediated by a particular kind of bosonic quantum field, called gauge fields. There’s a lot of technical stuff that goes into explaining what that means, but the basic idea is that the gauge fields help us compare other fields at different points in space, when those fields are invariant under a certain kind of symmetry. For more details, check out this video from the Biggest Ideas in the Universe series (but you might need to go back to pick up some of the prerequisites).

All of which is just throat-clearing to say: there are four forces, but they’re all different in important ways, and electromagnetism is special. All the forces play some kind of role in accounting for the world around us, but electromagnetism is responsible for almost all of the “interestingness” of the world of our experience. Let’s see why.

When you have a force carried by a gauge field, one of the first questions to ask is what phase the field is in (in whatever physical situation you care about). This is “phase” in the same sense as “phase of matter,” e.g. solid, liquid, gas, etc. In the case of gauge theories, we can think about the different phases in terms of what happens to lines of force — the imaginary paths through space that we would draw to be parallel to the direction of the force exerted at each point.

The simplest thing that lines of force can do is just to extend away from a source, traveling forever through space until they hit some other source. (For electromagnetism, a “source” is just a charged particle.) That corresponds to field being in the Coulomb phase. Infinitely-stretching lines of force dilute in density as the area through which they are passing increases. In three dimensions of space, that corresponds to spheres we draw around the source, whose area goes up as the distance squared. The magnitude of the force therefore goes as the inverse of the square — the famous inverse square law. In the real world, both gravity and electromagnetism are in the Coulomb phase, and exhibit inverse-square laws.

But there are other phases. There is the confined phase, where lines of force get all tangled up with each other. There is also the Higgs phase, where the lines of force are gradually absorbed into some surrounding field (the Higgs field!). In the real world, the strong nuclear force is in the confined phase, and the weak nuclear force is in the Higgs phase. As a result, neither force extends farther than subatomic distances.

So there are four gauge forces that push around particles, but only two of them are “long-range” forces in the Coulomb phase. The short-range strong and weak forces are important for explaining the structure of protons and neutrons and nuclei, but once you understand what stable nuclei there are, there work is essentially done, as far as accounting for the everyday world is concerned. (You still need them to explain fusion inside stars, so here we’re just thinking of life here on Earth.) The way that those nuclei come together with electrons to make atoms and molecules and larger structures is all explained by the long-range forces, electromagnetism and gravity.

But electromagnetism and gravity aren’t quite equal here. Gravity is important, obviously, but it’s also pretty simple: everything attracts everything else. (We’re ignoring cosmology etc, focusing in on life here on Earth.) That’s nice — it’s good that we stay attached to the ground, rather than floating away — but it’s not a recipe for intricate complexity.

To get complexity, you need to be able to manipulate matter in delicate ways with your force. Gravity isn’t up to the task — it just attracts. Electromagentism, on the other hand, is exactly what the doctor ordered. Unlike gravity, where the “charge” is just mass and all masses are positive, electromagnetism has both positive and negative charges. Like charges repel, and opposite charges attract. So by deftly arranging collections of positively and negatively charged particles, you can manipulate matter in whatever way you like.

That pinpoint control over pushing and pulling is crucial for the existence of complex structures in the universe, including you and me. Nuclei join with electrons to make atoms because of electromagnetism. Atoms come together to make molecules because of electromagnetism. Molecules interact with each other in different ways because of electromagnetism. All of the chemical processes in your body, not to mention in the world immediately around you, can ultimately be traced to electromagnetism at work.

Electromagnetism doesn’t get all the credit for the structure of matter. A crucial role is played by the Pauli exclusion principle, which prohibits two electrons from inhabiting exactly the same state. That’s ultimately what gives matter its size — why objects are solid, etc. But without the electromagnetic interplay between atoms of different sizes and numbers of electrons, matter would be solid but inert, just sitting still without doing anything interesting. It’s electromagnetism that allows energy to move from place to place between atoms, both via electricity (electrons in motion, pushed by electromagnetic fields) and radiation (vibrations in the electromagnetic fields themselves).

So we should count ourselves lucky that we live in a world where at least one fundamental force is both in the Coulomb phase and has opposite charges, and give appropriate thanks. It’s what makes the world interesting.

The Zombie Argument for Physicalism (Contra Panpsychism)

2021-11-18T01:43:57Z

The nature of consciousness remains a contentious subject out there. I’m a physicalist myself — as I explain in The Big Picture and elsewhere, I think consciousness is best understood as weakly-emergent from the ordinary physical behavior of matter, without requiring any special ontological status at a fundamental level. In poetic-naturalist terms, consciousness is part of a successful way of talking about what happens at the level of humans and other organisms. “Being conscious” and “having conscious experiences” are categories that help us understand how human beings live and behave, while corresponding to goings-on at more fundamental levels in which the notion of consciousness plays no role at all. Nothing very remarkable about that — the same could be said for the categories of “being alive” or “being a table.” There is a great deal of work yet to be done to understand how consciousness actually works and relates to what happens inside the brain, but it’s the same kind of work that is required in other questions at the science/philosophy boundary, without any great metaphysical leaps required.

Not everyone agrees! I recently went on a podcast hosted by philosophers Philip Goff (former Mindscape guest) and Keith Frankish to hash it out. Philip is a panpsychist, who believes that consciousness is everywhere, underlying everything we see around us. Keith is much closer to me, but prefers to describe himself as an illusionist about consciousness.

Obviously we had a lot to disagree about, but it was a fun and productive conversation. (I’m nobody’s panpsychist, but I’m extremely impressed by Philip’s willingness and eagerness to engage with people with whom he seriously disagrees.) It’s a long video; the consciousness stuff starts around 17:30, and goes to about 2:04:20.

But despite the length, there was a point that Philip raised that I don’t think was directly addressed, at least not carefully. And it goes back to something I’m quite fond of: the Zombie Argument for Physicalism. Indeed, this was the original title of a paper that I wrote for a symposium responding to Philip’s book Galileo’s Error. But in the editing process I realized that the argument wasn’t original to me; it had appeared, in somewhat different forms, in a few previous papers:

Balog, K. (1999). “Conceivability, Possibility, and the Mind-Body Problem,” The Philosophical Review, 108: 497-528.
Frankish, K. (2007). “The Anti-Zombie Argument,” The Philosophical Quarterly, 57: 650-666.
Brown, R. (2010). “Deprioritizing the A Priori Arguments against Physicalism,” Journal of Consciousness Studies, 17 (3-4): 47-69.
Balog, K. (2012). “In Defense of the Phenomenal Concept Strategy,” Philosophy and Phenomenological Research, 84: 1-23.
Campbell, D., J. Copeland and Z-R Deng 2017. “The Inconceivable Popularity of Conceivability Arguments,” The Philosophical Quarterly, 67: 223—240.

So the published version of my paper shifted the focus from zombies to the laws of physics.

Carroll, S.M. (2021). “Consciousness and the Laws of Physics,” Journal of Consciousness Studies, 28 (9-10): 16-31.

The idea was not to explain how consciousness actually works — I don’t really have any good ideas about that. It was to emphasize a dilemma that faces anyone who is not a physicalist, someone who doesn’t accept the view of consciousness as a weakly-emergent way of talking about higher-level phenomena.

The dilemma flows from the following fact: the laws of physics underlying everyday life are completely known. They even have a name, the “Core Theory.” We don’t have a theory of everything, but what we do have is a theory that works really well in a certain restricted domain, and that domain is large enough to include everything that happens in our everyday lives, including inside ourselves. I won’t rehearse all the reasons we have for believing this is probably true, but they’re in The Big Picture, and I recently wrote a more technical paper that goes into some of the details:

Carroll, S.M. (2021). “The Quantum Field Theory on Which the Everyday World Supervenes.” Submitted to Levels of Reality: A Scientific and Metaphysical Investigation (Jerusalem Studies in Philosophy and History of Science), eds. O. Shenker, M. Hemmo, S. Iannidis, and G. Vishne.

Given that success, the dilemma facing the non-physicalist about consciousness is the following: either your theory of consciousness keeps the dynamics of the Core Theory intact within its domain of applicability, or it doesn’t. There aren’t any other options! I emphasize this because many non-physicalists are weirdly cagey about whether they’re going to violate the Core Theory. In our discussion, Philip suggested that one could rely on “strong emergence” to create new kinds of behavior without really violating the CT. You can’t. The fact that the CT is a local effective field theory completely rules out the possibility, for reasons I talk about in the above two papers.

That’s not to say we are certain the Core Theory is correct, even in its supposed domain of applicability. As good scientists, we should always be open to the possibility that our best current theories will be proven inadequate by future developments. It’s absolutely fine to base your theory of consciousness on the idea that the CT will be violated by consciousness itself — that’s one horn of the above dilemma. The point of “Consciousness and the Laws of Physics” was simply to emphasize the extremely high standard to which any purported modification should be held. The Core Theory is extraordinarily successful, and to violate it within its domain of applicability means not only that we are tweaking a successful model, but that we are somehow contradicting some extremely foundational principles of effective field theory. And maybe consciousness does that, but I want to know precisely how. Show me the equations, explain what happens to energy conservation and gauge invariance, etc.

Increasingly, theorists of consciousness appreciate this fact. They therefore choose the other horn of the dilemma: leave the Core Theory intact as a theory of the dynamics of what happens in the world, but propose that a straightforward physicalist understanding fails to account for the fundamental nature of the world. The equations might be right, in other words, but to account for consciousness we should posit that Mind (or something along those lines) underlies all of the stuff obeying those equations. It’s not hard to see how this strategy might lead one to a form of panpsychism.

That’s fine! You are welcome to contemplate that. But then we physicalists are welcome to tell you why it doesn’t work. That’s precisely what the Zombie Argument for Physicalism does. It’s not precisely an argument for physicalism tout court, but for the superiority of physicalism over a non-physicalist view that purports to explain consciousness while leaving the behavior of matter unaltered.

Usually, of course, the zombie argument is deployed against physicalism, not for it. I know that. We find ourselves in the presence of irony.

The intuition behind the usual zombie argument stems from a conviction from introspection — from our first-person experience of the world, inaccessible in principle to outsiders — that there is something going on other than the mere physical behavior of physical stuff. And if that’s true, we can imagine the same behavior of physical stuff with or without consciousness. A (philosophical) zombie is a creature that behaves exactly as an ordinary person would in every way, but lacks the inner experience of consciousness — the qualia that characterize “what it is like” to be something. The argument is then that, if we can conceive of precisely the same physical behavior with and without consciousness, consciousness must be something other than a way of talking about physical behavior. It’s a bit reminiscent of Descartes’s argument for mind-body dualism: I can imagine my body not existing, but I can’t imagine my mind not existing, so the mind and body must be different things. But the conclusion here is not supposed to be that the mind must be a distinct substance from the body, merely the somewhat weaker conclusion that our conscious experiences cannot be reduced to the behavior of physical matter.

Let me stress the radicalness of the zombie concept, because I think people sometimes underestimate it, even some proponents of the usual zombie argument. When first presented with the idea of a philosophical zombie, it is natural to conjure up something like a Vulcan from Star Trek: humanoid in appearance, rational, and indisputably alive, but lacking some kind of affect or emotion. That is not right. The zombie, to reiterate, behaves exactly as a conscious creature would behave. If you interacted with a zombie, it would exhibit all the features of love and joy and sadness and anxiety that an ordinary person would. Zombies would cry of heartbreak, compose happy songs, giggle while rolling around on the ground with puppies, and write densely-argued books against the idea that consciousness could be entirely physical. If you asked a zombie about its inner conscious experiences, it would earnestly assure you that it had them, and would describe “what it was like” to experience this or that, on the basis of its introspection. The difference is that, unlike conscious creatures who are purportedly accurate when they make those claims, the zombie is wrong. You would never be able to convince the zombie they were wrong, but too bad for them.

Nobody is claiming the zombies actually exist or even are possible in our world, only that they are conceivable. And that if we can conceive of them, our notion of “consciousness” must be distinct from our notion of the behavior of matter.

But if there is an intuition that our conscious experience is something more than the motion of physical stuff, there is also a countervailing intuition: surely my consciousness affects my behavior! To a person on the street, rather than a highly-trained philosopher, it’s pretty obvious that your conscious experiences have some effect on your behavior. Such intuitions aren’t really reliable — a lot of people are intuitive dualists about the mind. But they provide pointers for us to dig into an issue and understand it better.

Taking a cue from our intuition that consciousness surely affects our behavior, and a suspicion that zombie advocates aren’t really thinking through the implications of the thought experiment, leads us to flip the usual argument on its head. The zombie scenario is actually a really good argument for physicalism (at least by contrast to the kind of passive panpsychism that doesn’t affect physical behavior in any way).

To make things clear, consider a very explicit version of the zombie scenario. We imagine two possible worlds (or at least conceivable, or at least maybe-conceivable). We have P-world (for “physical”), which consists solely of physical stuff, and that stuff obeys the Core Theory in its claimed domain of applicability. Then we have Ψ-world (for “psychist”), which behaves in precisely the same way, but which is fundamentally based on consciousness. The physical properties and behavior of Ψ-world should be thought of as aspects (emanations? not sure what the preferred vocabulary is here) of an underlying mentality.

(Note our use of “behavior” here means all of the behavior of all physical stuff, down to individual electrons and photons; not just the macroscopic behavior of human beings. There’s no connection to “behaviorism” in psychology.)

The starting point of the zombie argument for physicalism is that, when we sit down to compare P-world and Ψ-world, we realize that the purported “consciousness” that is central to Ψ-world is playing no explanatory role whatsoever. It might be there, ineffably in the background, but it has no impact at all on what human beings do or say. As Keith put it in our conversation, it offers no “differential” explanatory power to discriminate between the two scenarios.

And — here is an important point — whatever that background, causally-inert stuff is, it’s not what I have in mind when I’m trying to explain “consciousness.” The consciousness I have in mind absolutely does play an explanatory role in accounting for human behavior. The fact that someone is conscious of some inner experience (falling in love, or having the feeling they are being watched) manifestly affects their behavior. So the consciousness of Ψ-world isn’t the consciousness I care about, and I might as well be a physicalist.

Aha, says the panpsychist, but you’re leaving out something important. The behavior of which you speak can be seen by the outside world. But I also, personally, have access to my inner experience: the first-person perspective that cannot be witnessed by outsiders. Science is used to explaining objective third-person-observable behavior, but not this. I therefore have a reason — based on data, even if it’s not publicly-available — to prefer Ψ-world over P-world.

That move doesn’t work, as we can see if we think a bit more carefully about what’s going on in Ψ-world. How should I interpret someone’s claim that they have inner conscious experiences of the kind a zombie wouldn’t have? The claim itself — the utterance “I have conscious experience” — is a behavior. They said it, or wrote it, or whatever. The matter in their bodies acts in certain ways so as to form those words. And that matter, within either P-world or Ψ-world, exactly obeys the equations of motion of the Core Theory. That theory, in turn, is causally closed: you tell me the initial conditions, there is an equation that unambiguously describes how the universe evolves forward in time.

So the utterance claiming that a person has inner conscious experiences has precisely the same causal precursors in either P-world or Ψ-world: a certain configuration of particles and forces in the person’s brain and body. But we’ve agreed that non-physical consciousness plays no role in explaining those things within the context of P-world. Therefore, consciousness cannot play any role in explaining those utterances in Ψ-world, either.

Thus: you are welcome to claim that you have access to inner first-person experiences of some non-physical conscious experiences, but that claim bears no relationship whatsoever to whether or not you actually do have such experiences. So there is no “data” at all, in the ordinary sense.

Said another way: the claim is that we have a certain kind of knowledge based on introspection. But a zombie would make exactly the same claim, and you are arguing that the zombie is wrong. The lesson is that this kind of introspection is completely unreliable. And therefore there is no reason to favor Ψ-world over P-world. (The point is not that introspection itself is completely unreliable, just that if you think zombies are conceivable, you have to admit that introspection gives us no evidence for the non-physical nature of consciousness.)

Of course philosophers are very clever people, and they can invent different categories of “introspection” and “experience” and “evidence” in an attempt to make it all work out. But the essential point is clear and robust: by sequestering off “consciousness” from playing any causal role in the world, you’ve turned it into something very different from what we were originally trying to explain. Time to turn to some other strategy.

There is one dangling thread here, which is what Philip brought up in the conversation and I don’t think we did justice to. Sure, you might say, there is no differential explanatory role being played by consciousness in the comparison between P-world and Ψ-world. They both behave in the same way, even though one has consciousness and the other doesn’t. But that doesn’t mean there is no explanatory role being played within Ψ-world itself. In other words, maybe consciousness doesn’t distinguish between what happens in the two worlds, but surely it is crucial to Ψ-world considered by its own lights. That world is literally made of consciousness!

Nice try, but this move also fails. Consider an analogy: two identical coffee cups sitting on two tables. The tables themselves are identical in form, except that one table is made of wood and the other of iron. You can’t distinguish between the two worlds just by the fact that the coffee cup is being held up by the two tables (analogous to the behavior of matter in P-world and Ψ-world); in either case, the table holds up the up, despite them being made of different materials. But surely the iron is playing a role in the world where that’s what the table is made of!

Well, yes, the iron is “playing a role.” But it’s not a role that is relevant to understanding what keeps the cup from falling. If you had a “hard problem of coffee cups,” which involved understanding why cups sit peacefully on a table rather than falling to the ground, nobody would think that a table made of iron provided a better solution than a table made of wood. The explanation is material-independent. It’s the table-ness that matters, not the substance of which the table is made.

The actual analogy that Philip used in a post-discussion Twitter thread was to software, and the substrate-independence of computer algorithms.

Likewise, the thesis that human behavioural functions could be realised in non-conscious zombie stuff doesn't entail that human consciousness doesn't do anything. 2/2
— Philip Goff (@Philip_Goff) November 13, 2021

The same response applies here. Sure, you could run the same software on different hardware. But the entire point of substrate independence is that you cannot then say that the nature of the substrate influenced the outcome of the calculation in any way! Analogously, the panpsychist who wants to differentiate between the software of reality running on physical vs. mental hardware cannot claim that consciousness gets any credit at all for our behavior in the world.

I get why non-physicalists about consciousness are reluctant to propose explicit ways in which the dynamics of the Core Theory might be violated. Physics is really strong, very well-understood, and backed by enormous piles of experimental data. It’s hard to mess with that. But the alternative of retreating to a view where consciousness “explains” things in the world, while exhibiting precisely the same behaviors that the world would have if there were no consciousness, pretty clearly fails. It’s better to be a physicalist who works to understand consciousness as a higher-level description of ordinary physical stuff doing its ordinary physical things. If you’re not willing to go there, face up to the challenge and explain exactly how our physical understanding needs to be modified. You’ll probably be wrong, but if you turn out to be right, it will all be worth it. That’s how science goes.

Energy Conservation and Non-Conservation in Quantum Mechanics

2021-01-28T17:40:11Z

Conservation of energy is a somewhat sacred principle in physics, though it can be tricky in certain circumstances, such as an expanding universe. Quantum mechanics is another context in which energy conservation is a subtle thing — so much so that it’s still worth writing papers about, which Jackie Lodman and I recently did. In this blog post I’d like to explain two things:

In the Many-Worlds formulation of quantum mechanics, the energy of the wave function of the universe is perfectly conserved. It doesn’t “require energy to make new universes,” so that is not a respectable objection to Many-Worlds.
In any formulation of quantum mechanics, energy doesn’t appear to be conserved as seen by actual observers performing quantum measurements. This is a not-very-hard-to-see aspect of quantum mechanics, which nevertheless hasn’t received a great deal of attention in the literature. It is a phenomenon that should be experimentally observable, although as far as I know it hasn’t yet been; we propose a simple experiment to do so.

The first point here is well-accepted and completely obvious to anyone who understands Many-Worlds. The second is much less well-known, and it’s what Jackie and I wrote about. I’m going to try to make this post accessible to folks who don’t know QM, but sometimes it’s hard to make sense without letting the math be the math.

First let’s think about energy in classical mechanics. You have a system characterized by some quantities like position, momentum, angular momentum, and so on, for each moving part within the system. Given some facts of the external environment (like the presence of gravitational or electric fields), the energy is simply a function of these quantities. You have for example kinetic energy, which depends on the momentum (or equivalently on the velocity), potential energy, which depends on the location of the object, and so on. The total energy is just the sum of all these contributions. If we don’t explicitly put any energy into the system or take any out, the energy should be conserved — i.e. the total energy remains constant over time.

There are two main things you need to know about quantum mechanics. First, the state of a quantum system is no longer specified by things like “position” or “momentum” or “spin.” Those classical notions are now thought of as possible measurement outcomes, not well-defined characteristics of the system. The quantum state — or wave function — is a superposition of various possible measurement outcomes, where “superposition” is a fancy term for “linear combination.”

Consider a spinning particle. By doing experiments to measure its spin along a certain axis, we discover that we only ever get two possible outcomes, which we might call “spin-up” or “” and “spin-down” or “.” But before we’ve made the measurement, the system can be in some superposition of both possibilities. We would write , the wave function of the spin, as

where and are numerical coefficients, the “amplitudes” corresponding to spin-up and spin-down, respectively. (They will generally be complex numbers, but we don’t have to worry about that.)

The second thing you have to know about quantum mechanics is that measuring the system changes its wave function. When we have a spin in a superposition of this type, we can’t predict with certainty what outcome we will see. All we can predict is the probability, which is given by the amplitude squared. And once that measurement is made, the wave function “collapses” into a state that is purely what is observed. So we have

At least, that’s what we teach our students — Many-Worlds has a slightly more careful story to tell, as we’ll see.

We can now ask about energy, but the concept of energy in quantum mechanics is a bit different from what we are used to in classical mechanics. Classically, a single particle has a constant energy, given by the sum of its potential energy (which depends on its position) and its kinetic energy (which depends on its momentum). But in quantum mechanics, the state of the particle isn’t specified by position and velocity; those are just possible measurement outcomes. The state of the system is given by the wave function.

There are, however, special states called eigenstates, in which some particular observable has a definite value. So we have “position eigenstates,” for which the position is exactly defined, “momentum eigenstates,” for which momentum is exactly defined, and so on. There are no states for which both position and momentum are exactly defined — that would violate the Heisenberg uncertainty principle. And indeed, in most states neither one of them is exactly defined. But we can think of any state as a superposition of position eigenstates, or as a superposition of momentum eigenstates (but not both).

The same goes for energy, which is an observable quantity just like position or momentum. There are energy eigenstates, where the energy has a definite value, but neither position nor momentum do. And if you happen to be in an energy eigenstate, “energy conservation” is trivially true — the energy stays the same. But that’s a much less interesting statement than in classical mechanics, because energy eigenstates don’t evolve at all! A system with a definite energy just sits there, stationary and unevolving.

Fortunately, most states don’t have a definite energy, but rather are superpositions of different energy eigenstates. That’s good, because the system as a whole can then evolve. All the interesting evolution of quantum systems can actually be thought of as different energy eigenstates combining to give time-dependent answers to questions we could ask about other quantities like position or momentum.

But what can we say about energy conservation if a quantum state doesn’t even have a definite energy? Well, we can still associate an average energy to any particular quantum state, even if specific measurements might give answers that fluctuate around that central value. (For experts: the expectation value of the Hamiltonian.) If we think of an arbitrary quantum state as a weighted superposition of various specific-energy eigenstates, the average energy is just what it sounds like: the weighted average of the energies of all those eigenstates.

Let’s imagine that we’re in the state described above, a superposition of spin-up and spin-down. And let’s further imagine that the spin-up state is a state with definite energy (i.e. it’s an energy eigenstate) , and the spin-down state has a definite energy . Then the average energy is just a combination of both these values, weighted by the squares of the amplitudes:

As long as the quantum system obeys the Schrödinger equation, you will be happy to hear that the average energy is precisely conserved. It doesn’t change over time. That’s the notion of “energy conservation” you have in quantum mechanics: the average or expected value stays constant, as long as you obey the Schrödinger equation.

Alas, there is a famous case in which quantum systems do not obey the Schrödinger equation, or at least they appear not to: when they are being measured. As we said above, what we teach our students is that wave functions collapse when they are observed; this collapse process is unpredictable, and doesn’t obey the Schrödinger equation. As a result, the average energy is not conserved in the process of quantum measurement. Indeed, as we can quickly see by comparing with the equations we started with, after we do the measurement the system will either have energy (if we measured spin-up) or it will have energy (if we measured spin-down). And in general, if those two values are unequal (and both and are non-zero), neither one of those will be the same as our original average .

This is all pretty straightforward, almost trivial! And indeed, I wouldn’t object if you thought that. But people like energy conservation, deep in their bones. So what I suspect is that, if you asked most working quantum physicists what was going on here, they would guess that the total energy of the universe actually is conserved, but you just weren’t keeping track of it accurately. After all, there needs to be some apparatus and observer who interact with the system in order to measure it. Perhaps whenever the energy changes in the system we observe, there is a compensating change in energy in the apparatus or the rest of the world, so that the total is conserved.

Not so, or at least not in quantum mechanics as we generally understand it. That’s what we show in the paper Jackie and I recently submitted.

Energy Non-Conservation in Quantum Mechanics
Sean M. Carroll and Jackie Lodman
We study the conservation of energy, or lack thereof, when measurements are performed in quantum mechanics. The expectation value of the Hamiltonian of a system can clearly change when wave functions collapse in accordance with the standard textbook (Copenhagen) treatment of quantum measurement, but one might imagine that the change in energy is compensated by the measuring apparatus or environment. We show that this is not true; the change in the energy of a state after measurement can be arbitrarily large, independent of the physical measurement process. In Everettian quantum theory, while the expectation value of the Hamiltonian is conserved for the wave function of the universe (including all the branches), it is not constant within individual worlds. It should therefore be possible to experimentally measure violations of conservation of energy, and we suggest an experimental protocol for doing so.

Basically what we do is to construct a complete toy model of both a system and a measuring apparatus, one sufficiently simple that we can keep track of the energy exactly. And we verify that the change in energy of the system has no necessary connection at all to the change in energy of the rest of the world. (As we explain in the paper, other people have pointed to this phenomenon before, but usually in the context of trying to avoid it; we are more celebratory, and suggest that people should be looking for this experimentally.)

So, if you’re a textbook/Copenhagen kind of person, the punchline is simple: energy is not conserved in quantum measurements. Really the only way out is to refuse to accept the “average energy” of a state as representing the true energy at all. That’s fine, as far as it goes. But in that case almost no states (i.e., no states other than energy eigenstates) will have a well-defined energy. And as we say in the paper, the average energy is a rigorous energy-like quantity that would be perfectly conserved if it weren’t for measurements. So the fact that measurements violate that conservation law is pretty interesting.

Now we can come to the Everettian perspective, which puts a very attractive spin on things. I won’t go too deeply into the Everettian formulation itself; see this delightful book, or this somewhat shorter blog post. The point is that in Everett, wave functions never collapse; all they ever do is obey the Schrödinger equation. What you and I think of as a “measurement” is just when a quantum system in a superposition becomes entangled with some macroscopic object (the “measuring apparatus”), which in turn becomes entangled with its environment (“decoherence”). When that happens, the different parts of the superposition become parts of separate worlds. So rather than our above superposition of spin-up and spin-down suddenly collapsing into one or the other, the state evolves smoothly into one describing two non-interacting copies of reality, one where the spin is up and the other where the spin is down.

The nice thing about this is: energy is completely conserved! Individual observers think that they witness the average energy changing, because they only live in one branch at a time. But in the “wave function of the universe” (the quantum state describing all branches at once), the average energy is a constant, since that wave function obeys the Schrödinger equation. The energy simply gets divided up between branches slightly differently as time goes on.

This story is very different than what you might often hear, namely that it’s Everett, not Copenhagen, that has a problem with energy conservation. After all, where does the energy come from to make all those worlds?

Hopefully this worry has been completely dissipated by the discussion above. The point is that there are two different senses of the word “energy”: the energy that observers within any branch (world) might attribute to the reality they see, and the total energy of all the branches combined. If a wave function describes a collection of many branches labeled , with amplitudes and average energies , the average energy of the whole shebang is

So even though there are more and more branches as time evolves, the contribution of each branch to the total energy is weighted by the factors , and those numbers go down over time as branches split. The effects precisely cancel, so that the total energy of the universe (all branches included) is constant. It’s just that individual branches get “thinner” over time (their amplitudes get smaller), so they make smaller and smaller contributions to the total.

This “thinning” process is completely invisible from inside. You have no way of knowing what the amplitude of your particular branch is; it’s invisible to you. The fact that the amplitudes go down doesn’t mean that the world around you looks somehow less tangible or energetic. The energy you would calculate by adding up the individual energies of all the stuff in the universe (stars, planets, black holes, dark matter, etc) goes into the energy of your particular branch; there’s no reason for that number to systematically diminish over time. (Given the tiny changes in average energy that can happen at measurement events, the energy of your world as seen from inside will undergo a gradual random walk of gradually-diminishing steps, but honestly the changes are so incredibly tiny that you’d never notice.)

Is this change of the average energy of the universe (as seen by observers on individual branches) potentially observable in experiments? In principle, absolutely yes; in practice, maybe, but it would be hard. Not particle-accelerator-the-size-of-the-galaxy hard, but a challenge. This is the other thing that Jackie and I suggest in our paper. The trick is that (1) it’s extremely hard in practice to construct superpositions of very different energy states, so any hoped-for changes in average energy will be very tiny; and (2) any measurement generally spills a lot of energy all over the place, which is hard to keep track of. I won’t go into details, but we suggest a general protocol, and also a specific implementation where one spinning particle is kept stationary in a trap, while another travels by it and they become entangled. Then by measuring the spin of the moving particle, we can change the spin of the stationary one, hopefully changing its energy in the process.

I’m honestly not sure how feasible this kind of experiment is; that’s above my pay grade. But it’s a nice example of how thinking carefully about the foundations of quantum mechanics can lead to interesting ideas.

Thanksgiving

2020-11-26T17:48:54Z

This year we give thanks for one of the very few clues we have to the quantum nature of spacetime: black hole entropy. (We’ve previously given thanks for the Standard Model Lagrangian, Hubble’s Law, the Spin-Statistics Theorem, conservation of momentum, effective field theory, the error bar, gauge symmetry, Landauer’s Principle, the Fourier Transform, Riemannian Geometry, the speed of light, the Jarzynski equality, the moons of Jupiter, and space.)

Black holes are regions of spacetime where, according to the rules of Einstein’s theory of general relativity, the curvature of spacetime is so dramatic that light itself cannot escape. Physical objects (those that move at or more slowly than the speed of light) can pass through the “event horizon” that defines the boundary of the black hole, but they never escape back to the outside world. Black holes are therefore black — even light cannot escape — thus the name. At least that would be the story according to classical physics, of which general relativity is a part. Adding quantum ideas to the game changes things in important ways. But we have to be a bit vague — “adding quantum ideas to the game” rather than “considering the true quantum description of the system” — because physicists don’t yet have a fully satisfactory theory that includes both quantum mechanics and gravity.

The story goes that in the early 1970’s, James Bardeen, Brandon Carter, and Stephen Hawking pointed out an analogy between the behavior of black holes and the laws of good old thermodynamics. For example, the Second Law of Thermodynamics (“Entropy never decreases in closed systems”) was analogous to Hawking’s “area theorem”: in a collection of black holes, the total area of their event horizons never decreases over time. Jacob Bekenstein, who at the time was a graduate student working under John Wheeler at Princeton, proposed to take this analogy more seriously than the original authors had in mind. He suggested that the area of a black hole’s event horizon really is its entropy, or at least proportional to it.

This annoyed Hawking, who set out to prove Bekenstein wrong. After all, if black holes have entropy then they should also have a temperature, and objects with nonzero temperatures give off blackbody radiation, but we all know that black holes are black. But he ended up actually proving Bekenstein right; black holes do have entropy, and temperature, and they even give off radiation. We now refer to the entropy of a black hole as the “Bekenstein-Hawking entropy.” (It is just a useful coincidence that the two gentlemen’s initials, “BH,” can also stand for “black hole.”)

Consider a black hole whose area of its event horizon is . Then its Bekenstein-Hawking entropy is

where is the speed of light, is Newton’s constant of gravitation, and is Planck’s constant of quantum mechanics. A simple formula, but already intriguing, as it seems to combine relativity (), gravity (), and quantum mechanics () into a single expression. That’s a clue that whatever is going on here, it something to do with quantum gravity. And indeed, understanding black hole entropy and its implications has been a major focus among theoretical physicists for over four decades now, including the holographic principle, black-hole complementarity, the AdS/CFT correspondence, and the many investigations of the information-loss puzzle.

But there exists a prior puzzle: what is the black hole entropy, anyway? What physical quantity does it describe?

Entropy itself was invented as part of the development of thermodynamics is the mid-19th century, as a way to quantify the transformation of energy from a potentially useful form (like fuel, or a coiled spring) into useless heat, dissipated into the environment. It was what we might call a “phenomenological” notion, defined in terms of macroscopically observable quantities like heat and temperature, without any more fundamental basis in a microscopic theory. But more fundamental definitions came soon thereafter, once people like Maxwell and Boltzmann and Gibbs started to develop statistical mechanics, and showed that the laws of thermodynamics could be derived from more basic ideas of atoms and molecules.

Hawking’s derivation of black hole entropy was in the phenomenological vein. He showed that black holes give off radiation at a certain temperature, and then used the standard thermodynamic relations between entropy, energy, and temperature to derive his entropy formula. But this leaves us without any definite idea of what the entropy actually represents.

One of the reasons why entropy is thought of as a confusing concept is because there is more than one notion that goes under the same name. To dramatically over-simplify the situation, let’s consider three different ways of relating entropy to microscopic physics, named after three famous physicists:

Boltzmann entropy says that we take a system with many small parts, and divide all the possible states of that system into “macrostates,” so that two “microstates” are in the same macrostate if they are macroscopically indistinguishable to us. Then the entropy is just (the logarithm of) the number of microstates in whatever macrostate the system is in.
Gibbs entropy is a measure of our lack of knowledge. We imagine that we describe the system in terms of a probability distribution of what microscopic states it might be in. High entropy is when that distribution is very spread-out, and low entropy is when it is highly peaked around some particular state.
von Neumann entropy is a purely quantum-mechanical notion. Given some quantum system, the von Neumann entropy measures how much entanglement there is between that system and the rest of the world.

These seem like very different things, but there are formulas that relate them to each other in the appropriate circumstances. The common feature is that we imagine a system has a lot of microscopic “degrees of freedom” (jargon for “things that can happen”), which can be in one of a large number of states, but we are describing it in some kind of macroscopic coarse-grained way, rather than knowing what its exact state actually is. The Boltzmann and Gibbs entropies worry people because they seem to be subjective, requiring either some seemingly arbitrary carving of state space into macrostates, or an explicit reference to our personal state of knowledge. The von Neumann entropy is at least an objective fact about the system. You can relate it to the others by analogizing the wave function of a system to a classical microstate. Because of entanglement, a quantum subsystem generally cannot be described by a single wave function; the von Neumann entropy measures (roughly) how many different quantum must be involved to account for its entanglement with the outside world.

So which, if any, of these is the black hole entropy? To be honest, we’re not sure. Most of us think the black hole entropy is a kind of von Neumann entropy, but the details aren’t settled.

One clue we have is that the black hole entropy is proportional to the area of the event horizon. For a while this was thought of as a big, surprising thing, since for something like a box of gas, the entropy is proportional to its total volume, not the area of its boundary. But people gradually caught on that there was never any reason to think of black holes like boxes of gas. In quantum field theory, regions of space have a nonzero von Neumann entropy even in empty space, because modes of quantum fields inside the region are entangled with those outside. The good news is that this entropy is (often, approximately) proportional to the area of the region, for the simple reason that field modes near one side of the boundary are highly entangled with modes just on the other side, and not very entangled with modes far away. So maybe the black hole entropy is just like the entanglement entropy of a region of empty space?

Would that it were so easy. Two things stand in the way. First, Bekenstein noticed another important feature of black holes: not only do they have entropy, but they have the most entropy that you can fit into a region of a fixed size (the Bekenstein bound). That’s very different from the entanglement entropy of a region of empty space in quantum field theory, where it is easy to imagine increasing the entropy by creating extra entanglement between degrees of freedom deep in the interior and those far away. So we’re back to being puzzled about why the black hole entropy is proportional to the area of the event horizon, if it’s the most entropy a region can have. That’s the kind of reasoning that leads to the holographic principle, which imagines that we can think of all the degrees of freedom inside the black hole as “really” living on the boundary, rather than being uniformly distributed inside. (There is a classical manifestation of this philosophy in the membrane paradigm for black hole astrophysics.)

The second obstacle to simply interpreting black hole entropy as entanglement entropy of quantum fields is the simple fact that it’s a finite number. While the quantum-field-theory entanglement entropy is proportional to the area of the boundary of a region, the constant of proportionality is infinity, because there are an infinite number of quantum field modes. So why isn’t the entropy of a black hole equal to infinity? Maybe we should think of the black hole entropy as measuring the amount of entanglement over and above that of the vacuum (called the Casini entropy). Maybe, but then if we remember Bekenstein’s argument that black holes have the most entropy we can attribute to a region, all that infinite amount of entropy that we are ignoring is literally inaccessible to us. It might as well not be there at all. It’s that kind of reasoning that leads some of us to bite the bullet and suggest that the number of quantum degrees of freedom in spacetime is actually a finite number, rather than the infinite number that would naively be implied by conventional non-gravitational quantum field theory.

So — mysteries remain! But it’s not as if we haven’t learned anything. The very fact that black holes have entropy of some kind implies that we can think of them as collections of microscopic degrees of freedom of some sort. (In string theory, in certain special circumstances, you can even identify what those degrees of freedom are.) That’s an enormous change from the way we would think about them in classical (non-quantum) general relativity. Black holes are supposed to be completely featureless (they “have no hair,” another idea of Bekenstein’s), with nothing going on inside them once they’ve formed and settled down. Quantum mechanics is telling us otherwise. We haven’t fully absorbed the implications, but this is surely a clue about the ultimate quantum nature of spacetime itself. Such clues are hard to come by, so for that we should be thankful.

Sean Carroll

New Course: The Many Hidden Worlds of Quantum Mechanics

Related Posts:

Thanksgiving

Related Posts:

Proposed Closure of the Dianoia Institute at Australian Catholic University

Related Posts:

Thanksgiving

Related Posts:

The Biggest Ideas in the Universe: Space, Time, and Motion

Related Posts:

Johns Hopkins

Related Posts:

Thanksgiving

Related Posts:

The Zombie Argument for Physicalism (Contra Panpsychism)

Related Posts:

Energy Conservation and Non-Conservation in Quantum Mechanics

Related Posts:

Thanksgiving

Related Posts: