A tuning fork hums at 440 cycles per second. Strike it, hold it up to your ear, and physics says there's nothing in that air but pressure waves — molecules crowding together and spreading apart, 440 times every second. There is no "note" out there. No "A." Just moving air.
And yet somewhere between that vibrating fork and your conscious experience, the air becomes a pitch — a thing you could hum back, recognize, name. That conversion is one of the most beautiful pieces of engineering in the body, and it happens before you've consciously noticed anything at all. So here's the question this whole section turns on: when, exactly, does a vibration in the air become a note in your mind? The answer is that pitch isn't received. It's built — and the place it gets built is not where most people would guess.
Start with the ear, because that's where the air finally runs out of road. Sound funnels down the canal, rattles the eardrum, and the vibration gets passed along three tiny bones into a coiled, fluid-filled tube called the cochlea. Picture a snail shell about the size of a pea. Inside it runs a thin membrane, and here's the trick: that membrane isn't uniform. It's stiff and narrow at one end, floppy and wide at the other. High frequencies make the stiff end vibrate. Low frequencies travel all the way down to the floppy end. So the cochlea takes a single complicated sound and physically smears it out by frequency — like a prism splitting white light into a rainbow, except the rainbow is laid out along a strip of tissue inside your skull.
That layout has a name worth keeping: tonotopy. Tono for tone, topy for place. Frequency gets mapped onto physical position. And here's the part that surprises people the first time they hear it — the brain never throws that map away. It carries it all the way up. From the cochlea, through relay stations in the brainstem, up into the auditory cortex, the orderly arrangement of frequencies survives. In the cortex, that map lives largely in a structure called Heschl's gyrus, a small fold of tissue tucked into the side of each hemisphere. If you could lay a probe across it and play tones from low to high, the active spot would march steadily across the tissue. Low here, higher there, higher still. A piano keyboard, smeared across the surface of the brain.
So far this is a clean story: air vibrates, cochlea sorts by frequency, cortex keeps the map. If that were the whole thing, pitch would just be frequency wearing a costume. But this is exactly where it gets stranger — because frequency and pitch are not the same thing, and the gap between them is where the brain actually shows its hand.
Here's the demonstration that breaks the simple story. Take a low note — say, a male voice singing a deep note. That sound isn't one frequency. It's a whole stack: a fundamental frequency, the lowest one, plus a ladder of higher frequencies riding on top called harmonics. Now do something sneaky. Filter out the fundamental completely. Erase it. By the simple cochlea story, the pitch should jump up, because the lowest frequency is gone. But it doesn't. You still hear the same low note. Engineers call this the missing fundamental, and it's not a party trick — it's the entire reason you can hear a bass line through tinny phone speakers that physically cannot produce those low frequencies at all. Your brain reconstructs the pitch from the pattern of the harmonics. The pitch you hear was never fully in the air. Your brain inferred it.
That single fact forces a split. Frequency is physical — it's out there in the world, measurable with a microphone. Pitch is perceptual — it's the brain's best guess about what's making the sound, and sometimes it guesses a note that no microphone would find. And here's the elegant part the research keeps surfacing: those two jobs are handled by different real estate. The core auditory regions, around Heschl's gyrus, do the raw frequency analysis — they track what's called the spectral content, the actual mix of frequencies present. But the experience of a unified pitch, that single hummable "note," tends to be computed in regions surrounding the core, a bit further along the processing chain. Core areas handle the ingredients. The surrounding areas cook the meal.
Stay with this for one more step, because it's the move that makes melody possible. If your brain only logged raw frequencies, music would be a list of unrelated facts. Instead, what it tracks is relationships — the distance between one pitch and the next. Play "Happy Birthday" in any key, high or low, sped up or slowed down, and everyone recognizes it instantly. Why? Because the actual frequencies are completely different every time. What stays constant is the pattern of intervals — the jumps between notes. A melody isn't a sequence of pitches. It's a shape, a contour of rises and falls, and the brain stores the shape, not the raw numbers. Harmony works the same way one layer up: when several pitches sound together, the brain reads the relationships between them as consonant or tense, settled or unresolved. So if someone stopped you right here and asked what a melody actually is, physically — what would you say? … It's not the notes. It's the spacing between them.
This relational, pattern-finding way of hearing is exactly the territory the 2025 Nature Reviews Neuroscience perspective on neural resonance theory is built to explain. That paper, surveying pitch, harmony, melody and tonality, argues something genuinely contested in the field. The mainstream view treats the brain as a prediction machine, running an internal model that forecasts the next note. The neural resonance camp — led by researchers like Edward Large — pushes back. They argue people anticipate musical structure not because they're running predictive models, but because the brain and body physically embody that structure through resonance, the way a wine glass rings at a particular tone. On their account, those statistically universal features of music across cultures — the ones documented in the work of Patrick Savage and colleagues on the structures of human song — exist because they correspond to stable states of a pattern-forming physical system. It's a real disagreement, and it's not settled. The prediction framework dominates the textbooks. But the resonance view has the advantage that it grounds pitch in the same dynamical principles that govern how vibrating systems behave in the rest of the universe — and that's an elegant kind of explanation, even if the jury's still out. The honest answer right now is that both camps explain real data, and serious people are still arguing about which runs deeper.
Now, one last surprise, and it's the one that ties this section to the larger argument. You might assume speech and music live in totally separate parts of the brain — language over here, melody over there. They don't. They share a remarkable amount of the same auditory real estate. Both lean on Heschl's gyrus, both ride the same tonotopic maps, both are built from pitch and timing. But they don't lean on it the same way. The networks tuned for the fine, fast frequency changes of speech are tilted toward one set of demands; the networks tracking the slower, more sustained pitch relationships of music are tuned toward another. Same neighborhood, different specialists. Which is why a stroke can sometimes rob a person of words while leaving their ability to sing those same words intact — a fact that becomes a treatment later in this course, not just a curiosity.
So strip all of this down to what's actually doing the work. The cochlea sorts sound by frequency and the brain keeps that map all the way up — that's tonotopy, the piano keyboard laid across Heschl's gyrus. But frequency is just the raw material; pitch is something the brain constructs, sometimes conjuring a note that isn't physically there at all. And what music is really made of isn't notes — it's the relationships between them, the intervals and contours your brain extracts and remembers.
Which means the moment a vibration becomes a note is the moment your brain stops measuring the world and starts interpreting it. And once a brain is interpreting — guessing, filling in, building shapes out of patterns — it's only a short step to the brain's deeper habit: not just hearing the note that's playing, but betting on the one that's coming next.