Harnessed: How Language and Music Mimicked Nature and Transformed Ape to Man
Page 17
Figure 30. Distribution of the number of beats for a theme “run” to cover the top or bottom half of the tessitura. The most common number of beats to cross half the tessitura is two, consistent with how quickly people turn 90 degrees. (Runs had three or more notes moving in the same direction, and all within the top or bottom half of the tessitura, and filling at least 80% of its half tessitura.)
This close fit between human and melodic turning rates is, as we will see, helpful in trying to understand regularities in the overall structure of melody, which we delve into next.
Musical Encounters
In light of the theory that music is a story about a fictional mover in our midst, is it possible to address what a typical melody is like? How do melodies usually begin? What is the typical melodic contour shape? How many beats are in a typical melody? Is there even such a thing as a “typical” melody?
To answer whether there is a typical melody, we must ask if there is a typical way in which a mover moves in our midst. Is there such a thing as a typical story of human movement?
Yes, in fact, there is. In a nutshell, the most generic possible story of a human mover consists of the mover noticing you and veering toward you, interacting with you in some way, and then scampering off. The plot is: “Hello. How are you? Good-bye.” Let’s call this an “encounter.” Stories told by composers by no means must be encounters, of course. One can expect to find tremendous variability in the stories told by music. But there is no getting around the fact that Hello–HowAreYou–Good-bye is a common story, whereas, say, Good-bye–Hello–HowAreYou is a strange story. People in stories usually arrive and then depart, not vice versa.
If “encounters” are indeed the most typical story of human movement in our midst, then let’s be more specific about the kinds of movement involved. Figure 31 shows the four qualitatively distinct kinds of movement we discussed earlier in the chapter. Can we say what an encounter is in terms of these movement categories? Yes. In the “Hello” part of the encounter, the mover suddenly notices you and begins veering toward you. This is case A or B in Figure 31, depending on whether the mover was receding or nearing, respectively, when he first noticed you. If A is the start of the story, then B occurs next. By the end of B, the mover has gotten near enough that he must begin veering away, lest he bump into you. This is the segment of the mover’s path that brings him closest to you—where he says, “How are you?”—and is case C in Figure 31. Finally, in the “Good-bye” part of the encounter, the mover veers away, which is case D.
Figure 31. Summary of the movement meaning of pitch for low and high, rising and falling pitch, as shown earlier in Figure 28. The sequence A-B-C-D is a plausible “most generic” movement, which I call an “encounter,” or Hello–HowAreYou–Good-bye movement.
The generic encounter, then, has one of two sequences, B-C-D or A-B-C-D, with the movement meanings of A through D defined in Figure 31. Furthermore, I suggest that the “full” A-B-C-D movement is the more common of the two, because stories of human movement often consist of multiple, repeated encounters, and in these cases A will be the first segment of any encounter after the first. That is to say, if a mover goes away, then turns to come back for another encounter, the sequence must begin at A, not B. We conclude, then, that the generic Hello–HowAreYou–Good-bye movement is thus A-B-C-D. A-B is the “Hello.” C is the “How are you?” And D is the “Good-bye.”
We now have some idea of what a generic movement is, and so the next question is, “What does it sound like?” With Figure 31 in hand, we can immediately say what the Doppler pitches are for this movement. Figure 32 illustrates the overhead view and the pitch contour of the generic A-B-C-D encounter. This generic pitch contour in Figure 32 has two distinctive features that we expect, no matter the specific shape of the mover’s encounter path. First and foremost, the generic pitch contour goes up, and then down. Second, it dwells longer (has a flatter slope) at the minimum and maximum of its pitch range (something I will discuss in the Encore section titled “Home Pitch”). In essence, this pitch contour shares two qualitative features that are crucial to hills: hills are not just lumps of earth on the ground—not domes, not mounds, not pyramids, not wedges—but have gentle slopes toward their bottoms and a gentle flattening at the top.
Figure 32. The most generic sequence of movement, and its pitch over time. One can see it is a “hill.” Furthermore, this path tends to be traversed in around eight steps (because people tend to take about two steps to turn 90 degrees).
We are nearly ready to ask whether melodies tend to look like the “hill” we see in the pitch contour of the generic encounter, but first we must gather one further piece of information about encounters. We need to know how long an encounter lasts. When movers do a Hello–HowAreYou–Good-bye to us, do they do it over 100 footsteps, or four? Our results from the previous section (“Human Curves”) can help us answer. We saw then that human movers tend to take about two steps to turn 90 degrees. Each of the four segments of the generic encounter—A, B, C, and D—is roughly a 90-degree turn, and thus the entire encounter can be carried out in about eight steps. Although Hello–HowAreYou–Good-bye behaviors can take fewer or more than eight footsteps, a plausible baseline expectation is that generic encounters will occupy around eight steps—not two steps, and not 80. The generic story of human movement in our midst—the “encounter”—is not just the sequence of movements A-B-C-D, but also, these movements being enacted in eight or so steps. Accordingly, the generic Doppler pitch contour is not just a hill, but a hill implemented in about eight footsteps. The sound of a generic human mover in our midst is an eight-step pitch hill.
These eight-step-hill encounters are, I claim, a fundamental intermediate-level structure found in the pattern of Doppler pitches from human movers in our midst. Real movers will, of course, often diverge from this, but such instances should be viewed as deviations from this generic baseline. I call the generic encounter an intermediate-level structure because it is hierarchically above the individual “atoms” of movement (A, B, C, and D), and because full stories of movers in our midst may involve hundreds or thousands of footsteps—full stories of human movement are built by combining many short bouts of movement. Because the encounter is the generic short bout of movement, the generic long story of human movement is many eight-step hills—many encounters.
And now we are in a position to figure out whether these generic stories of human movement are found in music. In particular, we want to know if melodies are built out of encounter-like structures. And because generic encounters sound like eight-step pitch hills, we wish to see if melodies have any tendency to be built out of eight-beat pitch hills. Of course, we expect that any such tendency should be weak: the eight-step pitch hill is the expectation for the generic encounter, but you can be sure that composers like to tell nongeneric stories as well. Nevertheless, we hope to see the signs of the generic melody by looking across a great many melodies.
Eric Jordan and I set out to measure the average pitch contour for themes, following the lead of Professor David Huron, who first carried out measurements of this kind and found arches in average pitch contours. Themes were put into groups with other themes having the same number of notes; each theme’s pitches were normalized (so that the bottom and top of the tessitura were 0 and 1, respectively); and the average normalized pitch was computed across the group. Classical themes tend to have fewer than 25 notes, and in order to sample longer melodies, allowing us to better discern signs of eight-beat hills, we also measured from a set of 10,000 Finnish folk songs, which have themes with longer lengths of 25 to 40 notes. Figure 33 shows the average pitch contours for each group. There are, for example, 83 themes having exactly eight notes (the average of their contours is shown at the upper left in Figure 33). If melodies are built from eight-beat hills, as predicted, then we expect to see such hills in these averaged melodies. A casual glance across the average pitch contours for melodies having the same number of notes indicates a mu
ltiple-hill pattern in longer melodies. For themes with eight to 13 notes, only one hill is apparent in the average contour, but by 14 through about 19 notes, there is an apparent bimodality to the plots. Among the plots with 30 or more notes, a multiple-hill contour is strongly apparent. And the hills are very roughly eight notes in length. (These eight-note hills are due to themes with a predominance of quarter notes. Themes with a preponderance of eight notes lead to 16-note hills.)
Average Melodic Pitch Contour
Figure 33. Average melodic contour for melodies with the same number of notes. Thirty-two different plots are shown, for melodies having eight notes through 40 notes, shown by the number label along each x-axis. The “n” values show the number of melodies over which the average is calculated. If melodies are built with eight-beat hills, then because a sizable fraction of melodies consist mostly of beat-long notes, there should be a strong eight-note-hill tendency in these plots. There will also be sizable fraction of melodies consisting mostly of half-beat-long notes, and these will tend to have a sixteen-note hill. The smallest hills we expect to see, then, are eight-note ones.
To get a more quantitative estimate of the typical number of notes per hill in these average-melodic-contour data, Figure 34 plots the approximate number of hills as a function of the number of notes in the average melody. One can see that the number of hills is approximately 1/8 the number of notes, and so there are about eight notes per hill.
Figure 34. Number of hills in the average melodic contour as a function of the number of notes in the melody. The approximate number of arches from each plot in Figure 33 having eight notes through 36 notes (after which the number of hills is less clear) was recorded. The number of hills rises as 0.1256 times the number of notes in the melody. The number of notes per hill is thus 1/0.1256, which is 7.90, or approximately 8, consistent with our expectation from generic human encounters.
The data we have just discussed provide evidence of the “eightness” of melody’s hills. But recall that in order to show that they are hills, and not some other protuberance shape, we must show that the note durations at the start, peak, and end of the protuberance tend to have longer duration, just as hills are flatter on their ends and on top (see Figure 32 again). Focusing now just on classical themes with eight, nine, and ten notes, Eric Jordan and I determined the average duration (in beats) of each note. To generate a hill-shaped pitch contour, we would expect a “W”-shaped plot of how note durations vary over the course of a melodic theme, with longer durations at the start, middle, and end. Indeed, that’s what we found, as shown in Figure 35. Melodies tend to have longer-duration notes at the start and end (when the fictional mover is headed away from the listener), and also in the middle (when the mover is headed directly toward the listener).
Melodies appear to have a tendency to be built out of eight-beat hills, which is just what is expected if melody’s stories are about a fictional mover’s multiple encounters with the listener. Like arrival, interaction, and departure in eight steps, melodies tend to rise and fall over eight beats, and in the nonlinear fashion consistent with a real encounter. Melodies, in other words, seem to have the signature structure of stories built from Hello–HowAreYou–Good-byes.
In this and the previous sections we have seen that melody behaves in some respects like the Doppler pitch modulations of a person moving. But there’s much more to the similarity between Doppler and melody, and I discuss additional similarities in detail in the Encore. Here I will only hint at them, but I encourage you to read the hints, because they are exciting, and they are crucial to the case I am making.
Encore 3: “Fancy Footwork” This section will discuss how when people turn and their Doppler pitch changes, their gait can often become more complex. And I will provide evidence that music behaves in the same way. I referred to Encore 3 earlier in this chapter when we wrapped up the discussion of rhythm, because “Fancy Footwork” concerns how rhythm and pitch interact.
Figure 35. Average duration in beats of each note for classical themes with number of notes near that of the generic encounter (i.e., for 8, 9, and 10 notes). One can see the expected “W” shape, showing that the eight-note-hills in Figure 33 are truly hills.
Encore 5: “Home Pitch” This Encore section will discuss three similarities between melody and Doppler pitch modulations, each concerning how pitch distributes itself. Like Doppler pitch modulations, melodic contours have a fixed home range (called the tessitura); tend to distribute themselves uniformly over their home range; and tend to dwell longer at the edges of the pitch range.
Encore 6: “Fast Tempo, Wide Pitch” Just as faster movers have wider Doppler pitch ranges, faster-tempo music has a wider pitch range.
Encore 7: “Newton’s First Law of Music” If melodic contours are Doppler pitch modulations, then we expect a variety of asymmetries between pitch rises and falls. Pitch changes are not generally expected to have “momentum” (i.e., a tendency to continue going in the same direction) with the exception of small downward changes in pitch. Also, melody is expected to have a tendency to drift gradually down, but to have larger upswings in pitch.
Where’s the Moving Pitch?
Throughout this chapter thus far, I have been suggesting that the Doppler effect is the ecological foundation of melody. One potential stumbling block for this hypothesis that I have until now avoided mentioning is that we’re not generally aware of these Doppler shifts in pitch. We may sometimes consciously notice them coming from skateboards, bikes, cars, and trains, but do we notice them coming from moving people? If our auditory system is listening to the Doppler shifts of people, then wouldn’t we have noticed? Not necessarily. As we discussed in Chapter 1, in the section titled “Under the Radar,” your conscious self not only doesn’t acknowledge these lower-level stimuli; it typically does not even have access to them. Just as your conscious self sees objects, not visual contour junctions, your conscious self hears the behaviors and actions of movers, not the auditory substrate out of which the whole is built. Your conscious self tends to latch onto the invariants out there—a pink room with a variety of colored lights will have a variety of spectra, but you’ll typically see the walls as uniformly pink. Similarly, when a mover in your midst is making sounds, your conscious self will focus on invariants such as what the mover is doing. Your auditory system will hear all of the lower-level structure, including the (detectable) pitch modulations due to the Doppler effect, but all that will tend to stay below the radar because you don’t need to know the details.
It would not, then, be surprising if our auditory system detects (and uses) Doppler shifts from human movers and yet we don’t consciously notice it. But one might wonder whether our auditory system can possibly detect Doppler shifts for movers going human speeds in the first place. If I am walking at one meter per second, then the difference in pitch between the highest and lowest pitch in the Doppler range is small, roughly about a tenth of a semitone on the piano. The fastest sprints possible by humans are about 10 meters per second, and even at these speeds the Doppler pitch range is only about a single semitone on the piano (e.g., from C to C#). Thus, whereas the Doppler pitch modulations for trains, planes, and automobiles are sizable because those vehicles move quickly, the pitch modulations for humans are small. Tiny as these Doppler pitch shifts may be, however, our auditory system is exquisitely sensitive to pitch changes, capable of detecting differences as small as about 0.3 percent of the frequency, or about 5 percent of a semitone—sensitive enough to distinguish even the pitch shifts of walkers.
So our ears are sensitive enough. But if one thinks more carefully about how our feet make sounds, there appears to be a big problem with the suggestion that the sounds of human movers make Doppler shifts at all. The biggest bang your body makes in motion is when your foot takes a step. But notice that when you are walking and your foot hits the ground, it is moving neither forward nor backward relative to the ground. Your feet are stationary when they touch the ground. But if your sound-
making foot is not moving forward when it makes its sound, then its pitch will not get Doppler shifted upward. That is, even though you are moving steadily forward, your feet are standing still at the moments when they are making their footstep sounds, and so your footstep sounds undergo no Doppler shift. (See Figure 36a.) (This is not a problem that applies to between-the-steps gangly bangings, which do Doppler shift.)
Footsteps are, however, more subtle than a simple vertical thud on the ground, and there are multiple avenues in which Doppler shifts may occur. First, even if your footsteps stomp the ground without any forward speed, your body is still moving forward. When the footstep sound waves rise into the air, your big old body bumps into them and reflects them. The waves that your front runs into get reflected forward and are consequently Doppler upshifted, and the waves that reflect off your back get Doppler downshifted, as illustrated in Figure 36b (i). Second, footsteps aren’t the simple vertical ground bangers that Figure 36a would have us believe. For one thing, the dynamics of a footstep are complicated by the fact that the ground is often complicated. Throughout most of our evolutionary history the ground was not smooth as it is today, but covered in grass, brush, and pebbles. When your foot goes in for a landing, it has some forward velocity, and will undergo a sequence of microcollisions with the ground material. This sequence of collisions is a forward-moving sequence, and will sound higher in pitch (or compressed) to a listener toward whom the mover is directed. (See Figure 36b [ii].) Not only is the ground more complicated than the “stomping” in Figure 36a indicates, but the foot dynamics even on smooth ground are more subtle than I have let on. Our heel hits first, and the contact points move forward along the foot toward the toes (see Figure 36b [iii]). Whether on smooth ground or natural terrain, this sequence of microhits underlying a single step is moving in the direction of the mover, and thus will Doppler shift.