Decoding Melodic Intuition through Cognitive Science
by David Fuentes Ph.D.
Executive Summary
This report examines Melodic Figuration Theory (MFT) and its profound alignment with established principles of cognitive science. By mapping the kinetic energy of melody to the mechanisms of human perception, MFT offers a unified framework for understanding how musical lines generate motion and meaning.
Melodic Figuration Theory identifies a shared vocabulary of kinetic, semantic figures found throughout the corpus of Western tonal music, regardless of style, not as inert motifs but as units defined by their characteristic ways of moving and generating expectation. Yet the discovery of this vocabulary is not MFT’s central contribution. MFT is not a theory of static patterns but a theory of melodic behavior. Its true power lies in decoding the agency of these figures—revealing not just how they are constructed, but how they inherently tend to move.
Furthermore, MFT grounds musical expressivity in a robust cognitive paradigm: the distinction between “Normal” and “Modified” behavior. This framework directly operationalizes Leonard Meyer’s foundational insight that musical meaning arises from the fulfillment or thwarting of expectation. MFT posits that “Normal” behaviors represent the brain’s statistical expectation (stability), while “Modified” behaviors represent the deviations that capture attention (expression). By formalizing this dichotomy, MFT supplies a practical toolkit for shaping listener expectation, linking the craft of construction directly to the psychology of emotion.
It is important to note that MFT did not emerge from formal cognitive-science study but from over thirty years of practice-based investigation by a composition professor into observable melodic behavior. The framework was developed with an eye toward implementation, requiring that every inquiry serve compositional practice.
PART ONE: The Methodological Framework of Melodic Figuration Theory
Melodic Figuration Theory builds a holistic understanding of melodic behavior through four interconnected tiers. Tier 1 introduces the “Vocabulary of Melody,” a limited collection melodic figures that make up the building blocks of melody. Tier 2 reveals that melodic figures are hardly inert patterns, but harbor behavioral tendencies—tendencies that manifest in how they harness metric gravity, negotiate harmonic tension, and generate forward motion and expression. Tier 3 explores the sorts of “localized” melodic syntax that build compelling phrase structures. And Tier 4 shows ways to apply the principles and behaviors of MFT to create schemas for larger structures.
TIER 1: The Vocabulary of Melody
Melodic Figuration Theory holds that all tonal melodies are composed of a shared vocabulary of 36 distinct melodic figures. Each figure has unique behavioral tendencies—tendencies that shape how it moves, interacts with other figures, and unfolds in time.
This is new. These aren’t just recurring patterns. They’re specialized agents with intrinsic behavioral characteristics that, when acted upon by five musical forces—localized harmony, metric gravity, connection, register, and shape (the elements that form Tier 2)—generate autonomous motion at the melodic surface. This self-propelling potential operates independently of deeper structural frameworks, addressing a dimension of melody that has remained largely unexplained until now.
Click on any figure to hear how it behaves in a few well-known melodies across eras and genres.
Vocab TablePresented in root position without rhythm or context, the figures can seem almost unmusical—abstract placeholders rather than living melodic material. That changes immediately upon hearing them in action.
This vocabulary has always existed. MFT is the first to write it down.
A central tenet of MFT is that composers don’t acquire knowledge of these figures through deliberate, formal study. Instead, we absorb them through years of singing, playing, and listening until the patterns become second nature. The result is a shared vernacular: figures that surface again and again, not by scholarly decree, but through the collective muscle memory of generations of composers.
It is worth noting that normal and modified behavior—though developed fully across the five dimensions of Tier 2—are not exclusively Tier 2 concepts. They are already present in the figures themselves. Each figure carries a kinetic potential that constitutes its normal behavior: the way it most naturally moves when left to its own logic. Modification begins the moment a composer redirects that potential. In this sense, the normal/modified distinction is not a property of the dimensions acting on figures; it is a property of the figures themselves, which the dimensions then make visible and controllable.
TIER 2: Five Dimensions of Melodic Behavior
In moving from vocabulary to behavior, we encounter a fundamental shift in perspective. MFT posits that melodic figures are not static bricks to be stacked; they are agents with tendencies. They possess a kind of kinetic “will”—an inherent drive to move in specific ways depending on how they interact with metric gravity and harmonic tension. In Tier 2, we map this agency across five specific dimensions.
1. Harmonic melody: While “traditional approaches to music theory” are hardly monolithic, there are some areas where they uniformly concur. Every approach to reckoning the relationship between melody and harmony follows the same either/or formula: categorizing each melodic note based on how it fits (or doesn’t) with the prevailing harmony—as either chord tone or non-chord tone. While traditional approaches excel at vertical/hierarchical analysis, Melodic Figuration Theory prioritizes emergent harmonic coherence within kinetic figures.
By observing characteristic behaviors of notes within each melodic figure, we can predict which ones are most likely to establish a figure’s harmony, even without knowing precisely which chord is in play. Consequently, we can appreciate each figure’s harmonic elements horizontally, in line with melody’s inherently diachronic flow. Distinguishing harmonic elements within the most basic patterns of melody explains how harmony naturally emerges from the figure itself, even without accompaniment. This phenomenon extends to reharmonization, which can involve reinterpretation as well as role reversal (switching chord tone and non-chord tone functions). MFT also addresses “harmonic divergence,” where figures intentionally create dissonance or tension against the prevailing harmony, yet still sound musically compelling due to their internal coherence.
2. Metric Gravity/Placement: MFT defines meter as “music’s essential pulse: a living, dynamic cycle of strong and weak beats that acts as a gravitational engine.” And depending on how the other musical elements interact with this cyclical engine of strong and weak beats, “a composer can generate sensations ranging from deep stability to playful confidence to intense agitation and more.” This is possible because musical elements “take on the characteristic lift and heft of the metric positions they inhabit.” Going further, MFT identifies various metric placements of melodic figures, including “normal alignment” (starting a figure on the beat, ending on an upbeat), “ligatures” (placing the last note on a beat), “pickups” (leading to an upcoming beat), and “straddled alignment” (positioning a figure’s middle note on a beat). This nuanced understanding of metric placement allows composers to intentionally shape momentum and feel of a melody as it unfolds.
3. Connection: MFT not only focuses on a phrase’s overall flight, but also on the distinct effect pathway between beats. Using each figure’s inherent “gait”—its unique kinetic profile—the composer shapes each beat-to-beat pathway with an ear toward controlling a melody’s local and cumulative pacing, flow, momentum, and expressive emphasis.
4. Register: MFT underscores the significant expressive and structural role of register in melody, a dimension often overlooked in traditional approaches. Composers consistently leverage register to create “registral dramaturgy,” establishing a “home” register and then stretching, sidestepping, or abandoning it for expressive effect. MFT also differentiates between “fixed” figures, whose registral span remains constant (between a 2nd and 4th), and “flexible” figures, which can expand or contract. Techniques explored include varying registral slope, isolating distinct registral zones within a phrase, and “overstepping the line”—marking an upper or lower registral boundary and then exceeding it for dramatic impact.
5. Shape: Melodic shape, the aural shape traced by notes, is understood by MFT as a multi-layered process involving macro (phrase and section), mezzo (figure and gesture), and micro (note-to-note) motions. These layers interact to shape the melody’s overall profile. MFT examines how shape is used for contrast between gestures, to maintain continuity through repetition, and through “melodic tailoring.” Tailoring can be “adaptive” (adjusting intervals to blend seamlessly, which often means changing figures) or “expressive” (preserving the contour but making a gesture “bigger” or “smaller” for emotional force). The theory also introduces the presence of “nested figures”—subtle, recurring 3-note figures embedded within 4-note figures that create nuanced resonances during melodic development.
TIER 3: Melodic Syntax
In Melodic Figuration Theory, a melodic figure, all by itself, is not yet music. To come to life, musical speaking, a melodic figure must combine with rhythm to form a “melodic gesture.” As such, the melodic gesture—not the melodic figure—is “the smallest intact unit of melody.” Going further, MFT’s melodic syntax articulates the “responsive logic” inherent in melody, presenting its course as a back-and-forth discourse. An initial melodic “proposition” (a gesture) naturally elicits a “response.” This tier defines the “melodic gesture” as “the smallest complete unit of melody.”
MFT identifies three fundamental ways to respond to any gesture, phrase, or section: repeat it, vary it, or contrast it. Drawing upon its deep understanding of melodic behavior across the five dimensions (Tier 2), MFT has cataloged an extensive repertoire of 75 distinct response strategies—25 ways to repeat, 25 ways to vary, and 25 ways to contrast a melodic gesture. This comprehensive catalog provides composers with a rich array of options (schemas) for developing melodic ideas.
TIER 4: Melodic Schemas
The systematic study of musical schemas has gained significant momentum over the past few decades, largely through the work of Robert Gjerdingen and colleagues, who have cataloged the recurring harmonic-melodic patterns that define the Galant style. MFT extends this tradition downward—from the phrase level to the figure and gesture level—where the same logic of pattern recognition and behavioral expectation operates at its most granular.
MFT defines a musical schema as “a recurring pattern or framework that composers, improvisers, and performers recognize instinctively,” then draw on to create or interpret melodies. Key characteristics of schemas include recognizable patterns, cognitive efficiency, flexible application, and cross-genre validity.
According to MFT, every element of music can be represented as a schema within this paradigm, such that learning music is, in essence, “learning schemas, by book or by ear.” MFT, through its keen awareness of the properties and behaviors of melodic figures, uses schemas in part to shine a light on “overlooked facets of melodies.” In addition to the 75 syntactical schemas mentioned above, others include:
1. MFT-specific schemas for integrating melodic figures within standard musical schemas. For example, using nested figures to embellish a cadential 6/4 melodic formula.
2. MFT-specific schemas for using each melodic figure. Each of the 24 melodic figures brings its own signature behaviors—abilities it excels at—while also sharing other traits with its peers.
3. MFT-specific schemas for using any of the five dimensions of melodic behavior. A thorough review of existing literature shows that composers already employ schemas to manipulate harmony, meter, connection, register, and shape—yet many of these remain undocumented. For example, MFT’s “Anda-2” schema observes how melodies often begin off the down-beat to shift metric weight onto the next strong beat (e.g., “My Funny Valentine”).
4. MFT-specific schemas for building phrases. A unique contribution of MFT in this tier is the concept of building “one-off phrase schemas,” which involves analyzing the distinctive syntactical behaviors of existing melodies and then imprinting these behaviors onto new melodic materials.
5. Schemas for breaking schemas. Resisting the common wisdom that “one learns the rules so one can break them later,” MFT maintains that creative deviation from established “rules” is not random but follows alternate, intentional routes to achieve specific expressive effects. By systematically mapping such deviations, MFT provides composers with structured pathways for innovation.
A concrete example of MFT’s schema approach at work is the Perforated Melody Schema—a pattern built from 2-note iterations (occasionally 3) in a quick upbeat-to-downbeat rhythm, distributed across pauses. What makes this schema theoretically significant is the cognitive arc it produces. The first iteration registers as maximally ambiguous: two notes, no precedent, nothing to compare against. The listener holds them in suspension. The second iteration triggers retrospective pattern recognition—the shape, brevity, and metric position of the first fragment are now identifiable as the opening of a series. By the third iteration, expectation is fully primed. This arc is not incidental. It is chunking, priming, and Meyerian expectation management operating in sequence, at the figure level—precisely the level CGF has argued MFT uniquely formalizes.
PART TWO: The Cognitive Foundation of Melodic Figuration Theory
The previous section outlined the four-tiered framework of Melodic Figuration Theory on its own musical terms. This section now provides scientific validation for that framework, demonstrating point-by-point how MFT’s practical, musically-derived insights reflect the brain’s own fundamental processes for perceiving, learning, and creating music. The following analysis reveals that the core tenets of MFT are not arbitrary but are deeply resonant with established principles of cognitive science.
A. Chunking: The Fundamental Units of Melodic Perception
Chunking is a foundational cognitive process that involves organizing smaller, discrete units of information into larger, more meaningful, and manageable fragments (Song & Cohen, 2014). This concept gained prominence from George A. Miller’s influential 1956 paper, which highlighted the brain’s limited working memory capacity—typically holding between five and nine chunks of information—and how chunking effectively extends this capacity (Miller, 1956). In the context of music, chunking is a pervasive cognitive activity, relevant whenever musical elements are conceived in groups of notes, rhythms, or articulations, whether during improvisation, sight-reading from notation, or passive listening (Chenette, 2022).
Musical chunks are perceived as holistic fragments of action and sound, generally lasting between 0.5 and 5 seconds (Godøy et al., 2010). The formation of these chunks can occur in two primary ways: through the recognition of familiar musical units or established schemata, such as a motif or a metric pattern, even in the absence of obvious breaks in the sound (Godøy et al., 2010). Crucially, the brain leverages pattern recognition to process these familiar structures as unified entities rather than isolated elements, significantly enhancing informational efficiency (Sloboda, 1985).
Melodic Figuration Theory’s melodic figures align with this cognitive model, presenting themselves not merely as potential chunks, but as the natural, pre-formed cognitive units that the human brain already uses to perceive, process, and generate melody. They represent the “bite-size” pieces the brain prefers for efficient processing, suggesting that MFT is not imposing an arbitrary classification system but is formalizing the very units of melodic thought that the human brain naturally employs. Further, MFT defines a melodic gesture as the “smallest intact unit of melody,” formed by combining a melodic figure with rhythm (Fuentes, MFT, Tier 3: From Figure to Gesture). This directly parallels the cognitive process of chunking, which involves breaking down long and complex sequences into more manageable fragments that can be “learned and then concatenated together to acquire the long sequence” (Song & Cohen, 2014). MFT’s progression from figures to gestures and then to phrases mirrors this hierarchical chunk assembly in cognitive processing. For a potential AI system, this would mean operating with the same fundamental cognitive units as a human composer, enabling truly collaborative interaction by directly addressing the composer’s “native language.”
1. Chunking as a Tool for Learning and Performance
Chunking plays a critical role in learning and memory. By segmenting complex musical pieces into smaller, manageable parts, the brain can process and encode information with greater efficiency (Dueck, 2023). In practical applications like sight-reading, chunking enables musicians to perceive notes in groups, which significantly reduces the cognitive load associated with processing each individual note, thereby enhancing both accuracy and musicality (Chenette, 2022), as well as motor skills (Pike, 2012).
The chunk types commonly referenced in studies include arpeggios, neighbor tones, and recurring rhythmic motives (Maddocks, 2010). MFT’s catalog of 36 behavior-rich building blocks constitutes a more intricate and detailed system of chunks, though it similarly segregates them into three main types: arpeggio, scale, and neighbor figures. This aligns with research showing that while chunking is an intuitive process, conscious effort and the explicit identification of patterns can significantly amplify its effectiveness. Deliberately identifying chunks during practice has been shown to strengthen this innate capability (Chenette, 2022), just as proactive pattern recognition before playing enhances chunk utilization (Sloboda, 1985). This synergy between intuition and explicit knowledge mirrors MFT’s pedagogical approach. In fact, MFT functions as a pedagogical framework that structures this process, providing an overt, structured way to identify and practice melodic figures, thereby bridging instinctive musical understanding with controlled technical mastery.
2. Chunking in the Perception of Musical Structure
Beyond memorization and performance, chunking fosters deeper musicality by enabling musicians to grasp the “big picture” of a piece—not merely individual notes, but their contextual relationships, directional momentum, and expressive intent (Chenette, 2022). This is because chunking occurs on various hierarchical levels, which helps musicians organize musical material into a logical progression—the foundation for understanding musical form (Neuhaus, 2013).
This process of perceiving larger structures is supported by direct neurological evidence. The cognitive underpinnings of chunking extend to the perception of phrase structure—a crucial organizational mechanism for auditory processing (Knösche et al., 2005). Phrase boundaries are signaled by various structural markers, such as small breaks, the lengthening of notes, changes in pitch contour, or harmonic cadences (Knösche et al., 2005). Electrophysiological studies using EEG/MEG reveal a “closure positive shift” (CPS) in brain activity at these boundaries, mirroring prosodic phrase processing in speech (Knösche et al., 2005). Source localization implicates limbic structures (e.g., the hippocampus) associated with memory consolidation and attentional shifting (Knösche et al., 2005). This suggests the CPS reflects not just boundary detection, but the cognitive “packaging” of completed phrases into unified chunks while reallocating attention to new material (Knösche et al., 2005). This neurological process of “packaging” phrases provides a direct perceptual parallel to MFT’s compositional syntax. The theory’s “responsive logic”—in which an initial gesture is answered by one that repeats, varies, or contrasts it—provides a practical framework for creating the very structural markers and coherent phrases that the brain’s CPS mechanism is designed to detect and process. Cognitively, this relates directly to melodic expectation, where the brain constantly generates predictions about “what will come next” based on past experiences and recognized patterns (Nguyen & Yildirim, 2024). MFT’s catalog of 75 distinct response strategies, therefore, can be understood as a detailed toolkit for managing these inherent cognitive expectations and guiding the listener’s attention through melodic transitions.
3. Chunking in Compositional Practice
The evidence confirms chunking as a fundamental cognitive process governing both music perception and creation, from auditory intake to artistic output (Chenette, 2022). A bidirectional relationship exists between auditory chunking and the physical gestures of performance, indicating that chunking is a holistic mechanism supporting the entire creative cycle (Godøy et al., 2010). This provides strong support for MFT’s emphasis on “observable melodic behavior” as a basis for writing melody. If chunking is a natural cognitive process for producing music, then a theory that codifies these natural chunks offers a direct, cognitively aligned framework for composition.
For composers, this process often works in reverse from that of an analyst. While analysis often involves the top-down dissection of a composition into sections, phrases, and smaller components, MFT frames the creative process as a bottom-up construction: “arranging notes into figures, figures into gestures, and gestures into phrases” (Fuentes, MFT, Tier 3: Melodic Syntax). By providing a detailed vocabulary of these foundational chunks, MFT supports the composer’s craft in a direct way. It provides a framework for translating intuitive, passively absorbed knowledge into explicit and controllable skills for composers, thereby improving their craft. It is this codification of the creative cycle—from perception to production—that provides the robust foundation needed for an AI partner capable of true musical collaboration.
This cognitive efficiency has profound implications for the computational modeling of music. A raw stream of MIDI notes presents a nearly infinite set of combinatorial possibilities, making it computationally opaque. However, by reducing melody to a finite vocabulary of behavioral figures, MFT transforms an infinite problem into a modelable system. This constraint renders the nebulous art of melody into a computationally sufficient logic. Rather than viewing composition as the statistical probability of the next note (a near-infinite variable), MFT allows us to model the behavioral logic of the next figure (a finite, predictable unit).
B. Gestalt Principles: The Laws of Perceptual Grouping
The question of how our brains naturally group notes into the precise, recurring patterns we call chunks leads us directly to the foundational insights of Gestalt theory. While most associate Gestalt with visual phenomena, its origins are profoundly rooted in music, directly affirming MFT’s core contention that we hear melody in meaningful, unified groups, not as a mere succession of individual notes. Christian von Ehrenfels introduced the concept of Gestaltqualität (“emergent property of form/shape/pattern”) in 1890, arguing that a melody is undeniably more than the sum of its sensory components. It is this emergent quality that allows a melody to be transposed to a new key, using entirely different notes, while still retaining its undeniable identity (von Ehrenfels, 1890, as discussed in Smith, 1988).
MFT’s understanding of melodic figures and gestures perfectly resonates with this principle. While other music researchers have deftly applied Gestalt principles to broader phenomena like phrase grouping, and Eugene Narmour’s Implication-Realization model provides a powerful note-to-note framework, MFT is the first theory to formalize the mezzo-level of intuitive melodic construction. It identifies a vocabulary of kinetic figures and shows how they are combined into gestures. This essential, “pedestrian” perspective illuminates entirely new ways to understand how music “holds together” and “moves” at the primary level of composerly intuition, opening new avenues for research and application.
The overarching principle in Gestalt theory is the Law of Prägnanz, which states that the mind tends to perceive reality in its most regular, orderly, and simple form (Koffka, 1935). At every level, MFT is consistent with this principle by defining every aspect of melody according to its “normal” (most simple and orderly) and “modified” behaviors.
Good Continuation: This principle describes our innate tendency to perceive separate objects as forming simple, continuous patterns rather than disjointed arrangements (Wertheimer, 1938). MFT’s catalog of melodic figures exemplifies this, especially as it emphasizes behavior at every turn. For example, regarding shape, figures like the Run exhibit a consistent direction while others like the Arc form curves. Regarding connection, a figure ending with stepwise motion normally continues moving by step into the next, maintaining a seamless melodic flow. And figures that end with a “gap”—a leap of a third—have a strong proclivity to fill that gap by moving to the next figure by step in the opposite direction to the gap’s motion.
Closure (Reification): This principle explains our predisposition to perceive incomplete forms as complete, mentally synthesizing missing units to create a unified whole (Wertheimer, 1938). In music, this explains how listeners intuit a clear harmonic underpinning when hearing an unaccompanied melody. MFT establishes compelling new criteria for this phenomenon by showing how the inherent shape of each figure makes it clear which notes function as chord tones, allowing the brain to “close” (fill out) what’s missing from the harmonic information.
Figure-Ground: This principle describes our ability to differentiate a dominant figure from its background (Rubin, 1921). One way MFT does this is by precisely identifying figures like the Pendulum and the Pendulum Auxiliary (Fuentes, MFT, Tier 1: Melodic Vocabulary), which inherently contain a fulcrum against which other notes pivot, establishing clear melodic motion against a fixed background. Another is by showing how figures like the Double Neighbor and Double 3rd have apparent commotion “atop” underlying motion.
Proximity: This principle states that elements close to one another in time and space tend to be perceived as a group (Wertheimer, 1938). MFT leverages this fundamental law through its highly developed approach to register. By analyzing the inherent span of individual melodic figures, MFT isolates distinct registral regions within a melody, providing a precise lens to understand how proximity shapes melodic coherence.
Similarity: Elements that share features, especially contour, tend to be perceived as a group (Wertheimer, 1938). In MFT, figures with similar contours stand in correspondence, so listeners readily hear them as related even when they aren’t identical. Every 4-note figure contains a nested 3-note figure, reinforcing these implicit links. Similarity also underpins “tailoring:” substituting one figure for another with a closely matched contour when harmony prevents direct repetition, thereby preserving continuity.
C. The Intuitive Physics Engine: The Perception of Melodic Motion
In his seminal work on musical forces, Steve Larson (2004) proposed a compelling framework for understanding melodic expectation. He argues that we perceive melody through metaphorical forces—gravity, magnetism, and inertia—that are rooted in our intuitive “experience of physical motions.” This frame-work aligns with the concept of an “intuitive physics engine” within cognitive science, which enables humans to make rapid and accurate inferences about the physical world (McCloskey, 1983). MFT’s emphasis on the kinetic and gestural nature of melody finds its scientific explanation in this cognitive capacity, especially through three key concepts:
1. Metric Gravity/Placement: If anything is “structural” in MFT, it’s the beat itself—a point of kinetic contact, like a foot touching the ground mid-stride. The beat isn’t a final destination, but a deliberate point within a cycle that energizes and propels the melody. MFT’s concept of metric gravity posits that the beat acts as a gravitational center. The placement of notes in relation to this metric pulse is a primary source of a melody’s kinetic energy and expressive character. Notes that land squarely on a strong beat feel stable and grounded; notes that arrive early or late create a sense of kinetic energy, like an object in motion that defies simple gravitational pull. This provides a scientific model for why MFT’s focus on normal and modified ways to align melodic figures within the meter is so crucial for shaping melodic character.
2. Connection: MFT’s focus on the figure as a kinetic entity provides a precise “mezzo-level” methodology for controlling the melodic surface. Such distinctions describe the very kind of information an intuitive physics engine processes. Its distinctions between “normal” (direct) pathways for reaching a “target” (the upcoming beat) and “modified” routes have cognitive referents in how humans move from point to point in the physical world. Also, it observes that the effect of linking one figure to the next hinges upon the type of motion between the first figures last two notes. For example, stepwise motion sounds a the end of one figure sounds smoothest when it continues moving by step. What’s more, MFT’s observation that the perceived effect of a leap is not merely about its interval size but about how it navigates the musical forces of metric gravity (the beat with its accompanying metric cycle) and inertia (the melodic line’s particular manner of motion and follow-through). For example, a dissonant leap made after landing solidly on a beat will sound elegant and intentional, whereas the same leap executed across the beat can sound clumsy and disruptive. Thus, the conclusion that for melodic leaps, “timing matters more than distance,” is a profound insight that cognitive science can now help explain (Fuentes, MFT, Tier 2: Connection).
3. Gestures: MFT’s central concept of the “melodic gesture” is overtly put forth to replace more standard terms like fragment, segment, cell, or motive—precisely because these technical terms inadvertently reduce the small parts of a melody to abstraction, stripping them of their inherent kinetic energy. In sharp contrast, “A gesture harnesses motion to convey meaning with immediacy” (Fuentes, MFT, Tier 3: From Motion to Meaning). This is as true for melodic gestures as it is for hand gestures. MFT codifies these kinetic gestures, providing a systematic approach to understanding and manipulating the very forces that drive melodic motion and listener expectation.
D. Implicit Learning and the Priming of Expectation
MFT maintains that composers do not learn this vocabulary through formal study, but internalize it through years of immersion until the patterns become second nature. This aligns perfectly with implicit learning—the cognitive process where complex knowledge is acquired “without the aid of explicit instruction” or “conscious awareness” (Ettlinger et al., 2011). Because the brain is sensitive to “statistical regularities” within auditory sequences (Tillmann & McAdams, 2004), composers naturally absorb the shared behaviors of MFT’s figures simply by listening. This mechanism explains how a standardized melodic vocabulary can arise across diverse genres and eras without ever being written down.
The brain’s inherent reliance on “pattern recognition” (Sloboda, 1985) for efficient information processing further supports this. The human auditory system possesses a remarkable ability to establish “memory traces for invariant features” in the acoustic environment despite continuous variations (Tervaniemi et al., 2001). This process is facilitated by musical expertise, particularly for musicians who rely heavily on auditory information over visual scores (Tervaniemi et al., 2001). This explains how MFT’s “limited set of 36 common, small-scale patterns” becomes deeply ingrained and forms the basis of “observable melodic behavior” in compositional practice.
If implicit learning allows humans to internalize complex musical grammars and regularities without explicit instruction or conscious awareness (Ettlinger et al., 2011), then MFT, by systematically identifying and codifying these “observable melodic behaviors” and “consistent ways that composers shape and refine” figures, is essentially reverse-engineering this implicitly learned musical grammar, making the unconscious rules of melodic intuition explicit and computable. MFT reverse-engineers observable behaviors of melodic intuition into explicit schemas, making implicit patterns accessible for practice and computation while acknowledging intuition’s complex, non-symbolic nature.
This process of intuitive acquisition also sets the stage for priming, a form of implicit memory in which exposure to one musical stimulus automatically prepares the brain to process a subsequent, related stimulus (Ettlinger et al., 2011). Hearing a melodic gesture activates a network of associated concepts, making the mind more attuned to what might follow. This mode of expectation is not a conscious prediction, but a subconscious process that gives the mind a powerful advantage in comprehension (Ettlinger et al., 2011). This is the cognitive mechanism behind MFT’s “responsive logic” (MFT: Tier 3, From Motion to Meaning). An initial melodic gesture (the “proposition”) primes the listener’s brain with a set of expectations. The subsequent gesture (the “response”) must then manage those expectations in a musically meaningful way by choosing to repeat (confirming the expectation), contrast (subverting it), or vary (confirming it with a twist) (Fuentes, MFT, Tier 3: Not Just One Thing After Another). This transforms the act of hearing from a passive experience into a dynamic, anticipatory one, and it provides a compositional model for creating melodies that feel cohesive and compelling (Ettlinger et al., 2011).
E. Expectation and Meaning: MFT as a Meyerian Framework
The concepts of priming and implicit learning lead directly to a highly influential theory of music cognition: Leonard Meyer’s thesis that musical emotion and meaning are generated through the fulfillment and thwarting of expectation (Zangwill, 2021). For Meyer, a musical event has meaning because it points to and makes us expect another musical event. The satisfaction, surprise, or even frustration we feel in response to the music that follows is the basis of our aesthetic experience.
MFT’s understanding of melodic figures and gestures perfectly resonates with this principle. While other music researchers have deftly applied Gestalt principles to broader musical phenomena (e.g., phrases, phrase grouping, rhythm, and form), MFT is the first theory to leverage these laws specifically to systematize the mezzo-level units of intuitive melodic construction—how figures form melodic gestures and how those gestures interact. This gesture-level analysis illuminates entirely new ways to understand how music generates coherence and propels motion at the fundamental level of composerly intuition, opening transformative avenues for research and application.
The brain is a prediction machine. MFT operationalizes this by defining “Normal Behavior” as the brain’s statistical expectation. “Modified Behavior” is the deviation that captures attention and encodes emotion. Without the standard of “Normal,” the “Modified” has no meaning.
This is a crucial distinction. Unlike theories that focus on note-to-note (or “micro-level”) predictions, MFT’s approach is inherently Gestalt-based, operating at the level of the gesture—the smallest recognized perceptual whole. It is here, within the very DNA of the melodic figures, that MFT provides a practical toolkit for manipulating the listener’s expectations, thereby generating emotion and meaning in a way that bridges the gap between the single note and the complete phrase.
This Meyerian framework extends from the intuitive to the conscious. While the manipulation of figures often occurs at an unconscious level, MFT’s schemas (Fuentes, MFT, Tier 4: Melodic Schemas) provide a framework for the deliberate, conscious management of expectation at a higher structural level. The concept of “schemas for breaking schemas,” in particular, formalizes the creative act of “haphazardly” deviating from established patterns. It is Meyerian expectation management writ large, transforming what might otherwise be random “rule-breaking” into a structured, artistic choice. This allows composers to move from the unconscious assimilation of melodic grammar to its deliberate and masterful control, bridging the gap between mere imitation and genuine originality.
F. Musical Energetics: The Historical Precedent for Kinetic Agency
While cognitive science provides the mechanism for how we perceive melodic motion, the history of music theory provides the precedent. MFT does not exist in a vacuum; it stands on the shoulders of Energetics, a school of thought that has long argued music is not a static architecture of pitch, but a dynamic experience of force and motion.
In the early 20th century, theorists like Ernst Kurth argued that melody is driven by kinetic energy—a “psychic motion” that surges through notes (Rothfarb, 1991). Later, Victor Zuckerkandl described tones not by their acoustical frequency, but by their “dynamic quality”—the palpable sense that a note is pointing, leaning, or striving toward another (Zuckerkandl, 1956). More recently, Steve Larson’s theory of “Musical Forces” identified three primary energies—Gravity, Magnetism, and Inertia—that listeners intuitively track in real-time (Larson, 2012).
MFT is the practical operationalization of these concepts.
Where Energetics successfully identified the invisible forces acting upon melody, it often struggled to identify the specific objects being moved. MFT supplies the missing syntax (the figures) that these forces act upon. When we speak of MFT’s “Connection” or “Metric Gravity,” we are effectively mapping the kinetic dimension of melody—the specific ways our auditory system tracks these energetic forces through the “solid” objects of the melodic figures.
In this light, MFT bridges the gap between the abstract philosophy of Energetics and the concrete demands of composition. It takes the “dynamic qualities” that Kurth and Zuckerkandl heard and gives them a name, a shape, and a predictable behavior—effectively completing the unfinished business of Energetics by offering a system that composers can actively wield.
PART THREE: Implications for Future Research
MFT’s central contribution is neither speculative nor forward-looking: it is the first systematic map of the mezzo level—the layer of melodic figures and gestures where melody does its most expressive, agile work. This layer has always been present, always been used, and until now has lacked both a name and a framework. The cognitive science surveyed in this document does not merely support MFT; it explains why the mezzo level behaves the way it does, and why composers have always navigated it by ear rather than by rule.
The detailed validation in this report—from the neurological basis of chunking and the perceptual laws of Gestalt, to the intuitive physics of melodic motion and the Meyerian creation of meaning—provides a robust scientific foundation for Melodic Figuration Theory. This foundation validates MFT’s central claim: by discovering and cataloging the finite vocabulary of melodic figures, MFT has formalized a previously unconscious dimension of musical craft, offering an unprecedented window into the mechanics of composerly intuition.
It is from this unique vantage point that we can appreciate the profound implications for future research. By formalizing the “hidden grammar” of melody, MFT opens distinct avenues for advancement in Artificial Intelligence, Music Cognition, and Computational Musicology.
- Explainable AI (XAI) and Symbolic Modeling
Current music AI largely operates as a probabilistic “black box,” analyzing finished works to statistically guess the next note. MFT offers a crucial counter-paradigm: a “White Box” model based on generative principles.
Research into MFT-based systems can help solve the problem of “explainability” in AI. Because MFT is built on an understanding of how melody is created rather than just why it is structured, it offers a framework for Neuro-symbolic AI—systems that combine the learning power of neural networks with the explainable, elegant internal logic of MFT’s syntax. This supports the development of agents capable of “reasoning” about melody, offering principled, creative possibilities rather than algorithmic guesswork. - Quantifying “Relevance” and Musical Semantics
MFT allows researchers to address one of the most elusive challenges in music theory: “Relevance.” As Aaron Copland noted, composers write to “summarize in some permanent form your most basic feelings about being alive… to set down some sort of permanent statement about the way it feels to live now, today” (Copland, 1952).
Traditionally, this connection between shape and feeling has been vague. However, MFT’s “Normal vs. Modified” paradigm offers a mechanism to map specific melodic behaviors to the profound human implications they embody. Future research can leverage MFT to create a rigorous Semantics of Melody, analyzing how specific deviations in “figural behavior” correlate with listener perceptions of tension, yearning, or resolution—effectively decoding the link between “how it sounds” and “how it feels.” - Modeling Compositional Intent and Development
Musical development requires transforming material in ways that feel intentional—a nuanced process where current statistical models often fail, resulting in arbitrary mutation or “gambling.” MFT spells out a far more extensive range of techniques for development than previously available, such as varying the connection of adjacent gestures or utilizing nested figures.
This provides a roadmap for Computational Creativity research. By encoding these high-level strategies, researchers can model “Compositional Intent,” creating algorithms that do not merely mimic style, but emulate the strategic decision-making process of a skilled composer. - Ethical Generative Frameworks
In an era marked by legal challenges over data scraping, MFT offers a theoretical foundation for Ethical AI. A generative system based on MFT requires no opaque training data and runs no risk of “lifting” copyrighted material, as its knowledge is derived from the foundational principles of melodic behavior rather than a dataset of finished works. Research into such “principle-based” generation offers a pathway toward “clean,” legally sound, and genuinely original computational systems. - Empirical Cognition Studies
Finally, MFT serves as a powerful hypothesis generator for cognitive science. Researchers can now empirically test the reality of “Metric Gravity” and “Figural Agency.” Do listeners process “Normal” figures faster than “Modified” ones? Does the brain track the “connection” of a melody as it would a physical event? MFT provides the controlled variables and finite vocabulary necessary to design precise experiments, further bridging the gap between music theory and brain science.
References
Baker, D. J. “Chapter 3: Individual Differences.” In Modeling Melodic Dictation. Retrieved July 24, 2025. https://davidjohnbaker1.github.io/document/individual-differences.htm
Chenette, T.. 2022. “Formalizing chunks.” In Foundations of Aural Skills. Retrieved July 24, 2025. https://uen.pressbooks.pub/auralskills/chapter/formalizing-chunks/
Chenette, T. 2022. “Sight reading in chunks.” In Foundations of Aural Skills. Retrieved July 24, 2025. https://uen.pressbooks.pub/auralskills/chapter/sight-reading-in-chunks/
Copland, A. (1952). Music and imagination. Harvard University Press. https://archive.org/details/musicimagination0000copla
Dueck, R. W.. 2023. Developing a Progressive Chunking Technique in Music Literacy: An Analysis of the Method Books and Sight‑Reading Materials for Beginner Piano Students. [Master’s thesis, Université d’Ottawa]. / University of Ottawa. hdl.handle.net/10393/45642
Ettlinger, M., Margulis, E. H., & Wong, P.C.M. (2011). Implicit memory in music and language. Frontiers in Psychology, 2, 211. https://doi.org/10.3389/fpsyg.2011.00211
Fuentes, D. (2025). A brief introduction to melodic figuration theory. Figuring Out Melody. Retrieved July 24, 2025. https://figuringoutmelody.com/melodic-figuration-theory.pdf
Fuentes, D. (2020). The building blocks of melody. Figuring Out Melody. Retrieved July 24, 2025. https://figuringoutmelody.com/the-building-blocks-of-melody/
Godøy, R. I., Jensenius, A. R. and Nymoen K. 2010. “Chunking in Music by Coarticulation.” Acta Acustica united with Acustica 96 (4): 690–700. https://www.duo.uio.no/bitstream/handle/10852/36826/Acoustica-Godoy.pdf
Knösche, T. R., Neuhaus, C., Haueisen, J., Alter, K., Maess, B., Witte, O. W., & Friederici, A. D. (2005). Perception of phrase structure in music. Human Brain Mapping, 24(4), 259–273. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6871182/
Larson, Steve. Musical Forces: Motion, Metaphor, and Meaning in Music. Bloomington: Indiana University Press, 2012, 82–83.
Maddocks, R. 2010. “Effective Memorization In Music.” Getting There for Musicians (blog), July 2010. Retrieved July 24, 2025. https://intenseproductions.blogspot.com/2010/07/effective-memorization-in-music.html
Miller, G. A. (1956). The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychological Review, 63(2), 81–97. https://psychclassics.yorku.ca/Miller/
Neuhaus, C. 2013. “Processing Musical Form – Behavioral and Neurocognitive Approaches.” Musicae Scientiae 17 (1): 109–127. https://journals.sagepub.com/doi/abs/10.1177/1029864912468998
Nguyen, B. K., & Yildirim, I. (2024). An intuitive physics approach to modeling melodic expectation. In Proceedings of the 46th Annual Meeting of the Cognitive Science Society. UC Merced. https://escholarship.org/content/qt8ds971xf/qt8ds971xf.pdf
Ochwani, J. (2025, May 2). Chunking: Learn its psychology and examples. Octet Design Solutions Inc. Updated May 2, 2025. Retrieved July 24, 2025, from https://octet.design/journal/chunking/
Pike, P. D. (2012). Sight-reading strategies for the beginning and intermediate piano student: A fresh look at a familiar topic. American Music Teacher. Retrieved July 24, 2025. https://test.mtna.org/downloads/Informed/AMT/AMT%20AOY/2013AOY.pdf
Rothfarb, Lee A. Ernst Kurth: Selected Writings. Cambridge: Cambridge University Press, 1991, 37–41.
Sloboda, J. A. (1985). The Musical Mind: The Cognitive Psychology of Music. Oxford University Press. https://archive.org/details/musicalmindcogni00slob
Smith, B. (1988). Gestalt theory: An essay in philosophy. In B. Smith (Ed.), Foundations of Gestalt Theory (pp. 11–81). Philosophia Verlag. http://ontology.buffalo.edu/smith/articles/gestalt.pdf
Song, S., & Cohen, L. (2014). Impact of conscious intent on chunking during motor learning. Learning & Memory, 21(9), 449–451. https://doi.org/10.1101/lm.035824.114
Tillmann, B., & McAdams, S. (2004). Implicit learning of musical timbre sequences: Statistical regularities confronted with acoustical (dis)similarities. Journal of Experimental Psychology: Learning, Memory, and Cognition, 30(5), 1131–1142. https://doi.org/10.1037/0278-7393.30.5.1131
Zangwill, N. (2021). Music and emotion. In E. N. Zalta (Ed.), The Stanford Encyclopedia of Philosophy (Winter 2021 ed.). Stanford University. https://plato.stanford.edu/archives/win2021/entries/music/
Zuckerkandl, Victor. Sound and Symbol: Music and the External World. Translated by Willard R. Trask. Princeton: Princeton University Press, 1956, 11–24.







































































































































































