In this study we introduce a symbolic dataset composed of non-metric Iranian classical music, and algorithms for structural parsing of this music, and generation of variations.
The corpus comprises MIDI files and data sheets of Dastgah Shour from Radif Mirza Abdollah, the foundational repertoire of Iranian classical music.
Furthermore, we apply our previously-introduced algorithm for parsing melodic structure [1]to the dataset. Unlike much Western music, this type of non-metric music does not follow bar-centric organisation. The non-metric organisation can be captured well by our parsing algorithm. We parse each tune (Gusheh) into a grammar to identify motifs and phrases. These grammar representations can be useful for educational and ethnomusicological purposes.
We also further develop a previously-introduced method of creating melodic variations [1]. After parsing an existing tune to produce a grammar, by applying mutations to this grammar, we generate a new grammar. Expanding this new version yields a variation of the original tune. Variations are assessed by a domain-expert listener. Additionally, we conduct a statistical analysis of mutation with different representation setups for our parsing and generation algorithms. The overarching conclusion is that the system successfully produces acceptable variations post-mutation. While our case study focuses on Iranian classical music, the methodology can be adapted for Arabic or Turkish classical music.
A substantial portion of the classical music culture in Iran, Turkey, Arabic regions, and parts of India is characterized by non-metric or free rhythm music. Unlike Alap in Indian music or Taqsim in Arab and Makam in Turkish music, the main body of Iranian classical music, Avaz, is non-metric [2][3]. This distinctive feature prompted us to focus on Iranian classical music as a case study for our exploration of musical structure and structural mutation in non-metric musical cultures. There exists a structured repertoire within this musical tradition called Radif, which serves as a pedagogical framework for learning improvisation through various Dastgahs (musical systems) and Gushehs (melodies within these systems). The pieces in Radif are divided to 12 subsets: 7 Dastgahs and 5 Avazes, each comprising numerous smaller music pieces referred to as Gushehs. The term Avaz has three connotations: broadly, it signifies singing; it denotes the segments of Radif smaller than a Dastgah; and specifically, it refers to the non-metric Iranian music style. In this study, our focus is on the latter meaning of Avaz, excluding its associations with singing or smaller Dastgahs.
In this study we introduce the first symbolic dataset of such music.
This type of music necessitates a flexible micro-level analysis rather than a bar-centred analysis which can be suitable for Western compositions [4]. Avaz generally avoids the formal structures (such as Sonata, Theme and Variation, or Rondo Forms) prevalent in Western music [5], often initiating with a musical idea that is developed sequentially without adhering to predefined forms.
Gushehs in Iranian classical music are divided into three types: modal, melodic, and rhythmic. Melodic Gushehs, like Hazin, and rhythmic ones, such as Kereshmeh, have distinct melodies or rhythms that are fixed and cannot be altered. This means variations on these Gushehs may not be acceptable as examples of the same Gusheh.
Radif has been taught by various artists over the last century. The most renowned version is Radif Mirza Abdollah. We’ve chosen to use a recent edition of this Radif by Dariush Talai [6] for our study because it includes both a transcription and an explanation of the music’s hierarchical structure.
Representing this music accurately involves tackling two main challenges: non-metric rhythm and micro-tonal pitch. These challenges are addressed in Sec. 3.
The sequential melodic data for each piece in the collection is parsed by a previously-introduced algorithm 3, extended to be suitable for Iranian classical music. The result is a grammar which represents the hierarchical repetition structure of the piece, which is potentially valuable for educational and ethnomusicological research.
We then also applied mutations to these grammars, in order to generate variations of the piece which respect overall structural properties, such as complexity and reuse. It preserves the hierarchical structure while introducing significant variations compared to the original tune.
We are not looking to generate new music but rather variations. This approach to generating variations is useful and exciting for musicians, researchers, and listeners. For instance, a musician with several variations of a Gusheh can improve their improvisation skills in this style of music. Researchers can benefit when they need several versions of a Gusheh but lack enough real recorded samples. Finally, listeners can use the algorithm to obtain variations of the tunes they like, providing them with new versions of their favorite music. Additionally, this avenue of study provides some evidence that our approach to representing hierarchical structure is valid.
Above, we created variations at the Gusheh (piece) level. Playing all of the Gushehs in order simulates a Dastgah performance, which is traditionally taught to musicians. The next step for a music student is to create variations of a Dastgah for their own performances. This gives us the motivation to also try our mutations on the entire Dastgah to observe the variation results.
In [5] Farhat has a wide research on form and style in Iranian classical music. He traces the classical form back to ancient times, noting its separation from the broader Islamic musical tradition in the 16th century. This shift gave Persian music its unique character, despite retaining basic similarities with Turkish and Arabic music. He outlines the Dastgah system and modal structures [5][7] and compares form in western music and Iranian music. He believes Persian music’s expressiveness and flexibility allow it to be shaped by the individual tastes and temperaments of its performers.
Several authors have discussed Iranian non-metric music. Tsuge explains the concept of non-metric and how it is more important in Iranian music than others [2]. He notes that the rhythmic organization of Avaz primarily relies on the poetic meter system, with a recurring cycle of short and long syllables forming the basis of its rhythmic structure. This structure is deeply bound up with the nature of the Persian (Farsi) language and its classical versification system, which plays a significant role in how the melody is formed and perceived. In [8] the authors introduce the main melodic figures that are usually used in non-metric Avaz music.
Laudan Nooshin broadly discussed improvisation in Iranian and Indian music [9]. Her work on improvisation is mostly constructed on how improvisation is shaped in non-metric Iranian music based on Radif by different prominent performers. In another place she explores what creativity means in this music [10]. Jean During has done substantial work in Middle Eastern music [11][12][13] and his role is very important as the first published transcription of Radif Mirza Abdollah was by him. However we used [6] by Dariush Talai for our work because he provided a more understandable explanation of Radif for readers.
At the start of the study we tried to find a symbolic corpus that covers middle eastern classical music but we didn’t find any. Turkish Makams have a symbolic corpus called SymbTr [14]. According to our knowledge there is no symbolic corpus for Iranian and Arabic music. However there were some audio datasets for these types of music like Dunya corpus which includes Turkish Makam, Carnatic, Hindustani, Beijing Opera, Arab-Andalusian. Its Turkish part is connected to our research area [15]. KUG Dastgāhi [16] and [17] are two audio corpora for Iranian music. Nava [18] is an audio dataset for Iranian instrument recognition task and Ar-MGC: Arabic Music Genre Classification Dataset [19]. Therefore, we have decided to create our own dataset representing a subset (Dastgah Shour) of the repertoire (Radif), for use in our algorithms and for use by others. In the future, we plan to expand the dataset.
Hakkı Parlak and Kösemen discuss [20] an approach to automatic music generation for Turkish Makams using random numbers. The system utilizes these random numbers to determine the notes, note lengths, tempo, instruments, and percussion for compositions within chosen Turkish Makams. While the system aims to generate music that can be completely random or conform to musical rules based on user preferences, it does not seek to replicate the works of past composers but rather to introduce new musical ideas and provide entertainment and inspiration for enthusiasts of Turkish Makam music.
Şentürk and Chordia [21] explore the application of Variable-Length Markov Models (VLMM) to predict melodies in the Uzun Hava form of Turkish folk music. This study introduces the first symbolic machine-readable database of Uzun Havas and represents the first attempt at predictive modeling in Turkish folk music. The research highlights the predictive power of VLMMs in modeling non-Western and non-metric musical styles, suggesting their broader applicability beyond Western music traditions.
In the realm of non-metric music, Radif stands as the foundational repertoire of Iranian classical music. With numerous eminent performers having crafted their interpretations, various versions of Radif exist. However, the version by Mirza Abdollah has emerged as the predominant choice in music education throughout the current century. Among the various transcriptions of Mirza Abdollah’s Radif, we have chosen to work with “Radif Analysis - based on the notation of Mirza Abdollah’s Radif with annotated visual description” by Dariush Talai [6]. This particular edition offers a detailed hierarchical structure for each Gusheh, accompanied by a recorded performance that aligns with the musical scores and our corpus. Radif is traditionally divided into 12 subsets: Shour, Bayat-e-tork, Dashti, Abu-ata, Afshari, Segah, Nava, Homayoun, Bayat-e-Esfahan, Chahargah, Mahur, and Rast-Panjgah. Our corpus focuses on Shour, one of the primary and most extensive subsets, comprising 29 non-metric tunes (Gushehs). The total number of notes in the MIDI files is 7001, and it takes 1965.81 seconds to be played. The figure [fig:transcript] presents a transcription example from the book.
In our dataset, we represented each musical piece as a sequence of notes, where for each note we store note name, note duration, MIDI note number, interval, and pitch bending, as illustrated in Figure 1.
Notes are denoted by the symbols
While these intervals suggest a chromatic scale with 24 quarter notes in one octave, all of which may be used in contemporary compositions, Iranian classical instruments typically play only 18 of these notes: C, Db, Dk, D, Eb, Ek, E, F, Fs, F
We represent quarter-tones using regular MIDI note numbers by performing an adjustment on them. Koron and Sori are two types of quarter-tones, respectively, a quarter lower and a quarter higher than the natural note. To represent Koron, we use a semitone (flat) lower than the natural note, and for Sori, we use the same number as the natural note. Then, we employ pitch-bending to make Koron and Sori distinguishable from flat and natural notes. The bend column is used to represent micro-tonal sounds in MIDI outputs by setting the bend for micro-tones at 2048 units: a flat note is transformed into the note koron variant, and a natural note is altered to its sori. This value is 0 for standard MIDI tones. In our algorithms, we represent pitches using just MIDI pitch numbers. This is sufficient because, similar to Western diatonic music, no Gusheh will include more than one altered note of the same scale degree, e.g. if D is present then Ds will not be present.
The lowest note in the first Gusheh is the start note for the main octave. For example in the first tune of our corpus the lowest note is F3 so the main octave is from F3 to F4. The music in the main octave is represented only by its symbols and accidental signs (if any are present). For notes in octaves other than the main one, we use ‘+’ or ‘-’ followed by a number to indicate the number of octaves above or below the main octave. For example, an Ak from one octave higher than the main octave would be written as Ak+1.
The Intervals column quantifies the pitch difference between consecutive notes, where “1" signifies a quarter-tone.
Note durations are inexact in this form of music but can be categorized into four main types: very short, short, long, and very long, corresponding to sixteenth, eighth, quarter, and half notes [6], and are represented numerically as 1, 2, 4, and 8.
We also notated the hierarchical structure of the piece, which is given in the original printed source (see Figure [fig:transcript]). In our notation, brackets signify hierarchical relationships, giving a tree structure. An open bracket “[” marks the beginning of a node (a section or subsection), and each close bracket “]” signifies the end of that node. In each tune, the entire tune is enclosed between brackets, giving the root node. Every other pair of brackets represents a child node, and a child node can itself have other child nodes (brackets) that represent sections and subsections. Taking Fig. [fig:transcript] as an example, the abstract structure is [ [ ] [ ] [ [ ] ] ]
. The outer brackets frame the tune. The second open and close brackets contain the first section that includes the first three lines in Fig. [fig:transcript]. The third open/close brackets surround the section from line three to 6. The next open bracket is accompanied by another open bracket which means it has a subsection. That is for line 8 and nine and we can see the subsection at the last line highlighted. Due to the fact that these subsections are not repeated in the Gusheh, our algorithm (described below) does not detect them. However, this is not what we are seeking in this paper: our algorithm is focused on detecting repetition structure. Since some of these subsections are repeated in some other Gushehs, running the algorithm over the entire concatenated Radif will detect many of them. The hierarchical structure notation will be used in future research.
Based on MDL (Minimum Description Length) ideas, the shorter representation of an object captures its internal structure better [22][23][24]. In our previous work we operationalised this by representing sequences as grammars, as described in Section 4.2 [1]. Thus, we propose multiple representations for musical data motivated by finding a representation which gives short encoding. In this subsection we will see the integer/tuple, chromatic/diatonic and interval/pitch possibilities for the representations.
We opted to represent each note as a tuple, including the note’s pitch and its duration. In our previous work, we represented notes as integers (MIDI note numbers), because in Irish folk music most notes have the same duration.
Another critical decision in our approach is choosing between pitches or intervals for the representation. The initial preference for a pitch-based representation is that it tends to have a shorter encoding than those based on intervals for the Iranian music corpus. However, given the hypothesis that Iranian music frequently employs repetition with transposition, exploring interval-based approaches seemed promising.
The example shown in Figure 1 depicts a sequence of chromatic MIDI note numbers and their durations:
[(53,2), (55,2), (56,2), (58,2), (56,2), (55,2), (56,2), (55,2), (56,1), (56,2), (55,8)]
The unique set of pitches for the entire tune is {53, 55, 56, 58, 60}. When we consider these pitches as degrees within a diatonic modal framework, they can be mapped as {53:1, 55:2, 56:3, 58:4, 60:5}. Based on this mapping, the diatonic interval list for the sequence would be represented as:
[(0,2), (1,2), (1,2), (1,2), (-1,2), (-1,2), (1,2), (-1,2), (1,1), (0,2), (-1,8)]
where the first elements reflect the relative pitch change between notes according to their position within the modal framework. As there is no previous note there is no interval there for the first note so we dedicate 0 to the first tuple to make it possible to represent the duration (that is 2 here).
To capture the musical tune structures, we employ the concept of Pathway Assembly (PA) [25]. PA is a method that detects hierarchical repetition structure, in our case in sequences.
Given a sequence such as abracadabra
, the elementary units are identified as {a, b, c, d, r}
, and the PA unfolds by iterative concatenation as follows:
{a, b, c, d, r} -> ca -> ab -> ra -> cad
-> abra -> cadabra -> abracadabra
[25]
The PA Index (PAI) quantifies the minimum length of the PA required for any given object, with the PAI for ’abracadabra’ determined to be 7, as shown by the 7 concatenation steps, above.
To understand the structure of a piece of music and determine its PAI, we employ the Sequitur algorithm [26]. The algorithm identifies hierarchical patterns within data by iteratively substituting consecutive symbol pairs with a new non-terminal symbol. This method of redefining sequences enables a clear representation of the tune’s structure, illustrated through the example of abracadabra
below:
p0 -> p1 c a d p1
p1 -> p3 p2
p2 -> r a
p3 -> a b
Although Sequitur rules may combine several items and PA only permits binary concatenation, the two are equivalent as the rules’ right-hand sides can be segmented into multiple binary joins. This allows for the PAI calculation, where, for ’abracadabra’, the sum of joins within the rules equals 7. The algorithm translates a sequence into tune structures, enabling understanding musical patterns. We can also reverse this process later, after mutation of the grammar, to expand it and produce a sequence that represents our generated tune.
Fig. 2 is the grammar based on Fig. [fig:transcript]. Each note is represented as a tuple: pitch and duration.
In our previous work [1][27] we introduce 19 distinct types of mutations (Table1) that can be applied to the grammatical representation of a tune, encompassing operations such as addition, deletion, swapping, and inversion of grammar parts. These mutations are designed to ensure that the resultant musical variations retain the connections to the original piece, while introducing novel elements that enrich the musical experience. Fig. 3 is an example of how a single mutation can affect a tune.
Definition | Definition |
---|---|
Insert a primitive in an RHS | Delete a primitive from an RHS |
Move a primitive – in an RHS | Move a primitive – to another RHS |
Swap two primitives - in an RHS | Swap two primitives – between two RHSs |
Change a primitive in an RHS | |
Insert an existing rule to an RHS | Delete a rule from an RHS |
Move a rule - in an RHS | Move a rule – between two RHSs |
Swap two rules – in a single RHS | Swap two rules – between two RHSs |
Swap a rule and a primitive - in an RHS | Swap a rule and a primitive - in two RHSs |
Reverse an RHS | Reverse a sub-sequence of an RHS |
Swap two RHSs | |
Delete a rule from the grammar |
Mutation types - the abbreviation “RHS" stands for “right-hand side".
If we use intervals for the representation, in the expanding step, a starting point is needed to convert these intervals back into pitches, thus creating the variant tune. This approach draws from the understanding that mutations in intervals might drastically alter the tonality unless carefully managed. One approach could be to retain the first note of the original tune as the starting point and sequentially apply the mutated intervals to derive the subsequent pitches. However, this method results in variations that always begin with the same note, which is not typically the case in natural variations. In practice, variations crafted by musicians often share the same concluding note rather than the starting one. Therefore, There is another possibility to preserve the last note of the tune and run the Sequitur algorithm on the reversed sequence. To this end, we experimented with a interval backward generation setup, ensuring to preserve the tonality by fixing the last note of each sequence.
To reduce the potential for tonal deviation post-mutation, we adhered to the practice of maintaining the final note of each tune—a common anchor across different versions of a Gusheh—and then reconstructing the melody backwards from this note. This method, however, introduced a new challenge: ensuring the melody remains within the original modal framework. The term “modal framework" refers to the set of notes that are utilized within a particular section or Gusheh, defining its unique scale or mode [28][29]. Repair is applied when a note’s pitch falls outside the predefined modal framework’s range, defined by a unique list of pitches. We addressed the challenge by adjusting the generated intervals to match an ordered set of pitches present in the original tune, always selecting the closest pitch within the modal framework boundaries. This type of repair is called clamp.
There is another type of repair called mirror. If a generated note is beyond the highest or lowest pitch in this list, the mirroring method is employed to adjust the note back within the acceptable range. This adjustment is made by calculating the difference between the out-of-range note and the closest boundary (either the highest or lowest pitch). This difference is then applied in the opposite direction from the opposing boundary, effectively “mirroring” the note back into the modal framework. This is done at the diatonic level, so that the mirrored note is within the framework.
Figure 4 presents the potential setups that can be achieved by selecting one option from each category of the 5 discussed above. We selected 5 specific setups out of 72 potential setups. Differing from our prior work focused on Irish music, which employed a setup characterized as “pitches, chromatic, forward, note, non-repair”, our exploration into Iranian music evaluates five distinct setups: one pitch-based and four interval-based.
The pitch-based setup is categorized as “pitches, chromatic, forward, note + duration (tuple), non-repair”. The first of the interval-based setups is described as “intervals, chromatic, forward, note + duration (tuple) and clamp”. Second one, “intervals, chromatic, backward, tuple and clamp”. The third interval-based setup is characterized as “intervals, diatonic, backward, tuple and clamp”. Finally, the last interval-based setup repeats the configuration of the third but is distinguished by mirror repair instead of clamp.
As we mentioned in the introduction, middle eastern non-metric music necessitates a micro-level analysis rather than a bar-centred analysis which can be suitable for Western compositions. Figure [fig:trip] illustrates the topology of “Mother’s Delight," an Irish tune. In this topography, nodes represent the rules in the grammar for that piece of music, and the arrows branching from a node indicate other rules that are part of that rule’s definition. where ’0’ signifies the entire tune, and ‘1’ and ‘2’ mark two principal sections, comparable to what might be termed ‘A’ and ‘B’ sections in the Irish tradition. Such sectional divisions are common in Western music.
Conversely, Figure 6 depicts “Hazin”, one of the Gushehs from our Iranian corpus, where ‘0’ — the entire tune — interlinks directly with eight nodes, indicating the absence of distinct large sections. This structural distinction is crucial; we know from previous work that our grammatical mutation method produces results with structures more similar to that shown on the right. This shows us something important: when making new versions of tunes, we don’t need to worry about following strict, large-scale structures. Our method for creating new music works well because it doesn’t rely on these large structures either. It can make new and interesting versions of tunes, which is perfect for Avaz or other similar music styles. It suggests our algorithm is a good fit for creating new variations of these tunes.
We calculated the PAI for all tunes in the corpus, as depicted in Figure 7. Longer tunes tend to have higher PAI values. The PAI calculation was performed on 4 representations of each tune 1) the pitch, 2) the interval, 3) pitch and length tuples 4) interval and length tuples. Typically, PAIs based solely on pitch are lower compared to those incorporating intervals or length.
Figure 8 presents our findings from applying different metrics to the five setups previously outlined(pitches-chromatic-forward-tuple-non-repair (setup_1), intervals-chromatic-forward-tuple-clamp (setup_2), intervals-chromatic-backward-tuple-clamp (setup_3), intervals-diatonic-backward-tuple-clamp (setup_4) and intervals-diatonic-backward-tuple-mirror (setup_5)). This analysis encompasses all Gushehs in our corpus, with each undergoing 100 mutations, represented by individual thin lines. The thick line illustrates the aggregate average across all mutations. We focused on different metrics for this evaluation:
Length: Shows how the lengths of the Gushehs have been changed by mutations.
Edit Distance (ED): ED quantifies the changes a Gusheh undergoes after each mutation, comparing each version to the original. An initial ED of 0 indicates no change, as the original tune and itself are identical. A higher ED value per step signifies more substantial modifications, while a gradual increase in ED may be preferred for minor adjustments.
Pathway Assembly Index (PAI): PAI measures the complexity of the music from the original tune, where closeness to the original composition is considered advantageous. It is a common observation that music often occupies a “middle ground” or “goldilocks zone”: it should not be too simple, nor too complex [30]. The ideal complexity can be estimated by the distribution of complexity values observed in the original corpus. Therefore, if our mutations are successful, they should give results in the same range.
Normalized PAI: Given that longer tunes naturally exhibit higher PAI values, we normalized these figures by dividing PAI by the tune length (PAI/L) to compare tunes of varying lengths on an equal footing.
Upon reviewing the results displayed in Figure 8, derived from the examination of all Gushehs in our corpus across the five setups, we observe the following trends based on 100 mutations per Gusheh:
Keeping the length around the original length is desirable, and none of the setups have a significant change in their length. The PAI values across all setups did not undergo large changes. PAI/L exhibits interesting behavior across the setups. For setup_1, it gradually increases, indicating a steady rise in complexity relative to length. Setup_2 had a drop on PAI/L and then started to slightly increase over mutations. In the case of setup_3, there is an initial increase followed by a decrease. Both setup_4 and setup_5 show an increase followed by stability, indicating that after reaching a certain level of complexity, the mutations no longer significantly alter the tune’s complexity relative to its length. ED changing behavior is generally similar across all setups. There is a very fast increase in the early mutations and then a slow increase.
In summary, setup_1 and setup_2 are more consistent in maintaining the original length of tunes. setup_1’s PAI values remain closer to the original, suggesting a lesser degree of complexity change. While the interval-based versions reach higher ED values sooner, it does not necessarily mean they offer the desired change more quickly. These setups should be audibly evaluated to determine if the tunes remain recognizable after mutations. The PAI/L metric suggests that all setups maintain the complexity of the music well except setup_2, indicating that each possesses valuable features for mutation.
In conducting the analysis presented within this section, it is important to comment the viewpoint brought by the first author, who is a classically-trained Iranian musician. While the insights derived are deeply informed, they are subjective and influenced by the individual’s specific training. We are in the process of designing more objective listening tests for future work.
Our evaluation involved listening to variations of Gushehs after 5, 10, 15, and 20 mutations across the entire corpus, utilizing the five setups introduced earlier. We concentrated on mid-sized Gushehs, avoiding extremely long or short pieces.
Early Mutations (5 and 10): After 5 mutations, pitch-based variations remained closely recognizable compared to their original counterparts, with changes being noticeable yet coherent. The interval-based versions, while sometimes close to the original tune, often resulted in more pronounced alterations. By the 10th mutation, pitch-based adaptations provided a musically suitable (based on the author’s experience) new version of the Gushehs, while maintaining the essence of Gushehs. However, jumps that are not regular in this style of music were observed, attributed to a broader modal framework in some tunes, which is less typical in traditional Iranian classical music. Interval-based versions at this stage diverged more significantly, offering a distinctly different interpretation of the original Gusheh.
Mid to Late Mutations (15 and 20): The transition through the 15th mutation did not introduce major changes, serving more as an intermediate step. Upon reaching 20 mutations, pitch-based versions evolved into distinctly different interpretations, suggesting potential avenues for novel improvisation. Yet, they retained musical integrity, suggesting creativity within bounds. Conversely, 20 mutations in interval-based setups often resulted in confusion, with changes deemed excessive for considering them as mere variations. Among the interval-based approaches, the mirror repair setup frequently yielded the most musically pleasing outcomes, suggesting its effectiveness in preserving musicality while introducing novelty.
Pitch vs. Interval-Based Mutations: Pitch-based mutations generally led to more musically coherent variations, possibly due to their ability to retain recognizable patterns. Interval-based mutations, while maintaining patterns, shifted them in ways that could start from unexpected notes, leading to confusion regarding the original tune’s essence.
Special Note on Alterations: Mutations involving altered notes, such as altered notes in Salmak, occasionally produced dissonant sounds. That happens notably on the fifth degree of the tonal centre, which in our dataset is D. Bringing D, then some other notes, and then Dk is common in Iranian classical music, but the reverse order is not and results in a dissonant interval. Such alterations clashed with the traditional rules of Iranian classical music, highlighting areas where the algorithm might benefit from incorporating musical context and rules to avoid unintended dissonance.
In summary, our listening experience revealed that pitch-based approach generally offered more suitable variations.
One of the main objectives when a musician plays the entire concatenated Dastgah is to preserve Seyr, or melodic movement [28]. In the context of traditional Iranian music, Seyr or“melodic movement” encapsulates the progression of melodies within a piece, guiding the overall direction of the melody through its initiation, development, climax, and resolution. Observing the order of introducing frameworks is one of the main features of Seyr. Seyr emphasizes the importance of transitional notes and melodic phrases in establishing the identity and modal character of the piece. These elements are crucial in directing the melodic flow from one segment to another, ensuring a coherent and expressive musical journey. Here, we carry out a new experiment on the concatenated Dastgah.
In our corpus, PAI is 1982 for the sum of individual PAIs, while that is 1406 for considering the Dastgah as one piece of music. This is a common principle in MDL. If there are any commonalities across the corpus, the compressed description of the concatenated corpus will be less than the sum of compressed descriptions of the individual parts. The early parts of the piece can provide a “dictionary” for later parts, which allows savings. In the first row of Fig. 9 we can see the entire Dastgah, each Gusheh is shown with yellow highlights. The trend of fluctuation in this row shows us the Seyr for the original version.
By comparing MIDI files shapes in Fig. 9 it can be seen how in these five variations the Seyr has been well-preserved compared to the original one. In fact, each piece usually stays in the same modal framework, even after many mutations. When a Gusheh changes the previous modal framework, the introduced framework remains prominent in the new version. As a result, the overall trend (Seyr) remains similar. PAI for these variations respectively are 1555, 1517, 1555, 1557, 1572 which shows around 8 to 11 percent increase in PAI after 100 mutations. Together with our results in the previous subsection, this suggests that our mutation methods have a slight bias towards increasing complexity and/or length.
The parsing method offers potential for educational and (ethno)musicological applications. The traditional process of teaching Radif emphasizes learning by listening, where teachers break down Gushehs into small, manageable pieces for students to easily grasp and replicate. This approach relies heavily on repetitive segments, which, as shown in Figure [fig:transcript], the algorithm in most cases accurately identifies. We highlight only the initial occurrence of each segment, but the tune’s grammar reveals their repetition and structure [fig:Grammar]. Such grammatical analysis could be invaluable in educational settings, aiding students in learning non-metric music through its conventional methods.
A crucial aspect of Radif performance is recognizing significant motifs within the repertoire. Ali Jariteh’s manual analysis of the entire Radif [31] to identify these motifs or phrases is an attempt to find and introduce these motifs and phrases. The algorithm can help with this kind of research, for instance comparing these kind of motifs in two different versions of Radif, as the algorithm is a tool to discover these patterns. While Figure [fig:transcript] displays motifs identified by the algorithm within a single Gusheh, running the algorithm across an entire Dastgah uncovers longer phrases repeated across multiple Gushehs. An example is a cadence phrase in line 8, echoing similar phrases in other tunes, or the sequence in lines 4, 5, and 6, which resembles the first tune in our corpus (Daramad). By detecting these similarities, the algorithm simplifies understanding the connections and relationships between Gushehs.
In this work, we introduced a symbolic corpus for Iranian classical non-metric music. Furthermore, we explored different representations for this style of music and adapted the PA algorithm to the corpus to find its underlying hierarchical structure. We also used the algorithm to generate musical variations based on five different setups. After conducting auditory and statistical analyses, it turns out that pitch-based variations become more musically suitable after about 10 mutations. Future work will focus on developing a system to identify the most suitable mix of setups for a specific tune, aiming to generate variations. We have released all of our data, but in future we will expand our dataset to include the entire Radif book.
This work was conducted with the financial support of the Science Foundation Ireland Centre for Research Training in Digitally-Enhanced Reality (d-real) under Grant No. 18/CRT/6224.
This study centers on the exploration of algorithmic variations in non-metric Iranian classical music. Our work uses publicly available music corpora and does not involve human participants directly. The music collection is part of the Iranian classical music tradition, which is not itself owned or copyrighted by any individual composer. The edition we have used is used by permission of the author.
This musical tradition is culturally important, and we have tried to be respectful and careful in how we talk about it and use it.
The authors declare no conflicts of interest.