top of page
Writer's picturetoinfinityandbeyondmathclub

Musical Analysis Conforming To The Normal Distribution

Introduction

 

In music, note durations and inter-onset intervals (IOIs) are essential parameters that affect the rhythm and the flow of a composition. Understanding their distribution helps in uncovering underlying patterns in numerous musical performances. This paper explores the hypothesis that these parameters conform to a normal distribution, which utilises a

bell-shaped curve and is commonly used in statistics to model data that clusters around a mean value with symmetrical dispersion.


Note durations and IOIs are often expected to follow a normal distribution due to the natural variance in human performance and the general structure of musical compositions. This understanding can help us analyse variations in these parameters and how they reflect performance styles and compositional trends. Our research aims to offer insights into the probabilistic nature of musical expression and structure promoting musical articulation and creativity among artists


Key terms

 

-        Note Durations: The length of time for which a note is played is called its note duration (“Note Duration”)

-        Inter onset intervals(IOIs): Inter-onset intervals (IOIs) are the intervals between onsets of adjacent notes and provide a measure of tempo (“Time point”)


Research Methodology

1.     Data Sampling: To ensure an unbiased sample, a diverse set of 100 musical pieces were chosen from each genre. For each genre, we assigned a unique number from 1-100 to each musical composition. Then using a random number generator, one song was chosen. The process was repeated for each of the five genres we selected including classical, jazz, pop, rock and Hindustani Classical to ensure a broad analysis of rhythmic patterns. This variety allows us to capture a wide range of rhythmic practices and styles.

2.     Data Collection and Formatting: We first obtained a MIDI file of each song, then using MuseScore Studio, an open-source music notation software, we converted the MIDI file to an XML file type. An example of a file converted to the XML file format is displayed below:



3.     Data Cleaning and Formatting: Before using the extracted data for note durations collected from different musical compositions, we ensured that no anomalies were present and the note duration data was accurate. To accomplish this, we generated the note durations from MuseScore several times for each individual composition, scanning for any missing values or gaps present and comparing each outcome obtained from a composition to verify that they are identical.

Then using an algorithm, we obtained accurate note duration data for the song. The algorithm for the same is shown below:




The data was then formatted to fit the syntax of the second algorithm that plotted the histogram, normal distribution curve and q-q plot for the data. This process was repeated for each song. A part of the algorithm for the same can be found below:




4.     Pattern Identification and analysis: Using the graphs generated by the algorithm, we analysed how closely musical rhythms conform to statistical norms, specifically the normal distribution and what certain deviations might indicate about the musical style or composer's intent. Furthermore, we used graphs of commonly found IOIs of each of the genres to study the impact of the same on musical composition and performance.


Mathematical Working

1.     Normal Distribution Theory: the normal distribution, also known as the Gaussian distribution or bell curve, is one of the most important continuous probability distributions in statistics. It is used to describe data that clusters around a central mean value with a symmetrical dispersion. (“Normal Distribution (Definition, Formula, Table, Curve, Properties & Examples)”). Using the functions in the algorithm given above we automatically calculated mean, standard deviation and the probability density function (PDF) of a normal distribution but the following can be calculated through:

Mean can be calculated using the formula:






Where,

 

●              xᵢ is the variable

●              μ the population mean

●              N is the total number of values in the population


 

is the standard deviation

 

Standard deviation can be calculated using the formula:

 







Where,

 

●              xᵢ is the variable

●              μ the mean

●              N is the total number of values in the population

is the sum of the squared deviations of each value from the population mean.





The probability density function (PDF) of a normal distribution is given by:





Where,

 

●              x is the variable

●              μ the mean

●              σ is the standard deviation

 

 

2.     Histograms: A histogram is a visual representation of the distribution of quantitative data. The term was first introduced by Karl Pearson. (“Histogram”). Using the procedure clearly labelled and indicated under the histogram part of the algorithm, the histograms for the data were generated.




3.     Quantile-Quantile (Q-Q) plots: Quantile-Quantile (Q-Q) plots are used to assess whether or not a set of data potentially came from some theoretical distribution. In most cases, this type of plot is used to determine whether or not a set of data follows a normal distribution. (Bobbitt)


Using the procedure clearly labelled and indicated under the histogram part of the algorithm, the Q-Q plots were generated.



Analysis

We Will Rock You(“We Will Rock You”) —> Rock Genre




Raag Bhairav(“Raag Bhairav”) —> Hindustani Classical Genre



 


Songbird(“Songbird”) —> Jazz Genre


 

 

 

Beethoven’s Symphony No. 5 in C Minor, Op. 67, Part I —> Classical Genre




 

Blinding Lights(“Blinding Lights”) —> Pop Genre



 


Conclusion

By analysing various musical performances and compositions, we aimed to understand the underlying patterns and variations in these parameters, shedding light on the probabilistic nature of musical expression. Our findings suggest that while note durations and IOIs generally exhibit a tendency towards a normal distribution, there are noteworthy deviations which could have been influenced by factors such as the performance style of the human, genre, and the musician’s intent. For example, using the graphs, we can determine the median of both note durations and inter-onset intervals for the five selected genres. The highest median for note durations has been displayed by the song “Songbird”, which belongs to the jazz genre, giving insights into the probable note duration of the genre.

The normal distribution serves as a useful model for understanding general trends and variances in musical timing, however real-world data reveals additional complexities that may be due to human variability and artistic expression. This research highlights the importance of considering both statistical models and individual performance characteristics when analysing musical rhythm and flow, contributing to our understanding of musical articulation and creativity.



Implications

 

-        Performance Analysis: Understanding the normal distribution of note durations and IOIs can help musicians and composers refine their techniques and achieve desired expressive effects.

-        Composition: Composers can use insights from statistical analysis to craft rhythmic patterns and structures that align with or deviate from traditional norms.


-        This research may also be used to improve theoretical models on music composition or by AI music generating models to enhance music composition by improving existing machine learning algorithms on music analysis and aid the upcoming AI models in generating music.


Evaluation

After a thorough assessment of our research paper and the approach to it, we arrived at the following judgement, identifying the merits and shortcomings of our research as well as future prospects for research.


Strengths

-        This study uses a number of reliable statistical techniques, such as histograms, Q-Q plots, and normal distribution theory. All of these methods work together to ensure dependable and understandable results.

-        Data cleaning which was a part of the research methodology ensures maximum accuracy of note duration as well as inter-onset intervals (IOIs) values.

-        The use of a fair sampling method using number assignment followed by random number generation aims to reduce bias in the study, providing reliable results.


Limitation

-  The study's reliance on specific musical genres and data sources may limit the generalizability of the findings.

-  Further research could explore additional probability distributions or analyse a broader range of musical styles.

-  A larger sample size could have been used to arrive at more accurate results.


Future Research

-  The use of other probability distributions to musical data can be researched.

-  We could explore the impact of different musical genres and cultures on the distribution of rhythmic parameters, how they are affected by genre, culture, region.



Citations

 

Queen. "We Will Rock You." News of the World, EMI Records, 1977.


The Weeknd. "Blinding Lights." After Hours, XO and Republic Records, 2020. Fleetwood Mac. "Songbird." Rumours, Warner Bros. Records, 1977.

Ustad Rashid Khan. "Raag Bhairav." Morning Ragas, Sony Music India, 1994.


Bobbitt, Zach. “The Complete Guide: How to Interpret Q-Q Plots.” Statology, 19 January 2024, https://www.statology.org/qq-plot-interpretation/. Accessed 4 August 2024.

“Histogram.” Wikipediahttps://en.wikipedia.org/wiki/Histogram. Accessed 4 August 2024. “Normal Distribution (Definition, Formula, Table, Curve, Properties & Examples).” BYJU'S,


By Satpreet Singh Chadha and Dhruv Tahiliani

3 views0 comments

Recent Posts

See All

Comments


bottom of page