Contents
Video presentation at ISMIR 2020
(top)
Example audio from a section in a concert
Audio source: https://musicbrainz.org/recording/178b4cf6-88e6-414d-bfbd-3d90bb368a9a
(top)
Examples of different surface tempo multiples
Each figure below is a spectrogram of an 8-second example (Frequency in Hz on y-axis and time in seconds on x-axis).
A dashed box (in white) of width 2.5 seconds is shown on each spectrogram to highlight the differences in onset densities between the examples. The m.t. in all the examples is between 50 and 60 BPM, so in each case, the number of strokes / syllables we expect to see within the dashed rectangle is roughly equal to twice the s.t.m. value. The inset in figure (d) of width 1 second highlights the distinct nature of vocalisation at s.t.m 8.
(top)
YouTube playlist of some select Dhrupad vocal music
(The last couple of videos are lecture demonstrations which offer a detailed explanation of the music form)
(top)
Some other model architectures that were experimented with
(top)
Musical description of the bandish sections
(top)
References
- M. Clayton, Time in Indian Music: Rhythm, Metre, and Form in North Indian Rāg Performance. Oxford, England: Oxford University Press, 2000.
- M. A. Rohit and P. Rao, “Structure and automatic segmentation of Dhrupad vocal bandish audio,” Unpublished technical report, arXiv:2008.00756 [eess.AS],2020.
(top)