Académique Documents
Professionnel Documents
Culture Documents
RSUM
Cet article propose dexplorer un work in progress
2. STRUCTURAL OVERVIEW
cherchant dvelopper une dmarche de composition The algorithmic structure may be divided into three dis-
assiste par ordinateur (CAO) axe sur llaboration de tinct processing stages. The first one, including the signal
structures musicales au moyen dune mthode de classifi- modelling, the audio features extraction and the temporal
cation audio non supervise et de certains algorithmes de modelling, focuses on data extraction. The second stage,
parcours de graphe. Certaines parties de cette ide ont including the features space analysis, the calculation of a
dj t tudies, notamment dans le cadre de la synthse distance threshold and the audio clustering itself, focuses
concatnative par corpus, de la reconnaissance des genres on data analysis. The third one, including the clusters
musicaux ou de lorchestration assiste par ordinateur space analysis and the graph exploration, focuses on data
pour en nommer quelques-unes, mais le dfi reste de sorting.
trouver une manire dintgrer ces techniques informa-
tiques dans le processus compositionnel, non pas pour
gnrer du matriau sonore mais plutt pour lexplorer,
pour lanalyser et pour mieux le comprendre en prpara-
tion dune uvre musicale. Contrairement aux dmarches
traditionnelles de CAO, principalement axes sur des
mthodes gnratives, la suivante propose une approche
analytique du matriau sonore axe sur llaboration de
diffrents types de structures musicales afin de stimuler le
processus cratif. Cet article propose donc de dissquer la
structure algorithmique afin dexposer la mthodologie,
dclairer certaines problmatiques inhrentes, dexposer
les limites dune telle approche et finalement de prsenter
quelques ides pour dautres applications. Figure 1. Structural overview
, = ( )) ,- ( ) (1)
7
That is because the overlaps allow a single element to belong to multi-
8
ple clusters at once, thus making the clusters more difficult to distin- As for the previous single-layer relational space, this information is
guish. essential to outline the resulting network.
Similar to the temporal modelling, each coefficient of a 5.1.3 Inter-cluster analysis
multidimensional feature is considered as an individual
Similar to the features-space analysis described earlier,
time series and is treated consequently. Thus, each coef-
the inter-cluster analysis consists of measuring the dis-
ficient represents a different data point calculated as the
tance (magnitude) between each clusters barycentre. For
mean of the means in the resulting vectors.
that, the Mahalanobis distance (1) also appears to be the
5.1.2 Audio clustering (Neat clustering) most adequate measurement in order to process this type
of data structure. Then, as for the single-layer relational
At this stage, the user may decide to reformulate the
space, a distance matrix is built from the previous calcu-
clustering in a way to remove the fuzzy components, the
lations in order to outline the underlying network.10
sounds that are assigned to multiple clusters, in order to
obtain a neat clustering where each sound belongs to a Although this could be enough to unravel the network, it
single cluster. That is done by measuring the distance appeared to be important to add a second element in the
(magnitude) between the sounds and the centre of their process with respect to the fuzzy clustering approach.
assigned clusters in order to identify the closest and make That is to use the Jaccard distance [28]. While the Ma-
it the final assignation.9 For that, each sound has to be halanobis distance informs a metric distance between two
modelled similarly to the previous intra-cluster modelling theoretical barycentre, the latter informs a similarity level
method in order to assess the distance to its clusters bary- (distance interpretable) based on the ratio of two given
centre. Since a sound is a single element, the difference is clusters intersection (shared components) and their union
that there is no need to average the means of each audio (combined components).11 In this case, the Mahalanobis
feature. distance [0. +inf.] is simply weighted (multiplied) by the
Jaccard distance [0. 1.].
qs , qs
, = (8)
qs