Content area

Abstract

While audio data play an increasingly central role in computer-based music production, interaction with large sound collections in most available music creation and production environments is very often still limited to scrolling long lists of file names. This paper describes a general framework for devising interactive applications based on the content-based visualization of sound collections. The proposed framework allows for a modular combination of different techniques for sound segmentation, analysis, and dimensionality reduction, using the reduced feature space for interactive applications. We analyze several prototypes presented in the literature and describe their limitations. We propose a more general framework that can be used flexibly to devise music creation interfaces. The proposed approach includes several novel contributions with respect to previously used pipelines, such as using unsupervised feature learning, content-based sound icons, and control of the output space layout. We present an implementation of the framework using the SuperCollider computer music language, and three example prototypes demonstrating its use for data-driven music interfaces. Our results demonstrate the potential of unsupervised machine learning and visualization for creative applications in computer music.

Details

1009240
Business indexing term
Title
A General Framework for Visualization of Sound Collections in Musical Interfaces
Author
Roma, Gerard 1   VIAFID ORCID Logo  ; Xambó, Anna 2   VIAFID ORCID Logo  ; Green, Owen 1   VIAFID ORCID Logo  ; Tremblay, Pierre Alexandre 1   VIAFID ORCID Logo 

 Centre for Research into New Music (CeReNeM), University of Huddersfield, Huddersfield HD1 3DH, UK; [email protected] (O.G.); [email protected] (P.A.T.) 
 Music, Technology and Innovation (MTI2), De Montfort University, Leicester LE1 9BH, UK; [email protected] 
Publication title
Volume
11
Issue
24
First page
11926
Publication year
2021
Publication date
2021
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
20763417
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2021-12-15
Milestone dates
2021-11-12 (Received); 2021-12-08 (Accepted)
Publication history
 
 
   First posting date
15 Dec 2021
ProQuest document ID
2612738872
Document URL
https://www.proquest.com/scholarly-journals/general-framework-visualization-sound-collections/docview/2612738872/se-2?accountid=208611
Copyright
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-05-05
Database
ProQuest One Academic