Content area

Abstract

This paper presents the design and implementation of an automatic music transcription algorithm for piano audio, utilizing an optimized convolutional neural network with optimal parameters. In this study, we adopt the cepstral coefficient derived from cochlear filters, a method commonly used in speech signal processing, for extracting features from transformed musical audio. Conventional convolutional neural networks often rely on a universally shared convolutional kernel when processing piano audio, but this approach fails to account for the variations in information across different frequency bands. To address this, we select 24 Mel filters, each featuring a distinct center frequency ranging from 105 to 19,093 Hz, which aligns with the 44,100 Hz sampling rate of the converted music. This setup enables the system to effectively capture the key characteristics of piano audio signals across a wide frequency range, providing a solid frequency-domain foundation for the subsequent music transcription algorithms.

Details

1009240
Business indexing term
Title
Design and implementation of piano audio automatic music transcription algorithm based on convolutional neural network
Author
Li, Mengshan 1 

 Ningbo Open University, College of Geriatric Education, Ningbo, China 
Volume
2025
Issue
1
Pages
26
Publication year
2025
Publication date
Dec 2025
Publisher
Springer Nature B.V.
Place of publication
New York
Country of publication
Netherlands
ISSN
16874714
e-ISSN
16874722
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-07-01
Milestone dates
2025-06-16 (Registration); 2025-02-26 (Received); 2025-06-12 (Accepted)
Publication history
 
 
   First posting date
01 Jul 2025
ProQuest document ID
3226009607
Document URL
https://www.proquest.com/scholarly-journals/design-implementation-piano-audio-automatic-music/docview/3226009607/se-2?accountid=208611
Copyright
Copyright Springer Nature B.V. Dec 2025
Last updated
2025-07-02
Database
ProQuest One Academic