Content area

Abstract

Advancements in live audio processing, specifically in sound classification and audio captioning technologies, have widespread applications ranging from surveillance to accessibility services. However, traditional methods encounter scalability and energy efficiency challenges. To overcome these, Triboelectric Nanogenerators (TENG) are explored for energy harvesting, particularly in live‐streaming sound monitoring systems. This study introduces a sustainable methodology integrating TENG‐based sensors into live sound monitoring pipelines, enhancing energy‐efficient sound classification and captioning by model selection and fine‐tuning strategies. Our cost‐effective TENG sensor harvests ambient sound vibrations and background noise, producing up to 1.2 µW cm−2 output power and successfully charging capacitors. This shows its capability for sustainable energy harvesting. The system achieves 94.3% classification accuracy using the Hierarchical Token Semantic Audio Transformer (HTS‐AT) model identified as optimal for live sound event monitoring. Additionally, continuous audio captioning using the EnCodec Combining Neural Audio Codec and Audio‐Text Joint Embedding for Automated Audio Captioning model (EnCLAP) showcases rapid and precise processing capabilities that are suitable for live‐streaming environments. The Bidirectional Encoder representation from the Audio Transformers (BEATs) model also demonstrated exceptional performance, achieving an accuracy of 97.25%. These models were fine‐tuned using the TENG‐recorded ESC‐50 dataset, ensuring the system's adaptability to diverse sound conditions. Overall, this research significantly contributes to the development of energy‐efficient sound monitoring systems with wide‐ranging implications across various sectors.

Details

1009240
Business indexing term
Title
Machine Learning‐Enabled Triboelectric Nanogenerator for Continuous Sound Monitoring and Captioning
Author
Bagheri, Majid Haji 1   VIAFID ORCID Logo  ; Gu, Emma 1 ; Khan, Asif Abdullah 1 ; Zhang, Yanguang 2   VIAFID ORCID Logo  ; Xiao, Gaozhi 2   VIAFID ORCID Logo  ; Nankali, Mohammad 3   VIAFID ORCID Logo  ; Peng, Peng 3 ; Xi, Pengcheng 4 ; Ban, Dayan 1   VIAFID ORCID Logo 

 Department of Electrical and Computer Engineering, Waterloo Institute for Nanotechnology, University of Waterloo, Waterloo, Canada 
 Quantum and Nanotechnologies Research Centre, National Research Council of Canada, Ottawa, Ontario, Canada 
 Faculty of Engineering, University of Waterloo, Waterloo, Ontario, Canada 
 Digital Technologies Research Centre, National Research Council of Canada, Ottawa, Ontario, Canada 
Publication title
Volume
4
Issue
2
Number of pages
10
Publication year
2025
Publication date
Feb 1, 2025
Section
Research Article
Publisher
John Wiley & Sons, Inc.
Place of publication
Stanford
Country of publication
United States
Publication subject
ISSN
27511219
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-01-08
Milestone dates
2024-11-25 (manuscriptRevised); 2025-02-10 (publishedOnlineFinalForm); 2024-10-09 (manuscriptReceived); 2025-01-08 (publishedOnlineEarlyUnpaginated)
Publication history
 
 
   First posting date
08 Jan 2025
ProQuest document ID
3276239067
Document URL
https://www.proquest.com/scholarly-journals/machine-learning-enabled-triboelectric/docview/3276239067/se-2?accountid=208611
Copyright
© 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-11-28
Database
ProQuest One Academic