Deep Neural Network-Based Modeling of Multimodal Human–Computer Interaction in Aircraft Cockpits

Abstract

Improving the performance of human–computer interaction systems is an essential indicator of aircraft intelligence. To address the limitations of single-modal interaction methods, a multimodal interaction model based on gaze and EEG target selection is proposed using deep learning technology. This model consists of two parts: target classification and intention recognition. The target classification model based on long short-term memory networks is established and trained by combining the eye movement information of the operator. The intention recognition model based on transformers is constructed and trained by combining the operator’s EEG information. In the application scenario of the aircraft radar page system, the highest accuracy of the target classification model is 98%. The intention recognition rate obtained by training the 32-channel EEG information in the intention recognition model is 98.5%, which is higher than other compared models. In addition, we validated the model on a simulated flight platform, and the experimental results show that the proposed multimodal interaction framework outperforms the single gaze interaction in terms of performance.

Details

Subject

Aircraft;
Software;
Eye movements;
Accuracy;
Classification;
Recognition;
Artificial neural networks;
Aviation;
Aircraft performance;
Interaction models;
Methods;
Machine learning;
Speech

Identifier / keyword

deep learning; multi-modal; long short-term memory network; transformer; human–computer interaction

Title

Deep Neural Network-Based Modeling of Multimodal Human–Computer Interaction in Aircraft Cockpits

Author

Wang, Li¹; Zhang, Heming²; Wang, Changyuan³

¹ School of Electronic & Electrical Engineering, Baoji University of Arts and Sciences, Baoji 721016, China
² School of Optoelectronic Engineering, Xi’an Technological University, Xi’an 710000, China; [email protected]
³ School of Computer Science, Xi’an Technological University, Xi’an 710021, China; [email protected]

Publication title

Future Internet; Basel

Volume

Issue

First page

127

Publication year

2025

Publication date

2025

Publisher

MDPI AG

Place of publication

Basel

Country of publication

Switzerland

Publication subject

Computers--Internet

e-ISSN

19995903

Source type

Scholarly Journal

Language of publication

English

Document type

Journal Article

Publication history

Online publication date

2025-03-13

Milestone dates

2025-01-07 (Received); 2025-03-10 (Accepted)

Publication history

First posting date

13 Mar 2025

DOI

https://doi.org/10.3390/fi17030127

ProQuest document ID

3181453752

Document URL

https://www.proquest.com/scholarly-journals/deep-neural-network-based-modeling-multimodal/docview/3181453752/se-2?accountid=208611

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Last updated

2025-03-27

Database

ProQuest One Academic

Deep Neural Network-Based Modeling of Multimodal Human–Computer Interaction in Aircraft Cockpits

Content area

Abstract

Details