KAN–EEG: towards replacing backbone–MLP for an

Full text

Turn on search term navigation

Headnote

The landscape of artificial intelligence (AI) research is witnessing a transformative shift with the emergence of the Kolmogorov-Arnold network (KAN), presenting a novel architectural paradigm aimed to redefine the structural foundations of AI models, which are based on multilayer perceptron (MLP). Through rigorous experimentation and evaluation, we introduce the KAN-electroencephalogram (EEG) model, a tailored design for efficient seizure detection. Our proposed network is tested and successfully generalized on three different datasets, one from the USA, one from Europe, and one from Oceania, recorded with different front-end hardware. All datasets are scalp EEG in adults and are from patients living with epilepsy. Our empirical findings reveal that while both architectures demonstrate commendable performance in seizure detection, the KAN model exhibits high-level out-of-sample generalization across datasets from diverse geographical regions, underscoring its inherent efficacy and adaptability at the backbone level. Furthermore, we demonstrate the resilience of the KAN architecture to model size reduction and shallow network configurations, highlighting its versatility and efficiency by preventing over-fitting in-sample datasets. This study advances our understanding of innovative neural network architectures and underscores the pioneering potential of KANs in critical domains such as medical diagnostics.

Keywords:

Kolmogorov-Arnold networks, robustness, flexibility, seizure, out-of-distribution

(ProQuest: ... denotes formulae omited.)

1. Introduction

Epilepsy is a neurological disorder causing recurring seizures, affecting millions globally. In about 30-35% of cases, standard anti-epileptic drugs (AEDs) fail to control abnormal brain activities, resulting in drug-resistant epilepsy. Despite advancements in AED development and testing, improvements in their effectiveness have been limited [1]. Unpredictable and unprovoked seizures significantly impact patients' quality of life, employment and overall well-being, posing risks such as falls and sudden unexpected death in epilepsy [2,3]. An accurate system for detecting and counting seizures can greatly enhance decision-making, treatment planning and disease management, leading to better patient outcomes. Electroencephalogram (EEG) signals are commonly used by scientists to diagnose neurological diseases such as seizures. It is a technique that records brain electrical activity by placing electrodes on the scalp to capture electrical signals generated by neuronal activity [4]. These electrical signals reflect the brain's functional state and information processing, making EEG an important tool for studying brain function [5]. Variations in EEG signals in time and frequency can reveal brain activity patterns associated with sleep, cognition, emotion and pathological states such as epilepsy [6]. Artificial intelligence (AI) has undeniably made significant strides in healthcare, becoming a new era of innovation and patient care. Its impact on the healthcare industry has been overwhelmingly positive, with numerous benefits spanning various aspects of medical practice, research and patient outcomes [7-9]. Among the various AI techniques, machine learning models, particularly neural networks, have shown great promise in analysing and interpreting complex medical data. Multilayer perceptron (MLP) networks are the backbones of today's AI architectures [10,11] and have been extensively used for their effectiveness in detecting and classifying abnormalities from biosignals. The MLP, a feed-forward artificial neural network, consists of multiple layers of neurons that process input data through weighted connections and activation functions. It excels in capturing the relationships within the data, making it well-suited for analysing the intricate patterns present in medical signals. Despite its advantages, the MLP has limitations, particularly in model interpretability and efficiency. To address these challenges, a new model known as the Kolmogorov-Arnold network (KAN) has been proposed as a promising alternative to MLPs [12]. Unlike MLPs, which use fixed activation functions on the hidden layers, KANs employ learnable activation functions, replacing linear weights with univariate functions parametrized as splines. This architectural difference allows them to achieve greater accuracy and interoperability.

KANs have demonstrated the ability to outperform MLPs with smaller network sizes, making them more computationally efficient [12]. Despite its advantages, the MLP has limitations, particularly in model interpretability and efficiency. Furthermore, it has demonstrated the ability to outperform MLPs with smaller network sizes, making them more computationally efficient [12]. In the context of EEG signal analysis, KANs offer potential advantages over MLPs. Their learnable activation functions and efficient representation of data structures enable better handling of the complex, high-dimensional data characteristic of EEG signals. This can improve detection accuracy and more insightful interpretations of the underlying neural activity. The subsequent sections will delve deeper into the comparative performance of our proposed architectures and MLPs in detecting epileptic seizures, highlighting the strengths and potential of this critical application. The diagram in figure 1 delineates the structural differences between the KAN and the MLP architectures. The KAN structure demonstrates learnable activation functions, which can constitute interpretable systems and face MLP architecture challenges through enhanced capabilities for continual learning and efficiency at shallow, sparse connections.

1.1. Background

Epileptic seizure detection has seen substantial progress with the rise of machine learning models, particularly those using EEG signals. EEG tests record the electrical activity produced by neurons, typically non-invasively, by placing multiple electrodes on the scalp [4]. EEGs play a crucial role in detecting epileptic foci and categorizing epilepsy types, such as focal, generalized and unknown seizures [13-15]. These recordings are essential for diagnosing epilepsy, monitoring ongoing conditions, making predictions and effectively responsive neurostimulation. Recent advancements in deep learning have introduced various architectures to improve EEG-based seizure detection. Convolutional neural networks (CNNs) are prominently used, transforming EEG signals into different dimensional forms for detailed analysis [16-21]. Recurrent neural networks (RNNs), including their advanced versions like long shortterm memory (LSTM) and gated recurrent units, are adept at capturing temporal dependencies in EEG data, making them suitable for sequential data analysis [22-25]. Unsupervised learning methods, such as deep belief networks [26,27] and auto-encoders (AEs) [28-30], are employed to extract and reconstruct features from raw signals.

Table 1 summarizes various traditional models applied to different bio-signal applications, preprocessing methods, datasets and indicators. In addition, hybrid models that combine CNNs with RNNs or AEs use the spatial feature extraction capabilities of CNNs and the temporal modelling strengths of RNNs or the reconstruction abilities of AEs [31,32,38-41]. For example, convolutional long short-term memory (ConvLSTM) networks integrate the spatial feature extraction power of CNNs with the temporal sequence modelling capabilities of LSTMs [32]. This integration has enhanced the model's performance in detecting seizures.

Transformer-based networks, a more recent innovation, incorporate attention mechanisms to better capture complex patterns in EEG data, further improving detection accuracy [34,42-44]. Despite these significant advancements, traditional seizure detection models still encounter challenges in generalization and real-time implementation. Models trained on large datasets often struggle with low area under the receiver operating curve (AUROC) and high false-positive rates, which limits their clinical applicability. Thus, there is a pressing need for models that can balance sensitivity and specificity and be broadly applicable across diverse patient populations and real-world scenarios [32,45].

Although the traditional MLP-based model has significantly contributed to the development of EEG- based seizure detection systems, the emergence of KAN and other advanced models provides a promising direction for improving the accuracy, efficiency, and applicability of epilepsy detection technology. The following section will provide a detailed comparison between MLP and KAN, emphasizing the advantages and potential of KAN in EEG signal analysis of epilepsy detection.

1.2. Limitations of the previous studies

Previous models used in machine learning-based systems, particularly those relying on MLP architectures, have been constrained by their reliance on fixed activation functions. These functions, such as The rectified linear unit or sigmoid, are pre-defined and static, limiting the model's flexibility and adaptability. Furthermore, traditional MLP architectures lack the capability for continuous learning, meaning that once a model is trained, it cannot adapt based on new incoming data without retraining. This limitation poses a significant challenge for real-time applications or scenarios where the data distribution changes over time. Without the ability to continuously learn and adapt, the models can become less accurate, leading to performance degradation. Therefore, there is growing interest in developing architectures that dynamically adjust their activation functions and support continuous learning, enabling more flexible, robust and efficient models for complex, real-world applications.

1.3. Novelty and significance

In this study, our primary aim is to introduce a novel backbone architecture for advancing the field of seizure detection, focusing on addressing challenges related to generalization and real-time implementation. To the best of our knowledge, this is the first study that uses a novel non-backbone MLP for healthcare applications and the first study to be used in seizure detection. We propose the utilization of KANs as a key component for identifying epileptic seizures from pre-recorded EEG signals. We incorporate the short-time Fourier transform (STFT) technique to process the EEG data effectively. Rather than solely emphasizing the challenges in generalization and implementation, we highlight the innovative potential of efficient-shallow architectures adaptable as a foundational framework for future AI models in seizure detection. Through our experimentation with the three different continental datasets, we aim to showcase the inherent efficacy of KANs and their capacity to serve as an efficient architecture for developing advanced seizure detection systems. We found the following advantages for our proposed method: (higher accuracy) preliminary findings suggest that our model demonstrates comparable AUROC values to traditional MLP models, indicating a similar level of performance in seizure detection. This comparative analysis suggests that while KANs do not necessarily outperform MLP models, they offer a comparable level of reliability, thereby contributing to the advancement of seizure detection methodologies; (efficiency) our proposed architecture requires a smaller network size, thereby making, them computationally efficient and suitable for real-time applications; (less training, high performance) our proposed architecture achieves high accuracy while requiring only a small portion of the training dataset, outperforming models that typically require larger datasets for similar results; (out-of-sample seizure detection) generalization beyond the training dataset was assessed by training our model on the USA dataset and evaluating its performance on independent datasets from Europe and Oceania, using the same trained weights. The outcomes indicate encouraging support for this test.

2. Methods

2.1. Datasets

Three datasets were used in this work: the Temple University Hospital (TUH) EEG Corpus, the scalp- EPILEPSIAE dataset, and the Royal Prince Alfred Hospital (RPAH) dataset. Figure 2 summarizes the TUH dataset, detailing key statistics. This dataset is divided into training and validation sets, offering comprehensive insights into the number of hours used for both patients with seizures and non-seizures and the number of patients that present seizures and non-seizures. The TUH dataset is the primary training dataset, providing a diverse range of seizure signals for various seizure types. With its large volume of files, the TUH dataset from the USA offers extensive data for robust model training. We used 400 h of EEG data for our training process, comprising 120 000 samples with a 12 s window. This dataset included approximately 75% background activity and 25% seizure activity. Notably, this represents a significantly smaller data volume than other models. Despite the reduced dataset size, we successfully validated the KAN-EEG model's efficiency in seizure detection with reduced training data. The validation was conducted using 192 hours of EEG data, corresponding to 57 306 samples with a 12 s window, and maintained the same 75-25% ratio of background to seizure information.

The EPILEPSIAE and RPAH datasets, consisting of adult EEG, are used for inference tests. Both datasets share common characteristics, such as using identical montages and adult patient data, ensuring consistency and comparability in analysis. Integrating these complementary datasets enhances the comprehensiveness and reliability of the AI model for seizure detection. The EPILEPSIAE dataset consists of 30 patients, where 19 and 11 are male and female, respectively. The RPAH dataset from one of Australia's major hospitals reliably maintained one of the largest datasets from adult epilepsy patients nationwide. This work uses nine years (2011-2019) of data, testing nearly 14 590 h of EEG data from 192 patients over 1006 sessions, each averaging around 15 h of recording. Out of 212 patients, 20 were excluded for reasons including excessive seizures (more than 11 seizures per 24 h), missing electrode data, or seizures confirmed only by video.

The Australian dataset is about 16 times larger than the US training dataset, with longer interictal periods and background data, making evaluating false positives highly robust. The distribution of the RPAH dataset across three domains-seizure type and frequency, age, gender and seizure occurrence within a 24 h cycle-highlights essential patterns. Notably, seizure occurrences are derived from intermittent monitoring, providing insights into the likely timing of seizures. Ethical approval was obtained to access this clinical data.

2.2. The Kolmogorov-Arnold network-electroencephalogram structure

2.2.1. Characteristics of Kolmogorov-Arnold networks

KANs are founded on the Kolmogorov-Arnold representation theorem, which posits that any multivariate continuous function can be broken down into a finite composition of continuous univariate functions and addition operations [12]. This foundational principle allows it to substitute traditional linear weights with spline-parametrized univariate functions. Moreover, it employs adaptive univariate activation functions along the network edges, enhancing flexibility and precision. These functions adjust based on the data, leading to more accurate approximations. Spline functions enable dynamic adaptation to the data, providing refined representations that effectively capture smooth transitions. Additionally, KANs require fewer parameters than MLPs, improving computational efficiency and model interpretability. The learnable functions can also be visualized for better understanding. The operation of a KAN layer can be described by equation (2.1):

... (2.1)

where xi represents the activation value at node i, and ??ij denotes the learnable activation function on the edge connecting node j to node i.

2.2.2. Mathematical formulation of Kolmogorov-Arnold networks

KANs use the Kolmogorov-Arnold representation theorem to break down a high-dimensional function into a sum of univariate functions. This decomposition is given by equation (2.2):

... (2.2)

In this formulation, φ_q and ψ_q,p are univariate functions parameterized as splines, while xp represents the input features. The inner functions ψ_q,p(x_p) transform the input features into intermediate representations, which are then aggregated and processed by the outer functions φ_q. This structured approach enables KANs to effectively capture compositional structures and univariate functions, providing a robust framework for function approximation [12]. An overall representation of the KAN structure is seen in figure 1.

2.3. Pre-processing

We used two signal processing techniques, independent component analysis (ICA) and STFT, to address the challenges associated with raw EEG data. Initially, the EEG signals were divided into 12 s segments, and the ICA algorithm was applied to decompose the signals into 19 independent components using blind source separation. ICA separates EEG signals into statistically independent components, as represented in equation (2.3):

... (2.3)

where T contains the EEG data, M contains the time information, and A contains the weights for topographic maps. Pearson correlation was used to identify independent sources strongly associated with eye movement, detected from the 'FP1' and 'FP2' EEG channels. These sources related to eye movement were removed, resulting in EEG signals free from such artefacts. Subsequently, the STFT was applied to the cleaned EEG signals. This involved using a window length of 250 samples (equivalent to 1 s) with a 50% overlap and eliminating the DC component of the transform. As a result, the data dimensions were (N × 23 × 125), where N represents the number of electrodes, 23 represents the time index, and 125 represents the frequencies. Data pre-processing is performed separately from the KAN model, ensuring the data is adequately prepared before input into the KAN model for further training/inference, as depicted in figure 3.

3. Experiments and results

3.1. Training and validation in sample

Our model was trained and validated using the TUH dataset. We achieved an impressive AUROC score of 0.89 in figure 4B. It is important to highlight that our model was trained on 400 h of data, considerably fewer than the 752 h used for the ConvLSTM model and the 910 h for the transformer model. These preliminary results highlight the exceptional performance of our approach, which not only rivals but also exceeds that of contemporary methods by considering a smaller training dataset but a comparable and even greater test dataset. Our model demonstrates superior efficacy in accurately detecting epileptic seizures, underscoring its potential for clinical applications. The structure used to obtain the outlined results is highlighted in table 2, and we use this as our baseline for our following assessment.

The training processing was performed for 100 epochs, where in figure 4A can be seen the trend across different metrics. Among these, the discernible reduction in the training loss is evident, gradually converging towards optimal thresholds. Noteworthy is the observation that metrics such as recall, precision, and AUROC manifest a discernible tendency to reach the convergence phase approximately around the 20th epoch. Given the nature of a small architecture, training losses have an average convergence between 0.2 and 0.3.

3.2. Robustness

The findings presented in table 2 offer compelling insights into the robustness of the KAN-EEG architecture in seizure detection. By systematically reducing the complexity of the model through decreases in both layers and hidden neurons, we sought to evaluate the architecture's adaptability and efficiency under varying structural configurations. Our analysis reveals that despite the reduction in model complexity, the KAN architecture consistently maintains a high level of performance across key evaluation metrics, as reflected by the consistently high AUROC values across all models. For instance, the largest tested model has an AUROC of 0.89, while the smallest model tested is 0.85. This indicates the model's ability to differentiate between seizure and non-seizure states effectively, even when the architectural complexity is significantly reduced.

Moreover, while marginal decreases are observed in metrics such as precision and recall with model simplification, the overall performance, as quantified by the F1-Score, remains robust. This suggests that our architecture not only preserves its ability to detect seizures accurately but also maintains a balanced trade-off between precision and recall. It is crucial for real-world applications where false positives and negatives have significant implications. These results underscore the resilience of the proposed model to structural modifications, highlighting its adaptability and efficiency in resource-constrained environments. The ability of the model to maintain high-performance levels even with reduced computational complexity holds promising implications for practical deployment.

3.3. Compact networks leads to out-of-sample continental generalization

Can the KAN-EEG generalize beyond in-sample testing? Out-of-sample generalization is crucial in seizure detection models, as it assesses the model's ability to perform effectively on unseen data that was not included during training. This is essential for establishing more robustness and reliability of the model in real-world clinical settings, where the variability of patient populations, seizure types and environmental factors can significantly impact performance. To evaluate generalization performance, we tested our initial model with two layers (764 and 256 neurons, respectively) on two distinct out-of-sample datasets. The RPAH dataset results are presented in table 3, and the EPILEPSIAE dataset results are shown in table 4. The results shown demonstrate some out-of-sample testing ability, albeit not efficiently, in RPAH and EPILEPSIAE datasets with AUROC results of 0.60 and 0.55, respectively. The observed results primarily stem from an architectural design showing indications of overfitting on the training dataset. This assertion suggests that the model has become overly specialized in capturing specific information inherent in the training/validation data, consequently compromising its ability to generalize in different datasets. As a result, addressing overfitting is crucial for enhancing the model's robustness and applicability in real-world scenarios, with unseen-world datasets with varying patterns of seizure that can be specific to each patient.

To address the challenges of inefficient generalization, we modified the model to have two layers with 32 and 16 neurons, respectively. Additionally, we provided an extra evaluation to confirm that our findings are assertive by testing in a model with two layers, each with 32 neurons. As these architectures showed robustness in in-sample testing, with an AUROC of 0.87, we applied them to the generalization test using weights from training when the AUROC metric was not increasing.

As discussed previously, the KAN-EEG model with two layers (764 and 256 neurons) had a lower AU- ROC than the LTC-FPTT and ConvLSTM models. Interestingly, the KAN-EEG models with two layers (32 and 16 neurons, and 32 and 32 neurons) demonstrated comparable or superior AUROC values relative to these models. For the RPAH dataset, we demonstrated that the compact proposed networks have an AUROC of 0.85 and 0.83, respectively, outperforming existing models. For the EPILEPSIAE dataset, although we did not surpass previous studies, our results were closer to 0.78 and 0.75, respectively.

This suggests that overfitting may occur in models with more complex neuron structures, adversely affecting their generalization performance. These findings are significant as they indicate the potential for future deployable on-edge KAN algorithms, where memory size compression and robustness are crucial. These findings suggest the potential for replacing MLP-based models for seizure detection with the KAN-EEG algorithm. Our framework remains robust and demonstrates promise for practical implementation in real-world applications. It is essential to highlight that our data was not trained on the full TUH dataset, indicating that there is even further room for improvement in achieving greater results.

3.4. Computation time in training

In this study, we used a Tesla P100 GPU with 16 GiB of memory to compare the training performance of two neural network configurations of the KAN-EEG model: a larger network with 764-256 neurons and a smaller network with 32-16 neurons. Using a batch size of 64, the training time per epoch for the larger network averaged 4 min and 12 s, while the smaller network completed an epoch in 1 min and 1 s. Regarding GPU memory consumption, the more extensive network required approximately 8.8 GiB, whereas the smaller network used only 1.6 GiB. These results underscore the efficiency of this neural network architecture, which is crucial for real-time applications. By contrast, spiking neural networks often require significantly more training time, especially when analysing extended sequences with many timesteps. The fast training performance of the KAN-EEG model makes it well-suited for real-time scenarios.

4. Discussion

In this study, we pioneer the application of shallow KANs to EEG data, using this model's unique structure and capabilities inspired by the Kolmogorov-Arnold representation theorem. The proposed architecture not only achieves a higher accuracy in-sample dataset but can also be generalized across different scalp-EEG, as smaller KANs can outperform much larger MLPs in terms of data fitting and partial differential equation solving, owing to faster neural scaling laws [12]. By applying KANs to EEG data, we expect to unlock new potentials in seizure prediction models, given its ability to learn compositional structures and optimize univariate functions [12]. This novel application could provide more accurate and interpretable predictions, advancing the neural network-based seizure prediction field. The model's resilience against catastrophic forgetting will also ensure stable and continuous learning, a critical requirement for medical applications involving long-term EEG monitoring. To this extent, our next steps will include deploying this architecture on memristor devices. Recent studies have demonstrated the potential of incorporating a KAN network with neuromorphic hardware. Given that memristors are a key component in developing neuromorphic platforms, this approach introduces exciting opportunities for the deployment of such models on non-conventional hardware [50].

4.1. Study and model limitations

This research serves as a proof of concept to evaluate whether the KAN model, which uses a learnable activation function, can form a viable AI-based approach for seizure detection. However, further exploration into the optimal architectural design for improved performance is still needed, although we have demonstrated that shallow networks tend to have better generalization and robustness as described in table 2. This study is limited to seizure detection, but we will incorporate this study's insights from the architectures for seizure prediction. Deployability has not been considered within this study. However, based on the architecture model, its aim can be tested in future studies. One noticeable drawback we found in our study was a linear increase in memory consumption based on the input data. Therefore, future studies should focus on aiming these challenges.

The data used in this model are in the frequency domain, which used a pre-processing step to transform the raw signals into a format suitable for analysis. This additional pre-processing imposes limitations, especially when considering real-time applications as it can introduce latency and increase computational demands, making the model less practical for real-world scenarios where rapid or continuous data processing is required. Consequently, this limitation affects the model's usability in time-sensitive or resource-constrained environments, such as portable or embedded systems.

5. Conclusion

This study has introduced the KANs as a novel approach for epileptic seizure detection using EEG signals. By using its unique architecture and learning properties, which feature learnable activation functions on edges instead of fixed ones on nodes, we have demonstrated the potential for achieving comparable accuracy and higher accuracy in both in-sample and out-of-sample seizure detection compared to traditional MLP models. Our results indicate that KANs should be a solution for replacing MLPs-based models for seizure detection applications, requiring a smaller network size while maintaining robust performance. The significant advancements presented in this work highlight the potential of KANs to improve clinical outcomes for patients with epilepsy by providing a more accurate, efficient and interpretable tool for seizure detection that enhances decision-making, treatment planning and overall disease management.

Ethics. No ethics declaration was needed for the TUH and EPILEPSIAE datasets. For this study, NSW Local Health District (LHD) ethics X19-0323-2019/STE16040 is approved for using the RPAH dataset in collaboration between The University of Sydney and the Comprehensive Epilepsy Services, the Department of Neurology, at the Royal Prince Alfred Hospital, Australia.

Data accessibility. Data and relevant code for this research work are stored in GitHub: [51] and have been archived within the Zenodo repository: [52]. The TUH dataset is publicly available at https://isip.piconepress.com/projects/tuh_eeg/. The EPILEPSIAE dataset is available at cost via https://www.epilepsy-database.eu/. The Department of Neurology at the Royal Prince Alfred Hospital (RPAH) dataset was used under Ethics Review Board approval and is not publicly available. If you have any questions regarding access to the code employed in this research paper, please direct your inquiries to the corresponding author. It is important to note that the author may outline specific terms, conditions or usage restrictions for the code, which will be provided to you as needed.

Declaration of AI use. We have not used AI-assisted technologies in creating this article.

Authors' contributions. L.F.H.C.: conceptualization, data curation, investigation, methodology, software, validation, visualization, writing-original draft, writing-review and editing; J.C.: investigation, methodology, writingoriginal draft, writing-review and editing; L.Y.: writing-original draft, writing-review and editing; Z.H.: writing-original draft, writing-review and editing; A.N.: conceptualization, data curation, funding acquisition, project administration, resources, supervision, validation, visualization, writing-original draft, writing-review and editing; O.K.: conceptualization, data curation, funding acquisition, investigation, methodology, project administration, resources, supervision, validation, visualization, writing-original draft, writing-review and editing.

All authors gave final approval for publication and agreed to be held accountable for the work performed therein.

Conflict of interest declaration. At the time of consideration of this manuscript, O.K. was a Royal Society Open Science Editorial Board member. He had no involvement in handling, peer review arrangements, evaluations or decisions concerning this manuscript.

Funding. L.F.H.C. would like to acknowledge the partial support of the Faculty of Engineering Research Scholarship provided by The University of Sydney.

Acknowledgements. Z.H. would like to acknowledge the support of the Research Training Program (RTP) provided by the Australian Government. O.K. acknowledges the support provided by The University of Sydney through a SOARFellowship and Microsoft's support through a Microsoft AI for Accessibility grant.

Sidebar

Cite this article: Herbozo Contreras LF, Cui J, Yu L, Huang Z, Nikpour A, Kavehei O. 2025 KAN-EEG: towards replacing backbone-MLP for an effective seizure detection system. R. Soc. Open Sci. 12: 240999. https://doi.org/10.1098/rsos.240999

Received: 18 June 2024

Accepted: 24 January 2025

Subject Category:

Engineering

Subject Areas:

biomedical engineering, neuroscience, artificial intelligence

Author for correspondence:

Luis Fernando Herbozo Contreras

e-mail: [email protected]

Footnote

†These authors contributed equally to the study.

References

References

1. Deloitte Access Economics. 2020 The economic burden of epilepsy in Australia, 2019-2020. Epilepsy Australia. See https://tinyurl.com/5ybbpa44.

2. Banerjee PN, Filippi D, Allen Hauser W. 2009 The descriptive epidemiology of epilepsy-a review. Epilepsy Res. 85, 31-45. (doi:10.1016/j.eplep-syres.2009.03.003)

3. Kwan P et al. 2010 Definition of drug resistant epilepsy: consensus proposal by the ad hoc task force of the ILAE commission on therapeutic strategies. Epilepsia 51, 1069-1077. (doi:10.1111/j.1528-1167.2009.02397.x)

4. Niedermeyer E, da Silva FL. 2005 Electroencephalography: basic principles, clinical applications, and related fields. Philadelphia, PA. Lippincott Williams & Wilkins.

5. Buzsáki G. 2006 Rhythms of the brain. Oxford, UK. Oxford University Press.

6. Sanei S, Chambers JA. 2013 EEG signal processing. New Jersey, USA. John Wiley & Sons.

7. Yu KH, Beam AL, Kohane IS. 2018 Artificial intelligence in healthcare. Nat. Biomed. Eng. 2, 719-731. (doi:10.1038/s41551-018-0305-z)

8. Davenport T, Kalakota R. 2019 The potential for artificial intelligence in healthcare. Future Healthc. J. 6, 94-98. (doi:10.7861/futurehosp.6-2-94)

9. Amann J, Blasimme A, Vayena E, Frey D, Madai VI. 2020 Explainability for artificial intelligence in healthcare: a multidisciplinary perspective. BMC Med. Inform. Decis. Mak. 20, 1-9. (doi:10.1186/s12911-020-01332-6)

10. Haykin S. 1998 Neural networks: a comprehensive foundation. New Jersey, USA. Prentice Hall PTR.

11. Cybenko G. 1989 Approximation by superpositions of a sigmoidal function. Math. Control Signals Syst. 2, 303-314. (doi:10.1007/bf02551274)

12. Liu Z, Wang Y, Vaidya S, Ruehle F, Halverson J, Soljačić M, Hou TY, Tegmark M. 2024 KAN: Kolmogorov-Arnold networks. arXiv:2404.19756 Preprint. http://arxiv.org/abs/2404.19756

13. Birjandtalab J, Heydarzadeh M, Nourani M. 2017 Automated EEG-based epileptic seizure detection using deep neural networks. In IEEE Int. Conf. on Healthcare Informatics (ICHI), Park City, UT, USA, pp. 552-555. (doi:10.1109/ICHI.2017.55)

14. Truong ND. Epileptic seizure detection and forecasting ecosystems. PhD thesis University of Sydney, Sydney, Australia https://ses.library.usyd.edu.au/handle/2123/21932.

15. Fisher RS et al. 2014 ILAE official report: a practical clinical definition of epilepsy. Epilepsia 55, 475-482. (doi:10.1111/epi.12550)

16. Avcu MT, Zhang Z, S.Chan DW. 2019 Seizure detection using least EEG channels by deep convolutional neural network. In IEEE Int. Con. on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, pp. 1120-1124. (doi:10.1109/ICASSP.2019.8683229)

17. Abdelhameed AM, Daoud HG, Bayoumi M. 2018 Epileptic seizure detection using deep convolutional autoencoder. In 2018 IEEE Int. Workshop on Signal Processing Systems (SiPS), Cape Town, pp. 223-228. (doi:10.1109/SiPS.2018.8598447)

18. Hossain MS, Amin SU, Alsulaiman M, Muhammad G. 2019 Applying deep learning for epilepsy seizure detection and brain mapping visualization. ACM Trans. Multimed. Comput. Commun. Appl. 15, 1-17. (doi:10.1145/3241056)

19. Zuo R et al. 2019 Automated detection of high-frequency oscillations in epilepsy based on a convolutional neural network. Front. Comput. Neurosci. 13, 6. (doi:10.3389/fncom.2019.00006)

20. Covert IC, Krishnan B, Najm I, Zhan J, Shore M, Hixson J, Po MJ. 2019 Temporal graph convolutional networks for automatic seizure detection. Michigan, USA. In Machine Learning for Healthcare Conf. pp. 160-180. PMLR.

21. Bouaziz B, Chaari L, Batatia H, Quintero-Rincón A. 2019 Epileptic seizure detection using a convolutional neural network. In Advances in predictive, preventive and personalised medicine digital health approach for predictive, preventive, personalised and participatory medicine, pp. 79-86. New York, NY. Springer International Publishing. (doi:10.1007/978-3-030-11800-6_9)

22. Chen X, Ji J, Ji T, Li P. 2018 Cost-sensitive deep active learning for epileptic seizure detection. In BCB '18, Washington DC USA, pp. 226-235. New York, NY, USA. (doi:10.1145/3233547.3233566). https://dl.acm.org/doi/proceedings/10.1145/3233547.

23. Fukumori K, Thu Nguyen HT, Yoshida N, Tanaka T. Fully Data-driven convolutional filters with deep learning models for epileptic spike detection. In ICASSP 2019 - 2019 IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, pp. 2772-2776. (doi:10.1109/ICASSP.2019.8682196)

24. Vidyaratne L, Glandon A, Alam M, Iftekharuddin KM. 2016 Deep recurrent neural network for seizure detection. In 2016 Int. Joint Conf. on Neural Networks (IJCNN), Vancouver, BC, Canada, pp. 1202-1207. (doi:10.1109/IJCNN.2016.7727334)

25. Geng M, Zhou W, Liu G, Li C, Zhang Y. 2020 Epileptic seizure detection based on stockwell transform and bidirectional long short-term memory. IEEE Trans. Neural Syst. Rehabil. Eng. 28, 573-580. (doi:10.1109/tnsre.2020.2966290)

26. Turner J, Page A, Mohsenin T, Oates T. 2014 Deep belief networks used on high resolution multichannel electroencephalography data for seizure detection. In AAAI Spring Symposium Series. Association for the Advancement of Artificial Intelligence (AAAI). California, USA

27. Xuyen LT, Thanh LT, Viet DV, Long TQ, Trung NL, Thuan ND. 2018 Deep learning for epileptic spike detection. JCSCE 33, 1-13. (doi:10.25073/2588-1086/vnucsce.156)

28. Golmohammadi M, Harati Nejad Torbati AH, Lopez de Diego S, Obeid I, Picone J. 2019 Automatic analysis of EEGs using big data and hybrid deep learning architectures. Front. Hum. Neurosci. 13, 76. (doi:10.3389/fnhum.2019.00076)

29. Emami A, Kunii N, Matsuo T, Shinozaki T, Kawai K, Takahashi H. 2019 Autoencoding of long-term scalp electroencephalogram to detect epileptic seizure for diagnosis support system. Comput. Biol. Med. 110, 227-233. (doi:10.1016/j.compbiomed.2019.05.025)

30. Shah V, GolmohammadiM, Ziyabari S, Von Weltin E, Obeid I, Picone J. 2017 Optimizing channel selection for seizure detection. In IEEE Signal Processing in Medicine and Biology Symposium (SPMB), pp. 1-5. (doi:10.1109/SPMB.2017.8257020)

31. Choi G, Park C, Kim J, Cho K, Kim TJ, Bae H, Min K, Jung KY, Chong J. A Novel Multi-scale 3D CNN with deep neural network for epileptic seizure detection. In 2019 IEEE Int. Conf. on Consumer Electronics (ICCE), Las Vegas, NV, USA, pp. 1-2. (doi:10.1109/ICCE.2019.8661969)

32. Yang Y, Truong ND, Maher C, Nikpour A, Kavehei O. 2022 Continental generalization of a human-in-the-loop AI system for clinical seizure recognition. Expert Syst. Appl. 207, 118083. (doi:10.1016/j.eswa.2022.118083)

33. Yang Y, Eshraghian JK, Duy Truong N, Nikpour A, Kavehei O. 2023 Neuromorphic deep spiking neural networks for seizure detection. Neuromorphic Comput. Eng. 3, 014010. (doi:10.1088/2634-4386/acbab8)

34. Ma Y, Liu C, Ma MS, Yang Y, Truong ND, Kothur K, Nikpour A, Kavehei O. 2023 TSD: Transformers for seizure detection. bioRxiv 2023.01.24.525308v2. (doi:10.1101/2023.01.24.525308)

35. Qiu X, Yan F, Liu H. 2023 A difference attention ResNet-LSTM network for epileptic seizure detection using EEG signal. Biomed. Signal Process. Control. 83, 104652. (doi:10.1016/j.bspc.2023.104652)

36. Tang S, Dunnmon JA, Liangqiong Q, Saab KK, Baykaner T, Lee-Messer C, Rubin DL. 2023 Modeling multivariate biosignals with graph neural networks and structured state space models. Proc. of the Conf. on Health, Inference. Learn. 209, 50-71.

37. Majed Alotaibi Sur-RahmanAImran Basheer M, Adnan Khan M. 2021 Ensemble machine learning based identification of pediatric epilepsy. Comput. Mater. Contin. 68, 149-165. (doi:10.32604/cmc.2021.015976)

38. Wen T, Zhang Z. 2018 Deep convolution neural network and autoencoders-based unsupervised feature learning of EEG Signals. IEEE Access 6, 25399-25410. (doi:10.1109/access.2018.2833746)

39. Yuan Y, Xun G, Jia K, Zhang A. 2019 A multi-view deep learning framework for EEG seizure detection. IEEE J. Biomed. Health Inform. 23, 83-94. (doi:10.1109/JBHI.2018.2871678)

40. Fang Z, Leung H, Choy CS. 2018 Spatial temporal GRU convnets for vision-based real time epileptic seizure detection. In IEEE 15th Int. Symp. on Biomedical Imaging (ISBI 2018), Washington, DC, pp. 1026-1029. (doi:10.1109/ISBI.2018.8363746)

41. Roy S, Kiral-Kornek I, Harrer S. 2018 Deep learning enabled automatic abnormal EEG identification. In 2018 40th Annual Int. Conf. of the IEEE Engineering in Medicine and Biology Society (EMBC), pp. 2756-2759. Honolulu, HI, USA: IEEE. (doi:10.1109/EMBC.2018.8512756)

42. Peh WY, Thangavel P, Yao Y, Thomas J, Tan YL, Dauwels J. 2023 Six-center assessment of CNN-transformer with belief matching loss for patient-independent seizure detection in EEG. Int. J. Neural Syst. 33, 2350012. (doi:10.1142/S0129065723500120)

43. Sun Y, Jin W, Si X, Zhang X, Cao J, Wang L, Yin S, Ming D. 2022 Continuous seizure detection based on transformer and long-term iEEG. IEEE J. Biomed. Health Informatics 26, 5418-5427. (doi:10.1109/jbhi.2022.3199206)

44. Pedoeem J, Bar Yosef G, Abittan S, Keene S. 2022 TABS: Transformer based seizure detection. In Biomedical sensing and analysis: signal processing in medicine and biology, pp. 133-160. Cham, Switzerland. Springer International Publishing. (doi:10.1007/978-3-030-99383-2_4)

45. Yang Y, Truong ND, Eshraghian JK, Nikpour A, Kavehei O. 2022 Weak self-supervised learning for seizure forecasting: a feasibility study. R. Soc. Open Sci. 9, 220374. (doi:10.1098/rsos.220374)

46. Tang S, Dunnmon JA, Saab K, Zhang X, Huang Q, Dubost F, Rubin DL, Lee-Messer C. 2021 Self-supervised graph neural networks for improved electroencephalographic seizure analysis. arXiv 2104.08336 Preprint. (doi:10.48550/arXiv.2104.08336)

47. Ahmedt-Aristizabal D, Fernando T, Denman S, Petersson L, J.Aburn M, Fookes C. 2020 Neural memory networks for seizure type classification. In Annual Int. Conf. of the IEEE Engineering in Medicine & Biology Society (EMBC), pp. 569-575. (doi:10.1109/EMBC44109.2020.9175641)

48. Wang F, Jiang M, Qian C, Yang S, Li C, Zhang H, Wang X, Tang X. 2017 Residual attention network for image classification. In 2017 IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, pp. 3156-3164. (doi:10.1109/CVPR.2017.683)

49. Herbozo Contreras LF, Huang Z, Yu L, Nikpour A, Kavehei O. 2024 Biological plausible algorithm for seizure detection: toward AI-enabled electroceuticals at the edge. APL Mach. Learn. 2 026114. (doi:10.1063/5.0192875)

50. Peng Y. 2024 Photonic KAN: a Kolmogorov-Arnold network inspired efficient photonic neuromorphic architecture. arXiv 2408.08407 Preprint. https://arxiv.org/abs/2408.08407

51. NeuroSyd. 2024 KAN-EEG.GitHub. See https://github.com/NeuroSyd/KAN-EEG.

52. LuisHerbozo. 2025 NeuroSyd/KAN-EEG: KAN-EEG (v.1.0.0). Zenodo. (doi:10.5281/zenodo.14630393)

Word count: 6044

Show less

© 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

Details

Title

KAN–EEG: towards replacing backbone–MLP for an effective seizure detection system

Author

Contreras, Luis Fernando Herbozo¹; Cui, Jiashuo¹; Yu, Leping¹; Huang, Zhaojing¹; Nikpour, Armin²; Kavehei, Omid

¹ School of Biomedical Engineering, Faculty of Engineering, The University of Sydney, Sydney, NSW 2006, Australia
² Comprehensive Epilepsy Service and Department of Neurology, Royal Prince Alfred Hospital, Sydney, NSW 2050, Australia

Pages

1-13

Section

Research

Publication year

2025

Publication date

2025

Publisher

The Royal Society Publishing

e-ISSN

20545703

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1098/rsos.240999

ProQuest document ID

3187599665

KAN–EEG: towards replacing backbone–MLP for an effective seizure detection system

Jump to:

Full text

Abstract

Details

Suggested sources