Content area

Abstract

Bioinformatics pipelines, which process vast amounts of sensitive biological data, are increasingly targeted by cyberattacks. Traditional security measures often fail to provide adequate protection due to the unique computational and network characteristics of these pipelines. This study proposes a machine learning-based Intrusion Detection System (IDS) tailored specifically for bioinformatics workflows. While the CICIDS2017 dataset serves as the primary benchmark, we augment the study with bioinformatics-specific network traffic to ensure relevance. We compare the performance of four machine learning algorithms Random Forest (RF), Support Vector Machine (SVM), Convolutional Neural Network (CNN), and Gradient Boosting Machine (GBM) and explore hybrid models for enhanced detection. Our findings highlight GBM's superior accuracy (98.3%) while also addressing its computational overhead and susceptibility to adversarial attacks. The study contributes novel insights by integrating real-world bioinformatics traffic data and proposing adaptive security strategies for genomic research environments.

Full text

Turn on search term navigation

Copyright Academic Conferences International Limited 2025