Content area

Abstract

Bioinformatics pipelines, which process vast amounts of sensitive biological data, are increasingly targeted by cyberattacks. Traditional security measures often fail to provide adequate protection due to the unique computational and network characteristics of these pipelines. This study proposes a machine learning-based Intrusion Detection System (IDS) tailored specifically for bioinformatics workflows. While the CICIDS2017 dataset serves as the primary benchmark, we augment the study with bioinformatics-specific network traffic to ensure relevance. We compare the performance of four machine learning algorithms Random Forest (RF), Support Vector Machine (SVM), Convolutional Neural Network (CNN), and Gradient Boosting Machine (GBM) and explore hybrid models for enhanced detection. Our findings highlight GBM's superior accuracy (98.3%) while also addressing its computational overhead and susceptibility to adversarial attacks. The study contributes novel insights by integrating real-world bioinformatics traffic data and proposing adaptive security strategies for genomic research environments.

Details

Business indexing term
Title
A Machine Learning-Based Intrusion Detection Algorithm for Securing Bioinformatics Pipelines
Pages
345-353
Publication year
2025
Publication date
Mar 2025
Publisher
Academic Conferences International Limited
Place of publication
Reading
Country of publication
United Kingdom
Publication subject
Source type
Conference Paper
Language of publication
English
Document type
Conference Proceedings
ProQuest document ID
3202190727
Document URL
https://www.proquest.com/conference-papers-proceedings/machine-learning-based-intrusion-detection/docview/3202190727/se-2?accountid=208611
Copyright
Copyright Academic Conferences International Limited 2025
Last updated
2025-11-14
Database
ProQuest One Academic