Content area

Abstract

Automated machine learning is an approach to automate the creation of machine learning pipelines and models. The ability to automatically create a machine learning pipeline would allow users without machine learning knowledge to create and use machine learning systems. Existing machine learning practitioners can also use these automated approaches to simplify the creation of machine learning systems. As with any tool, effective evaluations of AutoML tools are necessary to ensure users can select the correct tool for their machine learning task.

Current evaluations of automated machine learning are performed on simple general purpose datasets, and these datasets may be unable to provide necessary comparison information depending on the machine learning task. There is also limited work on whether AutoML systems can generate comparable models to domain experts on domain-specific data. With many current AutoML approaches, only a small part of the machine learning pipeline is automated. For AutoML to replace the need for machine learning knowledge for its users, complete automation of the machine learning pipeline will be necessary. Automating the feature engineering process is the next step of automation for the many current AutoML approaches.

In this thesis, we present an empirical analysis of current open-source AutoML tools for tasks within the cybersecurity domain, highlight the current weakness of AutoML tools and evaluate the performance of popular AutoML tools for cybersecurity datasets. In addition, we propose a method of augmenting existing AutoML tools with automated feature engineering and assess the impact of different generation approaches and the effect on total pipeline creation time.


Details

1010268
Business indexing term
Title
An Empirical Analysis of AutoMl Tools and Techniques with Automated Feature Engineering
Author
Number of pages
112
Publication year
2022
Degree date
2022
School code
0115
Source
MAI 84/3(E), Masters Abstracts International
ISBN
9798351477848
Advisor
Committee member
Wu, Dan; Azab, Ahmed
University/institution
University of Windsor (Canada)
Department
COMPUTER SCIENCE
University location
Canada -- Ontario, CA
Degree
M.Sc.
Source type
Dissertation or Thesis
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
29395065
ProQuest document ID
2728319873
Document URL
https://www.proquest.com/dissertations-theses/empirical-analysis-automl-tools-techniques-with/docview/2728319873/se-2?accountid=208611
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Database
ProQuest One Academic