Content area

Abstract

The prevalence of HTTP web traffic on the Internet has long transcended the layer 7 classification, to layers such as layer 5 of the OSI model stack. This coupled with the integration-diversity of other layers and application layer protocols has made identification of user-initiated HTTP web traffic complex, thus increasing user anonymity on the Internet. This study reveals that, with the current complex nature of Internet and HTTP traffic, browser complexity, dynamic web programming structure, the surge in network delay, and unstable user behavior in network interaction, user-initiated requests can be accurately determined. The study utilizes HTTP request method of GET filtering, to develop a heuristic algorithm to identify user-initiated requests. The algorithm was experimentally tested on a group of users, to ascertain the certainty of identifying user-initiated requests. The result demonstrates that user-initiated HTTP requests can be reliably identified with a recall rate at 0.94 and F-measure at 0.969. Additionally, this study extends the paradigm of user identification based on the intrinsic characteristics of users, exhibited in network traffic. The application of these research findings finds relevance in user identification for insider investigation, e-commerce, and e-learning system as well as in network planning and management. Further, the findings from the study are relevant in web usage mining, where user-initiated action comprises the fundamental unit of measurement.

Details

Business indexing term
Title
A heuristics for HTTP traffic identification in measuring user dissimilarity
Author
Ikuesan, Adeyemi R. 1   VIAFID ORCID Logo  ; Salleh, Mazleena 2 ; Venter, Hein S. 3 ; Razak, Shukor Abd 2 ; Furnell, Steven M. 4 

 Community College of Qatar, Department of Cyber Security, Science and Technology Division, Doha, Qatar (GRID:grid.507454.7) (ISNI:0000 0004 4912 2826) 
 Universiti Teknologi Malaysia, School of Computing, Skudai, Malaysia (GRID:grid.410877.d) (ISNI:0000 0001 2296 1505) 
 University of Pretoria, Digital Forensic Research Group, Computer Science Department, Pretoria, South Africa (GRID:grid.49697.35) (ISNI:0000 0001 2107 2298) 
 University of Nottingham, School of Computer Science, Nottingham, UK (GRID:grid.4563.4) (ISNI:0000 0004 1936 8868) 
Publication title
Volume
2
Issue
1-4
Pages
17-28
Publication year
2020
Publication date
Dec 2020
Publisher
Springer Nature B.V.
Place of publication
Orange County
Country of publication
Netherlands
ISSN
25244876
e-ISSN
25244884
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2020-06-02
Milestone dates
2020-05-18 (Registration); 2019-05-14 (Received); 2020-05-18 (Accepted)
Publication history
 
 
   First posting date
02 Jun 2020
ProQuest document ID
2932321390
Document URL
https://www.proquest.com/scholarly-journals/heuristics-http-traffic-identification-measuring/docview/2932321390/se-2?accountid=208611
Copyright
© The Author(s) 2020. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2024-08-27
Database
ProQuest One Academic