Content area
Data regarding the use of autonomous software tools to capture the characteristics of commercial Web information systems, determine their specific importance, and store them in a central data repository, are presented. The ultimate goal is to develop a consistent analysis and evaluation framework for publicly accessible hypertext structures. Based on the preprocessed information, a multi-methodological approach is chosen that comprises statistical clustering, textual analysis, supervised and non-supervised neural networks and manual classification for validation purposes.
