Computer Science and Engineering Faculty Publications

Classifying Web Robots by K-Means Clustering

Document Type

Conference Proceeding

Publication Date

7-2009

Abstract

Sophisticated Web robots, sporting a variety of functionality and unique traffic characteristics, constitute a significant percentage of request and bandwidth volume serviced by a Web server. To adequately prepare Web servers for this continuous rise in Web robots, it is necessary to gain deeper insights into their traffic properties. In this paper, we propose to classify Web robots according to their workload characteristics, using K-means clustering as the underlying partitioning technique. We demonstrate how our approach can allow an examination of Web robot traffic from new perspectives by applying it to classify Web robots extracted from a year-long server log collected from the Univ. of Connecticut School of Engineering domain.

Comments

Presented at the Twenty-First International Conference on Software Engineering and Knowledge Engineering, Boston, MA, July 1-3, 2009.

Repository Citation

Doran, D., & Gokhale, S. S. (2009). Classifying Web Robots by K-Means Clustering. Proceedings of the Twenty-First International Conference on Software Engineering & Knowledge Engineering, 97-102.
https://corescholar.libraries.wright.edu/cse/258

Link to Full Text

COinS

Computer Science and Engineering Faculty Publications

Classifying Web Robots by K-Means Clustering

Document Type

Publication Date

Abstract

Comments

Repository Citation

Search

Browse

About

Computer Science and Engineering Faculty Publications

Classifying Web Robots by K-Means Clustering

Authors

Document Type

Publication Date

Abstract

Comments

Repository Citation

Share

Search

Browse

About