Kno.e.sis Publications

CloudVista: Visual Cluster Exploration for Extreme Scale Data in the Could

Document Type

Conference Proceeding

Publication Date

2011

Abstract

The problem of efﬁcient and high-quality clustering of extreme scale datasets with complex clustering structures continues to be one of the most challenging data analysis problems. An innovate use of data cloud would provide unique opportunity to address this challenge. In this paper, we propose the CloudVista framework to address (1) the problems caused by using sampling in the existing approaches and (2) the problems with the latency caused by cloud-side processing on interactive cluster visualization. The CloudVista framework aims to explore the entire large data stored in the cloud with the help of the data structure visual frame and the previously developed VISTA visualization model. The latency of processing large data is addressed by the RandGen algorithm that generates a series of related visual frames in the cloud without user's intervention, and a hierarchical exploration model supported by cloud-side subset processing. Experimental study shows this framework is effective and efﬁcient for visually exploring clustering structures for extreme scale datasets stored in the cloud.

Comments

The featured PDF document is the unpublished, peer-reviewed version of this article.

The featured abstract was published in the final version of this article, which appeared in Lecture Notes in Computer Science, volume 6809, pp. 332-350 and may be found at http://link.springer.com/content/pdf/10.1007%2F978-3-642-22351-8_21.pdf .

Repository Citation

Chen, K., Xi, H., Tian, F., & Guo, S. (2011). CloudVista: Visual Cluster Exploration for Extreme Scale Data in the Could. Lecture Notes in Computer Science, 6809, 332-350.
https://corescholar.libraries.wright.edu/knoesis/43

DOI

10.1007/978-3-642-22351-8_21

Download

Request Accessible Version

Included in

Bioinformatics Commons, Communication Technology and New Media Commons, Databases and Information Systems Commons, OS and Networks Commons, Science and Technology Studies Commons

COinS

Kno.e.sis Publications

CloudVista: Visual Cluster Exploration for Extreme Scale Data in the Could

Document Type

Publication Date

Abstract

Comments

Repository Citation

DOI

Included in

Search

Browse

About

Kno.e.sis Publications

CloudVista: Visual Cluster Exploration for Extreme Scale Data in the Could

Authors

Document Type

Publication Date

Abstract

Comments

Repository Citation

DOI

Included in

Share

Search

Browse

About