Kno.e.sis Publications

Scalable Euclidean Embedding for Big Data

Zohreh S. AlaviFollow
Sagar Sharma, Wright State University - Main CampusFollow
Lu Zhou, Wright State University - Main CampusFollow
Keke Chen, Wright State University - Main CampusFollow

Document Type

Conference Proceeding

Publication Date

7-1-2015

Abstract

Euclidean embedding algorithms transform data defined in an arbitrary metric space to the Euclidean space, which is critical to many visualization techniques. At big-data scale, these algorithms need to be scalable to massive dataparallel infrastructures. Designing such scalable algorithms and understanding the factors affecting the algorithms are important research problems for visually analyzing big data. We propose a framework that extends the existing Euclidean embedding algorithms to scalable ones. Specifically, it decomposes an existing algorithm into naturally parallel components and non-parallelizable components. Then, data parallel implementations such as MapReduce and data reduction techniques are applied to the two categories of components, respectively. We show that this can be possibly done for a collection of embedding algorithms. Extensive experiments are conducted to understand the important factors in these scalable algorithms: scalability, time cost, and the effect of data reduction to result quality. The result on sample algorithms: FastMap-MR and LMDS-MR shows that with the proposed approach the derived algorithms can preserve result quality well, while achieving desirable scalability.

Comments

Presented at the 8th IEEE International Conference on Cloud Computing, New York, NY, June 27-July 2, 2015.

Repository Citation

Alavi, Z. S., Sharma, S., Zhou, L., & Chen, K. (2015). Scalable Euclidean Embedding for Big Data. .
https://corescholar.libraries.wright.edu/knoesis/1095

Download

Included in

Bioinformatics Commons, Communication Technology and New Media Commons, Databases and Information Systems Commons, OS and Networks Commons, Science and Technology Studies Commons

COinS

Kno.e.sis Publications

Scalable Euclidean Embedding for Big Data

Document Type

Publication Date

Abstract

Comments

Repository Citation

Included in

Search

Browse

About

SelectedWorks Sites

Kno.e.sis Publications

Scalable Euclidean Embedding for Big Data

Authors

Document Type

Publication Date

Abstract

Comments

Repository Citation

Included in

Share

Search

Browse

About

SelectedWorks Sites