Abstract
The term Big Data was recently coined as the amount of generated and stored digital data has grown so rapidly that it has become very hard to store, manage and analyze without coming up with new techniques that can cope with such challenges.Finding innovative approaches to support big data analysis has become a priority as both the research community and the industry are trying to make use of these huge amounts of available data. In this paper we introduce a new approach to enhance the overall big data analysis performance. The approach calls for utilizing data set replication, parallel download, and parallel processing over multiple compute nodes. The main concept callsfor simultaneously parallelizing the download of the data (in partitions) from multiple replicated sites to multiple compute nodes that will also perform the analysis in parallel. Then the results are given to the client that requested the analysis.
Original language | English |
---|---|
Pages (from-to) | 98-101 |
Number of pages | 4 |
Journal | Performance Evaluation Review |
Volume | 41 |
Issue number | 4 |
DOIs | |
Publication status | Published - Mar 2014 |
Keywords
- Big Data
- Data Replication
- Dual Direction Processing
- Parallel Processing
ASJC Scopus subject areas
- Software
- Hardware and Architecture
- Computer Networks and Communications