FSBD: A framework for scheduling of big data mining in cloud computing

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Citations (Scopus)

Abstract

Cloud computing is seen as an emerging technology for big data mining and analytics. Cloud computing can provide data mining results in the form of a Software As a Service (SAS). Both performance and quality of mining are fundamentals criteria for the use of a data mining application provided by a Cloud computing environment. In this paper, we propose a Cloud computing framework, which is responsible to distribute and schedule a Cluster-Based data mining application and its data set. The main goal of our proposed framework for scheduling of Big Data Mining (FSBD) is to decrease the overall execution time of the application with minimum loss in mining quality. We consider the Cluster-based data mining technique as a pilot application for our framework. The results show an important speedup with a minimum loss in quality of mining. We obtained a ratio of 2 of the normalized actual makespan vis-a-vis the ideal makespan. The quality of mining scales well with the number of clusters and the increasing size of the dataset. The results are promising, encouraging the adoption of the framework by Cloud providers.

Original languageEnglish
Title of host publicationProceedings - 2014 IEEE International Congress on Big Data, BigData Congress 2014
EditorsPeter Chen, Peter Chen, Hemant Jain
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages514-521
Number of pages8
ISBN (Electronic)9781479950577
DOIs
Publication statusPublished - Sept 22 2014
Event3rd IEEE International Congress on Big Data, BigData Congress 2014 - Anchorage, United States
Duration: Jun 27 2014Jul 2 2014

Publication series

NameProceedings - 2014 IEEE International Congress on Big Data, BigData Congress 2014

Other

Other3rd IEEE International Congress on Big Data, BigData Congress 2014
Country/TerritoryUnited States
CityAnchorage
Period6/27/147/2/14

Keywords

  • Autonomous Computing
  • Cloud Computing
  • Data Mining
  • Distributed Systems
  • Divisible Load Application
  • High Performance Computing
  • Scheduling

ASJC Scopus subject areas

  • Computer Science Applications

Fingerprint

Dive into the research topics of 'FSBD: A framework for scheduling of big data mining in cloud computing'. Together they form a unique fingerprint.

Cite this