Authors:
Jorge Nereu
1
;
Ana Almeida
1
and
Jorge Bernardino
2
Affiliations:
1
ISEP and Polytechnic of Porto, Portugal
;
2
Polytechnic of Coimbra, Portugal
Keyword(s):
Big Data Analytics, BI, Open Source Big Data Platforms.
Related
Ontology
Subjects/Areas/Topics:
Artificial Intelligence
;
Big Data
;
Business Analytics
;
Cardiovascular Technologies
;
Computing and Telecommunications in Cardiology
;
Data Engineering
;
Data Management and Quality
;
Decision Support Systems
;
Decision Support Systems, Remote Data Analysis
;
Distributed and Mobile Software Systems
;
Health Engineering and Technology Applications
;
Knowledge-Based Systems
;
Parallel and High Performance Computing
;
Software Engineering
;
Symbolic Systems
Abstract:
Nowadays organizations look for Big Data as an opportunity to manage and explore their data with the objective to support decisions within its different operational areas. Therefore, it is necessary to analyse several concepts about Big Data Analytics, including definitions, features, advantages and disadvantages. By investigating today's big data platforms, current industrial practices and related trends in the research world, it is possible to understand the impact of Big Data Analytics on smaller organizations. This paper analyses the following five open source platforms for Big Data Analytics: Apache Hadoop, Cloudera, Spark, Hortonworks, and HPCC.