Big Data

Download as pdf or txt
Download as pdf or txt
You are on page 1of 13

Introduction to Big Data

Analytics
contents
1.What is big data?
2.Characteristic of Big Data
3.Big Data Analytics, tools and its phases
5.Importance of Big Data
6.Big Data in real world 1
7.Risk of Big Data
8.Impact on IT
9.Future of Big Data
What is Big Data ??
➢ Big Data is also data but with a huge size. Big Data is a
term used to describe a collection of data that is huge in
volume and yet growing exponentially with time. In short
such data is so large and complex that none of the
traditional data management tools are able to store it or
process it efficiently

➢ To simply explain it ; Big data refers to the large, diverse sets of


information that grow at ever-increasing rates.
Big Data Characteristics :
➢ Big data is characterized by 5 V’s:
➢ Volume:
Big Data is a vast 'volumes' of data generated from many sources daily, such
as business processes, machines, social media platforms, networks, human
interactions,and many more.

➢Variety:
Variety refers to structured, semi-structured and unstructured data.Data in the form of
emails,photos, videos, audio, PDFs, devices and more poses aserious challenge for storage,
mining and analysis.

➢ Velocity:
Big data velocity deals with the speed at the data flows from sources
like application logs, business processes, networks, and social media sites,
sensors, mobile devices, etc. Velocity means “How fast produce Data”..
➢ Veracity:
Veracity means how much the data is reliable. It has many ways to
filter or translate the data. Veracity is the process of being able to
handle and manage data efficiently. Big Data is also essential in
business development.
For example, Facebook posts with hashtags.

➢ Variability:
Data variability also known as spread or dispersion, refers to how
spread out a set of data is. Variability gives users a way to describe
how much data sets vary and allows users to use statistics to
compare their data to other sets of data.
➢ Big Data
Analytics:

Big Data analytics is the


process of collecting,
organizing and analyzing
large sets of data (called
Big Data) to discover
patterns and other
useful
information.
➢ Big Data analytics can help organizations to better
understand the
information contained within the data and will
also help identify the data
that is most important to the business
and future business decisions.

➢ Analysts working with Big Data typically want the


knowledge that comes from analyzing the data
Big Data Analytics Tools:

Here are some of the key big data analytics tools :


• Hadoop - helps in storing and analyzing data
• MongoDB - used on datasets that change frequently
• Talend - used for data integration and management
• Cassandra - a distributed database used to handle chunks of data
• Spark - used for real-time processing and analyzing large amounts of
data
• STORM - an open-source real-time computational system
• Kafka - a distributed streaming platform that is used for fault-tolerant
storage
The Lifecycle Phases of Big Data Analytics:
Now, let’s review how Big Data analytics works:
• Stage 1 - Business case evaluation - The Big Data analytics lifecycle
begins with a business case, which defines the reason and goal behind
the analysis.
• Stage 2 - Identification of data - Here, a broad variety of data sources are
identified.
• Stage 3 - Data filtering - All of the identified data from the previous
stage is filtered here to remove corrupt data.
• Stage 4 - Data extraction - Data that is not compatible with the tool is
extracted and then transformed into a compatible form.
• Stage 5 - Data aggregation - In this stage, data with the same fields
across different datasets are integrated.
• Stage 6 - Data analysis - Data is evaluated using analytical and
statistical tools to discover useful information.
• Stage 7 - Visualization of data - With tools like Tableau, Power BI,
and QlikView, Big Data analysts can produce graphic visualizations
of the analysis.
• Stage 8 - Final analysis result - This is the last step of the Big Data
analytics lifecycle, where the final results of the analysis are made
available to business stakeholders who will take action.
Importance of Big Data :
Big data analytics helps organisations harness their data and use it to identify new
opportunities. That, in turn, leads to smarter business moves, more efficient operations,
higher profits and happier customers. Businesses that use big data with advanced
analytics gain value in many ways, such as:

• Reducing cost. Big data technologies like cloud-based analytics can significantly reduce costs
when it comes to storing large amounts of data (for example, a data lake). Plus, big data analytics
helps organisations find more efficient ways of doing business.

• Making faster, better decisions. The speed of in-memory analytics – combined with the
ability to analyse new sources of data, such as streaming data from IoT – helps businesses analyse
information immediately and make fast, informed decisions.

• Developing and marketing new products and services. Being able to gauge customer
needs and customer satisfaction through analytics empowers businesses to give customers what
they want, when they want it. With big data analytics, more companies have an opportunity to
develop innovative new products to meet customers’ changing needs.

You might also like