Big Dataf8

Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

Data analytical computing P.

Vasanth
Assignmengt -2 1907A04G5
ECE C

Data visualization:

Data visualization is the graphical representation of information and data.


By using visual elements like charts, graphs, and maps, data visualization
tools provide an accessible way to see and understand trends, outliers, and
patterns in data. Additionally, it provides an excellent way for employees
or business owners to prevent data to non-technical audiences without
confusion. In the world of big data, data visualization tools and
technologies are essential to analyze massive amounts of information and
making data-driven decisions Advantages
Our eyes are drawn to colors and patterns. We can quickly identify red
from blue, and squares from circles. Our culture is visual, including
everything from art and advertisements to TV and movies. Data
visualization is another form of visual art that grabs our interest and
keeps our eyes on the message. When we see a chart, we quickly see
trends and outliers. If we can see something, we internalize it quickly.
It’s storytelling with a purpose. If you’ve ever stared at a massive
spreadsheet of data and couldn’t see a trend, you know how much more
effective a visualization can be.

Some other advantages of data visualization include:

• Easily sharing information.


• Interactively explore opportunities.
• Visualize patterns and relationships.
Preparing data for visualization:
Clean and great data is the most important factor for generating great visualizations. Data
comes in many formats and quite often it already contains some presentation and layout
formatting which is not understandable by another software. Therefore, in order to better
analyse the data and create interactive visualisations we need to get the data as clean and raw
as possible so that any other software can process the data.

Identify data table


Sometimes excels sheets contains multiple data tables within the same
sheets. This is not semantically correct, each sheet must have one data
table. Do not upload files/copy and paste data with multiple data tables.
Identify and isolate the data table you want to visualize.

The process is followed by the steps sucha as changing the first row of
the table as headers and adding missing headers,removing multiple
headers and merging them into one
Clean any annotations in the columns values

Numbers should be just numbers. Date must be just dates etc. If you you
need to annotate a value, add it in clear text in the description (or
additional information) field of your visualization
Blank cells and annotations
Sometimes the data contains blank cells just for layout reasons, they
actually have the value of the parent category cell. In this case fill the
value with the parent cell value. In other cases it is legitimate to have
blank cells, they denote an "unknown / not reported" value. In this case
leave them blank. And the next step is to check that all rows must have
same amount of clolumns,matching the columns headers. Clean up
summaries (total, averages, etc) and descriptive rows

Many data table contains summaries rows, like the total or average for
some columns. Sometimes you also find descriptive / headers row, just
for layout reasons. These rows will most probably break your
visualization.

Data Visualization Tools

Some of the best data visualization tools include Google Charts, Tableau,
Grafana, Chartist, FusionCharts, Datawrapper, Infogram, and
ChartBlocks etc. These tools support a variety of visual styles, be simple
and easy to use, and be capable of handling a large volume of data.
Data is becoming increasingly important every day. For any organisation,
you can understand how important data is while making crucial decisions.
For the same reason, data visualization is grabbing people's attention.
Modern data visualization tools and advanced software are on the market.
A data visualization tool is software that is used to visualize data. The
features of each tool vary, but at their most basic, they allow you to input
a dataset and graphically alter it. Most, but not all, come with pre-built
templates for creating simple visualizations.

Tableau

One of the most widely used data visualization tools, Tableau, offers
interactive visualization solutions to more than 57,000 companies.

Providing integration for advanced databases, including Teradata, SAP,


My SQL, Amazon AWS, and Hadoop, Tableau efficiently creates
visualizations and graphics from large, constantly-evolving datasets used
for artificial intelligence, machine learning, and Big Data applications.
The Pros of Tableau:

• Excellent visualization capabilities


• Easy to use
• Top class performance
• Supports connectivity with diverse data sources
• Mobile Responsive
• Has an informative community

The Cons of Tableau:

• The pricing is a bit on the higher side


• Auto-refresh and report scheduling options are not available

Dundas BI

Dundas BI offers highly-customizable data visualizations with interactive


scorecards, maps, gauges, and charts, optimizing the creation of ad-hoc,
multi-page reports. By providing users full control over visual elements,
Dundas BI simplifies the complex operation of cleansing, inspecting,
transforming, and modeling big datasets.

The Pros of Dundas BI:

• Exceptional flexibility
• A large variety of data sources and charts
• Wide range of in-built features for extracting, displaying, and
modifying data
The Cons of Dundas BI:

• No option for predictive analytics


• 3D charts not supported

Google Charts

One of the major players in the data visualization market space, Google
Charts, coded with SVG and HTML5, is famed for its capability to
produce graphical and pictorial data visualizations. Google Charts offers
zoom functionality, and it provides users with unmatched cross-platform
compatibility with iOS, Android, and even the earlier versions of the
Internet Explorer browser.

The Pros of Google Charts:

• User-friendly platform
• Easy to integrate data
• Visually attractive data graphs
• Compatibility with Google products.

The Cons of Google Charts:

• The export feature needs fine-tuning


• Inadequate demos on tools
• Lacks customization abilities
• Network connectivity required for visualization
IBM Watson

Named after IBM founder Thomas J. Watson, this high-caliber data


visualization tool uses analytical components and artificial intelligence
to detect insights and patterns from both unstructured and structured
data. Leveraging NLP (Natural Language Processing), IBM Watson's
intelligent, self-service visualization tool guides users through the entire
insight discovery operation.

The Pros of IBM Watson:

• NLP capabilities
• Offers accessibility from multiple devices
• Predictive analytics
• Self-service dashboards The Cons of IBM Watson:

• Customer support needs improvement


• High-cost maintenance

Power BI
Power BI, Microsoft's easy-to-use data visualization tool, is available for
both on-premise installation and deployment on the cloud infrastructure.
Power BI is one of the most complete data visualization tools that supports
a myriad of backend databases, including Teradata, Salesforce,
PostgreSQL, Oracle, Google Analytics, Github, Adobe Analytics, Azure,
SQL Server, and Excel. The enterprise-level tool creates stunning
visualizations and delivers real-time insights for fast decision-making.

The Pros of Power BI:

• No requirement for specialized tech support


• Easily integrates with existing applications
• Personalized, rich dashboard
• High-grade security
• No speed or memory constraints
• Compatible with Microsoft products

The Cons of Power BI:

• Cannot work with varied, multiple datasets

You might also like