Part 1 - Answer Sheet

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 5

ITS665 – Final Assessment

Part 1: Short Questions


1 Hour (9:00 am – 10:00 am)

Student Name : Puteri Yasmin Syazwani Bt Abdull Aziz

Student ID : 2019701929

Group : M3CS2465B

Please make sure you write the question number clearly


QUESTION 1
a) Briefly explain the following terms:
i) Big Data
Big data is a field that analyse, systematically extract information form, or otherwise deal with data set that
are too large or complex to be dealt by traditional data processing.

ii) Data Mining


Data mining is automated analysis of massive data sets. It is process of discovering interesting patterns and
knowledge from large amount of data
iii) Data Representation
Data representation is a form of data that is stored, processed and transmitted

(12 points)
b) Describe the benefits and challenges of data mining in today’s big data era for the
following fields:
i) Business
Benefit Challenges
1. It helps the business to make decision making 1. incomplete and noisy data to analysed
2.It can help to increase the company revenue 2. additional irrelevant information can hinder the
3. Discover strengths and weaknesses of their decision making.
competitors
3. the data collected can violates the user privacy

ii) Medical
Benefit Challenges
- Helps to identify disease patterns - The data is big and need large data storage to
- Finding the best screening methods for diseases stored it and it can be expensive
and injuries, particularly for patients in critical - Data security is lacking
conditions - Medical data is not static and need updating
- Identifying disease mechanisms and their very time
interactions with the problems of the body

(8 points)
Question 2
A database consists of six words. Let the minimum support = 50% and minimum confidence
= 80%.

a) Calculate the support value for the letter with a support count more than 1 and create the header table based on the
minimum support value.
Hint: A common item in a transaction is considered as noise. The list of items should be
sorted alphabetically.

TID List of items


T1 C, O, V, I, D
T2 C, O, R, O, N, A,
T3 F, A, K, E
T4 V, I, R, U, S,
T5 n, C, O, V
T6 V, I, R, A, a, L

Min support = (50/100) X 6 = 3

K= 1

C1

Item support
A 4
C 3
D 1
E 1
F 1
L 1
I 3
N 1
O 4
R 3
U 1
V 2

L1

ITEM SUPPORT
A 4
C 3
I 3
O 4
R 3
V 2

K2

C2

ITEM SUPPORT
AC 1
AI 1
AO 1
AR 1
AV 1
CI 1
CO 3
CR 1
CV 2
IO 1
IR 2
IV 3
OR 1
OV 2

LI

ITEM SUPPORT
CO 3
CV 2
IR 2
IV 3

K3

C3

ITEM SUPPORT
COV

You might also like