Part 1 - Answer Sheet
Part 1 - Answer Sheet
Part 1 - Answer Sheet
Student ID : 2019701929
Group : M3CS2465B
(12 points)
b) Describe the benefits and challenges of data mining in today’s big data era for the
following fields:
i) Business
Benefit Challenges
1. It helps the business to make decision making 1. incomplete and noisy data to analysed
2.It can help to increase the company revenue 2. additional irrelevant information can hinder the
3. Discover strengths and weaknesses of their decision making.
competitors
3. the data collected can violates the user privacy
ii) Medical
Benefit Challenges
- Helps to identify disease patterns - The data is big and need large data storage to
- Finding the best screening methods for diseases stored it and it can be expensive
and injuries, particularly for patients in critical - Data security is lacking
conditions - Medical data is not static and need updating
- Identifying disease mechanisms and their very time
interactions with the problems of the body
(8 points)
Question 2
A database consists of six words. Let the minimum support = 50% and minimum confidence
= 80%.
a) Calculate the support value for the letter with a support count more than 1 and create the header table based on the
minimum support value.
Hint: A common item in a transaction is considered as noise. The list of items should be
sorted alphabetically.
K= 1
C1
Item support
A 4
C 3
D 1
E 1
F 1
L 1
I 3
N 1
O 4
R 3
U 1
V 2
L1
ITEM SUPPORT
A 4
C 3
I 3
O 4
R 3
V 2
K2
C2
ITEM SUPPORT
AC 1
AI 1
AO 1
AR 1
AV 1
CI 1
CO 3
CR 1
CV 2
IO 1
IR 2
IV 3
OR 1
OV 2
LI
ITEM SUPPORT
CO 3
CV 2
IR 2
IV 3
K3
C3
ITEM SUPPORT
COV