IDA-Group Assignment Question
IDA-Group Assignment Question
IDA-Group Assignment Question
Learning Outcomes
On conclusion, students should be able to demonstrate the appropriate data analytic approaches
for a given problem (A3, PLO6).
This requires students to understand and exhibit the basic concepts in the field of data analytics,
knowledge discovery, data gathering and data mining techniques. Students should be able to
select, analyse and evaluate appropriate data analytics approach for simple real-world problems.
Assessment
The total assessment mark of this group case study is 50%, with 40% of the total contributed by
an individual component. Marking criteria is attached on this assignment.
Groups
Your class will be divided into groups. Each group will contain a maximum of 3 or 4 members.
In addition to your workload matrix, each member will also have to attach a personal reflection
report into their documentation.
Overview
For this assignment, you will come up with business domain of your interest and frame the
analytical problems to be solved and explore it using the techniques of data analysis that we have
discussed in class and explored in the labs.
You will set your own Aim and Objectives and propose an analytical solution that would
hypothetically solve the problems and give positive business impacts.
You should choose any current topics that interests, including but not limited to, levels of wealth,
housing, education, transportation, medical, sports, manufacturing, banking, gaming, agriculture
and etc. You shall adopt a methodology and perform all activities within the phases defined
including data selection, data integration, data processing (cleaning & transformation) and
analysis steps appropriate to the scope. As a result of these activities, you should present your
entire assignment with a demonstration of the models built and communicate the insights in
terms of business benefits and its impact.
You are required to prepare an individual documentation reflecting the efforts undertaken in
completion of the project.
• Domain Background information: Write a description of the selected dataset and project,
and its importance for your chosen company/ domain. Information must be appropriately
referenced.
• Transform any variables that you would like to use in a different form (raw numbers to
percent, etc) – if required
• Perform the relevant data analysis tasks using data mining techniques such as
classification/association/time series/clustering and identify the BI reporting solution
and/or dashboards you need to develop
• Justify why you chose these BI reporting solution/dashboards/data mining techniques and
why those data sets attributes are present and laid out in the fashion you proposed (feel
free to include all other relevant justifications).
• To ensure that you discuss this task properly, you must include visual samples of the
reports you produce (i.e. the screenshots of the BI report/dashboard must be presented
and explained in the written report), and also include any assumptions that you may have
made about the analysis.
Descriptive Analysis:
• Classification model – generate the classifier and test data to predict the outcome
and describe the influencing factors and values. Test the model accuracy and
precision
• Association Rule – use to identify the relationships and similarities among itemset
• Text Mining – use for unstructured text data to process Natural Languages,
analyse sentiments or recommendation analysis
Getting datasets
Every project must involve at least one dataset. There are many interesting and freely available
datasets that you can find in the internet especially on social networking datasets, airline data,
weather forecasting and much more.
Example of Open Datasets:
https://www.data.gov.my/
https://data.world/
https://www.kdnuggets.com/datasets/government-local-public.html
https://github.com/awesomedata/awesome-public-datasets
AWS Open Datasets : https://registry.opendata.aws/
Deliverables
Presentation
Students must be able to demonstrate the deliverables using SAS/any other suitable tool(s) for
data preparation and analytics. You will be required to interpret the results as per objective and
scope specified in your assignment.
Marking Criteria
(Illustration of each model and technique used + final result. Supported evidence from software tools used during
presentation)
Accuracy in meeting Objectives/ Business Goal & Completion of Assignment 10%
(overall achievement and effort delivered in solving problem)
Workload matrix & Personal reflection report 10%
Total 100%
Note: If unable to form a group due to insufficient student numbers or other approved reasons by module
lecturer, marking criteria above will be considered 100% as Individual component (all criteria marked as
individual component)