Quiz 10 - Regression, Cluster Analysis, & Association Analysis
Quiz 10 - Regression, Cluster Analysis, & Association Analysis
Quiz 10 - Regression, Cluster Analysis, & Association Analysis
Association Analysis
1. What is the main difference between classification and
regression?
In simple linear regression, the input has only categorical variables. In multiple
linear regression, the input can be a mix of categorical and numerical variables.
In simple linear regression, the input has only one variable. In multiple
linear regression, the input has more than one variables.
In simple linear regression, the input has only categorical variables. In multiple
linear regression, the input has only numerical variables.
They are the just different terms for linear regression with one input variable.
5. The goal of cluster analysis is
To segment data so that differences between samples in the same cluster are
maximized and differences between samples of different clusters are minimized.
To segment data so that all samples are evenly divided among the clusters.
To segment data so that all categorical variables are in one cluster, and all
numerical variables are in another cluster.
To segment data so that differences between samples in the same cluster
are minimized and differences between samples of different clusters are
maximized.
7. A cluster centroid is
Assign each sample to the closest centroid, then calculate the new centroid.
Calculate the centroids, then determine the appropriate stopping criterion
depending on the number of centroids.
Calculate the distances between the cluster centroids, then find the two closest
centroids.
Count the number of samples, then determine the initial centroids.
To find the most complex rules to explain associations between as many items as
possible in the data.
To find the number of outliers in the data
To find rules to capture associations between items or events
To find the number of clusters for cluster analysis