0% found this document useful (0 votes)
3 views

QB for AIML (3)

Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

QB for AIML (3)

Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

1.

What is Artificial Intelligence (AI), and how does it relate to Machine Learning
(ML)?
2. List and describe three real-world applications of Machine Learning.
3. Explain the differences between supervised, unsupervised, and reinforcement learning
with examples.
4. What role does data play in Machine Learning, and why is it crucial for building
models?

5. What is clustering in Machine Learning, and how does it differ from classification?
6. Write a Python program using Matplotlib to plot a bar chart for product sales data.
7. How does reinforcement learning work? Provide an example application.
8. Define supervised learning and name two commonly used algorithms.
9. Explain how Machine Learning is applied in the healthcare industry. Provide two
examples.
10. What challenges might arise when implementing Machine Learning in real-world
scenarios?
11. How can Machine Learning improve fraud detection systems in banking?
12. What are the key features of the Pandas library that make it important for data
manipulation?
13. Compare supervised and unsupervised learning. Provide an example use case for
each.
14. Explain the concept of linear regression. What is its primary use case?
15. Define logistic regression. How is it different from linear regression?
16. What is a decision tree, and how does it split data at each node?
17. Explain the working of the Random Forest algorithm. Why is it considered robust?
18. What are ensemble learning techniques? Name and describe two common types.
19. Define loss function in Machine Learning. Why is it important in training models?
20. What is overfitting, and how can it be prevented?
21. Explain underfitting and provide an example of when it might occur.
22. What are imbalanced classes, and why are they problematic in classification tasks?
23. How does the Random Forest algorithm reduce overfitting compared to a single
decision tree?
24. What is the difference between bagging and boosting in ensemble learning?
25. How does logistic regression handle binary classification problems?
26. Why is cross-validation essential when dealing with overfitting and underfitting?
27. What metrics can be used to evaluate models trained on imbalanced classes? Provid
28. Define clustering in the context of unsupervised learning. What is the difference
between hard clustering and soft clustering?
29. Explain the working of the K-Means clustering algorithm. How is the number of
clusters determined?
30. What are hierarchical clustering methods? How do agglomerative and divisive
clustering differ?
31. What is a dendrogram, and how is it used in hierarchical clustering?
32. What is the curse of dimensionality, and why does it affect clustering algorithms?
33. Explain the importance of dimensionality reduction techniques in machine learning.
34. What is feature selection, and why is it important in machine learning?
35. Compare dimensionality reduction with feature selection. Provide an example of each.
36. What are the advantages and limitations of using K-Means for clustering?
37. Explain the difference between feature extraction and feature selection. Provide
examples of algorithms used for each.
38. What is the purpose of model evaluation in Machine Learning?
39. Explain the train-test split method. Why is it important for model evaluation?
40. Define Mean Squared Error (MSE) and explain its significance in regression tasks.
41. What is the difference between MSE and Root Mean Squared Error (RMSE)?
42. Define Mean Absolute Error (MAE). How does it differ from MSE?
43. Explain the R-squared metric. What does an R-squared value of 0.9 indicate?
44. How can you use a confusion matrix to compute precision and recall? Provide a
formula for each.
45. What is one-hot encoding, and why is it used in Machine Learning?
46. Explain the difference between normalization and standardization. When would you
use each?
47. What is stacking in ensemble learning, and how does it work?
48. consider a binary classification issue where we have a dataset of 10 examples with
two classes: "Positive" and "Negative". Out of the 10 examples, 6 have a place with
the "Positive" class and 4 have a place with the "Negative" class. calculate the Gini
index.
49. Create a decision tree using gini index to classify following dataset for profit.
Age Competition Type Profit
Old Yes Software Down
Old No Software Down
Old No Hardware Down
Mid Yes Software Down
Mid Yes Hardware Down
Mid No Hardware Up
Mid No Software Up
New Yes Software Up
New No Hardware Up
New No Software Up

50. Suppose we have a binary classification problem where we are predicting whether an
email is spam or not spam. Let's assume that we have the following results from the
classifier:
50 emails were spam, and the model predicted 40 of them as spam, but 10 were
predicted as not spam.
50 emails were not spam, and the model predicted 45 of them as not spam, but 5 were
predicted as spam.

Predicted Spam (Yes) Predicted Not Spam (No)


Actual 40 10
Spam(Yes)
Actual Not 5 45
Spam (No)

Compute Accuracy, Precision, Recall, F1 Score.


51. apply k-means on the 1-dimensional data set for K=2.Data set {2, 4, 10, 12, 3, 20, 30,
11, 25} with centroids M1=4 and M2=11.
52. Consider the five datapoints P1(1,2,3),P2(0,1,2),P3(3,0,5),P4(4,1,3),P5(5,0,1) and
centroids are C1(0,0,1) C2(0,1,1) apply k-means algorithm on dataset.
53. Solve the example by using agglomerative hierarchical analysis with min distance and
proximity matrix by using datapoints 12,16,20,26,27,39. And draw the dendrogram
54. Solve the example by using agglomerative hierarchical analysis with min distance and
proximity matrix by using datapoints 18,22,25,42,27,43 and draw the dendrogram

55. Solve the example by using


P q r s t divisive hierarchical
p 0 8 6 11 4 clustering analysis
q 8 0 19 14 15
r 6 19 0 5 18
s 11 14 5 0 7
t 4 15 18 7 0

56. Reduce the dimensionality from 2 to 1 dimensional by using principal component


analysis (PAC).
Example 1 Example 2 Example 3 Example 4 Example 5 Example 6
X1 5 9 14 8 7 3
X2 12 5 6 15 9 10

57.

58. What is time series analysis, and what kind of problems can it help solve?
59. What are some common methods used to forecast future values in time series
analysis?
60. What is the advantage of using MongoDB over relational databases like MySQL in
certain applications?

You might also like