Question 1
How does the Pearson correlation coefficient differ from the Spearman rank correlation coefficient?
Pearson is for categorical data; Spearman is for numerical data
Pearson assumes linear relationships; Spearman assesses monotonic relationships
Pearson is non-parametric; Spearman is parametric
Pearson handles outliers better than Spearman
Question 2
How does "Non-Negative Matrix Factorization" (NMF) contribute to dimensionality reduction in data analysis?
By transforming features into a lower-dimensional space
By assigning equal importance to all features
By evaluating the correlation between features
By measuring the entropy of each feature
Question 3
What is the purpose of the term "Hierarchical Clustering" in clustering analysis?
Assessing the correlation between clusters
Creating a hierarchy of clusters based on similarities
Identifying outliers in clustered data
Measuring the similarity within clusters
Question 4
What is the primary purpose of the term "Confusion Matrix" in classification problems?
Assessing multicollinearity in regression models
Evaluating the distribution of residuals
Summarizing the performance of a classification model
Identifying outliers in a dataset
Question 5
In time series analysis, what does the term "Exponential Smoothing" refer to?
Identifying outliers in time series data
Handling missing values in time series data
Forecasting future values by giving more weight to recent observations
Assessing the autocorrelation between time series and lagged values
Question 6
How does the term "Ensemble Learning" improve model performance in machine learning?
Reducing model complexity
Combining predictions from multiple models
Handling outliers by giving less weight to extreme values
Ensuring that features contribute equally to a model
Question 7
What is the purpose of the term "Multicollinearity" in regression analysis?
Identifying outliers in a dataset
Assessing the spread of data
Evaluating the correlation between predictor variables
Handling missing values in regression models
Question 8
What does the term "confidence interval" represent in statistical analysis?
The range of values within which a population parameter is estimated to lie
The average of sample values
The proportion of data falling within a specified range
The standard error of the mean
Question 9
How does the term "Binning" contribute to feature engineering in data analysis?
Converting numerical features into categorical bins
Removing outliers from a dataset
Transforming features into a lower-dimensional space
Filling missing values in a dataset
Question 10
In data analysis, what does the term "Lift" signify in the context of a predictive model?
The ratio of true positives to false positives
The improvement in predictive performance compared to a random model
The increase in model complexity
The impact of outliers on model predictions
There are 27 questions to complete.