This set of 60 MCQs covers the fundamentals of correlation and covariance, including types like Pearson and Spearman, their calculations, interpretations, and applications in data analysis. Ideal for students and professionals exploring statistical relationships between variables.
60 Important Correlation and Covariance MCQs
✅ Correct Answer: b) How they vary together from their means
📝 Explanation:
Covariance indicates the direction of linear relationship: positive if variables move together, negative if oppositely.
✅ Correct Answer: c) Linear
📝 Explanation:
Pearson's r measures the strength and direction of the linear association between two continuous variables.
✅ Correct Answer: b) -1 to 1
📝 Explanation:
Values close to 1 or -1 indicate strong positive or negative correlation; 0 indicates no linear relationship.
✅ Correct Answer: a) Dividing by the product of standard deviations
📝 Explanation:
Pearson's r = Cov(X,Y) / (SD_X * SD_Y), making it unitless and comparable.
✅ Correct Answer: b) Spearman
📝 Explanation:
Spearman's rank correlation assesses monotonic relationships without assuming normality.
✅ Correct Answer: a) Variances of variables
📝 Explanation:
Off-diagonals show pairwise covariances; diagonals are variances (Cov(X,X) = Var(X)).
✅ Correct Answer: b) Strong negative relationship
📝 Explanation:
Values between -0.7 and -1 suggest a strong inverse linear association.
✅ Correct Answer: b) Monotonic association
📝 Explanation:
It counts concordant and discordant pairs, suitable for small samples or ties.
✅ Correct Answer: b) Σ(X_i - x̄)(Y_i - ȳ) / (n-1)
📝 Explanation:
Uses n-1 for unbiased estimate; x̄ and ȳ are sample means.
✅ Correct Answer: b) Causation
📝 Explanation:
Spurious correlations can occur without causal links; further analysis needed.
✅ Correct Answer: b) Covariance
📝 Explanation:
Covariance has units and scales with data; correlation is standardized and more robust.
✅ Correct Answer: b) One or more other variables
📝 Explanation:
It removes the effect of confounders to isolate direct association.
✅ Correct Answer: b) r ≈ 1
📝 Explanation:
Points aligned with positive slope show strong positive linear correlation.
✅ Correct Answer: b) One continuous and one binary variable
📝 Explanation:
It's a special case of Pearson for dichotomous predictors.
✅ Correct Answer: c) Variance
📝 Explanation:
Cov(X,X) = Var(X), the spread of the variable.
✅ Correct Answer: b) Ranked data
📝 Explanation:
It applies Pearson correlation to ranks, handling non-normal data.
✅ Correct Answer: a) 1
📝 Explanation:
Correlation of a variable with itself is perfect (r=1).
✅ Correct Answer: b) Ordinal variables assuming latent continuity
📝 Explanation:
It estimates correlation from underlying normal variables.
✅ Correct Answer: b) 0
📝 Explanation:
No linear co-variation implies uncorrelated variables.
✅ Correct Answer: a) Correlation
📝 Explanation:
Both indicate direction: positive or negative association.
✅ Correct Answer: b) From an underlying continuous distribution
📝 Explanation:
Used when the dichotomy is artificial, like pass/fail.
✅ Correct Answer: a) ρ = 0
📝 Explanation:
Tests if the true population correlation is zero (no association).
✅ Correct Answer: b) Two binary variables
📝 Explanation:
Assumes underlying bivariate normal distribution.
✅ Correct Answer: a) Units of measurement
📝 Explanation:
Changing units (e.g., cm to m) scales covariance, unlike correlation.
✅ Correct Answer: b) Two binary variables
📝 Explanation:
It's Pearson's r applied to 2x2 contingency tables.
✅ Correct Answer: a) Both high or both low
📝 Explanation:
They deviate from means in the same direction.
✅ Correct Answer: b) Relationship between one variable and multiple others
📝 Explanation:
R is the correlation between observed and predicted values in regression.
✅ Correct Answer: b) Within-group reliability
📝 Explanation:
Used for clustered data, like inter-rater agreement.
✅ Correct Answer: c) Zero
📝 Explanation:
Independence implies uncorrelatedness (for linear relationships).
✅ Correct Answer: b) Nominal categorical variables
📝 Explanation:
Derived from chi-square, ranges 0-1 for contingency tables.
✅ Correct Answer: a) n in denominator
📝 Explanation:
For r, it's Σ deviations product divided by (n * SD_X * SD_Y), but consistent with covariance's n-1 adjustment.
✅ Correct Answer: a) Two sets of variables
📝 Explanation:
Finds linear combinations maximizing correlation between groups.
✅ Correct Answer: b) No relationship at all
📝 Explanation:
Non-linear relationships (e.g., quadratic) can exist with r=0.
✅ Correct Answer: a) PCA for dimensionality reduction
📝 Explanation:
Eigen decomposition reveals principal components.
✅ Correct Answer: a) Ties in data
📝 Explanation:
Adjusts for tied ranks in ordinal comparisons.
✅ Correct Answer: b) Variables move in opposite directions
📝 Explanation:
One increases as the other decreases from means.
✅ Correct Answer: a) Categorical prediction uncertainty
📝 Explanation:
Measures how one variable reduces uncertainty in another.
✅ Correct Answer: a) Covariance with lagged self
📝 Explanation:
Normalized to detect patterns like seasonality.
✅ Correct Answer: a) Linearity and homoscedasticity
📝 Explanation:
Assumes bivariate normality for significance tests.
✅ Correct Answer: a) Similarity between two series at lags
📝 Explanation:
Used in signal processing for time shifts.
✅ Correct Answer: b) Little linear co-variation
📝 Explanation:
But magnitude depends on scales; check correlation for standardization.
✅ Correct Answer: a) Ties in Kendall's tau
📝 Explanation:
Adjusts tau-a for tied observations.
✅ Correct Answer: b) Independence assumptions
📝 Explanation:
Non-zero residual correlations violate model assumptions.
✅ Correct Answer: b) Asset co-movement
📝 Explanation:
Low or negative covariance reduces overall portfolio risk.
✅ Correct Answer: a) Ordinal data, ignoring ties
📝 Explanation:
Proportion of concordant minus discordant pairs.
✅ Correct Answer: a) Multicollinearity severity
📝 Explanation:
Near zero suggests singular matrix, high redundancy.
✅ Correct Answer: c) Monotonic
📝 Explanation:
Ranks preserve monotonic relationships exactly.
✅ Correct Answer: c) Quadratic relationship
📝 Explanation:
Curvilinear patterns cancel linear signal.
✅ Correct Answer: c) Pearson
📝 Explanation:
Use method='spearman' for ranks.
✅ Correct Answer: a) Mean and variance over time
📝 Explanation:
Covariances depend only on lag, not time.
✅ Correct Answer: a) 2x2 tables
📝 Explanation:
Based on odds ratio for dichotomous variables.
✅ Correct Answer: b) Pairwise relationship
📝 Explanation:
Focuses on two variables at a time.
✅ Correct Answer: b) Pearson correlation
📝 Explanation:
For two ranges of data.
✅ Correct Answer: a) No impact
📝 Explanation:
Correlation is scale-invariant but assumes homoscedasticity for inference.
✅ Correct Answer: a) Asymmetric nominal association
📝 Explanation:
Reduces prediction error using one variable to predict another.
✅ Correct Answer: a) Variance explained
📝 Explanation:
Proportion of variance in Y predictable from X.
✅ Correct Answer: b) Correlation matrix
📝 Explanation:
Default is Pearson; specify method for others.
✅ Correct Answer: a) Ranks agree in order
📝 Explanation:
Both pairs increase or decrease together.
✅ Correct Answer: b) Correlation
📝 Explanation:
Z-scores have SD=1, so Cov(Zx, Zy) = r.
✅ Correct Answer: b) Ice cream sales and drownings
📝 Explanation:
Both correlated with summer heat, not causally.
✅ Correct Answer: b) Latent structure
📝 Explanation:
High inter-correlations suggest common factors.
✅ Correct Answer: a) Nominal predictor on continuous outcome
📝 Explanation:
Non-linear analog of point-biserial.
✅ Correct Answer: a) Variance
📝 Explanation:
Cov(Y_t, Y_t) = Var(Y).
✅ Correct Answer: a) Stabilize variance of r for inference
📝 Explanation:
z = 0.5 ln((1+r)/(1-r)) for hypothesis tests.
Related Posts
New
New
New
120 Data Cleaning and Preprocessing in Data Analysis - MCQs
120 industry-level multiple-choice questions on data cleaning, handling missing values, outliers, encoding, scaling, and preprocessing pipelines—modeled after real data scientist…
November 8, 2025By MCQs Generator
50 Hypothesis Testing in Data Analysis - MCQs
This set of 50 MCQs explores key concepts in hypothesis testing, including null and alternative hypotheses, p-values, test statistics, error…
November 8, 2025By MCQs Generator
100 Descriptive, Inferential, and Time Series Statistics in Data Analysis - MCQs
100 challenging multiple-choice questions on descriptive statistics, inferential methods, and time series analysis. Inspired by real data science and analytics…
November 8, 2025By MCQs Generator