MCQs cover the fundamentals of Exploratory Data Analysis, covering data summarization, visualization techniques, handling anomalies, and inferring patterns from datasets. Ideal for data analysts and scientists to reinforce EDA practices using statistical and graphical methods.
1. What is the primary goal of Exploratory Data Analysis (EDA)?
2. Which statistic measures the central tendency of a dataset?
3. A histogram is primarily used for visualizing
4. What does the interquartile range (IQR) represent?
5. In EDA, missing values are often identified using
6. Which plot is best for detecting outliers in numerical data?
7. The correlation coefficient ranges from
8. Skewness measures the
9. A scatter plot visualizes
10. What is the purpose of a pair plot in EDA?
11. Kurtosis describes the
12. For categorical data, which visualization is appropriate?
13. The standard deviation measures
14. In EDA, data scaling is often checked for
15. A heatmap is used to visualize
16. What indicates a normal distribution in EDA?
17. Violin plots combine
18. Multicollinearity is detected using
19. For time series EDA, which plot shows trends over time?
20. The mode is the value that
21. In EDA, pivot tables are used for
22. A Q-Q plot assesses
23. Handling outliers in EDA often involves
24. Categorical variables are encoded in EDA using
25. The coefficient of variation (CV) is
26. Seaborn library in Python is popular for EDA because it
27. In bivariate analysis, a low correlation implies
28. Data profiling in EDA includes
29. A kernel density estimate (KDE) plot shows
30. For imbalanced classes in EDA, check
31. The five-number summary includes
32. In EDA, feature engineering starts with
33. A lag plot in time series EDA detects
34. Z-score for outlier detection is
35. Count plots are used for
36. In EDA, dimensionality reduction preview uses
37. Median is robust to
38. Joint plots in Seaborn combine
39. Variance is the average of
40. For text data in EDA, start with
41. Autocorrelation function (ACF) plot identifies
42. In EDA, data types include
43. A facet grid in EDA allows
44. Pearson correlation assumes
45. Box plot whiskers typically extend to
46. In EDA, resampling checks
47. Cramér's V measures
48. Log transformation in EDA is used to
49. A swarm plot displays
50. In multivariate EDA, parallel coordinates plot
51. The range is
52. For geospatial data in EDA, use
53. Chi-square test in EDA checks
54. Strip plots show
55. In EDA, cross-validation previews
56. Quantile-quantile (Q-Q) plot compares
57. For ordinal data, use
58. Partial dependence plots explain
59. In EDA, binning continuous data creates
60. Spearman's rank correlation is
61. A ridgeline plot visualizes
62. EDA documents findings via
63. Theil's U measures
64. In time series, decomposition separates
65. For high-cardinality categoricals in EDA, use
66. A contour plot shows
67. EDA hypothesis generation leads to
68. Mahalanobis distance detects
69. In EDA, groupby operations compute
70. A hexbin plot is a
71. EDA for regression checks
72. Polychoric correlation for
73. In EDA, melting data changes
74. SHAP values in EDA preview
75. For survival data EDA, Kaplan-Meier estimates
76. A sunburst plot visualizes
77. In EDA, Levene's test checks
78. Treemap displays
79. EDA for clustering previews with
80. Biserial correlation for
81. In EDA, pivot_longer in R or melt in Python
82. LIME explains
83. For network data EDA, use
84. Anderson-Darling test assesses
85. Sankey diagram illustrates
86. EDA for classification includes
87. Tetrachoric correlation assumes
88. In EDA, dcast or pivot in tools
89. Counterfactual explanations in EDA show
90. For audio data EDA, spectrograms visualize
91. Kolmogorov-Smirnov test compares
92. Alluvial plot shows
93. In EDA for anomaly detection, isolation forest previews
94. Phi coefficient for
95. Gather function in tidyverse
96. Anchored explanations focus on
97. For image data EDA, use
98. Shapiro-Wilk test for
99. Chord diagram visualizes
100. EDA for recommendation systems includes
101. Contingency coefficient for
102. Spread function in R
103. Prototype-based explanations use
104. For video data EDA, frame sampling and
105. Jarque-Bera test combines
106. Parallel sets plot for
107. In reinforcement learning EDA, check
108. Lambda coefficient for
109. Unpivot in data tools
110. Subgroup explanations in EDA target
111. For graph data EDA beyond basics, centrality measures like
112. D'Agostino's K-squared test for
113. Mosaic plot for
114. EDA in causal inference previews with
115. Uncertainty coefficient for
116. Pivot_table in pandas for
117. Contrastive explanations compare
118. For tabular data EDA, automated tools like
119. Lilliefors test is a
120. Dot plot alternative to
121. In federated learning EDA, focus on
122. Goodman-Kruskal gamma for
123. Reshape in Python for
124. Input gradient explanations use
125. For sensor data EDA, time-frequency analysis like
126. Cramér-von Mises test for
127. Streamgraph for
128. EDA in A/B testing checks
129. Kappa coefficient for
130. Tidy data principles in EDA ensure
131. Guided backpropagation for
132. For financial time series EDA, candlestick charts show
133. Anderson-Rubin test in
134. Waffle chart for
135. EDA for NLP includes
136. Yule's Q for
137. Stack in pandas for


