What is the purpose of a Chi-square test for independence?

  • To compare the means of two groups
  • To compare the variance of two groups
  • To test for a relationship between two categorical variables
  • To test the difference between an observed distribution and a theoretical distribution
The Chi-square test for independence is used to test for a relationship or association between two categorical variables.

How many groups or variables does a two-way ANOVA test involve?

  • 1
  • 2
  • 3 or more
  • Not restricted
A two-way ANOVA involves two independent variables, each with any number of levels/groups. It allows simultaneous analysis of the effects of these variables.

A negative Spearman's rank correlation coefficient indicates a(n) ________ association between two variables.

  • Direct
  • Inverse
  • Positive
  • Strong
A negative Spearman's rank correlation coefficient indicates an inverse association between two variables. That is, as one variable increases, the other tends to decrease.

Why might the confidence interval for a proportion be skewed?

  • Because of a large sample size
  • Because of a small sample size
  • Because the proportion is around 0.5
  • Because the proportion is close to 0 or 1
A confidence interval for a proportion might be skewed when the proportion is very close to 0 or 1. In these cases, the distribution of sample proportions is not symmetrical, leading to skewed intervals.

How does the Kruskal-Wallis Test handle ties between ranks?

  • Assigns them average ranks
  • Discards them
  • Ignores them
  • Treats them as errors
When two or more data points have the same value, they are considered tied. The Kruskal-Wallis Test assigns them the average of the ranks that the tied values would have received had they been different.

How does Bayes' theorem assist in decision making under uncertainty?

  • It eliminates all uncertainty
  • It proves the correctness of an assumption
  • It provides a method for incorporating new data to update our beliefs
  • It reduces the data needed for decision making
Bayes' Theorem provides a mathematical framework for updating probabilities, which can be interpreted as degrees of belief, based on the evidence at hand. Thus, it assists in decision making under uncertainty by allowing for the incorporation of new information.

What does inferential statistics allow you to do?

  • Collect data
  • Describe data
  • Organize data
  • Predict or make inferences about a population
Inferential statistics is a branch of statistics that allows us to use data from a sample to infer or predict trends about the overall population. This technique is immensely useful as it's often impractical or impossible to collect data from an entire population. Inferential statistics makes use of various techniques such as probability, hypothesis testing, correlation, and regression to draw conclusions.

What is the key difference between a t-test and an ANOVA?

  • t-test is for one variable, ANOVA is for two variables
  • t-test is for three groups, ANOVA is for two groups
  • t-test is for two groups, ANOVA is for three or more groups
  • t-test is for two variables, ANOVA is for one variable
The key difference between a t-test and an ANOVA is the number of groups being compared. A t-test is used to compare the means of two groups, while ANOVA is used to compare the means of three or more groups.

What is the difference between a one-tailed and a two-tailed test?

  • The directionality of the hypothesis
  • The number of samples being compared
  • The number of times the test is performed
  • The types of data being used
The main difference between one-tailed and two-tailed tests is the directionality of the hypothesis. One-tailed tests look for an effect in a specific direction, while two-tailed tests look for an effect in either direction.

In hierarchical clustering, a ________ is used to visualize the hierarchy of clusters.

  • bar chart
  • dendrogram
  • histogram
  • pie chart
In hierarchical clustering, a dendrogram is used to visualize the hierarchy of clusters. It is a tree-like diagram that records the sequences of merges or splits.

The measure of dispersion that is the square root of the variance is known as the _______.

  • Mean
  • Median
  • Range
  • Standard Deviation
The standard deviation is the square root of the variance. It measures the average distance between each data point and the mean. Like the variance, it expresses the dispersion of data around the mean, but unlike the variance, its units are the same as the original data, making it easier to interpret.

What is the primary purpose of Principal Component Analysis (PCA)?

  • To calculate the mean of data
  • To classify data
  • To reduce dimensionality of data
  • To visualize data
The primary purpose of PCA is to reduce the dimensionality of data while maintaining as much information as possible. It transforms the data into a new, lower-dimensional set of variables that are uncorrelated and that explain the maximum possible amount of variance in the data.