When two variables increase and decrease together, they are said to have a ________ correlation.

  • negative
  • positive
  • strong
  • zero
When two variables increase and decrease together, they are said to have a positive correlation. This is indicated by a positive Pearson's Correlation Coefficient.

When two or more independent variables in a regression model are highly correlated, it's known as ________.

  • Collinearity
  • Interaction
  • Multicollinearity
  • Overfitting
This is known as multicollinearity. In regression analysis, multicollinearity refers to a situation where two or more independent variables are highly correlated. This can make it difficult to determine the effect of each individual variable on the dependent variable and can lead to unstable and unreliable estimates.

A statistical technique that uses several explanatory variables to predict the outcome of a response variable is called ________.

  • ANOVA
  • correlation
  • multiple linear regression
  • simple linear regression
Multiple linear regression is a statistical technique used to predict the outcome of a response variable based on the value of two or more explanatory variables.

What are the key properties of a Bernoulli distribution?

  • It can only take positive integer values
  • It has a bell-shaped curve
  • It has a single trial with two possible outcomes
  • It models a series of independent trials
A Bernoulli distribution is a discrete probability distribution of a random variable which takes the value 1 with probability p and the value 0 with probability q=1-p. It models a single trial with two possible outcomes, often labelled 'success' and 'failure'.

What is the maximum value that a probability can take?

  • 1
  • 10
  • 100
  • Infinity
The maximum value that a probability can take is 1. This signifies that an event is certain to occur. In probability theory, probabilities range from 0 (implying the event is impossible) to 1 (implying the event is certain).

What does polynomial regression allow you to model?

  • Correlations
  • Data distribution
  • Non-linear relationships
  • Relationships between variables
Polynomial regression allows modeling of non-linear relationships. Unlike linear regression that models relationships between variables as a straight line, polynomial regression models relationships as curves, better capturing relationships that change in direction at different levels of the independent variables.

What is the relationship between variance and the square of the standard deviation?

  • Standard deviation is always larger
  • They are the same
  • Variance is always larger
  • Variance is the square root of the standard deviation
Variance and the square of the standard deviation are the same. The variance is calculated as the mean of the squared deviations from the mean, and the standard deviation is the square root of this variance. Hence, squaring the standard deviation gives us the variance.

In probability, an ________ is the set of possible results of an experiment.

  • Event
  • Outcome
  • Probability Space
  • Sample Space
In probability theory, an "outcome" is a possible result of an experiment or trial. For example, if you toss a coin, the possible outcomes are heads or tails. Each outcome of an experiment corresponds to a unique event.

When adding polynomial terms or interaction effects, what key assumption of regression might be violated?

  • Homoscedasticity
  • Independence of observations
  • Linearity
  • Normality of errors
When adding polynomial terms or interaction effects to a regression model, the assumption of linearity might be violated. The linearity assumption in regression analysis states that the relationship between the independent and dependent variables is linear, i.e., a change in the independent variable will result in a constant change in the dependent variable. When adding polynomial terms or interaction effects, we are essentially modeling a non-linear relationship.

How can you identify the presence of bimodal distribution in data?

  • By looking at the mean and median
  • By looking at the skewness
  • By looking at the standard deviation
  • By looking for two peaks in a histogram
A bimodal distribution is one that has two different modes, or peaks. This can often be identified in a histogram, where two separate areas of the data have higher frequencies. This might indicate that the data is drawn from two different populations.