Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Tells how many standard deviations a value is from the mean; have a mean of zero and a standard deviation of one






2. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels






3. A variable whose levels are controlled by the experimenter






4. The entire group of individuals or instances about whom we hope to learn






5. Useful family of models for unimodal - symmetric distributions






6. A list of individuals from whom the sample is drawn






7. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals






8. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set






9. Found by substituting the x-value in the regression equation; they're the values on the fitted line






10. When an observed difference is too large for us to believe that is is likely to have occurred naturally






11. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean






12. Adding a constant to each data value adds the same constant to the mean - the median - and the quartiles - but does not change the standard deviation or IQR






13. Having one mode; this is a useful term for describing the shape of a histogram when it's generally mound-shaped






14. The distribution of a variable restricting the who to consider only a smaller group of individuals






15. Holds information about the same characteristic for many cases






16. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population






17. Gives a value in 'y-units per x-unit'; changes of one unit in x are associated with changes of b1 units in predicted values of y






18. The process - intervention - or other controlled circumstance applied to randomly assigned experimental units






19. When both those who could influence and evaluate the results are blinded






20. Individuals on whom an experiment is performed






21. An observational study in which subjects are selected and then their previous conditions or behaviors are determined






22. The middle value with half of the data above and half below it






23. In a statistical display - each data value should be represented by the same amount of area






24. An arrangement of data in which each row represents a case and each column represents a variable






25. Extreme values that don't appear to belong with the rest of the data






26. A sampling design in which the population is divided into several subpopulations - and random samples are then drawn from each stratum






27. Gives the possible values of the variable and the relative frequency of each value






28. The lower of this is the value with a quarter of the data below it; the upper of this has a quarter of the data above it






29. Any attempt to force a sample to resemble specified attributes of the population






30. Variables are said to be this if the conditional distribution of one variable is the same for each category of the other






31. To be valid - an experiment must assign experimental units to treatment groups at random






32. A sampling design in which entire groups are chosen at random






33. Shows a bar representing the count of each category in a categorical variable






34. Found by summing all the data values and dividing by the count






35. All experimental units have an equal chance of receiving any treatment






36. Anything in a survey design that influences response






37. The number of individuals in a sample






38. An observational study in which subjects are followed to observe future outcomes






39. A distribution is this if it's not symmetric and one tail stretches out farther than the other






40. The specific values that the experimenter chooses for a factor






41. The sequence of several components representing events that we are pretending will take place






42. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion






43. When the levels of one factor are associated with the levels of another factor so their effects cannot be separated






44. Each predicted y-hat tends to be fewer standard deviations from its mean than its corresponding x was from its mean






45. Systematically recorded information - whether numbers or labels - together with its context






46. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment






47. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model






48. A variable other than x and y that simultaneously affects both variables - accounting for the correlation between the two






49. A variable that names categories (whether with words or numerals)






50. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage