Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. The ith ___ is the number that falls above i% of the data






2. Graphs a dot for each case against a single axis






3. To describe this aspect of a distribution - look for single vs. multiple modes - and symmetry vs. skewness






4. The number of individuals in a sample






5. An observational study in which subjects are selected and then their previous conditions or behaviors are determined






6. Value calculated from data to summarize aspects of the data






7. A value that attempts the impossible by summarizing the entire distribution with a single number - a 'typical' value






8. Useful family of models for unimodal - symmetric distributions






9. Consists of the minimum and maximum - the quartiles Q1 and Q3 - and the median






10. The square root of the variance






11. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals






12. When omitting a point from the data results in a very different regression model - the point is an ____






13. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population






14. Values of this record the results of each trial with respect to what we were interested in






15. A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population






16. Holds information about the same characteristic for many cases






17. Places in order the effects that many re-expressions have on the data






18. A numerical summary of how tightly the values are clustered around the 'center'






19. A quantity or amount adopted as a standard of measurement - such as dollars - hours - or grams






20. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment






21. A variable that names categories (whether with words or numerals)






22. A sampling design in which entire groups are chosen at random






23. The sum of squared deviations from the mean - divided by the count minus one






24. Adding a constant to each data value adds the same constant to the mean - the median - and the quartiles - but does not change the standard deviation or IQR






25. The sequence of several components representing events that we are pretending will take place






26. Each predicted y-hat tends to be fewer standard deviations from its mean than its corresponding x was from its mean






27. When the levels of one factor are associated with the levels of another factor so their effects cannot be separated






28. Gives the possible values of the variable and the frequency or relative frequency of each value






29. The ____ we care about most is straight






30. Lists the categories in a categorical variable and gives the count or percentage of observations for each category






31. Consists of the individuals who are conveniently available






32. Displays the 5-number summary as a central box with whiskers that extend to the non-outlying data values






33. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion






34. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value






35. When both those who could influence and evaluate the results are blinded






36. A distribution is this if the two halves on either side of the center look approximately like mirror images of each other






37. An arrangement of data in which each row represents a case and each column represents a variable






38. Found by substituting the x-value in the regression equation; they're the values on the fitted line






39. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model






40. An observational study in which subjects are followed to observe future outcomes






41. Sampling schemes that combine several sampling methods






42. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels






43. An equation of the form y-hat = b0 + b1x






44. Done to eliminate units; values can be compared and combined even if the original variables had different units and magnitudes






45. A numerically valued attribute of a model for a population






46. A normal model with a mean of 0 and a standard deviation of 1






47. The specific values that the experimenter chooses for a factor






48. Bias introduced to a sample when individuals can choose on their own whether to participate in the sample






49. In a statistical display - each data value should be represented by the same amount of area






50. Anything in a survey design that influences response