Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value






2. Control - randomize - replicate - block






3. A distribution is this if the two halves on either side of the center look approximately like mirror images of each other






4. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data






5. Values of this record the results of each trial with respect to what we were interested in






6. In a statistical display - each data value should be represented by the same amount of area






7. Having one mode; this is a useful term for describing the shape of a histogram when it's generally mound-shaped






8. Each predicted y-hat tends to be fewer standard deviations from its mean than its corresponding x was from its mean






9. The ____ we care about most is straight






10. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage






11. A hump or local high point in the shape of the distribution of a variable; the apparent locations of these can change as the scale of a histogram is changed






12. When doing this - consider their shape - center - and spread






13. Distributions with two modes






14. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x






15. An individual result of a component of a simulation






16. Adding a constant to each data value adds the same constant to the mean - the median - and the quartiles - but does not change the standard deviation or IQR






17. Anything in a survey design that influences response






18. The square root of the variance






19. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set






20. The number of individuals in a sample






21. Extreme values that don't appear to belong with the rest of the data






22. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals






23. A sample drawn by selecting individuals systematically from a sampling frame






24. Ideally tells who was measured - what was measured - how the data were collected - where the data were collected - and when and why the study was performed






25. Individuals on whom an experiment is performed






26. Displays data that change over time






27. The process - intervention - or other controlled circumstance applied to randomly assigned experimental units






28. The sum of squared deviations from the mean - divided by the count minus one






29. A sample that consists of the entire population






30. Useful family of models for unimodal - symmetric distributions






31. A distribution is this if it's not symmetric and one tail stretches out farther than the other






32. Places in order the effects that many re-expressions have on the data






33. A quantity or amount adopted as a standard of measurement - such as dollars - hours - or grams






34. A numerical measure of the direction and strength of a linear association






35. All experimental units have an equal chance of receiving any treatment






36. Multiplying each data value by a constant multiplies both the measures of position and the measures of spread by that constant






37. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table






38. Distributions with more than two modes






39. Numerically valued attribute of a model






40. Displays counts and - sometimes - percentages of individuals falling into named categories on two or more variables; categorizes the individuals on all variables at once - to reveal possible patterns in one variable that may be contingent on the cate






41. The parts of a distribution that typically trail off on either side; they can be characterized as long or short






42. An observational study in which subjects are followed to observe future outcomes






43. A sampling design in which entire groups are chosen at random






44. A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population






45. An observational study in which subjects are selected and then their previous conditions or behaviors are determined






46. These are hard to generate - but several websites offer an unlimited supply of equally likely random values






47. To describe this aspect of a distribution - look for single vs. multiple modes - and symmetry vs. skewness






48. A variable other than x and y that simultaneously affects both variables - accounting for the correlation between the two






49. An arrangement of data in which each row represents a case and each column represents a variable






50. When omitting a point from the data results in a very different regression model - the point is an ____