Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. A value that attempts the impossible by summarizing the entire distribution with a single number - a 'typical' value






2. The ith ___ is the number that falls above i% of the data






3. A quantity or amount adopted as a standard of measurement - such as dollars - hours - or grams






4. Design Randomization occurring within blocks






5. An observational study in which subjects are selected and then their previous conditions or behaviors are determined






6. A variable whose values are compared across different treatments






7. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set






8. The natural tendency of randomly drawn samples to differ






9. A sample that consists of the entire population






10. When an observed difference is too large for us to believe that is is likely to have occurred naturally






11. A point that does not fit the overall pattern seen in the scatterplot






12. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean






13. Value found by subtracting the mean and dividing by the standard deviation






14. A scatterplot shows an association that is this if there is little scatter around the underlying relationship






15. A variable that names categories (whether with words or numerals)






16. A study based on data in which no manipulation of factors has been employed






17. In a statistical display - each data value should be represented by the same amount of area






18. A sampling design in which the population is divided into several subpopulations - and random samples are then drawn from each stratum






19. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____






20. This - b0 - gives a starting value in y-units; it's the y-hat-value when x is 0






21. A sample drawn by selecting individuals systematically from a sampling frame






22. The number of individuals in a sample






23. A variable whose levels are controlled by the experimenter






24. Anything in a survey design that influences response






25. When the levels of one factor are associated with the levels of another factor so their effects cannot be separated






26. The best defense against bias - in which each individual is given a fair - random chance of selection






27. Having one mode; this is a useful term for describing the shape of a histogram when it's generally mound-shaped






28. Lists the categories in a categorical variable and gives the count or percentage of observations for each category






29. Bias introduced to a sample when a large fraction of those sampled fails to respond






30. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment






31. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model






32. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion






33. Summarized with the standard deviation - interquartile range - and range






34. Values of this record the results of each trial with respect to what we were interested in






35. Displays the 5-number summary as a central box with whiskers that extend to the non-outlying data values






36. The specific values that the experimenter chooses for a factor






37. The lower of this is the value with a quarter of the data below it; the upper of this has a quarter of the data above it






38. Consists of the minimum and maximum - the quartiles Q1 and Q3 - and the median






39. Holds information about the same characteristic for many cases






40. Displays counts and - sometimes - percentages of individuals falling into named categories on two or more variables; categorizes the individuals on all variables at once - to reveal possible patterns in one variable that may be contingent on the cate






41. Shows how a 'whole' divides into categories by showing a wedge of a circle whose area corresponds to the proportion in each category






42. A normal model with a mean of 0 and a standard deviation of 1






43. Distributions with two modes






44. Uses adjacent bars to show the distribution of vales in a quantitative variable; each bar represents the frequency (or relative frequency) of values falling in an interval of values






45. An equation of the form y-hat = b0 + b1x






46. A representative subset of a population - examined in hope of learning about the population






47. Consists of the individuals who are conveniently available






48. Extreme values that don't appear to belong with the rest of the data






49. Numerically valued attribute of a model






50. Bias introduced to a sample when individuals can choose on their own whether to participate in the sample