Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Value calculated from data to summarize aspects of the data






2. The natural tendency of randomly drawn samples to differ






3. Summarized with the standard deviation - interquartile range - and range






4. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population






5. A sample that consists of the entire population






6. Doing this is equivalent to changing its units






7. The tendency of many human subjects (often 20% or more of experiment subjects) to show a response even when administered a placebo






8. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion






9. Lists the categories in a categorical variable and gives the count or percentage of observations for each category






10. Tells how many standard deviations a value is from the mean; have a mean of zero and a standard deviation of one






11. In a retrospective or prospective study Subjects who are similar in ways not under study may be ____ and then compared with each other on the variables of interest






12. In a statistical display - each data value should be represented by the same amount of area






13. An equation or formula that simplifies and represents reality






14. A variable whose levels are controlled by the experimenter






15. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean






16. Systematically recorded information - whether numbers or labels - together with its context






17. Shows quantitative data values in a way that sketches the distribution of the data






18. Sampling schemes that combine several sampling methods






19. Individuals on whom an experiment is performed






20. The most basic situation in a simulation in which something happens at random






21. Graphs a dot for each case against a single axis






22. A distribution that's roughly flat






23. The sum of squared deviations from the mean - divided by the count minus one






24. The sequence of several components representing events that we are pretending will take place






25. This corresponding to a z-score gives the percentage of values in a standard normal distribution found at that z-score or below






26. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x






27. A quantity or amount adopted as a standard of measurement - such as dollars - hours - or grams






28. Distributions with more than two modes






29. A sample drawn by selecting individuals systematically from a sampling frame






30. A distribution is this if the two halves on either side of the center look approximately like mirror images of each other






31. This - b0 - gives a starting value in y-units; it's the y-hat-value when x is 0






32. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small






33. When groups of experimental units are similar - it is a good idea to gather them together into these






34. The ____ we care about most is straight






35. Values of this record the results of each trial with respect to what we were interested in






36. Distributions with two modes






37. Summarized with the mean or the median






38. The ith ___ is the number that falls above i% of the data






39. Although linear models provide an easy way to predict values of y for a given value of x - it is unsafe to predict for values of x far from the ones used to find the linear model equation; predictions should not be trusted






40. A variable that names categories (whether with words or numerals)






41. When doing this - consider their shape - center - and spread






42. Value found by subtracting the mean and dividing by the standard deviation






43. Shows how a 'whole' divides into categories by showing a wedge of a circle whose area corresponds to the proportion in each category






44. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment






45. When omitting a point from the data results in a very different regression model - the point is an ____






46. A treatment known to have no effect - administered so that all groups experience the same conditions






47. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels






48. The square root of the variance






49. Bias introduced to a sample when a large fraction of those sampled fails to respond






50. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____