Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean






2. A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population






3. Consists of the individuals who are conveniently available






4. Although linear models provide an easy way to predict values of y for a given value of x - it is unsafe to predict for values of x far from the ones used to find the linear model equation; predictions should not be trusted






5. Multiplying each data value by a constant multiplies both the measures of position and the measures of spread by that constant






6. The ith ___ is the number that falls above i% of the data






7. A representative subset of a population - examined in hope of learning about the population






8. A quantity or amount adopted as a standard of measurement - such as dollars - hours - or grams






9. Summarized with the standard deviation - interquartile range - and range






10. Design Randomization occurring within blocks






11. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model






12. A normal model with a mean of 0 and a standard deviation of 1






13. Shows the relationship between two quantitative variables measured on the same cases






14. In a retrospective or prospective study Subjects who are similar in ways not under study may be ____ and then compared with each other on the variables of interest






15. Done to eliminate units; values can be compared and combined even if the original variables had different units and magnitudes






16. An individual about whom or which we have data






17. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table






18. Gives the possible values of the variable and the frequency or relative frequency of each value






19. When doing this - consider their shape - center - and spread






20. Gives the possible values of the variable and the relative frequency of each value






21. These are hard to generate - but several websites offer an unlimited supply of equally likely random values






22. The specific values that the experimenter chooses for a factor






23. A numerically valued attribute of a model for a population






24. A display to help assess whether a distribution of data is approximately normal; if it is nearly straight - the data satisfy the nearly normal condition






25. A sample drawn by selecting individuals systematically from a sampling frame






26. Shows quantitative data values in a way that sketches the distribution of the data






27. An event is this if we know what outcomes could happen - but not which particular values will happen






28. Systematically recorded information - whether numbers or labels - together with its context






29. Value calculated from data to summarize aspects of the data






30. Shows a bar representing the count of each category in a categorical variable






31. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x






32. Shows how a 'whole' divides into categories by showing a wedge of a circle whose area corresponds to the proportion in each category






33. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion






34. The difference between the first and third quartiles






35. A variable other than x and y that simultaneously affects both variables - accounting for the correlation between the two






36. A sampling design in which the population is divided into several subpopulations - and random samples are then drawn from each stratum






37. A scatterplot shows an association that is this if there is little scatter around the underlying relationship






38. Ideally tells who was measured - what was measured - how the data were collected - where the data were collected - and when and why the study was performed






39. Doing this is equivalent to changing its units






40. The difference between the lowest and highest values in a data set






41. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels






42. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value






43. Found by summing all the data values and dividing by the count






44. In a statistical display - each data value should be represented by the same amount of area






45. An equation or formula that simplifies and represents reality






46. Consists of the minimum and maximum - the quartiles Q1 and Q3 - and the median






47. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set






48. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population






49. This of sample size n is one in which each set of n elements in the population has an equal chance of selection






50. Any attempt to force a sample to resemble specified attributes of the population