Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. A value that attempts the impossible by summarizing the entire distribution with a single number - a 'typical' value






2. An observational study in which subjects are selected and then their previous conditions or behaviors are determined






3. Although linear models provide an easy way to predict values of y for a given value of x - it is unsafe to predict for values of x far from the ones used to find the linear model equation; predictions should not be trusted






4. Control - randomize - replicate - block






5. Summarized with the mean or the median






6. The sum of squared deviations from the mean - divided by the count minus one






7. Found by substituting the x-value in the regression equation; they're the values on the fitted line






8. Any attempt to force a sample to resemble specified attributes of the population






9. Adding a constant to each data value adds the same constant to the mean - the median - and the quartiles - but does not change the standard deviation or IQR






10. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion






11. Holds information about the same characteristic for many cases






12. An observational study in which subjects are followed to observe future outcomes






13. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean






14. These are hard to generate - but several websites offer an unlimited supply of equally likely random values






15. A variable that is not explicitly part of a model but affects the way the variables in the model appear to be related






16. A distribution that's roughly flat






17. In a retrospective or prospective study Subjects who are similar in ways not under study may be ____ and then compared with each other on the variables of interest






18. Doing this is equivalent to changing its units






19. A numerical summary of how tightly the values are clustered around the 'center'






20. The most basic situation in a simulation in which something happens at random






21. Useful family of models for unimodal - symmetric distributions






22. Values of this record the results of each trial with respect to what we were interested in






23. An individual about whom or which we have data






24. A distribution is this if it's not symmetric and one tail stretches out farther than the other






25. Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups






26. A variable whose levels are controlled by the experimenter






27. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____






28. When omitting a point from the data results in a very different regression model - the point is an ____






29. A display to help assess whether a distribution of data is approximately normal; if it is nearly straight - the data satisfy the nearly normal condition






30. A variable in which the numbers act as numerical values; always has units






31. A list of individuals from whom the sample is drawn






32. Found by summing all the data values and dividing by the count






33. The ____ we care about most is straight






34. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels






35. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population






36. The parts of a distribution that typically trail off on either side; they can be characterized as long or short






37. An equation or formula that simplifies and represents reality






38. A treatment known to have no effect - administered so that all groups experience the same conditions






39. The ith ___ is the number that falls above i% of the data






40. Places in order the effects that many re-expressions have on the data






41. Distributions with two modes






42. This corresponding to a z-score gives the percentage of values in a standard normal distribution found at that z-score or below






43. Done to eliminate units; values can be compared and combined even if the original variables had different units and magnitudes






44. Lists the categories in a categorical variable and gives the count or percentage of observations for each category






45. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value






46. The difference between the lowest and highest values in a data set






47. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment






48. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table






49. A sample is this if the statistics computed from it accurately reflect the corresponding population parameters






50. Displays the 5-number summary as a central box with whiskers that extend to the non-outlying data values