Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage






2. A variable other than x and y that simultaneously affects both variables - accounting for the correlation between the two






3. A variable whose values are compared across different treatments






4. When either those who could influence or evaluate the results is blinded






5. The distribution of a variable restricting the who to consider only a smaller group of individuals






6. An equation or formula that simplifies and represents reality






7. All experimental units have an equal chance of receiving any treatment






8. Places in order the effects that many re-expressions have on the data






9. Each predicted y-hat tends to be fewer standard deviations from its mean than its corresponding x was from its mean






10. The lower of this is the value with a quarter of the data below it; the upper of this has a quarter of the data above it






11. The sum of squared deviations from the mean - divided by the count minus one






12. Sampling schemes that combine several sampling methods






13. The sequence of several components representing events that we are pretending will take place






14. Doing this is equivalent to changing its units






15. The difference between the lowest and highest values in a data set






16. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model






17. Value calculated from data to summarize aspects of the data






18. A numerical measure of the direction and strength of a linear association






19. The difference between the first and third quartiles






20. A list of individuals from whom the sample is drawn






21. An equation of the form y-hat = b0 + b1x






22. Systematically recorded information - whether numbers or labels - together with its context






23. A value that attempts the impossible by summarizing the entire distribution with a single number - a 'typical' value






24. The number of individuals in a sample






25. Found by summing all the data values and dividing by the count






26. Any attempt to force a sample to resemble specified attributes of the population






27. A variable in which the numbers act as numerical values; always has units






28. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population






29. Summarized with the mean or the median






30. A sample that consists of the entire population






31. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table






32. Multiplying each data value by a constant multiplies both the measures of position and the measures of spread by that constant






33. An individual about whom or which we have data






34. Displays the 5-number summary as a central box with whiskers that extend to the non-outlying data values






35. Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups






36. When doing this - consider their shape - center - and spread






37. A variable whose levels are controlled by the experimenter






38. Ideally tells who was measured - what was measured - how the data were collected - where the data were collected - and when and why the study was performed






39. When an observed difference is too large for us to believe that is is likely to have occurred naturally






40. The middle value with half of the data above and half below it






41. Distributions with more than two modes






42. A representative subset of a population - examined in hope of learning about the population






43. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals






44. These are hard to generate - but several websites offer an unlimited supply of equally likely random values






45. A variable that is not explicitly part of a model but affects the way the variables in the model appear to be related






46. A sample drawn by selecting individuals systematically from a sampling frame






47. Individuals on whom an experiment is performed






48. An individual result of a component of a simulation






49. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small






50. Control - randomize - replicate - block