Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. A variable in which the numbers act as numerical values; always has units






2. Extreme values that don't appear to belong with the rest of the data






3. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals






4. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model






5. An observational study in which subjects are followed to observe future outcomes






6. The difference between the lowest and highest values in a data set






7. An equation or formula that simplifies and represents reality






8. The best defense against bias - in which each individual is given a fair - random chance of selection






9. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion






10. The parts of a distribution that typically trail off on either side; they can be characterized as long or short






11. A sample is this if the statistics computed from it accurately reflect the corresponding population parameters






12. Variables are said to be this if the conditional distribution of one variable is the same for each category of the other






13. Bias introduced to a sample when individuals can choose on their own whether to participate in the sample






14. The sequence of several components representing events that we are pretending will take place






15. The ____ we care about most is straight






16. Bias introduced to a sample when a large fraction of those sampled fails to respond






17. Gives the possible values of the variable and the frequency or relative frequency of each value






18. A sample drawn by selecting individuals systematically from a sampling frame






19. The lower of this is the value with a quarter of the data below it; the upper of this has a quarter of the data above it






20. Consists of the individuals who are conveniently available






21. A distribution that's roughly flat






22. The entire group of individuals or instances about whom we hope to learn






23. Displays counts and - sometimes - percentages of individuals falling into named categories on two or more variables; categorizes the individuals on all variables at once - to reveal possible patterns in one variable that may be contingent on the cate






24. Adding a constant to each data value adds the same constant to the mean - the median - and the quartiles - but does not change the standard deviation or IQR






25. Lists the categories in a categorical variable and gives the count or percentage of observations for each category






26. The middle value with half of the data above and half below it






27. An equation of the form y-hat = b0 + b1x






28. A positive ____ or association means that - in general - as one variable increases - so does the other; when increases in one variable generally correspond to decreases in the other - the association is negative






29. The most basic situation in a simulation in which something happens at random






30. When omitting a point from the data results in a very different regression model - the point is an ____






31. An event is this if we know what outcomes could happen - but not which particular values will happen






32. Gives the possible values of the variable and the relative frequency of each value






33. Shows a bar representing the count of each category in a categorical variable






34. Found by summing all the data values and dividing by the count






35. Holds information about the same characteristic for many cases






36. Tells how many standard deviations a value is from the mean; have a mean of zero and a standard deviation of one






37. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean






38. Although linear models provide an easy way to predict values of y for a given value of x - it is unsafe to predict for values of x far from the ones used to find the linear model equation; predictions should not be trusted






39. A list of individuals from whom the sample is drawn






40. A variable whose values are compared across different treatments






41. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment






42. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small






43. Values of this record the results of each trial with respect to what we were interested in






44. A distribution is this if the two halves on either side of the center look approximately like mirror images of each other






45. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population






46. This of sample size n is one in which each set of n elements in the population has an equal chance of selection






47. Graphs a dot for each case against a single axis






48. These are hard to generate - but several websites offer an unlimited supply of equally likely random values






49. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels






50. A variable other than x and y that simultaneously affects both variables - accounting for the correlation between the two