Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. When groups of experimental units are similar - it is a good idea to gather them together into these






2. Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups






3. A distribution that's roughly flat






4. When either those who could influence or evaluate the results is blinded






5. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small






6. Anything in a survey design that influences response






7. Summarized with the standard deviation - interquartile range - and range






8. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table






9. Variables are said to be this if the conditional distribution of one variable is the same for each category of the other






10. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data






11. Places in order the effects that many re-expressions have on the data






12. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage






13. Displays data that change over time






14. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model






15. The entire group of individuals or instances about whom we hope to learn






16. Distributions with more than two modes






17. The middle value with half of the data above and half below it






18. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population






19. When both those who could influence and evaluate the results are blinded






20. Consists of the individuals who are conveniently available






21. The number of individuals in a sample






22. Gives the possible values of the variable and the relative frequency of each value






23. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____






24. A variable that names categories (whether with words or numerals)






25. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x






26. A numerical summary of how tightly the values are clustered around the 'center'






27. A variable that is not explicitly part of a model but affects the way the variables in the model appear to be related






28. Doing this is equivalent to changing its units






29. Tells how many standard deviations a value is from the mean; have a mean of zero and a standard deviation of one






30. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels






31. The distribution of a variable restricting the who to consider only a smaller group of individuals






32. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value






33. A scatterplot shows an association that is this if there is little scatter around the underlying relationship






34. Extreme values that don't appear to belong with the rest of the data






35. The parts of a distribution that typically trail off on either side; they can be characterized as long or short






36. The natural tendency of randomly drawn samples to differ






37. Gives the possible values of the variable and the frequency or relative frequency of each value






38. A numerical measure of the direction and strength of a linear association






39. Shows the relationship between two quantitative variables measured on the same cases






40. This corresponding to a z-score gives the percentage of values in a standard normal distribution found at that z-score or below






41. Bias introduced to a sample when a large fraction of those sampled fails to respond






42. Uses adjacent bars to show the distribution of vales in a quantitative variable; each bar represents the frequency (or relative frequency) of values falling in an interval of values






43. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment






44. The sum of squared deviations from the mean - divided by the count minus one






45. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals






46. The ____ we care about most is straight






47. The sequence of several components representing events that we are pretending will take place






48. An equation or formula that simplifies and represents reality






49. A study based on data in which no manipulation of factors has been employed






50. These are hard to generate - but several websites offer an unlimited supply of equally likely random values