Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. An observational study in which subjects are followed to observe future outcomes






2. When doing this - consider their shape - center - and spread






3. Variables are said to be this if the conditional distribution of one variable is the same for each category of the other






4. When both those who could influence and evaluate the results are blinded






5. A variable that is not explicitly part of a model but affects the way the variables in the model appear to be related






6. An equation of the form y-hat = b0 + b1x






7. Ideally tells who was measured - what was measured - how the data were collected - where the data were collected - and when and why the study was performed






8. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value






9. Numerically valued attribute of a model






10. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x






11. This corresponding to a z-score gives the percentage of values in a standard normal distribution found at that z-score or below






12. The ith ___ is the number that falls above i% of the data






13. The sequence of several components representing events that we are pretending will take place






14. Shows quantitative data values in a way that sketches the distribution of the data






15. Anything in a survey design that influences response






16. A variable whose values are compared across different treatments






17. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment






18. Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups






19. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage






20. A variable in which the numbers act as numerical values; always has units






21. Doing this is equivalent to changing its units






22. Tells how many standard deviations a value is from the mean; have a mean of zero and a standard deviation of one






23. To be valid - an experiment must assign experimental units to treatment groups at random






24. When groups of experimental units are similar - it is a good idea to gather them together into these






25. Holds information about the same characteristic for many cases






26. Value found by subtracting the mean and dividing by the standard deviation






27. A positive ____ or association means that - in general - as one variable increases - so does the other; when increases in one variable generally correspond to decreases in the other - the association is negative






28. In a retrospective or prospective study Subjects who are similar in ways not under study may be ____ and then compared with each other on the variables of interest






29. Found by substituting the x-value in the regression equation; they're the values on the fitted line






30. These are hard to generate - but several websites offer an unlimited supply of equally likely random values






31. A study based on data in which no manipulation of factors has been employed






32. A numerically valued attribute of a model for a population






33. A distribution is this if the two halves on either side of the center look approximately like mirror images of each other






34. Places in order the effects that many re-expressions have on the data






35. The difference between the first and third quartiles






36. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean






37. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population






38. A numerical measure of the direction and strength of a linear association






39. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set






40. A distribution that's roughly flat






41. When either those who could influence or evaluate the results is blinded






42. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____






43. A variable whose levels are controlled by the experimenter






44. The best defense against bias - in which each individual is given a fair - random chance of selection






45. A scatterplot shows an association that is this if there is little scatter around the underlying relationship






46. The parts of a distribution that typically trail off on either side; they can be characterized as long or short






47. Done to eliminate units; values can be compared and combined even if the original variables had different units and magnitudes






48. A value that attempts the impossible by summarizing the entire distribution with a single number - a 'typical' value






49. When omitting a point from the data results in a very different regression model - the point is an ____






50. A variable other than x and y that simultaneously affects both variables - accounting for the correlation between the two