Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table






2. The sequence of several components representing events that we are pretending will take place






3. The difference between the lowest and highest values in a data set






4. A numerically valued attribute of a model for a population






5. Systematically recorded information - whether numbers or labels - together with its context






6. An equation or formula that simplifies and represents reality






7. The process - intervention - or other controlled circumstance applied to randomly assigned experimental units






8. In a retrospective or prospective study Subjects who are similar in ways not under study may be ____ and then compared with each other on the variables of interest






9. The parts of a distribution that typically trail off on either side; they can be characterized as long or short






10. Consists of the minimum and maximum - the quartiles Q1 and Q3 - and the median






11. An individual about whom or which we have data






12. A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population






13. This - b0 - gives a starting value in y-units; it's the y-hat-value when x is 0






14. Bias introduced to a sample when a large fraction of those sampled fails to respond






15. A sample is this if the statistics computed from it accurately reflect the corresponding population parameters






16. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population






17. A display to help assess whether a distribution of data is approximately normal; if it is nearly straight - the data satisfy the nearly normal condition






18. When either those who could influence or evaluate the results is blinded






19. Anything in a survey design that influences response






20. When doing this - consider their shape - center - and spread






21. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels






22. Uses adjacent bars to show the distribution of vales in a quantitative variable; each bar represents the frequency (or relative frequency) of values falling in an interval of values






23. To be valid - an experiment must assign experimental units to treatment groups at random






24. The distribution of a variable restricting the who to consider only a smaller group of individuals






25. An observational study in which subjects are followed to observe future outcomes






26. A scatterplot shows an association that is this if there is little scatter around the underlying relationship






27. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value






28. Gives a value in 'y-units per x-unit'; changes of one unit in x are associated with changes of b1 units in predicted values of y






29. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x






30. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set






31. An observational study in which subjects are selected and then their previous conditions or behaviors are determined






32. A variable in which the numbers act as numerical values; always has units






33. A normal model with a mean of 0 and a standard deviation of 1






34. Although linear models provide an easy way to predict values of y for a given value of x - it is unsafe to predict for values of x far from the ones used to find the linear model equation; predictions should not be trusted






35. A hump or local high point in the shape of the distribution of a variable; the apparent locations of these can change as the scale of a histogram is changed






36. A variable whose levels are controlled by the experimenter






37. A point that does not fit the overall pattern seen in the scatterplot






38. Displays counts and - sometimes - percentages of individuals falling into named categories on two or more variables; categorizes the individuals on all variables at once - to reveal possible patterns in one variable that may be contingent on the cate






39. Shows the relationship between two quantitative variables measured on the same cases






40. A representative subset of a population - examined in hope of learning about the population






41. The ith ___ is the number that falls above i% of the data






42. A study based on data in which no manipulation of factors has been employed






43. Doing this is equivalent to changing its units






44. The square root of the variance






45. The tendency of many human subjects (often 20% or more of experiment subjects) to show a response even when administered a placebo






46. Having one mode; this is a useful term for describing the shape of a histogram when it's generally mound-shaped






47. A sample that consists of the entire population






48. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model






49. Found by substituting the x-value in the regression equation; they're the values on the fitted line






50. This of sample size n is one in which each set of n elements in the population has an equal chance of selection