Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Places in order the effects that many re-expressions have on the data






2. Each predicted y-hat tends to be fewer standard deviations from its mean than its corresponding x was from its mean






3. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data






4. A variable in which the numbers act as numerical values; always has units






5. When omitting a point from the data results in a very different regression model - the point is an ____






6. To describe this aspect of a distribution - look for single vs. multiple modes - and symmetry vs. skewness






7. Distributions with more than two modes






8. An individual result of a component of a simulation






9. Values of this record the results of each trial with respect to what we were interested in






10. Anything in a survey design that influences response






11. Consists of the individuals who are conveniently available






12. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x






13. An individual about whom or which we have data






14. A distribution is this if it's not symmetric and one tail stretches out farther than the other






15. A numerical measure of the direction and strength of a linear association






16. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels






17. The tendency of many human subjects (often 20% or more of experiment subjects) to show a response even when administered a placebo






18. This of sample size n is one in which each set of n elements in the population has an equal chance of selection






19. A study based on data in which no manipulation of factors has been employed






20. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table






21. When groups of experimental units are similar - it is a good idea to gather them together into these






22. The number of individuals in a sample






23. Ideally tells who was measured - what was measured - how the data were collected - where the data were collected - and when and why the study was performed






24. A treatment known to have no effect - administered so that all groups experience the same conditions






25. Any attempt to force a sample to resemble specified attributes of the population






26. The middle value with half of the data above and half below it






27. When doing this - consider their shape - center - and spread






28. Distributions with two modes






29. An arrangement of data in which each row represents a case and each column represents a variable






30. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value






31. A variable whose values are compared across different treatments






32. A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population






33. The difference between the first and third quartiles






34. A sampling design in which entire groups are chosen at random






35. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion






36. A normal model with a mean of 0 and a standard deviation of 1






37. The ith ___ is the number that falls above i% of the data






38. In a statistical display - each data value should be represented by the same amount of area






39. A value that attempts the impossible by summarizing the entire distribution with a single number - a 'typical' value






40. A sample that consists of the entire population






41. When either those who could influence or evaluate the results is blinded






42. Shows a bar representing the count of each category in a categorical variable






43. This - b0 - gives a starting value in y-units; it's the y-hat-value when x is 0






44. The lower of this is the value with a quarter of the data below it; the upper of this has a quarter of the data above it






45. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population






46. A scatterplot shows an association that is this if there is little scatter around the underlying relationship






47. A numerical summary of how tightly the values are clustered around the 'center'






48. Value found by subtracting the mean and dividing by the standard deviation






49. Systematically recorded information - whether numbers or labels - together with its context






50. A quantity or amount adopted as a standard of measurement - such as dollars - hours - or grams