Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Summarized with the mean or the median






2. A positive ____ or association means that - in general - as one variable increases - so does the other; when increases in one variable generally correspond to decreases in the other - the association is negative






3. The sum of squared deviations from the mean - divided by the count minus one






4. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table






5. Displays counts and - sometimes - percentages of individuals falling into named categories on two or more variables; categorizes the individuals on all variables at once - to reveal possible patterns in one variable that may be contingent on the cate






6. The distribution of a variable restricting the who to consider only a smaller group of individuals






7. The ____ we care about most is straight






8. Multiplying each data value by a constant multiplies both the measures of position and the measures of spread by that constant






9. All experimental units have an equal chance of receiving any treatment






10. Value calculated from data to summarize aspects of the data






11. Consists of the individuals who are conveniently available






12. When both those who could influence and evaluate the results are blinded






13. A variable that is not explicitly part of a model but affects the way the variables in the model appear to be related






14. A sampling design in which the population is divided into several subpopulations - and random samples are then drawn from each stratum






15. Variables are said to be this if the conditional distribution of one variable is the same for each category of the other






16. A distribution is this if the two halves on either side of the center look approximately like mirror images of each other






17. When groups of experimental units are similar - it is a good idea to gather them together into these






18. Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups






19. A variable in which the numbers act as numerical values; always has units






20. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals






21. Numerically valued attribute of a model






22. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small






23. The difference between the first and third quartiles






24. Displays data that change over time






25. A scatterplot shows an association that is this if there is little scatter around the underlying relationship






26. Value found by subtracting the mean and dividing by the standard deviation






27. The difference between the lowest and highest values in a data set






28. Although linear models provide an easy way to predict values of y for a given value of x - it is unsafe to predict for values of x far from the ones used to find the linear model equation; predictions should not be trusted






29. A hump or local high point in the shape of the distribution of a variable; the apparent locations of these can change as the scale of a histogram is changed






30. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____






31. Consists of the minimum and maximum - the quartiles Q1 and Q3 - and the median






32. When the levels of one factor are associated with the levels of another factor so their effects cannot be separated






33. An individual about whom or which we have data






34. A point that does not fit the overall pattern seen in the scatterplot






35. Bias introduced to a sample when individuals can choose on their own whether to participate in the sample






36. Sampling schemes that combine several sampling methods






37. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model






38. In a statistical display - each data value should be represented by the same amount of area






39. The best defense against bias - in which each individual is given a fair - random chance of selection






40. Gives the possible values of the variable and the relative frequency of each value






41. A sampling design in which entire groups are chosen at random






42. Ideally tells who was measured - what was measured - how the data were collected - where the data were collected - and when and why the study was performed






43. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion






44. Control - randomize - replicate - block






45. A value that attempts the impossible by summarizing the entire distribution with a single number - a 'typical' value






46. The lower of this is the value with a quarter of the data below it; the upper of this has a quarter of the data above it






47. Shows the relationship between two quantitative variables measured on the same cases






48. A variable whose levels are controlled by the experimenter






49. A variable whose values are compared across different treatments






50. A sample that consists of the entire population