Test your basic knowledge |

Data Mining

Subject : it-skills
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Which function calculates the number of entries in a table?






2. Twice as likely to identify the important class (compared to avg. prevalence)






3. An analytical-oriented organizational structure is a data warehouse _____________.






4. ___________________ is used to relate one set of outcomes (dependent variable) to a set of predictor (independent) variables (e.g. - in time series analysis). Through this analysis we attempt to predictive future events - as the dependent variables b






5. The _____ operation of two tables results in a single table with the same columns as the first table - and containing all rows that are in the first table merged with all the rows in the second table - minus any duplicate rows.






6. These are considered an alternate storage techniques for data warehousing include.






7. An alternative to the data warehouse concept is a lower-cost - scaled-down version referred to as the _____________.






8. Organizes and analyzes data as an n-dimensional cube. The cube can be thought of as a common spreadsheet with two extensions: (1) support for multiple dimensions and (2) support for multiple concurrent users.






9. The ACCESS feature that tests to see if your tables are normalized properly is the ____.






10. __________ occurs when the initial scope of a project continues to expand as new features are incorporated into the project.






11. Which rule would you be violating - if you tried to delete a sales rep record - who currently has customers on file?






12. In general - ______________ are transformed to relations/tables by defining one relation for the object itself and another relation for each multivalued attribute.






13. The set of activities used to find new - hidden - or unexpected patterns in data is referred to as _____.






14. Models that do ___________: MLR; KNN; Regression and Classification Trees; ANN; SVM






15. Which of the following is at the center of a star schema?






16. ____________ would not normally be associated with ROUTINE data warehouse maintenance.






17. The term "ETL" in data warehousing stands for: Extraction - ________________________ - & Loading.






18. To create the primary key clause for the Customer table on the CustomerNum field - which of the following is the correct statement?






19. A common example of the use of association methods where a retailer can mine the data generated by a point-of-sale system - such as the price scanner you are familiar with at the grocery store is referred to as:






20. A single column that you create for an entity to serve as the primary key - because you otherwise would need many concatenated columns to do so - is called a(n) ____________.






21. The minimum cardinality and m is the maximum cardinality Cardinalities in Semantic Objects are shown as subscripts in the format n-m where _____






22. Gives us an idea of the magnitude of errors. Actual value - estimated value.






23. ___________ determines exactly what level of detail constitutes a fact record.






24. Not the same as goodness-of-fit; We want to know how well the model predicts new data - not how well it fits the data it was trained with; Key component of most measures is difference between actual y and predicted y (error)






25. An economic feasibility measure. So is Internal rate of return.






26. The _______________________ represents the source data for the DW. This layer is comprised - primarily - of operational transaction processing systems and external secondary databases.






27. R- squared(and adjusted r-squared) - A measure of how much of the variability around the target mean is explained by your predictive variables. Doesn't mean you have a good predictive model—only validation will tell you that






28. Which clause would be used to create groups of records?






29. Which statement removes the table Salesrep from a DBMS?






30. Which statement will take away user privileges to the database?






31. Useful for assessing performance in terms of identifying the most important class. Helps such choices as: How many tax records to examine; How many loans to grant; How many customers to mail an offer






32. _________ seeks to ensure that each application under development is fully integrated within its own boundaries and to eliminate any inconsistencies in the final software product.






33. An ___________ relates two other objects.






34. A ___________ combines result sets from more than one fact table.






35. A compound semantic object is an object that contains at least one ____.






36. The SQL built-in functions - which may appear on the same line as the SELECT statement (before the FROM clause) are called _____ functions.






37. The process that records how data from operational data stores and external sources are transformed on the way into the warehouse is referred to as ________________.






38. Are a data mining technology.






39. A powerful trend in IT is known as - which maintains that Computer transmission speed doubles every 18 months.






40. Gives an idea of systematic over- or under-prediction. Magnitude of average absolute error.






41. A Star diagram has two types of tables (objects). They are called the___________________ tables and ; fact tables.






42. The SQL command for deleting the Warehouse field from the Part table is _____.






43. ___________ is not a characteristic of a data warehouse.






44. A synonym for data mining






45. This is not considered one of the four major categories of processing algorithms and rule approaches.






46. When an entity has a minimum cardinality of one it means the entity is required in _______.






47. The process by which numerical data is converted into graphical images is referred to as:






48. On an ER Diagram the number (mark) on relationship line that is farthest away from each entity (rectangle) represents the _______ cardinality.






49. A _____________ is a system-generated primary key.






50. The product of two tables is also called the ________ product.