Test your basic knowledge |

Data Mining

Subject : it-skills
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. The SQL command for deleting the Warehouse field from the Part table is _____.






2. Which statement will take away user privileges to the database?






3. Are a data mining technology.






4. A powerful trend in IT is known as - which maintains that Computer transmission speed doubles every 18 months.






5. A single column that you create for an entity to serve as the primary key - because you otherwise would need many concatenated columns to do so - is called a(n) ____________.






6. A ___________ combines result sets from more than one fact table.






7. Which data mining technique utilizes linkage analysis to search operational transactions for patterns with a high probability of repetition?






8. A common example of the use of association methods where a retailer can mine the data generated by a point-of-sale system - such as the price scanner you are familiar with at the grocery store is referred to as:






9. The term _____ has been generally agreed to represent the broadest category of software technology that enables decision makers to conduct many dimensional analysis of consolidated enterprise data.






10. In general - ______________ are transformed to relations/tables by defining one relation for the object itself and another relation for each multivalued attribute.






11. The set of activities used to find new - hidden - or unexpected patterns in data is referred to as _____.






12. Not the same as goodness-of-fit; We want to know how well the model predicts new data - not how well it fits the data it was trained with; Key component of most measures is difference between actual y and predicted y (error)






13. 'Signatures' are used for intrusion detection by _______?






14. _________ seeks to ensure that each application under development is fully integrated within its own boundaries and to eliminate any inconsistencies in the final software product.






15. A _____________ is a system-generated primary key.






16. __________ occurs when the initial scope of a project continues to expand as new features are incorporated into the project.






17. The _____ operation of two tables results in a single table with the same columns as the first table - and containing all rows that are in the first table merged with all the rows in the second table - minus any duplicate rows.






18. Which of the following database design and data warehouse design approaches is viewed to take a more strategic rather than operational perspective?






19. The _______________________ represents the source data for the DW. This layer is comprised - primarily - of operational transaction processing systems and external secondary databases.






20. Useful for assessing performance in terms of identifying the most important class. Helps such choices as: How many tax records to examine; How many loans to grant; How many customers to mail an offer






21. An analytical-oriented organizational structure is a data warehouse _____________.






22. Information about tables in the database is kept in the _____.






23. Which clause would be used to create groups of records?






24. ___________ is not a characteristic of a data warehouse.






25. When an entity has a minimum cardinality of one it means the entity is required in _______.






26. These are considered an alternate storage techniques for data warehousing include.






27. An alternative to the data warehouse concept is a lower-cost - scaled-down version referred to as the _____________.






28. R- squared(and adjusted r-squared) - A measure of how much of the variability around the target mean is explained by your predictive variables. Doesn't mean you have a good predictive model—only validation will tell you that






29. Gives an idea of systematic over- or under-prediction. Magnitude of average absolute error.






30. Which of the following is at the center of a star schema?






31. The SQL built-in functions - which may appear on the same line as the SELECT statement (before the FROM clause) are called _____ functions.






32. ___________ determines exactly what level of detail constitutes a fact record.






33. On an ER Diagram the number (mark) on relationship line that is farthest away from each entity (rectangle) represents the _______ cardinality.






34. Organizes and analyzes data as an n-dimensional cube. The cube can be thought of as a common spreadsheet with two extensions: (1) support for multiple dimensions and (2) support for multiple concurrent users.






35. Within most organizations - the person known as the _____ determines the type of access various users can have to the corporate or enterprise database.






36. Models that do ___________: MLR; KNN; Regression and Classification Trees; ANN; SVM






37. To create the primary key clause for the Customer table on the CustomerNum field - which of the following is the correct statement?






38. Gives us an idea of the magnitude of errors. Actual value - estimated value.






39. The minimum cardinality and m is the maximum cardinality Cardinalities in Semantic Objects are shown as subscripts in the format n-m where _____






40. An economic feasibility measure. So is Internal rate of return.






41. Which function should be used to calculate the total of all entries in a given column?






42. Twice as likely to identify the important class (compared to avg. prevalence)






43. Which statement removes the table Salesrep from a DBMS?






44. The product of two tables is also called the ________ product.






45. The deletion of a record that also deletes related records is referred to as a(n) _____.






46. An ___________ relates two other objects.






47. Increased affordability of ____________ is a reason for the growth in popularity of data mining.






48. A synonym for data mining






49. To add a new row to a table - use the _____ command.






50. The term "ETL" in data warehousing stands for: Extraction - ________________________ - & Loading.