SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
Data Mining
Start Test
Study First
Subject
:
it-skills
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. The product of two tables is also called the ________ product.
Revoke
principle component analysis
Regression analysis
Cartesian
2. This is not considered one of the four major categories of processing algorithms and rule approaches.
groves law
artificial Key
Cartesian
principle component analysis
3. ___________ is not a characteristic of a data warehouse.
Horizontal integration
OLAP
numeric prediction
volatile data
4. An ___________ relates two other objects.
machine learning
association semantic object
Fact or Measurement table
degrees of summarization
5. Generally Semantic Object Modeling (SOM) is consideredmore bottom-up oriented than _____________.
Count
ERD Modeling
market basket analysis
Fact or Measurement table
6. You can save the results of a query as a table by including the _____ clause in the query.
ERD Modeling
MOLAP
Insert
Into
7. Gives us an idea of the magnitude of errors. Actual value - estimated value.
ALTER TABLE Part DELETE Warehouse;
MAE (Mean Absolute Error) deviation
near-line secondary storage devices
Count
8. Which data mining technique utilizes linkage analysis to search operational transactions for patterns with a high probability of repetition?
Top-down approach
knowledge data discovery
volatile data
Association
9. The minimum cardinality and m is the maximum cardinality Cardinalities in Semantic Objects are shown as subscripts in the format n-m where _____
Association
average error
n
Cartesian
10. Are a data mining technology.
PRIMARY KEY (CustomerNum)
neural networks & Decision Trees
ERD Modeling
artificial Key
11. Useful for assessing performance in terms of identifying the most important class. Helps such choices as: How many tax records to examine; How many loans to grant; How many customers to mail an offer
recognizing known patterns
Referential integrity
lift charts
Scope creep
12. _________ seeks to ensure that each application under development is fully integrated within its own boundaries and to eliminate any inconsistencies in the final software product.
knowledge data discovery
Fact or Measurement table
Horizontal integration
Group By
13. Which of the following is at the center of a star schema?
Fact or Measurement table
The degree of granularity
semantic object
neural networks & Decision Trees
14. These are considered an alternate storage techniques for data warehousing include.
system catalog
cascading delete
near-line secondary storage devices
Revoke
15. A _____________ is a system-generated primary key.
surrogate key
UNION
numeric prediction
Transformation
16. On an ER Diagram the number (mark) on relationship line that is farthest away from each entity (rectangle) represents the _______ cardinality.
database administrator
maximum
Transformation
The degree of granularity
17. To add a new row to a table - use the _____ command.
changing/UPDATE-ing
Association
Insert
Fact or Measurement table
18. Gives an idea of systematic over- or under-prediction. Magnitude of average absolute error.
system catalog
near-line secondary storage devices
average error
data mining
19. A synonym for data mining
The degree of granularity
knowledge data discovery
Breakeven analysis
Group By
20. The term _____ has been generally agreed to represent the broadest category of software technology that enables decision makers to conduct many dimensional analysis of consolidated enterprise data.
aggregate
machine learning
OLAP
volatile data
21. An alternative to the data warehouse concept is a lower-cost - scaled-down version referred to as the _____________.
database administrator
data mart
aggregate
semantic object
22. Which statement removes the table Salesrep from a DBMS?
machine learning
Horizontal integration
Count
DROP TABLE Salesrep;
23. An analytical-oriented organizational structure is a data warehouse _____________.
changing/UPDATE-ing
project readiness assessment factor
principle component analysis
numeric prediction
24. R- squared(and adjusted r-squared) - A measure of how much of the variability around the target mean is explained by your predictive variables. Doesn't mean you have a good predictive model—only validation will tell you that
cascading delete
ALTER TABLE Part DELETE Warehouse;
OLAP
performance metrics - Numeric Prediction
25. Which clause would be used to create groups of records?
groves law
Group By
changing/UPDATE-ing
ALTER TABLE Part DELETE Warehouse;
26. The SQL built-in functions - which may appear on the same line as the SELECT statement (before the FROM clause) are called _____ functions.
Count
aggregate
near-line secondary storage devices
Breakeven analysis
27. Twice as likely to identify the important class (compared to avg. prevalence)
decile chart
the relationship
Regression analysis
Cartesian
28. A powerful trend in IT is known as - which maintains that Computer transmission speed doubles every 18 months.
UNION
performance metrics - Numeric Prediction
groves law
data mining
29. A ___________ combines result sets from more than one fact table.
transformation mapping
drill-across report
machine learning
Sum
30. The term "ETL" in data warehousing stands for: Extraction - ________________________ - & Loading.
Transformation
Revoke
Regression analysis
neural networks & Decision Trees
31. The _____ operation of two tables results in a single table with the same columns as the first table - and containing all rows that are in the first table merged with all the rows in the second table - minus any duplicate rows.
the relationship
knowledge data discovery
Group By
UNION
32. Which rule would you be violating - if you tried to delete a sales rep record - who currently has customers on file?
Association
Referential integrity
market basket analysis
Into
33. Not the same as goodness-of-fit; We want to know how well the model predicts new data - not how well it fits the data it was trained with; Key component of most measures is difference between actual y and predicted y (error)
measuring predictive error
performance metrics - Numeric Prediction
semantic object (SOL) attribute
Regression analysis
34. 'Signatures' are used for intrusion detection by _______?
PRIMARY KEY (CustomerNum)
recognizing known patterns
data visualization
Fact or Measurement table
35. Which function should be used to calculate the total of all entries in a given column?
degrees of summarization
ERD Modeling
Association
Sum
36. In general - ______________ are transformed to relations/tables by defining one relation for the object itself and another relation for each multivalued attribute.
composite semantic objects
changing/UPDATE-ing
The degree of granularity
Top-down approach
37. The _______________________ represents the source data for the DW. This layer is comprised - primarily - of operational transaction processing systems and external secondary databases.
recognizing known patterns
operational and external layer
project readiness assessment factor
DROP TABLE Salesrep;
38. The process that records how data from operational data stores and external sources are transformed on the way into the warehouse is referred to as ________________.
transformation mapping
operational and external layer
groves law
lift charts
39. The set of activities used to find new - hidden - or unexpected patterns in data is referred to as _____.
data mining
average error
semantic object
MAE (Mean Absolute Error) deviation
40. A Star diagram has two types of tables (objects). They are called the___________________ tables and ; fact tables.
aggregate
Referential integrity
dimension
ERD Modeling
41. An economic feasibility measure. So is Internal rate of return.
Insert
Breakeven analysis
ERD Modeling
Scope creep
42. Information about tables in the database is kept in the _____.
principle component analysis
performance metrics - Numeric Prediction
semantic object (SOL) attribute
system catalog
43. To create the primary key clause for the Customer table on the CustomerNum field - which of the following is the correct statement?
Fact or Measurement table
system catalog
PRIMARY KEY (CustomerNum)
decile chart
44. The deletion of a record that also deletes related records is referred to as a(n) _____.
cascading delete
lift charts
UNION
Scope creep
45. A compound semantic object is an object that contains at least one ____.
maximum
drill-across report
semantic object (SOL) attribute
Referential integrity
46. ___________ determines exactly what level of detail constitutes a fact record.
market basket analysis
machine learning
lift charts
The degree of granularity
47. Semantic object link (SOL) attributes establish a relationship between one _______ and another.
operational and external layer
semantic object
degrees of summarization
MAE (Mean Absolute Error) deviation
48. Models that do ___________: MLR; KNN; Regression and Classification Trees; ANN; SVM
Horizontal integration
numeric prediction
composite semantic objects
principle component analysis
49. Why are Star Schemas so useful in Financial Planning and Accounting Information Systems?
degrees of summarization
numeric prediction
Document Analyzer
cascading delete
50. ____________ would not normally be associated with ROUTINE data warehouse maintenance.
artificial Key
machine learning
near-line secondary storage devices
changing/UPDATE-ing