SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
Data Mining
Start Test
Study First
Subject
:
it-skills
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. The SQL built-in functions - which may appear on the same line as the SELECT statement (before the FROM clause) are called _____ functions.
Fact or Measurement table
OLAP
machine learning
aggregate
2. You can save the results of a query as a table by including the _____ clause in the query.
decile chart
changing/UPDATE-ing
drill-across report
Into
3. Models that do ___________: MLR; KNN; Regression and Classification Trees; ANN; SVM
neural networks & Decision Trees
numeric prediction
project readiness assessment factor
near-line secondary storage devices
4. To add a new row to a table - use the _____ command.
database administrator
n
Insert
knowledge data discovery
5. An alternative to the data warehouse concept is a lower-cost - scaled-down version referred to as the _____________.
ERD Modeling
ALTER TABLE Part DELETE Warehouse;
data mart
decile chart
6. Useful for assessing performance in terms of identifying the most important class. Helps such choices as: How many tax records to examine; How many loans to grant; How many customers to mail an offer
semantic object
lift charts
Document Analyzer
semantic object (SOL) attribute
7. Which clause would be used to create groups of records?
Fact or Measurement table
operational and external layer
Group By
dimension
8. Which function should be used to calculate the total of all entries in a given column?
Sum
MAE (Mean Absolute Error) deviation
association semantic object
DROP TABLE Salesrep;
9. Organizes and analyzes data as an n-dimensional cube. The cube can be thought of as a common spreadsheet with two extensions: (1) support for multiple dimensions and (2) support for multiple concurrent users.
MOLAP
machine learning
MAE (Mean Absolute Error) deviation
principle component analysis
10. The process that records how data from operational data stores and external sources are transformed on the way into the warehouse is referred to as ________________.
association semantic object
transformation mapping
composite semantic objects
database administrator
11. Gives an idea of systematic over- or under-prediction. Magnitude of average absolute error.
numeric prediction
average error
system catalog
surrogate key
12. Gives us an idea of the magnitude of errors. Actual value - estimated value.
system catalog
MAE (Mean Absolute Error) deviation
Regression analysis
Breakeven analysis
13. __________ occurs when the initial scope of a project continues to expand as new features are incorporated into the project.
DROP TABLE Salesrep;
average error
semantic object
Scope creep
14. Twice as likely to identify the important class (compared to avg. prevalence)
Group By
the relationship
system catalog
decile chart
15. The SQL command for deleting the Warehouse field from the Part table is _____.
Document Analyzer
ALTER TABLE Part DELETE Warehouse;
groves law
Sum
16. On an ER Diagram the number (mark) on relationship line that is farthest away from each entity (rectangle) represents the _______ cardinality.
decile chart
changing/UPDATE-ing
maximum
Cartesian
17. Information about tables in the database is kept in the _____.
PRIMARY KEY (CustomerNum)
maximum
Cartesian
system catalog
18. The _______________________ represents the source data for the DW. This layer is comprised - primarily - of operational transaction processing systems and external secondary databases.
MAE (Mean Absolute Error) deviation
Document Analyzer
operational and external layer
performance metrics - Numeric Prediction
19. _________ seeks to ensure that each application under development is fully integrated within its own boundaries and to eliminate any inconsistencies in the final software product.
Document Analyzer
Count
volatile data
Horizontal integration
20. Which data mining technique utilizes linkage analysis to search operational transactions for patterns with a high probability of repetition?
average error
Association
ERD Modeling
Count
21. Why are Star Schemas so useful in Financial Planning and Accounting Information Systems?
Transformation
database administrator
degrees of summarization
Breakeven analysis
22. When an entity has a minimum cardinality of one it means the entity is required in _______.
degrees of summarization
the relationship
semantic object
volatile data
23. The set of activities used to find new - hidden - or unexpected patterns in data is referred to as _____.
data mining
surrogate key
transformation mapping
near-line secondary storage devices
24. The ACCESS feature that tests to see if your tables are normalized properly is the ____.
lift charts
Cartesian
surrogate key
Document Analyzer
25. Which function calculates the number of entries in a table?
Top-down approach
Count
Revoke
Document Analyzer
26. Within most organizations - the person known as the _____ determines the type of access various users can have to the corporate or enterprise database.
semantic object (SOL) attribute
n
database administrator
Scope creep
27. An economic feasibility measure. So is Internal rate of return.
Breakeven analysis
OLAP
drill-across report
project readiness assessment factor
28. A powerful trend in IT is known as - which maintains that Computer transmission speed doubles every 18 months.
degrees of summarization
decile chart
groves law
system catalog
29. An ___________ relates two other objects.
association semantic object
drill-across report
numeric prediction
principle component analysis
30. In general - ______________ are transformed to relations/tables by defining one relation for the object itself and another relation for each multivalued attribute.
principle component analysis
operational and external layer
composite semantic objects
data mart
31. A single column that you create for an entity to serve as the primary key - because you otherwise would need many concatenated columns to do so - is called a(n) ____________.
recognizing known patterns
cascading delete
surrogate key
artificial Key
32. A common example of the use of association methods where a retailer can mine the data generated by a point-of-sale system - such as the price scanner you are familiar with at the grocery store is referred to as:
semantic object (SOL) attribute
n
market basket analysis
maximum
33. A ___________ combines result sets from more than one fact table.
cascading delete
aggregate
data mart
drill-across report
34. A synonym for data mining
Into
knowledge data discovery
operational and external layer
Transformation
35. This is not considered one of the four major categories of processing algorithms and rule approaches.
database administrator
Group By
Insert
principle component analysis
36. Which rule would you be violating - if you tried to delete a sales rep record - who currently has customers on file?
DROP TABLE Salesrep;
database administrator
Referential integrity
degrees of summarization
37. Are a data mining technology.
changing/UPDATE-ing
decile chart
neural networks & Decision Trees
average error
38. R- squared(and adjusted r-squared) - A measure of how much of the variability around the target mean is explained by your predictive variables. Doesn't mean you have a good predictive model—only validation will tell you that
operational and external layer
performance metrics - Numeric Prediction
average error
Top-down approach
39. A compound semantic object is an object that contains at least one ____.
semantic object (SOL) attribute
data mart
near-line secondary storage devices
operational and external layer
40. Generally Semantic Object Modeling (SOM) is consideredmore bottom-up oriented than _____________.
Regression analysis
Insert
Document Analyzer
ERD Modeling
41. The product of two tables is also called the ________ product.
data mart
data visualization
association semantic object
Cartesian
42. The deletion of a record that also deletes related records is referred to as a(n) _____.
cascading delete
MAE (Mean Absolute Error) deviation
changing/UPDATE-ing
aggregate
43. Which of the following database design and data warehouse design approaches is viewed to take a more strategic rather than operational perspective?
groves law
Count
Top-down approach
maximum
44. Semantic object link (SOL) attributes establish a relationship between one _______ and another.
association semantic object
volatile data
recognizing known patterns
semantic object
45. The term "ETL" in data warehousing stands for: Extraction - ________________________ - & Loading.
Breakeven analysis
semantic object (SOL) attribute
Transformation
performance metrics - Numeric Prediction
46. The minimum cardinality and m is the maximum cardinality Cardinalities in Semantic Objects are shown as subscripts in the format n-m where _____
n
measuring predictive error
Transformation
groves law
47. Not the same as goodness-of-fit; We want to know how well the model predicts new data - not how well it fits the data it was trained with; Key component of most measures is difference between actual y and predicted y (error)
knowledge data discovery
measuring predictive error
market basket analysis
Referential integrity
48. Which of the following is at the center of a star schema?
Fact or Measurement table
maximum
ERD Modeling
volatile data
49. ____________ would not normally be associated with ROUTINE data warehouse maintenance.
maximum
surrogate key
system catalog
changing/UPDATE-ing
50. A _____________ is a system-generated primary key.
association semantic object
Fact or Measurement table
surrogate key
market basket analysis