Select Category
All Categories
Accounting
Agriculture
Astrobiology
Banking & Finance
Business World
Civilizations
Commerce & Economics
Competitive exams
Computer Science & IT
Current affairs
Earth Science
Education
Environment
Explore the Animal World
Food chain
General Knowledge
Geography Map
Green Life
History
Human Health
Journalism
Law
Science
Social Science
TAX
About Us
Contact Us
Login
Data Mining - Part 1
1
of
25
💡
Hints:
3
Q1. Of the following what are the distance based clustering algorithms?
A. K-Means
B. K-Medoids
C. Hierarchical
D. All
Q2. A collection of integrated, subject oriented databases designed to support the decision-support functions
A. Database
B. Data Collection
C. Data Warehouse
D. Data retrieval
Q3. Same person with multiple email addresses is an example of ___
A. Noise
B. Outliers
C. Missing values
D. Duplicate data
Q4. ___ often used for both the prelimi nary investigation of the data and the f inal data analysis
A. Aggregation
B. Feature creation
C. Sampling
D. Attribution transformation
Q5. This is a term that describes the large volume of data; both structured and unstructured.
A. Big Data
B. Big Knowledge
C. Big Information
D. Data at rest
Q6. ___ refers to the mapping or classifi cation of a class with some predefined group or class.
A. Data Discrimination
B. Data Characterization
C. Data Definition
D. Data Visualization
Q7. Is Logistic regression a supervised machine learning algorithm?
A. TRUE
B. FALSE
Q8. a data warehouse can include
A. flat-files
B. database table
C. online data
D. all
Q9. In statistical distribution the outlier is identified using ___
A. working hypothesis
B. discontency test
C. alternative hypothesis
D. none of the above
Q10. Select the tools that can be used for data mining
A. KNIME
B. WEKA
C. RATTLE
D. All
Q11. Data mining is ___
A. A time variant non-Volatile collec tion of data
B. The actual discovery phase of a Knowledge
C. The stage of selecting the right data
D. None of these
Q12. Lift dari rule K ___ E adalah ___
×
A. 0.4
B. 0.6
C. 0.8
D. 1
Q13. ___ refers to the grouping of records, observations, or cases into classes of similar objects
A. Clustering
B. Grouping
C. Classification
D. Gathering
Q14. Of the following which is not a distance based clustering algorithms?
A. K-Means
B. K-Medoids
C. BIRCH
D. DBSCAN
Q15. Find the median of these numbers:4,2,7,4,3
A. 2
B. 5
C. 7
D. 4
Q16. The mass of the beaker was 122 g.
A. Qualitative Data
B. Quantitative Data
Q17. k-means algorithm is sensitive to outliers
A. TRUE
B. FALSE
Q18. ___ is the output of KDD
A. Query
B. Useful Information
C. Data
D. Information
Q19. DBSCAN has a drawback over OPTICS
A. fixed sized radiius
B. fixed sized points in radiius
C. core points
D. none of above
Q20. Suppose a cluster contain the points (1, 3), (3, 3), (2, 1). What is the centroid of the cluster?
A. (2, 2.33)
B. (2.33, 2)
C. (2, 3)
D. None
Q21. Which of the following is not involve in data mining
A. Data archaeology
B. Knowledge extraction
C. Data transformation
D. Data exploration
Q22. Application server and data server are kept separately in.
A. Peer to Peer based Processing
B. Master slave based Processing
C. Host based Processing
D. 3-Tier Client Server model
Q23. Select the skills mainly required as a competent data analyst/scientist/miner
A. SQL
B. R
C. Java
D. All
Q24. The view over an operational data warehouse is known as virtual warehouse
A. TRUE
B. FALSE
Q25. Ordinal is an example of ___
A. Ratio class
B. Interval class
C. Categorical class
D. Range class
Submitting Your Quiz...
Please wait while we process your answers