20 questions
1) Which of the following is the processing of data to isolate patterns and establish relationships between data entities within the data lake.
Database
Machine Learning
Data Science
Data Mining
2) Which of the following is a storage repository for a massive amount of raw data and stores data in native format, in anticipation of future requirements
Data Mart
Database
Data Warehouse
Data Lake
3) Which of the followings are open source software?
Apache Spark
Mesos
Cassandra
All
4) Which of the following software is used for data visualization?
Akka
Kafka
GraphX
Mesos
5) Which of the following support processing of streaming data?
Mesos
Cassandra
Kafka
Akka
6) MQTT stands for
Message Queue Telemetry Transport
Message Queue Transport Telemetry
Message Queue Tele Transportation
Message Queue Travel Transport
7) How many stages are involved in CRISP-DM?
5
6
4
7
8) HORUS Stands for
Homogeneous Ontology for Recursive Uniform Schema
Hetrogeneous Ontology for Recursive Uniform Schema
Homogeneous Ontology for Return Uniform Schema
Hetrogeneous Ontology for Return Uniform Schema
9) How many layers are there in data science framework?
7
5
4
6
10) Six super set involved in function layers are
R-A-P-T-R-O
R-A-P-O-T-R
R-A-P-T-O-R
R-T-A-P-O-R
11) The super step which contains all the processing chains for building the data warehouse is
Retrive
Process
Transform
Organize
12) The super step which contains all the processing chains for building the data vault is
Retrive
Process
Transform
Organize
13) Which layer convert business requirements into data science requirements
Utility Layer
Functional Layer
Business Layer
Operational Management Layer
14) SCD Stand for
Slow Changing Dimension
Slow Constant Dimension
Speed Changing Dimension
Speed Constant Dimension
15) SCD Type 2
Only Update
Fast Growing Dimension
Transition Dimension
Keep Complete History
16) Basic data structure used to build data vault are
Hub
Link
Satellite
All
17)______________is an actor-based message-driven runtime for running concurrency, elasticity, and resilience processes
Kafka
Spark
Mesos
Akka
18) Which of the following Operational Management Layer super step used Drum Buffer Rope Technique
Parameter
Scheduling
Monitoring
Communication
19) which super step contains all the processing chains for building the data marts from the core data warehouse
Report
Retrieve
Transform
Organize
20) Which data storage methodology does not require a schema before you can load the data
Scheme-on-Read
Scheme-on-Write
Scheme-on-ReadWrite
Scheme-on-WriteRead