11. InfoSphere DataStage has . . . . . . . . levels of Parallelism.
12. A . . . . . . . . action can be configured to perform file system cleanup and directory creation before starting the mapreduce job.
13. Kafka is run as a cluster comprised of one or more servers each of which is called . . . . . . . .
14. . . . . . . . . is a distributed real-time computation system for processing large volumes of high-velocity data.
15. Amazon EMR clusters can read and process Amazon . . . . . . . . streams directly.
16. . . . . . . . . is the amount of time to keep a log segment before it is deleted.
17. . . . . . . . . is a fully integrated, state-of-the-art analytic database architected specifically to leverage strengths of Hadoop.
18. Which of the following is java-based tool for tracking, resolving and managing project dependencies?
19. Point out the wrong statement.
20. Impala is an integrated part of a . . . . . . . . enterprise data hub.
Read More Section(Best Practices and Optimization of Hadoop)
Each Section contains maximum 100 MCQs question on Best Practices and Optimization of Hadoop. To get more questions visit other sections.