Point out the wrong statement.
A. Kafka can serve as a kind of external commit-log for a distributed system
B. The log helps replicate data between nodes and acts as a re-syncing mechanism for failed nodes to restore their data
C. Kafka comes with a command-line client that will take input from a file or from standard input and send it out as messages to the Kafka cluster
D. All of the mentioned
Answer: Option D
What is a common optimization technique to improve Hadoop MapReduce performance?
A. Increase block size
B. Decrease block size
C. Maintain the default block size
D. Use fewer mappers
Which compression codec is commonly used for optimizing storage in Hadoop?
A. Gzip
B. Snappy
C. Bzip2
D. LZO
What is the purpose of Hadoop speculative execution?
A. To handle speculative workloads
B. To minimize resource usage
C. To mitigate the impact of slow-running tasks
D. To speed up task completion
How can data skew in a Hadoop job be addressed for optimization?
A. Increase the number of reducers
B. Decrease the number of reducers
C. Use a combiner function
D. Use a custom partitioner
Join The Discussion