Which distributed computing framework is known for its in-memory processing capabilities and is often used for iterative machine learning algorithms?
A. Apache Kafka
B. Apache HBase
C. Apache Spark
D. Apache Hive
Answer: Option C
A. Apache Kafka
B. Apache HBase
C. Apache Spark
D. Apache Hive
Answer: Option C
What is the primary characteristic of "big data"?
A. Small volume of data
B. High velocity of data
C. Variety of data sources
D. Low complexity of data
In the context of big data, what does the "3Vs" represent?
A. Velocity, Value, Variability
B. Volume, Variety, Velocity
C. Volume, Value, Variety
D. Velocity, Veracity, Variety
A. Java
B. Python
C. Hadoop
D. SQL
What is the main purpose of the Hadoop Distributed File System (HDFS) in a Hadoop ecosystem?
A. Real-time data processing
B. Data storage and retrieval
C. Data visualization
D. Data encryption
Join The Discussion