1.
What is the primary characteristic of "big data"?

5.
In distributed computing, what is the term for a group of computers connected over a network that work together to solve a problem or perform a task?

7.
What is the primary advantage of using distributed computing frameworks like Hadoop and Spark for big data processing?

8.
Which distributed computing framework is known for its in-memory processing capabilities and is often used for iterative machine learning algorithms?

9.
What is the main goal of data partitioning in distributed computing?

10.
Which technology is commonly used for real-time stream processing of big data and is part of the Apache ecosystem?