Javatpoint Logo
Javatpoint Logo

Spark groupByKey Function

In Spark, the groupByKey function is a frequently used transformation operation that performs shuffling of data. It receives key-value pairs (K, V) as an input, group the values based on key and generates a dataset of (K, Iterable) pairs as an output.

Example of groupByKey Function

In this example, we group the values based on the key.

  • To open the Spark in Scala mode, follow the below command.
Spark groupByKey Function
  • Create an RDD using the parallelized collection.

Now, we can read the generated result by using the following command.

Spark groupByKey Function
  • Apply groupByKey() function to group the values.
  • Now, we can read the generated result by using the following command.
Spark groupByKey Function

Here, we got the desired output.







Youtube For Videos Join Our Youtube Channel: Join Now

Feedback


Help Others, Please Share

facebook twitter pinterest

Learn Latest Tutorials


Preparation


Trending Technologies


B.Tech / MCA