pyspark.sql.DataFrame.foreachPartition¶
-
DataFrame.
foreachPartition
(f: Callable[[Iterator[pyspark.sql.types.Row]], None]) → None[source]¶ Applies the
f
function to each partition of thisDataFrame
.This a shorthand for
df.rdd.foreachPartition()
.New in version 1.3.0.
Examples
>>> def f(people): ... for person in people: ... print(person.name) >>> df.foreachPartition(f)