PySpark Native Functions
In this tutorial we will explore some common PySpark native functions
from pyspark.sql.functions import *df.agg(sum("column1"))df.agg(avg("column1"))df.agg(min("column1"))df.agg(max("column1"))df.select(concat(col("column1"), col("column2")))df.select(sum("column1").alias("sum_column1"))df.filter(col("column1") > 10)df.withColumn("new_column", col("column1") + col("column2"))Last updated