Apache Spark is a powerful execution engine for large-scale parallel data processing across a cluster of machines, which enables rapid application development and high performance. With Spark 3.0, big improvements make it possible to use the massively parallel architecture of GPUs to further accelerate Spark data processing.