Skip to content

Latest commit

 

History

History

python

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

Apache Spark demo example (Python)

This is a starter app for the Apache Spark Python template.

Running the application on a Spark standalone cluster via Docker

To run the application, execute the following steps:

  1. Setup a Spark cluster as described on http://github.com/big-data-europe/docker-spark by just running:
    git https://github.com/big-data-europe/docker-spark.git
    cd docker-spark
    docker-compose up -d
  2. Build the Docker image:
    bash build.sh python-example examples/python
  3. Run the Docker container:
    docker run --rm --network dockerspark_default --name pyspark-example bde2020/spark-python-example:3.3.0-hadoop3.3