How To Install Apache Spark on CentOS 7

Install Apache Spark on CentOS 7

In this tutorial we will show you how to install Apache Spark on CentOS 7 server. For those of you who didn’t know, Apache Spark is a fast and general-purpose cluster computing system. It provides high-level APIs in Java, Scala and Python, and also an optimized engine which supports overall execution charts. It also supports a rich set of higher-level tools including Spark SQL for SQL and structured information processing, MLlib for machine learning, GraphX for graph processing, and Spark Streaming.

This article assumes you have at least basic knowledge of Linux, know how to use the shell, and most importantly, you host your site on your own VPS. The installation is quite simple and assumes you are running in the root account, if not you may need to add ‘sudo’ to the commands to get root privileges. I will show you through the step by step install Apache Spark on CentOS 7 server.

Install Apache Spark on CentOS 7

Step 1. First let’s start by ensuring your system is up-to-date.

Step 2. Installing Java.

Installing java for requirement install apache spark:

Once installed, check java version:

Step 3. Installing Scala.

Spark installs Scala during the installation process, so we just need to make sure that Java and Python are present:

Once installed, check scala version:

Step 4. Installing Apache Spark.

Install Apache Spark using following command:

Setup some Environment variables before you start spark:

The standalone Spark cluster can be started manually i.e. executing the start script on each node, or simple using the available launch scripts. For testing we can run master and slave daemons on the same machine:

Step 5. Configure Firewall for Apache Spark.
Step 6. Accessing Apache Spark.

Apache Spark will be available on HTTP port 7077 by default. Open your favorite browser and navigate to or http://server-ip:7077 and complete the required the steps to finish the installation.

Install Apache Spark on CentOS 7

Congratulation’s! You have successfully installed Apache Spark on CentOS 7. Thanks for using this tutorial for installing Apache Spark on CentOS 7 systems. For additional help or useful information, we recommend you to check the official Apache Spark web site.

VPS Manage Service Offer
If you don’t have time to do all of this stuff, or if this is not your area of expertise, we offer a service to do “VPS Manage Service Offer”, starting from $10 (Paypal payment). Please contact us to get a best deal!