When you install Spark, the following directories will be created:
/usr/hdp/current/spark-clientfor submitting Spark jobs/usr/hdp/current/spark-historyfor launching Spark master processes, such as the Spark History Server/usr/hdp/current/spark-thriftserverfor the Spark Thrift Server
To install Spark:
Search for Spark in the HDP repo:
For RHEL or CentOS:
yum search sparkFor SLES:
zypper install sparkFor Ubuntu and Debian:
apt-cache spark
This will show all the versions of Spark available. For example:
spark_2_3_4_0_3485-master.noarch : Server for Spark master spark_2_3_4_0_3485-python.noarch : Python client for Spark spark_2_3_4_0_3485-worker.noarch : Server for Spark worker spark_2_3_4_0_3485.noarch : Lightning-Fast Cluster Computing
Install the version corresponding to the HDP version you currently have installed.
For RHEL or CentOS:
yum install spark_<version>-master spark_<version>-pythonFor SLES:
zypper install spark_<version>-master spark_<version>-pythonFor Ubuntu and Debian:
apt-get install spark_<version>-master apt-get install spark_<version>-python
Before you launch the Spark Shell or Thrift Server, make sure that you set
$JAVA_HOME:export JAVA_HOME=<path to JDK 1.8>

