当前位置:网站首页>Spark3.x源码编译
Spark3.x源码编译
2022-07-17 05:24:00 【顶尖高手养成计划】
简介
源码编译是我们定制化spark的前提,所以想成为一个高手那么就必须经历
开始
查看官网可以知道,编译需要maven 3.6.3,jdk 8 ,scala 2.12
maven下载地址
scala下载地址
All Available Versions | The Scala Programming Language
spark下载地址
安装包都准备好以后那么就开始安装了
tar -zxvf jdk-8u271-linux-x64.tar.gz
配置jdk环境变量
sudo vi /etc/profile.d/my_env.sh
#JAVA_HOME
export JAVA_HOME=/opt/module/jdk/jdk1.8.0_161
export PATH=$PATH:$JAVA_HOME/bin
export PATH=$PATH:$JAVA_HOME/sbin
source /etc/profile.d/my_env.sh
安装scala
tar -zxvf scala-2.12.0.tgz
sudo vi /etc/profile.d/my_env.sh
#SCALA_HOME
export SCALA_HOME=/home/bigdata/buildspark/scala-2.12.0
export PATH=$PATH:$SCALA_HOME/bin
source /etc/profile.d/my_env.sh
安装maven
tar -zxvf apache-maven-3.6.3-bin.tar.gz
sudo vi /etc/profile.d/my_env.sh
#MAVEN_HOME
export MAVEN_HOME=/home/bigdata/buildspark/apache-maven-3.6.3
export PATH=$PATH:$MAVEN_HOME/bin
source /etc/profile.d/my_env.sh
测试安装是否完成
mvn -version
scala -version
java -version
开始编译,参考官网参数
export MAVEN_OPTS="-Xmx2g -XX:ReservedCodeCacheSize=1g"
tar -zxvf spark-3.0.0.tgz
进入spark源码目录
cd spark-3.0.0
hadoop版本要和自己已安装好的hadoop版本相对应。命令表示生成的spark支持hadoop、hive、yarn
mvn -Pyarn -Phadoop-3.2 -Dhadoop.version=3.2.3 -Phive -Phive-thriftserver -DskipTests clean package
maven编译成功后,如果想要官网提供的那种二进制压缩包,则还需要输入以下命令
./dev/make-distribution.sh --name spark-hadoop3.2.3 --tgz -Pyarn -Phadoop-3.2 -Dhadoop.version=3.2.3 -Phive -Phive-thriftserver -DskipTests
等一会 以后
边栏推荐
- SQL skimming summary SQL leetcode review
- 深度学习笔记 Coursera Deep learning Notes
- Review of 4705 NLP
- Sword finger offer question brushing record - offer 06 Print linked list from end to end
- Gentoo installation tutorial (systemd+gnome)
- MySQL解压版安装步骤(Windows)
- Execute shell script under Linux to call SQL file and transfer it to remote server
- 4. Installation and use of idea
- SSM整合
- Configuration and use of cookies and sessions
猜你喜欢
Prevent blackmail attacks through data encryption schemes
网络知识-03 数据链路层-PPP
OpenSUSE install Netease cloud music (tumblefeed) (LEAP)
2021-10-25 浏览器兼容遇到的问题
CAN FD如何应用Vector诊断工具链?
网络知识-05 传输层-UDP
Fundamentals of crawler - basic principles of agent
Pytorch learning diary (4)
网络知识-04 网络层-IPv4协议
Network knowledge-05 transport layer TCP
随机推荐
SQL skimming summary SQL leetcode review
爬虫基础—多线程和多进程的基本原理
M FPGA implementation of chaotic digital secure communication system based on Lorenz chaotic self synchronization, Verilog programming implementation, with MATLAB chaotic program
Solve MySQL (1064) error: 1064 - you have an error in your SQL syntax;
MySql02 函数substr mod 视图view
4.IDEA的安装与使用
Review of 4705 NLP
High concurrency day01 (NiO, concurrent package)
爬虫基础—爬虫的基本原理
Ivew shuttle box transfer component highlights the operation value
Typescript (TS loader, tsconfig.json and lodash)
Solution to the conflict between security automatic login and anti CSRF attack
Review of 4246 Algorithms for Data Science
JS不使用async/await解决数据异步/同步问题
Network knowledge-03 data link layer PPPoE
M simulation of DQPSK modulation and demodulation technology based on MATLAB
Review of 4121 Computer System for Data Science
保姆级一条龙服务——自关联构造父子级关系(@JsonBackReference和@JsonManagedReference解决循环依赖)
Minecraft integration package [gtnh] gray Technology: new vision server building tutorial
MySql