env_hyperv
与虚拟机完全通过xshell/moba等工具交互
- Linux mint
- jdk8
- hdfs3.3.4
- spark3.1.3
- thrift 0.20 不需要任何header
环境变量可选写入/etc/profile或者~/.profile
hadoop伪分布配置
https://hadoop.apache.org/docs/r3.3.0/hadoop-project-dist/hadoop-common/SingleCluster.html
etc/hadoop/core-site.xml
:
1 | <configuration> |
etc/hadoop/hdfs-site.xml
:
1 | <configuration> |
hadoop-env.sh
1 | export JAVA_HOME=/opt/soft/jdk |
start
/opt/soft/hadoop/sbin/start-dfs.sh
/opt/soft/hadoop/sbin/start-yarn.sh
50070 -> 9870
spark
使用derby无需配置,固定启动路径即可
carbon
spark-shell –jars apache-carbondata-2.3.0-bin-spark3.1.1-hadoop2.7.2.jar
spark-sql –conf spark.sql.extensions=org.apache.spark.sql.CarbonExtensions –jars apache-carbondata-2.3.0-bin-spark3.1.1-hadoop2.7.2.jar
spark-submit
–class org.apache.carbondata.spark.thriftserver.CarbonThriftServer
–num-executors 3
–driver-memory 20G
–executor-memory 250G
–executor-cores 32
apache-carbondata-2.3.0-bin-spark3.1.1-hadoop2.7.2.jar
/beeline -u jdbc:hive2://
测试表
1 | cd carbondata |
数据存储路径:/opt/soft/spark-warehouse