aiurtech/hiveSimplifies building and testing applications using Spark 2.3+. This cluster setup focuses primarily on Spark with Hive integration.
The cluster is integrated in such a way that it correctly handles all dependencies and it's expected to work correctly out of the box.
The main benefits of this small cluster is that it's easy to configure to run integration tests with YARN cluster support on your own machine.
| Virgo cluster | Hadoop | Spark | Hive | Postgres | Livy |
|---|---|---|---|---|---|
| 0.8.2 | 2.7.7 | 2.3.0 | 1.2.2 | 11 | Moved |
| 0.7.5 | 2.7.7 | 2.3.0 | 1.2.2 | 9.5 | Moved |
| 0.7.0 | 2.7.7 | 2.3.0 | 1.2.2 | 9.5 | 0.4 |
| 0.6.2 | 2.7.7 | 2.2.3 | 1.2.2 | 9.5 | 0.4 |
| 0.5.7 | 2.7.7 | 2.2.3 | 1.2.2 | 9.5 |
To use, clone this repo, and use any of two forms:
bashdocker-compose up -d
or just Docker:
bash./run-cluster.sh
To stop the cluster:
bashdocker-compose down
or Just docker
bash./stop-cluster.sh
The folder virgo-client contains several useful clients to test the cluster:
Advantages:
Disadvantages
Whilst there are several commercial full distributions which offer a fully managed hadoop cluster, including Spark, they bundle at least another 30 components, several of which are out of date or not relevant in many workflows:
This project started as an attempt to use the images kindly provided by the Big Data Europe 2020 Project. However, we've found the images not suitable since they did not integrate Spark with Hive correctly. Furthermore, those images are no longer supported.
Please contact Aiur Tech [cto @ aiur.co.uk]
The Virgo Cluster is a "neighbouring" star cluster. It has some beatiful members:
Interestingly enough, soon after this project was created, the first ever picture of a black hole emerged, which was no other than M87 :smiley:




manifest unknown 错误
TLS 证书验证失败
DNS 解析超时
410 错误:版本过低
402 错误:流量耗尽
身份认证失败错误
429 限流错误
凭证保存错误
来自真实用户的反馈,见证轩辕镜像的优质服务