源码见:https://github.com/hiszm/hadoop-train
http://hadoop.apache.org/
The Apache? Hadoop? project develops open-source software for reliable, scalable, distributed computing.
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.
Modules
翻译翻译
分布式文件系统:HDFS用于将文件分布式存储载很多的服务器上
分布式计算框架:MapReduce实现在很多机器分布式并行计算
分布式资源调度框架:YARN实现集群资源管理以及作业的调度
起源
特点
起源
特点
特点
Apache社区版本
第三方发行版本(如CDH,HDP,MapR等)
//切换到root
$ sudo -i
# cd /etc/sysconfig/network-scripts/
# ls
//删除
# rm -f ifcfg-lo
csPING baidu.com (220.181.38.148) 56(84) bytes of data.
64 bytes from 220.181.38.148 (220.181.38.148): icmp_seq=1 ttl=46 time=42.7 ms
64 bytes from 220.181.38.148 (220.181.38.148): icmp_seq=2 ttl=46 time=42.0 ms
64 bytes from 220.181.38.148 (220.181.38.148): icmp_seq=3 ttl=46 time=45.0 ms
64 bytes from 220.181.38.148 (220.181.38.148): icmp_seq=4 ttl=46 time=44.4 ms