2012-06-18
原因:DataNode 47系统cpu过高重启后,出现下述问题。 [zhouhh@Hadoop48 ~]$ fs -rm 1.txt rm: org.apache.hadoop.hdfs.server.namenode.SafeModeException: Cannot delete /user/zhouhh/1.txt. Name node is in safe mode. 进入了安全模式? 重启hadoop也不能解决。只好强制离开安全模式。
[zhouhh@Hadoop48 hadoop-1.0.3]$ hadoop dfsadmin -safemode leave
[zhouhh@Hadoop48 ~]$ fs -rm 1.txt
Deleted hdfs://Hadoop48:54310/user/zhouhh/1.txt
删除文件成功。
[zhouhh@Hadoop48 ~]$ hadoop fs -put 1.txt .
12/06/18 18:14:53 INFO hdfs.DFSClient: Exception in createBlockOutputStream 192.168.10.46:50010 java.io.IOException: Bad connect ack with firstBadLink as 192.168.10.47:54313
12/06/18 18:14:53 INFO hdfs.DFSClient: Abandoning block blk_6739152855033658770_1512
12/06/18 18:14:53 INFO hdfs.DFSClient: Excluding datanode 192.168.10.47:54313
到47上看,jps 进程都在。
[zhouhh@Hadoop47 ~]$ jps
4067 HQuorumPeer
3913 TaskTracker
3803 DataNode
9951 Jps
4216 HRegionServer
查看端口,也都在监听:
[zhouhh@Hadoop47 ~]$ netstat -antp |grep 543
(Not all processes could be identified, non-owned process info
will not be shown, you would have to be root to see it all.)
tcp 0 0 0.0.0.0:54313 0.0.0.0:* LISTEN 3803/java
tcp 0 0 0.0.0.0:54314 0.0.0.0:* LISTEN 3803/java
tcp 0 0 0.0.0.0:54315 0.0.0.0:* LISTEN 3803/java
tcp 0 0 192.168.10.47:48636 192.168.10.48:54310 TIME_WAIT -
tcp 0 0 192.168.10.47:48639 192.168.10.48:54310 TIME_WAIT -
tcp 0 0 192.168.10.47:59779 192.168.10.48:54311 ESTABLISHED 3913/java
tcp 0 0 192.168.10.47:55475 192.168.10.48:54310 ESTABLISHED 3803/java
重启hadoop,没有解决问题。困惑了很久,忽然想到了防火墙。 有时系统没有配好防火墙,重启后导致防火墙屏蔽了hadoop的端口。测试时可以将防火墙全部关闭。
[root@Hadoop47 ~]# iptables -L
Chain INPUT (policy ACCEPT)
target prot opt source destination
RH-Firewall-1-INPUT all -- anywhere anywhere
Chain FORWARD (policy ACCEPT)
target prot opt source destination
RH-Firewall-1-INPUT all -- anywhere anywhere
Chain OUTPUT (policy ACCEPT)
target prot opt source destination
Chain RH-Firewall-1-INPUT (2 references)
target prot opt source destination
ACCEPT all -- anywhere anywhere
ACCEPT icmp -- anywhere anywhere icmp any
ACCEPT esp -- anywhere anywhere
ACCEPT ah -- anywhere anywhere
ACCEPT udp -- anywhere 224.0.0.251 udp dpt:mdns
ACCEPT udp -- anywhere anywhere udp dpt:ipp
ACCEPT tcp -- anywhere anywhere tcp dpt:ipp
ACCEPT all -- anywhere anywhere state RELATED,ESTABLISHED
ACCEPT tcp -- anywhere anywhere state NEW tcp dpt:ssh
REJECT all -- anywhere anywhere reject-with icmp-host-prohibited
关闭防火墙:
[root@Hadoop47 ~]# service iptables stop
Flushing firewall rules: [ OK ]
Setting chains to policy ACCEPT: filter [ OK ]
Unloading iptables modules: [ OK ]
[root@Hadoop47 ~]# chkconfig iptables off
在生产系统中再配置域外的防火墙。
[zhouhh@Hadoop48 ~]$ fs -put 1.txt . 成功
如非注明转载, 均为原创. 本站遵循知识共享CC协议,转载请注明来源
FEATURED TAGS
css
vc6
http
automake
linux
make
makefile
voip
乱码
awk
flash
vista
vi
vim
javascript
pietty
putty
ssh
posix
subversion
svn
windows
删除
编译
多线程
wxwidgets
ie
ubuntu
开源
c
python
bash
备份
性能
scp
汉字
log
ruby
中文
bug
msn
nginx
php
shell
wordpress
mqueue
android
eclipse
java
mac
ios
html5
js
mysql
protobuf
apache
hadoop
install
iocp
twisted
centos
mapreduce
hbase
thrift
tutorial
hive
erlang
lucene
hdfs
sqoop
utf8
filter
草原
yarn
ganglia
恢复
scrapy
django
fsimage
flume
tail
flume-ng
mining
scala
go
kafka
gradle
cassandra
baas
spring
postgres
maven
mybatis
mongodb
https
nodejs
镜像
心理学
机器学习
Keras
theano
anaconda
docker
spark
akka-http
json
群论
区块链
加密
抽象代数
离散对数
同余
欧拉函数
扩展欧几里德算法
ES6
node-inspect
debug
win10
vscode
挖矿