以下是在Ubuntu上對Hadoop數據進行備份的幾種方法:
hdfs dfs -mkdir /source/path
和hdfs dfs -mkdir /destination/path
,然后執行hadoop distcp hdfs:///source/path hdfs:///destination/path
進行復制。hdfs dfsadmin -allowSnapshot /path/to/directory
,再創建快照hdfs dfs -createSnapshot /path/to/directory snapshotName
,恢復時用hdfs dfs -revertSnapshot /path/to/directory snapshotName
。sudo ambari - backup backup --target - dir=/backup/location
進行備份,恢復時用sudo ambari - backup restore --target - dir=/backup/location
。#!/bin/bash
,內容為rsync -avz --delete /source/path /backup/path
,再通過crontab -e
設置定時任務,如0 2 * * * /path/to/script.sh
實現定時備份。