quickstart.rst 15 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383
  1. .. include:: global.rst.inc
  2. .. highlight:: bash
  3. .. _quickstart:
  4. Quick Start
  5. ===========
  6. This chapter will get you started with Borg and covers
  7. various use cases.
  8. A step by step example
  9. ----------------------
  10. .. include:: quickstart_example.rst.inc
  11. Archives and repositories
  12. -------------------------
  13. A *Borg archive* is the result of a single backup (``borg create``). An archive
  14. stores a snapshot of the data of the files "inside" it. One can later extract or
  15. mount an archive to restore from a backup.
  16. *Repositories* are filesystem directories acting as self-contained stores of archives.
  17. Repositories can be accessed locally via path or remotely via ssh. Under the hood,
  18. repositories contain data blocks and a manifest tracking which blocks are in each
  19. archive. If some data hasn't changed from one backup to another, Borg can simply
  20. reference an already uploaded data chunk (deduplication).
  21. Important note about free space
  22. -------------------------------
  23. Before you start creating backups, please make sure that there is *always*
  24. a good amount of free space on the filesystem that has your backup repository
  25. (and also on ~/.cache). A few GB should suffice for most hard-drive sized
  26. repositories. See also :ref:`cache-memory-usage`.
  27. Borg doesn't use space reserved for root on repository disks (even when run as root),
  28. on file systems which do not support this mechanism (e.g. XFS) we recommend to reserve
  29. some space in Borg itself just to be safe by adjusting the ``additional_free_space``
  30. setting (a good starting point is ``2G``)::
  31. borg config /path/to/repo additional_free_space 2G
  32. If Borg runs out of disk space, it tries to free as much space as it
  33. can while aborting the current operation safely, which allows the user to free more space
  34. by deleting/pruning archives. This mechanism is not bullet-proof in some
  35. circumstances [1]_.
  36. If you *really* run out of disk space, it can be hard or impossible to free space,
  37. because Borg needs free space to operate - even to delete backup
  38. archives.
  39. You can use some monitoring process or just include the free space information
  40. in your backup log files (you check them regularly anyway, right?).
  41. Also helpful:
  42. - create a big file as a "space reserve", that you can delete to free space
  43. - if you use LVM: use a LV + a filesystem that you can resize later and have
  44. some unallocated PEs you can add to the LV.
  45. - consider using quotas
  46. - use `prune` and `compact` regularly
  47. .. [1] This failsafe can fail in these circumstances:
  48. - The underlying file system doesn't support statvfs(2), or returns incorrect
  49. data, or the repository doesn't reside on a single file system
  50. - Other tasks fill the disk simultaneously
  51. - Hard quotas (which may not be reflected in statvfs(2))
  52. Important note about permissions
  53. --------------------------------
  54. Using root likely will be required if you want to backup files of other users
  55. or the operating system. If you only back up your own files, you neither need
  56. nor want to use root.
  57. Avoid to create a mixup of users and permissions in your repository (or cache).
  58. This can easily happen if you run borg using different user accounts (e.g. your
  59. non-privileged user and root) while accessing the same repo.
  60. Of course, a non-root user will have no permission to work with the files
  61. created by root (or another user) and borg operations will just fail with
  62. `Permission denied`.
  63. The easy way to avoid this is to always access the repo as the same user:
  64. For a local repository just always invoke borg as same user.
  65. For a remote repository: always use e.g. borg@remote_host. You can use this
  66. from different local users, the remote user accessing the repo will always be
  67. borg.
  68. If you need to access a local repository from different users, you can use the
  69. same method by using ssh to borg@localhost.
  70. Important note about files changing during the backup process
  71. -------------------------------------------------------------
  72. Borg does not do anything about the internal consistency of the data
  73. it backs up. It just reads and backs up each file in whatever state
  74. that file is when Borg gets to it. On an active system, this can lead
  75. to two kinds of inconsistency:
  76. - By the time Borg backs up a file, it might have changed since the backup process was initiated
  77. - A file could change while Borg is backing it up, making the file internally inconsistent
  78. If you have a set of files and want to ensure that they are backed up
  79. in a specific or consistent state, you must take steps to prevent
  80. changes to those files during the backup process. There are a few
  81. common techniques to achieve this.
  82. - Avoid running any programs that might change the files.
  83. - Snapshot files, filesystems, container storage volumes, or logical volumes. LVM or ZFS might be useful here.
  84. - Dump databases or stop the database servers.
  85. - Shut down virtual machines before backing up their images.
  86. - Shut down containers before backing up their storage volumes.
  87. For some systems Borg might work well enough without these
  88. precautions. If you are simply backing up the files on a system that
  89. isn't very active (e.g. in a typical home directory), Borg usually
  90. works well enough without further care for consistency. Log files and
  91. caches might not be in a perfect state, but this is rarely a problem.
  92. For databases, virtual machines, and containers, there are specific
  93. techniques for backing them up that do not simply use Borg to backup
  94. the underlying filesystem. For databases, check your database
  95. documentation for techniques that will save the database state between
  96. transactions. For virtual machines, consider running the backup on
  97. the VM itself or mounting the filesystem while the VM is shut down.
  98. For Docker containers, perhaps docker's "save" command can help.
  99. Automating backups
  100. ------------------
  101. The following example script is meant to be run daily by the ``root`` user on
  102. different local machines. It backs up a machine's important files (but not the
  103. complete operating system) to a repository ``~/backup/main`` on a remote server.
  104. Some files which aren't necessarily needed in this backup are excluded. See
  105. :ref:`borg_patterns` on how to add more exclude options.
  106. After the backup this script also uses the :ref:`borg_prune` subcommand to keep
  107. only a certain number of old archives and deletes the others.
  108. Finally, it uses the :ref:`borg_compact` subcommand to remove deleted objects
  109. from the segment files in the repository to preserve disk space.
  110. Before running, make sure that the repository is initialized as documented in
  111. :ref:`remote_repos` and that the script has the correct permissions to be executable
  112. by the root user, but not executable or readable by anyone else, i.e. root:root 0700.
  113. You can use this script as a starting point and modify it where it's necessary to fit
  114. your setup.
  115. Do not forget to test your created backups to make sure everything you need is being
  116. backed up and that the ``prune`` command is keeping and deleting the correct backups.
  117. ::
  118. #!/bin/sh
  119. # Setting this, so the repo does not need to be given on the commandline:
  120. export BORG_REPO=ssh://username@example.com:2022/~/backup/main
  121. # Setting this, so you won't be asked for your repository passphrase:
  122. export BORG_PASSPHRASE='XYZl0ngandsecurepa_55_phrasea&&123'
  123. # or this to ask an external program to supply the passphrase:
  124. export BORG_PASSCOMMAND='pass show backup'
  125. # some helpers and error handling:
  126. info() { printf "\n%s %s\n\n" "$( date )" "$*" >&2; }
  127. trap 'echo $( date ) Backup interrupted >&2; exit 2' INT TERM
  128. info "Starting backup"
  129. # Backup the most important directories into an archive named after
  130. # the machine this script is currently running on:
  131. borg create \
  132. --verbose \
  133. --filter AME \
  134. --list \
  135. --stats \
  136. --show-rc \
  137. --compression lz4 \
  138. --exclude-caches \
  139. --exclude '/home/*/.cache/*' \
  140. --exclude '/var/cache/*' \
  141. --exclude '/var/tmp/*' \
  142. \
  143. ::'{hostname}-{now}' \
  144. /etc \
  145. /home \
  146. /root \
  147. /var \
  148. backup_exit=$?
  149. info "Pruning repository"
  150. # Use the `prune` subcommand to maintain 7 daily, 4 weekly and 6 monthly
  151. # archives of THIS machine. The '{hostname}-' prefix is very important to
  152. # limit prune's operation to this machine's archives and not apply to
  153. # other machines' archives also:
  154. borg prune \
  155. --list \
  156. --prefix '{hostname}-' \
  157. --show-rc \
  158. --keep-daily 7 \
  159. --keep-weekly 4 \
  160. --keep-monthly 6 \
  161. prune_exit=$?
  162. # actually free repo disk space by compacting segments
  163. borg compact
  164. compact_exit=$?
  165. # use highest exit code as global exit code
  166. global_exit=$(( backup_exit > prune_exit ? backup_exit : prune_exit ))
  167. global_exit=$(( compact_exit > global_exit ? compact_exit : global_exit ))
  168. if [ ${global_exit} -eq 1 ];
  169. then
  170. info "Backup, Prune and/or Compact finished with a warning"
  171. fi
  172. if [ ${global_exit} -gt 1 ];
  173. then
  174. info "Backup, Prune and/or Compact finished with an error"
  175. fi
  176. exit ${global_exit}
  177. Pitfalls with shell variables and environment variables
  178. -------------------------------------------------------
  179. This applies to all environment variables you want Borg to see, not just
  180. ``BORG_PASSPHRASE``. The short explanation is: always ``export`` your variable,
  181. and use single quotes if you're unsure of the details of your shell's expansion
  182. behavior. E.g.::
  183. export BORG_PASSPHRASE='complicated & long'
  184. This is because ``export`` exposes variables to subprocesses, which Borg may be
  185. one of. More on ``export`` can be found in the "ENVIRONMENT" section of the
  186. bash(1) man page.
  187. Beware of how ``sudo`` interacts with environment variables. For example, you
  188. may be surprised that the following ``export`` has no effect on your command::
  189. export BORG_PASSPHRASE='complicated & long'
  190. sudo ./yourborgwrapper.sh # still prompts for password
  191. For more information, refer to the sudo(8) man page and ``env_keep`` in
  192. the sudoers(5) man page.
  193. .. Tip::
  194. To debug what your borg process is actually seeing, find its PID
  195. (``ps aux|grep borg``) and then look into ``/proc/<PID>/environ``.
  196. .. backup_compression:
  197. Backup compression
  198. ------------------
  199. The default is lz4 (very fast, but low compression ratio), but other methods are
  200. supported for different situations.
  201. You can use zstd for a wide range from high speed (and relatively low
  202. compression) using N=1 to high compression (and lower speed) using N=22.
  203. zstd is a modern compression algorithm and might be preferable over zlib and
  204. lzma, except if you need compatibility to older borg versions (< 1.1.4) that
  205. did not yet offer zstd.
  206. $ borg create --compression zstd,N /path/to/repo::arch ~
  207. Other options are:
  208. If you have a fast repo storage and you want minimum CPU usage, no compression::
  209. $ borg create --compression none /path/to/repo::arch ~
  210. If you have a less fast repo storage and you want a bit more compression (N=0..9,
  211. 0 means no compression, 9 means high compression): ::
  212. $ borg create --compression zlib,N /path/to/repo::arch ~
  213. If you have a very slow repo storage and you want high compression (N=0..9, 0 means
  214. low compression, 9 means high compression): ::
  215. $ borg create --compression lzma,N /path/to/repo::arch ~
  216. You'll need to experiment a bit to find the best compression for your use case.
  217. Keep an eye on CPU load and throughput.
  218. .. _encrypted_repos:
  219. Repository encryption
  220. ---------------------
  221. You can choose the repository encryption mode at repository creation time::
  222. $ borg init --encryption=MODE PATH
  223. For a list of available encryption MODEs and their descriptions, please refer
  224. to :ref:`borg_init`.
  225. If you use encryption, all data is encrypted on the client before being written
  226. to the repository.
  227. This means that an attacker who manages to compromise the host containing an
  228. encrypted repository will not be able to access any of the data, even while the
  229. backup is being made.
  230. Key material is stored in encrypted form and can be only decrypted by providing
  231. the correct passphrase.
  232. For automated backups the passphrase can be specified using the
  233. `BORG_PASSPHRASE` environment variable.
  234. .. note:: Be careful about how you set that environment, see
  235. :ref:`this note about password environments <password_env>`
  236. for more information.
  237. .. warning:: The repository data is totally inaccessible without the key
  238. and the key passphrase.
  239. Make a backup copy of the key file (``keyfile`` mode) or repo config
  240. file (``repokey`` mode) and keep it at a safe place, so you still have
  241. the key in case it gets corrupted or lost. Also keep your passphrase
  242. at a safe place.
  243. You can make backups using :ref:`borg_key_export` subcommand.
  244. If you want to print a backup of your key to paper use the ``--paper``
  245. option of this command and print the result, or print this `template`_
  246. if you need a version with QR-Code.
  247. A backup inside of the backup that is encrypted with that key/passphrase
  248. won't help you with that, of course.
  249. .. _template: paperkey.html
  250. .. _remote_repos:
  251. Remote repositories
  252. -------------------
  253. Borg can initialize and access repositories on remote hosts if the
  254. host is accessible using SSH. This is fastest and easiest when Borg
  255. is installed on the remote host, in which case the following syntax is used::
  256. $ borg init user@hostname:/path/to/repo
  257. Note: please see the usage chapter for a full documentation of repo URLs.
  258. Remote operations over SSH can be automated with SSH keys. You can restrict the
  259. use of the SSH keypair by prepending a forced command to the SSH public key in
  260. the remote server's `authorized_keys` file. This example will start Borg
  261. in server mode and limit it to a specific filesystem path::
  262. command="borg serve --restrict-to-path /path/to/repo",restrict ssh-rsa AAAAB3[...]
  263. If it is not possible to install Borg on the remote host,
  264. it is still possible to use the remote host to store a repository by
  265. mounting the remote filesystem, for example, using sshfs::
  266. $ sshfs user@hostname:/path/to /path/to
  267. $ borg init /path/to/repo
  268. $ fusermount -u /path/to
  269. You can also use other remote filesystems in a similar way. Just be careful,
  270. not all filesystems out there are really stable and working good enough to
  271. be acceptable for backup usage.