usage_general.rst.inc 15 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344
  1. Repository URLs
  2. ~~~~~~~~~~~~~~~
  3. **Local filesystem** (or locally mounted network filesystem):
  4. ``/path/to/repo`` - filesystem path to repo directory, absolute path
  5. ``path/to/repo`` - filesystem path to repo directory, relative path
  6. Also, stuff like ``~/path/to/repo`` or ``~other/path/to/repo`` works (this is
  7. expanded by your shell).
  8. Note: you may also prepend a ``file://`` to a filesystem path to get URL style.
  9. **Remote repositories** accessed via ssh user@host:
  10. ``user@host:/path/to/repo`` - remote repo, absolute path
  11. ``ssh://user@host:port/path/to/repo`` - same, alternative syntax, port can be given
  12. **Remote repositories with relative pathes** can be given using this syntax:
  13. ``user@host:path/to/repo`` - path relative to current directory
  14. ``user@host:~/path/to/repo`` - path relative to user's home directory
  15. ``user@host:~other/path/to/repo`` - path relative to other's home directory
  16. Note: giving ``user@host:/./path/to/repo`` or ``user@host:/~/path/to/repo`` or
  17. ``user@host:/~other/path/to/repo`` is also supported, but not required here.
  18. **Remote repositories with relative pathes, alternative syntax with port**:
  19. ``ssh://user@host:port/./path/to/repo`` - path relative to current directory
  20. ``ssh://user@host:port/~/path/to/repo`` - path relative to user's home directory
  21. ``ssh://user@host:port/~other/path/to/repo`` - path relative to other's home directory
  22. If you frequently need the same repo URL, it is a good idea to set the
  23. ``BORG_REPO`` environment variable to set a default for the repo URL:
  24. ::
  25. export BORG_REPO='ssh://user@host:port/path/to/repo'
  26. Then just leave away the repo URL if only a repo URL is needed and you want
  27. to use the default - it will be read from BORG_REPO then.
  28. Use ``::`` syntax to give the repo URL when syntax requires giving a positional
  29. argument for the repo (e.g. ``borg mount :: /mnt``).
  30. Repository / Archive Locations
  31. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  32. Many commands want either a repository (just give the repo URL, see above) or
  33. an archive location, which is a repo URL followed by ``::archive_name``.
  34. Archive names must not contain the ``/`` (slash) character. For simplicity,
  35. maybe also avoid blanks or other characters that have special meaning on the
  36. shell or in a filesystem (borg mount will use the archive name as directory
  37. name).
  38. If you have set BORG_REPO (see above) and an archive location is needed, use
  39. ``::archive_name`` - the repo URL part is then read from BORG_REPO.
  40. Type of log output
  41. ~~~~~~~~~~~~~~~~~~
  42. The log level of the builtin logging configuration defaults to WARNING.
  43. This is because we want Borg to be mostly silent and only output
  44. warnings, errors and critical messages, unless output has been requested
  45. by supplying an option that implies output (eg, --list or --progress).
  46. Log levels: DEBUG < INFO < WARNING < ERROR < CRITICAL
  47. Use ``--debug`` to set DEBUG log level -
  48. to get debug, info, warning, error and critical level output.
  49. Use ``--info`` (or ``-v`` or ``--verbose``) to set INFO log level -
  50. to get info, warning, error and critical level output.
  51. Use ``--warning`` (default) to set WARNING log level -
  52. to get warning, error and critical level output.
  53. Use ``--error`` to set ERROR log level -
  54. to get error and critical level output.
  55. Use ``--critical`` to set CRITICAL log level -
  56. to get critical level output.
  57. While you can set misc. log levels, do not expect that every command will
  58. give different output on different log levels - it's just a possibility.
  59. .. warning:: Options --critical and --error are provided for completeness,
  60. their usage is not recommended as you might miss important information.
  61. Return codes
  62. ~~~~~~~~~~~~
  63. Borg can exit with the following return codes (rc):
  64. ::
  65. 0 = success (logged as INFO)
  66. 1 = warning (operation reached its normal end, but there were warnings -
  67. you should check the log, logged as WARNING)
  68. 2 = error (like a fatal error, a local or remote exception, the operation
  69. did not reach its normal end, logged as ERROR)
  70. 128+N = killed by signal N (e.g. 137 == kill -9)
  71. If you use ``--show-rc``, the return code is also logged at the indicated
  72. level as the last log entry.
  73. .. _env_vars:
  74. Environment Variables
  75. ~~~~~~~~~~~~~~~~~~~~~
  76. Borg uses some environment variables for automation:
  77. General:
  78. BORG_REPO
  79. When set, use the value to give the default repository location. If a command needs an archive
  80. parameter, you can abbreviate as `::archive`. If a command needs a repository parameter, you
  81. can either leave it away or abbreviate as `::`, if a positional parameter is required.
  82. BORG_PASSPHRASE
  83. When set, use the value to answer the passphrase question for encrypted repositories.
  84. It is used when a passphrase is needed to access an encrypted repo as well as when a new
  85. passphrase should be initially set when initializing an encrypted repo.
  86. See also BORG_NEW_PASSPHRASE.
  87. BORG_PASSCOMMAND
  88. When set, use the standard output of the command (trailing newlines are stripped) to answer the
  89. passphrase question for encrypted repositories.
  90. It is used when a passphrase is needed to access an encrypted repo as well as when a new
  91. passphrase should be initially set when initializing an encrypted repo.
  92. If BORG_PASSPHRASE is also set, it takes precedence.
  93. See also BORG_NEW_PASSPHRASE.
  94. BORG_NEW_PASSPHRASE
  95. When set, use the value to answer the passphrase question when a **new** passphrase is asked for.
  96. This variable is checked first. If it is not set, BORG_PASSPHRASE and BORG_PASSCOMMAND will also
  97. be checked.
  98. Main usecase for this is to fully automate ``borg change-passphrase``.
  99. BORG_DISPLAY_PASSPHRASE
  100. When set, use the value to answer the "display the passphrase for verification" question when defining a new passphrase for encrypted repositories.
  101. BORG_HOSTNAME_IS_UNIQUE=no
  102. Borg assumes that it can derive a unique hostname / identity (see ``borg debug info``).
  103. If this is not the case or you do not want Borg to automatically remove stale locks,
  104. set this to *no*.
  105. BORG_LOGGING_CONF
  106. When set, use the given filename as INI_-style logging configuration.
  107. BORG_RSH
  108. When set, use this command instead of ``ssh``. This can be used to specify ssh options, such as
  109. a custom identity file ``ssh -i /path/to/private/key``. See ``man ssh`` for other options.
  110. BORG_REMOTE_PATH
  111. When set, use the given path as borg executable on the remote (defaults to "borg" if unset).
  112. Using ``--remote-path PATH`` commandline option overrides the environment variable.
  113. BORG_FILES_CACHE_TTL
  114. When set to a numeric value, this determines the maximum "time to live" for the files cache
  115. entries (default: 20). The files cache is used to quickly determine whether a file is unchanged.
  116. The FAQ explains this more detailed in: :ref:`always_chunking`
  117. TMPDIR
  118. where temporary files are stored (might need a lot of temporary space for some operations)
  119. Some automatic "answerers" (if set, they automatically answer confirmation questions):
  120. BORG_UNKNOWN_UNENCRYPTED_REPO_ACCESS_IS_OK=no (or =yes)
  121. For "Warning: Attempting to access a previously unknown unencrypted repository"
  122. BORG_RELOCATED_REPO_ACCESS_IS_OK=no (or =yes)
  123. For "Warning: The repository at location ... was previously located at ..."
  124. BORG_CHECK_I_KNOW_WHAT_I_AM_DOING=NO (or =YES)
  125. For "Warning: 'check --repair' is an experimental feature that might result in data loss."
  126. BORG_DELETE_I_KNOW_WHAT_I_AM_DOING=NO (or =YES)
  127. For "You requested to completely DELETE the repository *including* all archives it contains:"
  128. BORG_RECREATE_I_KNOW_WHAT_I_AM_DOING=NO (or =YES)
  129. For "recreate is an experimental feature."
  130. Note: answers are case sensitive. setting an invalid answer value might either give the default
  131. answer or ask you interactively, depending on whether retries are allowed (they by default are
  132. allowed). So please test your scripts interactively before making them a non-interactive script.
  133. Directories and files:
  134. BORG_KEYS_DIR
  135. Default to '~/.config/borg/keys'. This directory contains keys for encrypted repositories.
  136. BORG_KEY_FILE
  137. When set, use the given filename as repository key file.
  138. BORG_SECURITY_DIR
  139. Default to '~/.config/borg/security'. This directory contains information borg uses to
  140. track its usage of NONCES ("numbers used once" - usually in encryption context) and other
  141. security relevant data.
  142. BORG_CACHE_DIR
  143. Default to '~/.cache/borg'. This directory contains the local cache and might need a lot
  144. of space for dealing with big repositories).
  145. Building:
  146. BORG_OPENSSL_PREFIX
  147. Adds given OpenSSL header file directory to the default locations (setup.py).
  148. BORG_LZ4_PREFIX
  149. Adds given LZ4 header file directory to the default locations (setup.py).
  150. BORG_LIBB2_PREFIX
  151. Adds given prefix directory to the default locations. If a 'include/blake2.h' is found Borg
  152. will be linked against the system libb2 instead of a bundled implementation. (setup.py)
  153. Please note:
  154. - be very careful when using the "yes" sayers, the warnings with prompt exist for your / your data's security/safety
  155. - also be very careful when putting your passphrase into a script, make sure it has appropriate file permissions
  156. (e.g. mode 600, root:root).
  157. .. _INI: https://docs.python.org/3.4/library/logging.config.html#configuration-file-format
  158. .. _file-systems:
  159. File systems
  160. ~~~~~~~~~~~~
  161. We strongly recommend against using Borg (or any other database-like
  162. software) on non-journaling file systems like FAT, since it is not
  163. possible to assume any consistency in case of power failures (or a
  164. sudden disconnect of an external drive or similar failures).
  165. While Borg uses a data store that is resilient against these failures
  166. when used on journaling file systems, it is not possible to guarantee
  167. this with some hardware -- independent of the software used. We don't
  168. know a list of affected hardware.
  169. If you are suspicious whether your Borg repository is still consistent
  170. and readable after one of the failures mentioned above occured, run
  171. ``borg check --verify-data`` to make sure it is consistent.
  172. Units
  173. ~~~~~
  174. To display quantities, Borg takes care of respecting the
  175. usual conventions of scale. Disk sizes are displayed in `decimal
  176. <https://en.wikipedia.org/wiki/Decimal>`_, using powers of ten (so
  177. ``kB`` means 1000 bytes). For memory usage, `binary prefixes
  178. <https://en.wikipedia.org/wiki/Binary_prefix>`_ are used, and are
  179. indicated using the `IEC binary prefixes
  180. <https://en.wikipedia.org/wiki/IEC_80000-13#Prefixes_for_binary_multiples>`_,
  181. using powers of two (so ``KiB`` means 1024 bytes).
  182. Date and Time
  183. ~~~~~~~~~~~~~
  184. We format date and time conforming to ISO-8601, that is: YYYY-MM-DD and
  185. HH:MM:SS (24h clock).
  186. For more information about that, see: https://xkcd.com/1179/
  187. Unless otherwise noted, we display local date and time.
  188. Internally, we store and process date and time as UTC.
  189. Resource Usage
  190. ~~~~~~~~~~~~~~
  191. Borg might use a lot of resources depending on the size of the data set it is dealing with.
  192. If one uses Borg in a client/server way (with a ssh: repository),
  193. the resource usage occurs in part on the client and in another part on the
  194. server.
  195. If one uses Borg as a single process (with a filesystem repo),
  196. all the resource usage occurs in that one process, so just add up client +
  197. server to get the approximate resource usage.
  198. CPU client:
  199. borg create: does chunking, hashing, compression, crypto (high CPU usage)
  200. chunks cache sync: quite heavy on CPU, doing lots of hashtable operations.
  201. borg extract: crypto, decompression (medium to high CPU usage)
  202. borg check: similar to extract, but depends on options given.
  203. borg prune / borg delete archive: low to medium CPU usage
  204. borg delete repo: done on the server
  205. It won't go beyond 100% of 1 core as the code is currently single-threaded.
  206. Especially higher zlib and lzma compression levels use significant amounts
  207. of CPU cycles. Crypto might be cheap on the CPU (if hardware accelerated) or
  208. expensive (if not).
  209. CPU server:
  210. It usually doesn't need much CPU, it just deals with the key/value store
  211. (repository) and uses the repository index for that.
  212. borg check: the repository check computes the checksums of all chunks
  213. (medium CPU usage)
  214. borg delete repo: low CPU usage
  215. CPU (only for client/server operation):
  216. When using borg in a client/server way with a ssh:-type repo, the ssh
  217. processes used for the transport layer will need some CPU on the client and
  218. on the server due to the crypto they are doing - esp. if you are pumping
  219. big amounts of data.
  220. Memory (RAM) client:
  221. The chunks index and the files index are read into memory for performance
  222. reasons. Might need big amounts of memory (see below).
  223. Compression, esp. lzma compression with high levels might need substantial
  224. amounts of memory.
  225. Memory (RAM) server:
  226. The server process will load the repository index into memory. Might need
  227. considerable amounts of memory, but less than on the client (see below).
  228. Chunks index (client only):
  229. Proportional to the amount of data chunks in your repo. Lots of chunks
  230. in your repo imply a big chunks index.
  231. It is possible to tweak the chunker params (see create options).
  232. Files index (client only):
  233. Proportional to the amount of files in your last backups. Can be switched
  234. off (see create options), but next backup might be much slower if you do.
  235. The speed benefit of using the files cache is proportional to file size.
  236. Repository index (server only):
  237. Proportional to the amount of data chunks in your repo. Lots of chunks
  238. in your repo imply a big repository index.
  239. It is possible to tweak the chunker params (see create options) to
  240. influence the amount of chunks being created.
  241. Temporary files (client):
  242. Reading data and metadata from a FUSE mounted repository will consume up to
  243. the size of all deduplicated, small chunks in the repository. Big chunks
  244. won't be locally cached.
  245. Temporary files (server):
  246. None.
  247. Cache files (client only):
  248. Contains the chunks index and files index (plus a collection of single-
  249. archive chunk indexes which might need huge amounts of disk space,
  250. depending on archive count and size - see FAQ about how to reduce).
  251. Network (only for client/server operation):
  252. If your repository is remote, all deduplicated (and optionally compressed/
  253. encrypted) data of course has to go over the connection (ssh: repo url).
  254. If you use a locally mounted network filesystem, additionally some copy
  255. operations used for transaction support also go over the connection. If
  256. you backup multiple sources to one target repository, additional traffic
  257. happens for cache resynchronization.