2
0

usage.rst 31 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571572573574575576577578579580581582583584585586587588589590591592593594595596597598599600601602603604605606607608609610611612613614615616617618619620621622623624625626627628629630631632633634635636637638639640641642643644645646647648649650651652653654655656657658659660661662663664665666667668669670671672673674675676677678679680681682683684685686687688689690691692693694695696697698699700701702703704705706707708709710711712713714715716717718719720721722723724725726727728729730731732733734735736737738739740741742743744745746747748749750751752753754755756757758759760761762763764765766767768769770771772773774775776777778779780781782783784785786787788789790791792
  1. .. include:: global.rst.inc
  2. .. _detailed_usage:
  3. Usage
  4. =====
  5. |project_name| consists of a number of commands. Each command accepts
  6. a number of arguments and options. The following sections will describe each
  7. command in detail.
  8. General
  9. -------
  10. Type of log output
  11. ~~~~~~~~~~~~~~~~~~
  12. The log level of the builtin logging configuration defaults to WARNING.
  13. This is because we want |project_name| to be mostly silent and only output
  14. warnings, errors and critical messages.
  15. Log levels: DEBUG < INFO < WARNING < ERROR < CRITICAL
  16. Use ``--debug`` to set DEBUG log level -
  17. to get debug, info, warning, error and critical level output.
  18. Use ``--info`` (or ``-v`` or ``--verbose``) to set INFO log level -
  19. to get info, warning, error and critical level output.
  20. Use ``--warning`` (default) to set WARNING log level -
  21. to get warning, error and critical level output.
  22. Use ``--error`` to set ERROR log level -
  23. to get error and critical level output.
  24. Use ``--critical`` to set CRITICAL log level -
  25. to get critical level output.
  26. While you can set misc. log levels, do not expect that every command will
  27. give different output on different log levels - it's just a possibility.
  28. .. warning:: Options --critical and --error are provided for completeness,
  29. their usage is not recommended as you might miss important information.
  30. .. warning:: While some options (like ``--stats`` or ``--list``) will emit more
  31. informational messages, you have to use INFO (or lower) log level to make
  32. them show up in log output. Use ``-v`` or a logging configuration.
  33. Return codes
  34. ~~~~~~~~~~~~
  35. |project_name| can exit with the following return codes (rc):
  36. ::
  37. 0 = success (logged as INFO)
  38. 1 = warning (operation reached its normal end, but there were warnings -
  39. you should check the log, logged as WARNING)
  40. 2 = error (like a fatal error, a local or remote exception, the operation
  41. did not reach its normal end, logged as ERROR)
  42. 128+N = killed by signal N (e.g. 137 == kill -9)
  43. If you use ``--show-rc``, the return code is also logged at the indicated
  44. level as the last log entry.
  45. Environment Variables
  46. ~~~~~~~~~~~~~~~~~~~~~
  47. |project_name| uses some environment variables for automation:
  48. General:
  49. BORG_REPO
  50. When set, use the value to give the default repository location. If a command needs an archive
  51. parameter, you can abbreviate as `::archive`. If a command needs a repository parameter, you
  52. can either leave it away or abbreviate as `::`, if a positional parameter is required.
  53. BORG_PASSPHRASE
  54. When set, use the value to answer the passphrase question for encrypted repositories.
  55. BORG_DISPLAY_PASSPHRASE
  56. When set, use the value to answer the "display the passphrase for verification" question when defining a new passphrase for encrypted repositories.
  57. BORG_LOGGING_CONF
  58. When set, use the given filename as INI_-style logging configuration.
  59. BORG_RSH
  60. When set, use this command instead of ``ssh``. This can be used to specify ssh options, such as
  61. a custom identity file ``ssh -i /path/to/private/key``. See ``man ssh`` for other options.
  62. TMPDIR
  63. where temporary files are stored (might need a lot of temporary space for some operations)
  64. Some automatic "answerers" (if set, they automatically answer confirmation questions):
  65. BORG_UNKNOWN_UNENCRYPTED_REPO_ACCESS_IS_OK=no (or =yes)
  66. For "Warning: Attempting to access a previously unknown unencrypted repository"
  67. BORG_RELOCATED_REPO_ACCESS_IS_OK=no (or =yes)
  68. For "Warning: The repository at location ... was previously located at ..."
  69. BORG_CHECK_I_KNOW_WHAT_I_AM_DOING=NO (or =YES)
  70. For "Warning: 'check --repair' is an experimental feature that might result in data loss."
  71. BORG_DELETE_I_KNOW_WHAT_I_AM_DOING=NO (or =YES)
  72. For "You requested to completely DELETE the repository *including* all archives it contains:"
  73. Note: answers are case sensitive. setting an invalid answer value might either give the default
  74. answer or ask you interactively, depending on whether retries are allowed (they by default are
  75. allowed). So please test your scripts interactively before making them a non-interactive script.
  76. Directories:
  77. BORG_KEYS_DIR
  78. Default to '~/.config/borg/keys'. This directory contains keys for encrypted repositories.
  79. BORG_CACHE_DIR
  80. Default to '~/.cache/borg'. This directory contains the local cache and might need a lot
  81. of space for dealing with big repositories).
  82. Building:
  83. BORG_OPENSSL_PREFIX
  84. Adds given OpenSSL header file directory to the default locations (setup.py).
  85. BORG_LZ4_PREFIX
  86. Adds given LZ4 header file directory to the default locations (setup.py).
  87. Please note:
  88. - be very careful when using the "yes" sayers, the warnings with prompt exist for your / your data's security/safety
  89. - also be very careful when putting your passphrase into a script, make sure it has appropriate file permissions
  90. (e.g. mode 600, root:root).
  91. .. _INI: https://docs.python.org/3.4/library/logging.config.html#configuration-file-format
  92. Resource Usage
  93. ~~~~~~~~~~~~~~
  94. |project_name| might use a lot of resources depending on the size of the data set it is dealing with.
  95. CPU:
  96. It won't go beyond 100% of 1 core as the code is currently single-threaded.
  97. Especially higher zlib and lzma compression levels use significant amounts
  98. of CPU cycles.
  99. Memory (RAM):
  100. The chunks index and the files index are read into memory for performance
  101. reasons.
  102. Compression, esp. lzma compression with high levels might need substantial
  103. amounts of memory.
  104. Temporary files:
  105. Reading data and metadata from a FUSE mounted repository will consume about
  106. the same space as the deduplicated chunks used to represent them in the
  107. repository.
  108. Cache files:
  109. Contains the chunks index and files index (plus a compressed collection of
  110. single-archive chunk indexes).
  111. Chunks index:
  112. Proportional to the amount of data chunks in your repo. Lots of chunks
  113. in your repo imply a big chunks index.
  114. It is possible to tweak the chunker params (see create options).
  115. Files index:
  116. Proportional to the amount of files in your last backup. Can be switched
  117. off (see create options), but next backup will be much slower if you do.
  118. Network:
  119. If your repository is remote, all deduplicated (and optionally compressed/
  120. encrypted) data of course has to go over the connection (ssh: repo url).
  121. If you use a locally mounted network filesystem, additionally some copy
  122. operations used for transaction support also go over the connection. If
  123. you backup multiple sources to one target repository, additional traffic
  124. happens for cache resynchronization.
  125. In case you are interested in more details, please read the internals documentation.
  126. Units
  127. ~~~~~
  128. To display quantities, |project_name| takes care of respecting the
  129. usual conventions of scale. Disk sizes are displayed in `decimal
  130. <https://en.wikipedia.org/wiki/Decimal>`_, using powers of ten (so
  131. ``kB`` means 1000 bytes). For memory usage, `binary prefixes
  132. <https://en.wikipedia.org/wiki/Binary_prefix>`_ are used, and are
  133. indicated using the `IEC binary prefixes
  134. <https://en.wikipedia.org/wiki/IEC_80000-13#Prefixes_for_binary_multiples>`_,
  135. using powers of two (so ``KiB`` means 1024 bytes).
  136. Date and Time
  137. ~~~~~~~~~~~~~
  138. We format date and time conforming to ISO-8601, that is: YYYY-MM-DD and
  139. HH:MM:SS (24h clock).
  140. For more information about that, see: https://xkcd.com/1179/
  141. Unless otherwise noted, we display local date and time.
  142. Internally, we store and process date and time as UTC.
  143. .. include:: usage/init.rst.inc
  144. Examples
  145. ~~~~~~~~
  146. ::
  147. # Local repository (default is to use encryption in repokey mode)
  148. $ borg init /path/to/repo
  149. # Local repository (no encryption)
  150. $ borg init --encryption=none /path/to/repo
  151. # Remote repository (accesses a remote borg via ssh)
  152. $ borg init user@hostname:backup
  153. # Remote repository (store the key your home dir)
  154. $ borg init --encryption=keyfile user@hostname:backup
  155. Important notes about encryption:
  156. It is not recommended to disable encryption. Repository encryption protects you
  157. e.g. against the case that an attacker has access to your backup repository.
  158. But be careful with the key / the passphrase:
  159. If you want "passphrase-only" security, use the ``repokey`` mode. The key will
  160. be stored inside the repository (in its "config" file). In above mentioned
  161. attack scenario, the attacker will have the key (but not the passphrase).
  162. If you want "passphrase and having-the-key" security, use the ``keyfile`` mode.
  163. The key will be stored in your home directory (in ``.config/borg/keys``). In
  164. the attack scenario, the attacker who has just access to your repo won't have
  165. the key (and also not the passphrase).
  166. Make a backup copy of the key file (``keyfile`` mode) or repo config file
  167. (``repokey`` mode) and keep it at a safe place, so you still have the key in
  168. case it gets corrupted or lost. Also keep the passphrase at a safe place.
  169. The backup that is encrypted with that key won't help you with that, of course.
  170. Make sure you use a good passphrase. Not too short, not too simple. The real
  171. encryption / decryption key is encrypted with / locked by your passphrase.
  172. If an attacker gets your key, he can't unlock and use it without knowing the
  173. passphrase.
  174. Be careful with special or non-ascii characters in your passphrase:
  175. - |project_name| processes the passphrase as unicode (and encodes it as utf-8),
  176. so it does not have problems dealing with even the strangest characters.
  177. - BUT: that does not necessarily apply to your OS / VM / keyboard configuration.
  178. So better use a long passphrase made from simple ascii chars than one that
  179. includes non-ascii stuff or characters that are hard/impossible to enter on
  180. a different keyboard layout.
  181. You can change your passphrase for existing repos at any time, it won't affect
  182. the encryption/decryption key or other secrets.
  183. .. include:: usage/create.rst.inc
  184. Examples
  185. ~~~~~~~~
  186. ::
  187. # Backup ~/Documents into an archive named "my-documents"
  188. $ borg create /path/to/repo::my-documents ~/Documents
  189. # same, but verbosely list all files as we process them
  190. $ borg create -v --list /path/to/repo::my-documents ~/Documents
  191. # Backup ~/Documents and ~/src but exclude pyc files
  192. $ borg create /path/to/repo::my-files \
  193. ~/Documents \
  194. ~/src \
  195. --exclude '*.pyc'
  196. # Backup home directories excluding image thumbnails (i.e. only
  197. # /home/*/.thumbnails is excluded, not /home/*/*/.thumbnails)
  198. $ borg create /path/to/repo::my-files /home \
  199. --exclude 're:^/home/[^/]+/\.thumbnails/'
  200. # Do the same using a shell-style pattern
  201. $ borg create /path/to/repo::my-files /home \
  202. --exclude 'sh:/home/*/.thumbnails'
  203. # Backup the root filesystem into an archive named "root-YYYY-MM-DD"
  204. # use zlib compression (good, but slow) - default is no compression
  205. $ borg create -C zlib,6 /path/to/repo::root-{now:%Y-%m-%d} / --one-file-system
  206. # Make a big effort in fine granular deduplication (big chunk management
  207. # overhead, needs a lot of RAM and disk space, see formula in internals
  208. # docs - same parameters as borg < 1.0 or attic):
  209. $ borg create --chunker-params 10,23,16,4095 /path/to/repo::small /smallstuff
  210. # Backup a raw device (must not be active/in use/mounted at that time)
  211. $ dd if=/dev/sdx bs=10M | borg create /path/to/repo::my-sdx -
  212. # No compression (default)
  213. $ borg create /path/to/repo::arch ~
  214. # Super fast, low compression
  215. $ borg create --compression lz4 /path/to/repo::arch ~
  216. # Less fast, higher compression (N = 0..9)
  217. $ borg create --compression zlib,N /path/to/repo::arch ~
  218. # Even slower, even higher compression (N = 0..9)
  219. $ borg create --compression lzma,N /path/to/repo::arch ~
  220. # Use short hostname, user name and current time in archive name
  221. $ borg create /path/to/repo::{hostname}-{user}-{now} ~
  222. $ borg create /path/to/repo::{hostname}-{user}-{now:%Y-%m-%d_%H:%M:%S} ~
  223. .. include:: usage/extract.rst.inc
  224. Examples
  225. ~~~~~~~~
  226. ::
  227. # Extract entire archive
  228. $ borg extract /path/to/repo::my-files
  229. # Extract entire archive and list files while processing
  230. $ borg extract -v --list /path/to/repo::my-files
  231. # Extract the "src" directory
  232. $ borg extract /path/to/repo::my-files home/USERNAME/src
  233. # Extract the "src" directory but exclude object files
  234. $ borg extract /path/to/repo::my-files home/USERNAME/src --exclude '*.o'
  235. # Restore a raw device (must not be active/in use/mounted at that time)
  236. $ borg extract --stdout /path/to/repo::my-sdx | dd of=/dev/sdx bs=10M
  237. Note: currently, extract always writes into the current working directory ("."),
  238. so make sure you ``cd`` to the right place before calling ``borg extract``.
  239. .. include:: usage/check.rst.inc
  240. .. include:: usage/rename.rst.inc
  241. Examples
  242. ~~~~~~~~
  243. ::
  244. $ borg create /path/to/repo::archivename ~
  245. $ borg list /path/to/repo
  246. archivename Mon, 2016-02-15 19:50:19
  247. $ borg rename /path/to/repo::archivename newname
  248. $ borg list /path/to/repo
  249. newname Mon, 2016-02-15 19:50:19
  250. .. include:: usage/list.rst.inc
  251. Examples
  252. ~~~~~~~~
  253. ::
  254. $ borg list /path/to/repo
  255. Monday Mon, 2016-02-15 19:15:11
  256. repo Mon, 2016-02-15 19:26:54
  257. root-2016-02-15 Mon, 2016-02-15 19:36:29
  258. newname Mon, 2016-02-15 19:50:19
  259. ...
  260. $ borg list /path/to/repo::root-2016-02-15
  261. drwxr-xr-x root root 0 Mon, 2016-02-15 17:44:27 .
  262. drwxrwxr-x root root 0 Mon, 2016-02-15 19:04:49 bin
  263. -rwxr-xr-x root root 1029624 Thu, 2014-11-13 00:08:51 bin/bash
  264. lrwxrwxrwx root root 0 Fri, 2015-03-27 20:24:26 bin/bzcmp -> bzdiff
  265. -rwxr-xr-x root root 2140 Fri, 2015-03-27 20:24:22 bin/bzdiff
  266. ...
  267. $ borg list /path/to/repo::archiveA --list-format="{mode} {user:6} {group:6} {size:8d} {isomtime} {path}{extra}{NEWLINE}"
  268. drwxrwxr-x user user 0 Sun, 2015-02-01 11:00:00 .
  269. drwxrwxr-x user user 0 Sun, 2015-02-01 11:00:00 code
  270. drwxrwxr-x user user 0 Sun, 2015-02-01 11:00:00 code/myproject
  271. -rw-rw-r-- user user 1416192 Sun, 2015-02-01 11:00:00 code/myproject/file.ext
  272. ...
  273. # see what is changed between archives, based on file modification time, size and file path
  274. $ borg list /path/to/repo::archiveA --list-format="{mtime:%s}{TAB}{size}{TAB}{path}{LF}" |sort -n > /tmp/list.archiveA
  275. $ borg list /path/to/repo::archiveB --list-format="{mtime:%s}{TAB}{size}{TAB}{path}{LF}" |sort -n > /tmp/list.archiveB
  276. $ diff -y /tmp/list.archiveA /tmp/list.archiveB
  277. 1422781200 0 . 1422781200 0 .
  278. 1422781200 0 code 1422781200 0 code
  279. 1422781200 0 code/myproject 1422781200 0 code/myproject
  280. 1422781200 1416192 code/myproject/file.ext | 1454664653 1416192 code/myproject/file.ext
  281. ...
  282. .. include:: usage/delete.rst.inc
  283. Examples
  284. ~~~~~~~~
  285. ::
  286. # delete a single backup archive:
  287. $ borg delete /path/to/repo::Monday
  288. # delete the whole repository and the related local cache:
  289. $ borg delete /path/to/repo
  290. You requested to completely DELETE the repository *including* all archives it contains:
  291. repo Mon, 2016-02-15 19:26:54
  292. root-2016-02-15 Mon, 2016-02-15 19:36:29
  293. newname Mon, 2016-02-15 19:50:19
  294. Type 'YES' if you understand this and want to continue: YES
  295. .. include:: usage/prune.rst.inc
  296. Examples
  297. ~~~~~~~~
  298. Be careful, prune is a potentially dangerous command, it will remove backup
  299. archives.
  300. The default of prune is to apply to **all archives in the repository** unless
  301. you restrict its operation to a subset of the archives using ``--prefix``.
  302. When using ``--prefix``, be careful to choose a good prefix - e.g. do not use a
  303. prefix "foo" if you do not also want to match "foobar".
  304. It is strongly recommended to always run ``prune --dry-run ...`` first so you
  305. will see what it would do without it actually doing anything.
  306. There is also a visualized prune example in ``docs/misc/prune-example.txt``.
  307. ::
  308. # Keep 7 end of day and 4 additional end of week archives.
  309. # Do a dry-run without actually deleting anything.
  310. $ borg prune --dry-run --keep-daily=7 --keep-weekly=4 /path/to/repo
  311. # Same as above but only apply to archive names starting with the hostname
  312. # of the machine followed by a "-" character:
  313. $ borg prune --keep-daily=7 --keep-weekly=4 --prefix='{hostname}-' /path/to/repo
  314. # Keep 7 end of day, 4 additional end of week archives,
  315. # and an end of month archive for every month:
  316. $ borg prune --keep-daily=7 --keep-weekly=4 --keep-monthly=-1 /path/to/repo
  317. # Keep all backups in the last 10 days, 4 additional end of week archives,
  318. # and an end of month archive for every month:
  319. $ borg prune --keep-within=10d --keep-weekly=4 --keep-monthly=-1 /path/to/repo
  320. .. include:: usage/info.rst.inc
  321. Examples
  322. ~~~~~~~~
  323. ::
  324. $ borg info /path/to/repo::root-2016-02-15
  325. Name: root-2016-02-15
  326. Fingerprint: 57c827621f21b000a8d363c1e163cc55983822b3afff3a96df595077a660be50
  327. Hostname: myhostname
  328. Username: root
  329. Time (start): Mon, 2016-02-15 19:36:29
  330. Time (end): Mon, 2016-02-15 19:39:26
  331. Command line: /usr/local/bin/borg create -v --list -C zlib,6 /path/to/repo::root-2016-02-15 / --one-file-system
  332. Number of files: 38100
  333. Original size Compressed size Deduplicated size
  334. This archive: 1.33 GB 613.25 MB 571.64 MB
  335. All archives: 1.63 GB 853.66 MB 584.12 MB
  336. Unique chunks Total chunks
  337. Chunk index: 36858 48844
  338. .. include:: usage/mount.rst.inc
  339. Examples
  340. ~~~~~~~~
  341. ::
  342. $ borg mount /path/to/repo::root-2016-02-15 /tmp/mymountpoint
  343. $ ls /tmp/mymountpoint
  344. bin boot etc home lib lib64 lost+found media mnt opt root sbin srv tmp usr var
  345. $ fusermount -u /tmp/mymountpoint
  346. .. include:: usage/change-passphrase.rst.inc
  347. Examples
  348. ~~~~~~~~
  349. ::
  350. # Create a key file protected repository
  351. $ borg init --encryption=keyfile -v /path/to/repo
  352. Initializing repository at "/path/to/repo"
  353. Enter new passphrase:
  354. Enter same passphrase again:
  355. Remember your passphrase. Your data will be inaccessible without it.
  356. Key in "/root/.config/borg/keys/mnt_backup" created.
  357. Keep this key safe. Your data will be inaccessible without it.
  358. Synchronizing chunks cache...
  359. Archives: 0, w/ cached Idx: 0, w/ outdated Idx: 0, w/o cached Idx: 0.
  360. Done.
  361. # Change key file passphrase
  362. $ borg change-passphrase -v /path/to/repo
  363. Enter passphrase for key /root/.config/borg/keys/mnt_backup:
  364. Enter new passphrase:
  365. Enter same passphrase again:
  366. Remember your passphrase. Your data will be inaccessible without it.
  367. Key updated
  368. .. include:: usage/serve.rst.inc
  369. Examples
  370. ~~~~~~~~
  371. borg serve has special support for ssh forced commands (see ``authorized_keys``
  372. example below): it will detect that you use such a forced command and extract
  373. the value of the ``--restrict-to-path`` option(s).
  374. It will then parse the original command that came from the client, makes sure
  375. that it is also ``borg serve`` and enforce path restriction(s) as given by the
  376. forced command. That way, other options given by the client (like ``--info`` or
  377. ``--umask``) are preserved (and are not fixed by the forced command).
  378. ::
  379. # Allow an SSH keypair to only run borg, and only have access to /path/to/repo.
  380. # Use key options to disable unneeded and potentially dangerous SSH functionality.
  381. # This will help to secure an automated remote backup system.
  382. $ cat ~/.ssh/authorized_keys
  383. command="borg serve --restrict-to-path /path/to/repo",no-pty,no-agent-forwarding,no-port-forwarding,no-X11-forwarding,no-user-rc ssh-rsa AAAAB3[...]
  384. .. include:: usage/upgrade.rst.inc
  385. Examples
  386. ~~~~~~~~
  387. ::
  388. # Upgrade the borg repository to the most recent version.
  389. $ borg upgrade -v /path/to/repo
  390. making a hardlink copy in /path/to/repo.upgrade-2016-02-15-20:51:55
  391. opening attic repository with borg and converting
  392. no key file found for repository
  393. converting repo index /path/to/repo/index.0
  394. converting 1 segments...
  395. converting borg 0.xx to borg current
  396. no key file found for repository
  397. .. include:: usage/break-lock.rst.inc
  398. Miscellaneous Help
  399. ------------------
  400. .. include:: usage/help.rst.inc
  401. Debug Commands
  402. --------------
  403. There are some more commands (all starting with "debug-") which are all
  404. **not intended for normal use** and **potentially very dangerous** if used incorrectly.
  405. They exist to improve debugging capabilities without direct system access, e.g.
  406. in case you ever run into some severe malfunction. Use them only if you know
  407. what you are doing or if a trusted |project_name| developer tells you what to do.
  408. Additional Notes
  409. ----------------
  410. Here are misc. notes about topics that are maybe not covered in enough detail in the usage section.
  411. Item flags
  412. ~~~~~~~~~~
  413. ``borg create -v --list`` outputs a verbose list of all files, directories and other
  414. file system items it considered (no matter whether they had content changes
  415. or not). For each item, it prefixes a single-letter flag that indicates type
  416. and/or status of the item.
  417. If you are interested only in a subset of that output, you can give e.g.
  418. ``--filter=AME`` and it will only show regular files with A, M or E status (see
  419. below).
  420. A uppercase character represents the status of a regular file relative to the
  421. "files" cache (not relative to the repo -- this is an issue if the files cache
  422. is not used). Metadata is stored in any case and for 'A' and 'M' also new data
  423. chunks are stored. For 'U' all data chunks refer to already existing chunks.
  424. - 'A' = regular file, added (see also :ref:`a_status_oddity` in the FAQ)
  425. - 'M' = regular file, modified
  426. - 'U' = regular file, unchanged
  427. - 'E' = regular file, an error happened while accessing/reading *this* file
  428. A lowercase character means a file type other than a regular file,
  429. borg usually just stores their metadata:
  430. - 'd' = directory
  431. - 'b' = block device
  432. - 'c' = char device
  433. - 'h' = regular file, hardlink (to already seen inodes)
  434. - 's' = symlink
  435. - 'f' = fifo
  436. Other flags used include:
  437. - 'i' = backup data was read from standard input (stdin)
  438. - '-' = dry run, item was *not* backed up
  439. - '?' = missing status code (if you see this, please file a bug report!)
  440. --chunker-params
  441. ~~~~~~~~~~~~~~~~
  442. The chunker params influence how input files are cut into pieces (chunks)
  443. which are then considered for deduplication. They also have a big impact on
  444. resource usage (RAM and disk space) as the amount of resources needed is
  445. (also) determined by the total amount of chunks in the repository (see
  446. `Indexes / Caches memory usage` for details).
  447. ``--chunker-params=10,23,16,4095`` results in a fine-grained deduplication
  448. and creates a big amount of chunks and thus uses a lot of resources to manage
  449. them. This is good for relatively small data volumes and if the machine has a
  450. good amount of free RAM and disk space.
  451. ``--chunker-params=19,23,21,4095`` (default) results in a coarse-grained
  452. deduplication and creates a much smaller amount of chunks and thus uses less
  453. resources. This is good for relatively big data volumes and if the machine has
  454. a relatively low amount of free RAM and disk space.
  455. If you already have made some archives in a repository and you then change
  456. chunker params, this of course impacts deduplication as the chunks will be
  457. cut differently.
  458. In the worst case (all files are big and were touched in between backups), this
  459. will store all content into the repository again.
  460. Usually, it is not that bad though:
  461. - usually most files are not touched, so it will just re-use the old chunks
  462. it already has in the repo
  463. - files smaller than the (both old and new) minimum chunksize result in only
  464. one chunk anyway, so the resulting chunks are same and deduplication will apply
  465. If you switch chunker params to save resources for an existing repo that
  466. already has some backup archives, you will see an increasing effect over time,
  467. when more and more files have been touched and stored again using the bigger
  468. chunksize **and** all references to the smaller older chunks have been removed
  469. (by deleting / pruning archives).
  470. If you want to see an immediate big effect on resource usage, you better start
  471. a new repository when changing chunker params.
  472. For more details, see :ref:`chunker_details`.
  473. --read-special
  474. ~~~~~~~~~~~~~~
  475. The option ``--read-special`` is not intended for normal, filesystem-level (full or
  476. partly-recursive) backups. You only give this option if you want to do something
  477. rather ... special -- and if you have hand-picked some files that you want to treat
  478. that way.
  479. ``borg create --read-special`` will open all files without doing any special
  480. treatment according to the file type (the only exception here are directories:
  481. they will be recursed into). Just imagine what happens if you do ``cat
  482. filename`` --- the content you will see there is what borg will backup for that
  483. filename.
  484. So, for example, symlinks will be followed, block device content will be read,
  485. named pipes / UNIX domain sockets will be read.
  486. You need to be careful with what you give as filename when using ``--read-special``,
  487. e.g. if you give ``/dev/zero``, your backup will never terminate.
  488. The given files' metadata is saved as it would be saved without
  489. ``--read-special`` (e.g. its name, its size [might be 0], its mode, etc.) -- but
  490. additionally, also the content read from it will be saved for it.
  491. Restoring such files' content is currently only supported one at a time via
  492. ``--stdout`` option (and you have to redirect stdout to where ever it shall go,
  493. maybe directly into an existing device file of your choice or indirectly via
  494. ``dd``).
  495. Example
  496. +++++++
  497. Imagine you have made some snapshots of logical volumes (LVs) you want to backup.
  498. .. note::
  499. For some scenarios, this is a good method to get "crash-like" consistency
  500. (I call it crash-like because it is the same as you would get if you just
  501. hit the reset button or your machine would abrubtly and completely crash).
  502. This is better than no consistency at all and a good method for some use
  503. cases, but likely not good enough if you have databases running.
  504. Then you create a backup archive of all these snapshots. The backup process will
  505. see a "frozen" state of the logical volumes, while the processes working in the
  506. original volumes continue changing the data stored there.
  507. You also add the output of ``lvdisplay`` to your backup, so you can see the LV
  508. sizes in case you ever need to recreate and restore them.
  509. After the backup has completed, you remove the snapshots again. ::
  510. $ # create snapshots here
  511. $ lvdisplay > lvdisplay.txt
  512. $ borg create --read-special /path/to/repo::arch lvdisplay.txt /dev/vg0/*-snapshot
  513. $ # remove snapshots here
  514. Now, let's see how to restore some LVs from such a backup. ::
  515. $ borg extract /path/to/repo::arch lvdisplay.txt
  516. $ # create empty LVs with correct sizes here (look into lvdisplay.txt).
  517. $ # we assume that you created an empty root and home LV and overwrite it now:
  518. $ borg extract --stdout /path/to/repo::arch dev/vg0/root-snapshot > /dev/vg0/root
  519. $ borg extract --stdout /path/to/repo::arch dev/vg0/home-snapshot > /dev/vg0/home
  520. Append-only mode
  521. ~~~~~~~~~~~~~~~~
  522. A repository can be made "append-only", which means that Borg will never overwrite or
  523. delete committed data. This is useful for scenarios where multiple machines back up to
  524. a central backup server using ``borg serve``, since a hacked machine cannot delete
  525. backups permanently.
  526. To activate append-only mode, edit the repository ``config`` file and add a line
  527. ``append_only=1`` to the ``[repository]`` section (or edit the line if it exists).
  528. In append-only mode Borg will create a transaction log in the ``transactions`` file,
  529. where each line is a transaction and a UTC timestamp.
  530. In addition, ``borg serve`` can act as if a repository is in append-only mode with
  531. its option ``--append-only``. This can be very useful for fine-tuning access control
  532. in ``.ssh/authorized_keys`` ::
  533. command="borg serve --append-only ..." ssh-rsa <key used for not-always-trustable backup clients>
  534. command="borg serve ..." ssh-rsa <key used for backup management>
  535. Example
  536. +++++++
  537. Suppose an attacker remotely deleted all backups, but your repository was in append-only
  538. mode. A transaction log in this situation might look like this: ::
  539. transaction 1, UTC time 2016-03-31T15:53:27.383532
  540. transaction 5, UTC time 2016-03-31T15:53:52.588922
  541. transaction 11, UTC time 2016-03-31T15:54:23.887256
  542. transaction 12, UTC time 2016-03-31T15:55:54.022540
  543. transaction 13, UTC time 2016-03-31T15:55:55.472564
  544. From your security logs you conclude the attacker gained access at 15:54:00 and all
  545. the backups where deleted or replaced by compromised backups. From the log you know
  546. that transactions 11 and later are compromised. Note that the transaction ID is the
  547. name of the *last* file in the transaction. For example, transaction 11 spans files 6
  548. to 11.
  549. In a real attack you'll likely want to keep the compromised repository
  550. intact to analyze what the attacker tried to achieve. It's also a good idea to make this
  551. copy just in case something goes wrong during the recovery. Since recovery is done by
  552. deleting some files, a hard link copy (``cp -al``) is sufficient.
  553. The first step to reset the repository to transaction 5, the last uncompromised transaction,
  554. is to remove the ``hints.N`` and ``index.N`` files in the repository (these two files are
  555. always expendable). In this example N is 13.
  556. Then remove or move all segment files from the segment directories in ``data/`` starting
  557. with file 6::
  558. rm data/**/{6..13}
  559. That's all to it.
  560. Drawbacks
  561. +++++++++
  562. As data is only appended, and nothing deleted, commands like ``prune`` or ``delete``
  563. won't free disk space, they merely tag data as deleted in a new transaction.
  564. Note that you can go back-and-forth between normal and append-only operation by editing
  565. the configuration file, it's not a "one way trip".
  566. Further considerations
  567. ++++++++++++++++++++++
  568. Append-only mode is not respected by tools other than Borg. ``rm`` still works on the
  569. repository. Make sure that backup client machines only get to access the repository via
  570. ``borg serve``.
  571. Ensure that no remote access is possible if the repository is temporarily set to normal mode
  572. for e.g. regular pruning.
  573. Further protections can be implemented, but are outside of Borgs scope. For example,
  574. file system snapshots or wrapping ``borg serve`` to set special permissions or ACLs on
  575. new data files.