zrepl

mirror of https://github.com/zrepl/zrepl.git synced 2024-11-24 17:35:01 +01:00

Author	SHA1	Message	Date
Christian Schwarz	b9b9ad10cf	snapshotting: ability to specify timestamp location != UTC (#801 ) This PR adds a new field optional field `timestamp_location` that allows the user to specify a timezone different than the default UTC for use in the snapshot suffix. I took @mjasnik 's PR https://github.com/zrepl/zrepl/pull/785 and refactored+extended it as follows: * move all formatting logic into its own package * disallow `dense` and `human` with formats != UTC to protect users from stupidity * document behavior more clearly * regression test for existing users	2024-10-18 15:12:41 +02:00
Christian Schwarz	740ab4b1b2	chore: io/ioutil has been deprecated	2024-09-08 23:19:45 +00:00
Christian Schwarz	9c63736489	treat empty `jobs` & empty YAML as valid & ship empty `jobs` in deb/rpm (#788 ) fixes https://github.com/zrepl/zrepl/issues/784 obsoletes https://github.com/zrepl/zrepl/pull/787	2024-05-14 19:18:22 +02:00
Christian Schwarz	6be133f55d	remove unused JobDebugSettings along with docs For this kind of debugging, we switched to env vars a while ago. For example, ZREPL_RPC_DEBUG. I don't think we have a substitute for the RPCLog stuff. However, NetConnLogger is still in the codebase. obsoletes https://github.com/zrepl/zrepl/pull/661	2022-12-22 18:13:45 +01:00
Christian Schwarz	3ffb69bfb0	config: support zrepl's day and week units for snapshotting.interval Originally, I had a patch that would replace all usages of time.Duration in package config with the new config.Duration types, but: 1. these are all timeouts/retry intervals that have default values. Most users don't touch them, and if they do, they don't need day or week units. 2. go-yaml's error reporting for yaml.Unmarshaler is inferior to built-in types (line numbers are missing, so the error would not have sufficient context) fixes https://github.com/zrepl/zrepl/issues/486	2022-10-27 00:19:06 +02:00
Yannick Dylla	1da8f848f2	snapper: support custom timestamp format fixes https://github.com/zrepl/zrepl/issues/465 closes https://github.com/zrepl/zrepl/pull/639	2022-10-27 00:19:06 +02:00
Christian Schwarz	c743c7b03f	refactor snapper & support cron-based snapshotting fixes https://github.com/zrepl/zrepl/issues/554 refs https://github.com/zrepl/zrepl/discussions/547#discussioncomment-1936126	2022-09-25 19:23:44 +02:00
Cole Helbling	1df0f8912a	Add `--skip-cert-check` flag to `zrepl configcheck` to prevent checking cert files It may be desirable to check that a config is valid without checking for the existence of certificate files (e.g. when validating a config inside a sandbox without access to the cert files). This will be very useful for NixOS so that we can check the config file at nix-build time (e.g. potentially without proper permissions to read cert files for a TLS connection). fixes https://github.com/zrepl/zrepl/issues/467 closes https://github.com/zrepl/zrepl/pull/587	2022-07-08 20:18:41 +02:00
Christian Schwarz	2642c64303	make initial replication policy configurable (most_recent, all, fail) Config: ``` - type: push ... conflict_resolution: initial_replication: most_recent \| all \| fali ``` The ``initial_replication`` option determines which snapshots zrepl replicates if the filesystem has not been replicated before. If ``most_recent`` (the default), the initial replication will only transfer the most recent snapshot, while ignoring previous snapshots. If all snapshots should be replicated, specify ``all``. Use ``fail`` to make replication of the filesystem fail in case there is no corresponding fileystem on the receiver. Code-Level Changes, apart from the obvious: - Rework IncrementalPath()'s return signature. Now returns an error for initial replications as well. - Rename & rework it's consumer, resolveConflict(). Co-authored-by: Graham Christensen <graham@grahamc.com> Fixes https://github.com/zrepl/zrepl/issues/550 Fixes https://github.com/zrepl/zrepl/issues/187 Closes https://github.com/zrepl/zrepl/pull/592	2022-06-26 14:36:59 +02:00
Christian Schwarz	fb6a9be954	fix encrypt-on-receive with placeholders fixes https://github.com/zrepl/zrepl/issues/504 Problem: plain send + recv with root_fs encrypted + placeholders causes plain recvs whereas user would expect encrypt-on-recv Reason: We create placeholder filesytems with -o encryption=off. Thus, children received below those placeholders won't inherit encryption of root_fs. Fix: We'll have three values for `recv.placeholders.encryption: unspecified (default) \| off \| inherit`. When we create a placeholder, we will fail the operation if `recv.placeholders.encryption = unspecified`. The exception is if the placeholder filesystem is to encode the client identity ($root_fs/$client_identity) in a pull job. Those are created in `inherit` mode if the config field is `unspecified` so that users who don't need placeholders are not bothered by these details. Future Work: Automatically warn existing users of encrypt-on-recv about the problem if they are affected. The problem that I hit during implementation of this is that the `encryption` prop's `source` doesn't quite behave like other props: `source` is `default` for `encryption=off` and `-` when `encryption=on`. Hence, we can't use `source` to distinguish the following 2x2 cases: (1) placeholder created with explicit -o encryption=off (2) placeholder created without specifying -o encryption with (A) an encrypted parent at creation time (B) an unencrypted parent at creation time	2021-12-18 15:12:47 +01:00
Christian Schwarz	20ff9717bc	fix mis-spelled send option for embedded data fixes https://github.com/zrepl/zrepl/issues/522	2021-11-14 17:34:32 +01:00
Christian Schwarz	f5f269bfd5	send/recv: job-level bandwidth limiting Sponsored-by: Prominic.NET, Inc. fixes #339	2021-09-12 20:08:43 +02:00
Christian Schwarz	0ceea1b792	replication: simplify parallel replication variables & expose them in config closes #140	2021-03-14 17:30:10 +01:00
InsanePrawn	393fc10a69	[#285 ] support setting zfs send / recv flags in the config (send: -wLcepbS, recv: -ox) Co-authored-by: Christian Schwarz <me@cschwarz.com> Signed-off-by: InsanePrawn <insane.prawny@gmail.com> closes #285 closes #276 closes #24	2021-02-20 17:20:45 +01:00
Christian Schwarz	b1f8cdf385	[#373 ] pruning: add optional `regex` field to `last_n` rule fixes #373	2020-09-02 22:45:44 +02:00
Christian Schwarz	30cdc1430e	replication + endpoint: replication guarantees: guarantee_{resumability,incremental,nothing} This commit - adds a configuration in which no step holds, replication cursors, etc. are created - removes the send.step_holds.disable_incremental setting - creates a new config option `replication` for active-side jobs - adds the replication.protection.{initial,incremental} settings, each of which can have values - `guarantee_resumability` - `guarantee_incremental` - `guarantee_nothing` (refer to docs/configuration/replication.rst for semantics) The `replication` config from an active side is sent to both endpoint.Sender and endpoint.Receiver for each replication step. Sender and Receiver then act accordingly. For `guarantee_incremental`, we add the new `tentative-replication-cursor` abstraction. The necessity for that abstraction is outlined in https://github.com/zrepl/zrepl/issues/340. fixes https://github.com/zrepl/zrepl/issues/340	2020-07-26 20:32:35 +02:00
Christian Schwarz	1c270b7e39	add option to disable step holds for incremental sends This is a stop-gap solution until we re-write the pruner to support rules for removing step holds. Note that disabling step holds for incremental sends does not affect zrepl's guarantee that incremental replication is always possible: Suppose you yank the external drive during an incremental @from -> @to step: * restarting that step or future incrementals @from -> @to_later` will be possible because the replication cursor bookmark points to @from until the step is complete * resuming @from -> @to will work as long as the pruner on your internal pool doesn't come around to destroy @to. * in that case, the replication algorithm should determine that the resumable state on the receiving side isuseless because @to no longer exists on the sending side, and consequently clear it, and restart an incremental step @from -> @to_later refs #288	2020-06-14 15:26:05 +02:00
InsanePrawn	44bd354eae	Spellcheck all files Signed-off-by: InsanePrawn <insane.prawny@gmail.com>	2020-02-24 16:06:09 +01:00
Christian Schwarz	58c08c855f	new features: {resumable,encrypted,hold-protected} send-recv, last-received-hold - Resumable Send & Recv Support No knobs required, automatically used where supported. - Hold-Protected Send & Recv Automatic ZFS holds to ensure that we can always resume a replication step. - Encrypted Send & Recv Support for OpenZFS native encryption. Configurable at the job level, i.e., for all filesystems a job is responsible for. - Receive-side hold on last received dataset The counterpart to the replication cursor bookmark on the send-side. Ensures that incremental replication will always be possible between a sender and receiver. Design Doc ---------- `replication/design.md` doc describes how we use ZFS holds and bookmarks to ensure that a single replication step is always resumable. The replication algorithm described in the design doc introduces the notion of job IDs (please read the details on this design doc). We reuse the job names for job IDs and use `JobID` type to ensure that a job name can be embedded into hold tags, bookmark names, etc. This might BREAK CONFIG on upgrade. Protocol Version Bump --------------------- This commit makes backwards-incompatible changes to the replication/pdu protobufs. Thus, bump the version number used in the protocol handshake. Replication Cursor Format Change -------------------------------- The new replication cursor bookmark format is: `#zrepl_CURSOR_G_${this.GUID}_J_${jobid}` Including the GUID enables transaction-safe moving-forward of the cursor. Including the job id enables that multiple sending jobs can send the same filesystem without interfering. The `zrepl migrate replication-cursor:v1-v2` subcommand can be used to safely destroy old-format cursors once zrepl has created new-format cursors. Changes in This Commit ---------------------- - package zfs - infrastructure for holds - infrastructure for resume token decoding - implement a variant of OpenZFS's `entity_namecheck` and use it for validation in new code - ZFSSendArgs to specify a ZFS send operation - validation code protects against malicious resume tokens by checking that the token encodes the same send parameters that the send-side would use if no resume token were available (i.e. same filesystem, `fromguid`, `toguid`) - RecvOptions support for `recv -s` flag - convert a bunch of ZFS operations to be idempotent - achieved through more differentiated error message scraping / additional pre-/post-checks - package replication/pdu - add field for encryption to send request messages - add fields for resume handling to send & recv request messages - receive requests now contain `FilesystemVersion To` in addition to the filesystem into which the stream should be `recv`d into - can use `zfs recv $root_fs/$client_id/path/to/dataset@${To.Name}`, which enables additional validation after recv (i.e. whether `To.Guid` matched what we received in the stream) - used to set `last-received-hold` - package replication/logic - introduce `PlannerPolicy` struct, currently only used to configure whether encrypted sends should be requested from the sender - integrate encryption and resume token support into `Step` struct - package endpoint - move the concepts that endpoint builds on top of ZFS to a single file `endpoint/endpoint_zfs.go` - step-holds + step-bookmarks - last-received-hold - new replication cursor + old replication cursor compat code - adjust `endpoint/endpoint.go` handlers for - encryption - resumability - new replication cursor - last-received-hold - client subcommand `zrepl holds list`: list all holds and hold-like bookmarks that zrepl thinks belong to it - client subcommand `zrepl migrate replication-cursor:v1-v2`	2020-02-14 22:00:13 +01:00
Juergen Hoetzel	d35e2400b2	transport/{TCP,TLS}: optional IP_FREEBIND / IP_BINDANY bind socketops Allows to bind to an address even if it is not actually (yet or ever) configured. Fixes #238 Rationale: https://www.freedesktop.org/wiki/Software/systemd/NetworkTarget/#whatdoesthismeanformeadeveloper	2020-01-04 17:21:48 +01:00
Christian Schwarz	5c95c21727	transport/local: configurable dial_timeout for connect, default 2s	2019-09-29 19:05:54 +02:00
Christian Schwarz	f976212ec9	config: validate presence of port in addresses fixes #213	2019-09-28 14:25:14 +02:00
Ross Williams	729c83ee72	pre- and post-snapshot hooks * stack-based execution model, documented in documentation * circbuf for capturing hook output * built-in hooks for postgres and mysql * refactor docs, too much info on the jobs page, too difficult to discover snapshotting & hooks Co-authored-by: Ross Williams <ross@ross-williams.net> Co-authored-by: Christian Schwarz <me@cschwarz.com> fixes #74	2019-09-27 21:25:59 +02:00
Christian Schwarz	afed762774	format source tree using goimports	2019-03-22 19:41:12 +01:00
Christian Schwarz	17818439a0	Merge branch 'problame/replication_refactor' into InsanePrawn-master	2019-03-17 17:33:51 +01:00
Christian Schwarz	da3ba50a2c	Merge remote-tracking branch 'origin/master' into problame/replication_refactor	2019-03-16 14:48:01 +01:00
Christian Schwarz	4ee00091d6	pull job: support manual-only invocation	2019-03-16 14:24:05 +01:00
Christian Schwarz	aff639e87a	Merge remote-tracking branch 'origin/master' into InsanePrawn-master	2019-03-15 21:05:20 +01:00
Christian Schwarz	a0f301d700	syslog logging: fix priority parsing + add test for default facility	2019-03-15 18:18:16 +01:00
Ximalas	fc311a9fd6	syslog logging: support setting facility in config	2019-03-15 17:55:11 +01:00
Christian Schwarz	796c5ad42d	rpc rewrite: control RPCs using gRPC + separate RPC for data transfer transport/ssh: update go-netssh to new version => supports CloseWrite and Deadlines => build: require Go 1.11 (netssh requires it)	2019-03-13 13:53:48 +01:00
Christian Schwarz	c1aab0bee9	config: update yaml-config and use zeropositive constraint for timeouts	2018-12-11 21:54:36 +01:00
InsanePrawn	3d2688e959	Ugly but working inital snapjob implementation	2018-11-20 19:30:15 +01:00
Christian Schwarz	5e1ea21f85	pruning: add 'Negate' option to KeepRegex and expose it in config	2018-11-16 12:21:54 +01:00
Christian Schwarz	1f072936c5	fix default stdout outlet	2018-10-18 15:48:24 +02:00
Christian Schwarz	63169c51b7	add 'test filesystems' subcommand for testing filesystem filters	2018-10-13 16:22:19 +02:00
Christian Schwarz	125b561df3	rename root_dataset to root_fs for receiving-side jobs	2018-10-11 18:03:18 +02:00
Christian Schwarz	4e16952ad9	snapshotting: support 'periodic' and 'manual' mode 1. Change config format to support multiple types of snapshotting modes. 2. Implement a hacky way to support periodic or completely manual snaphots. In manual mode, the user has to trigger replication using the wakeup mechanism after they took snapshots using their own tooling. As indicated by the comment, a more general solution would be desirable, but we want to get the release out and 'manual' mode is a feature that some people requested...	2018-10-11 15:59:23 +02:00
Christian Schwarz	93c90cd705	pruning: fix YAML representation of PruneKeepRegex	2018-10-11 13:07:52 +02:00
Christian Schwarz	01668a989e	transport local: named listeners + struct renaming	2018-10-11 13:06:47 +02:00
Christian Schwarz	1ce0c69e4f	implement local replication using new local transport The new local transport uses socketpair() and a switchboard based on client identities. The special local job type is gone, which is good since it does not fit into the 'Active/Passive side ' + 'mode' concept used to implement the duality of push/sink \| pull/source.	2018-09-24 14:43:53 +02:00
Christian Schwarz	e3be120d88	refactor push + source into active + passive 'sides' with push and source 'modes'	2018-09-24 12:36:10 +02:00
Christian Schwarz	4a6160baf3	update to streamrpc 0.4 & adjust config (not breaking)	2018-09-23 20:28:30 +02:00
Christian Schwarz	975fdee217	replication & pruning: ditch replicated-property, use bookmark as cursor instead A bookmark with a well-known name is used to track which version was last successfully received by the receiver. The createtxg that can be retrieved from the bookmark using `zfs get` is used to set the Replicated attribute of each snap on the sender: If the snap's CreateTXG > the cursor's, it is not yet replicated, otherwise it has been. There is an optional config option to change the behvior to `CreateTXG >= the cursor's`, and the implementation defaults to that. The reason: While things work just fine with `CreateTXG > the cursor's`, ZFS does not provide size estimates in a `zfs send` dry run (see `acd2418`). However, to enable the use case of keeping the snapshot only around for the replication, the config flag exists.	2018-09-05 19:51:06 -07:00
Christian Schwarz	adab06405b	make go vet happy	2018-09-04 17:25:10 -07:00
Christian Schwarz	308e5e35fb	Multi-client servers + bring back stdinserver support	2018-09-04 16:43:55 -07:00
Christian Schwarz	754b253043	config: no-field for replication anymore It's closer to the original config and we don't want users to specify 'filesystems' and similar multiple times in a single job definition.	2018-09-04 14:44:45 -07:00
Christian Schwarz	3d8e552c6a	streamrpc 0.3 + config from daemon/config	2018-09-02 15:46:42 -07:00
Christian Schwarz	d55a271ac7	WIP adopt updated yaml-config with 'fromdefaults' struct tag	2018-09-02 15:46:03 -07:00
Christian Schwarz	1690339440	colorized stdout logger if stdout is tty	2018-08-30 13:33:28 +02:00

1 2

52 Commits