rclone

mirror of https://github.com/rclone/rclone.git synced 2024-11-08 09:35:26 +01:00

Author	SHA1	Message	Date
wjielai	22937e8982	docs: add Tencent COS to s3 provider list - fixes #4468 * add Tencent COS to s3 provider list. Co-authored-by: wjielai <wjielai@tencent.com>	2020-09-08 16:34:25 +01:00
Nick Craig-Wood	725ae91387	s3: reduce the default --s3-copy-cutoff to < 5GB The maximum value for the --s3--copy-cutoff should be 5GiB as tested with AWS S3. However b2 have implemented this as 5GB rather than 5GiB so having the default at 5 GiB makes the b2s3 server side copy of a large file by default. This patch sets the default to 4768 MiB which is slightly less than 5GB. This should have very little effect on anything. If in future rclone can lower this limit more if Copy can multithread. See: https://forum.rclone.org/t/copying-files-within-a-b2-bucket/16680/76	2020-09-01 18:53:29 +01:00
Nick Craig-Wood	b7dd3ce608	s3: preserve metadata when doing multipart copy Before this change the s3 multipart server side copy was not preserving the metadata of the object. This was most noticeable because the modtime was not preserved. This change fetches the metadata from the object before starting the copy and overwrites it if requires. It will also mean any other metadata is preserved. See: https://forum.rclone.org/t/copying-files-within-a-b2-bucket/16680/70	2020-09-01 18:39:30 +01:00
Egor Margineanu	921e384c4d	s3: update IBM COS endpoints - fixes #4522	2020-08-30 17:21:11 +01:00
Nick Craig-Wood	801a820c54	s3: fix detection of bucket existing This reverts part of `151f03378f` s3: fix upload of single files into buckets without create permission This erroneously assumed that a HEAD request on a non existent object would return "NotFound" if the bucket was found. In fact it returns "NotFound" when the bucket isn't found also. This will break the fix for #4297 - however that can be made to work using the new --s3-assume-bucket-exists flag	2020-08-21 13:28:08 +01:00
Nick Craig-Wood	d5f4c74697	s3: implement cleanup and backend command to list & remove multipart uploads This implements `rclone cleanup` to remove multipart uploads over 24 hours old. It also implements the backend command `list-multipart-uploads` to see which ones are available and `cleanup` to delete them with a configurable expiry interval. See #4302	2020-07-28 11:37:46 +01:00
Nick Craig-Wood	2288a5c617	s3: implement `profile` and `shared_credentials_file` options It is impossible to use two different profiles at the same time - these config vars enable that. See: https://forum.rclone.org/t/s3-source-destination-named-profile/17417	2020-07-28 11:32:32 +01:00
Nick Craig-Wood	f406dbbb4d	s3: add --s3-no-check-bucket for minimising rclone transactions and perms Fixes #4449	2020-07-27 17:49:40 +01:00
Nick Craig-Wood	80d2f38192	s3: fix bucket Region auto detection when Region unset in config #2915 Previous to this fix if Region was not set and Endpoint was not set then we set the endpoint to "https://s3.amazonaws.com/". This is unecessary because if the Region alone isn't set then we set it to "us-east-1" which has the same endpoint. Having the endpoint set breaks the bucket region auto detection with the error "Failed to update region for bucket: can't set region to "xxx" as endpoint is set". This fix removes that check.	2020-07-10 17:16:59 +01:00
Nick Craig-Wood	c820576329	fs: define SlowModTime and SlowHash features in the relevant backends	2020-06-30 12:01:36 +01:00
David	9058ec32e1	s3: Use regional s3 us-east-1 endpoint	2020-06-26 16:25:52 +01:00
Nick Craig-Wood	fd7c63bc78	s3: add backend restore command to restore objects from GLACIER See: https://forum.rclone.org/t/rclone-settier-fails-with-scaleway-entitytoolarge/17384	2020-06-25 21:33:23 +01:00
Nick Craig-Wood	5f75444ef6	s3: cancel in progress multipart uploads and copies on rclone exit #4300	2020-06-25 12:55:56 +01:00
Nick Craig-Wood	85bcacac90	s3: Cap expiry duration to 1 Week and return error when sharing dir	2020-06-18 17:50:50 +01:00
Vincent Feltz	f4d7e41f24	s3: add Scaleway provider - fixes #4338	2020-06-13 11:55:37 +01:00
Nick Craig-Wood	2ea15a72bc	s3: fix --header-upload - Fixes #4303 Before this change we were setting the headers on the PUT request for normal and multipart uploads. For normal uploads this caused the error 403 Forbidden: There were headers present in the request which were not signed After this fix we set the headers in the object upload request itself as the s3 SDK expects. This means that we only support a limited range of headers - Cache-Control - Content-Disposition - Content-Encoding - Content-Language - Content-Type - X-Amz-Tagging - X-Amz-Meta- Note for the last of those are for setting custom metadata in the form "X-Amz-Meta-Key: value". This now works for multipart uploads and single part uploads See also #59	2020-06-10 12:28:48 +01:00
Kamil Trzciński	7458d37d2a	s3: add `max_upload_parts` support - fixes #4159 * s3: add `max_upload_parts` support This allows to configure a maximum amount of chunks used to upload file: - Support Scaleway which has a limit of 1k chunks currently - Reduce a cost on S3 when each request costs some money at the expense of memory used Co-authored-by: Nick Craig-Wood <nick@craig-wood.com>	2020-06-08 18:22:34 +01:00
Roman Kredentser	c0521791db	s3: implement link sharing with PublicLink	2020-06-05 14:51:05 +01:00
Nick Craig-Wood	151f03378f	s3: fix upload of single files into buckets without create permission Before this change, attempting to upload a single file into an s3 bucket which did not have create permission gave AccessDenied: Access Denied error when it tried to create the bucket. This was masked until `e2bf91452a` was fixed. This fix marks the bucket as OK if a fetch on an object indicates it is OK. This stops rclone thinking it has to create the bucket in the first place. Fixes #4297	2020-06-02 14:33:21 +01:00
Martin Michlmayr	4aee962233	doc: fix typos throughout docs and code	2020-05-20 15:54:51 +01:00
Nick Craig-Wood	8a58e0235d	s3: don't leak memory or tokens in edge cases for multipart upload	2020-05-14 07:48:18 +01:00
Nick Craig-Wood	4e869e03f7	s3: improve docs for --s3-disable-checksum	2020-04-28 17:47:10 +01:00
Tim Gallant	5cb7229a16	s3: add support for HTTPOption	2020-04-23 11:07:21 +01:00
Nick Craig-Wood	f8039deb7c	s3: fix detection of BucketAlreadyOwnedByYou and BucketAlreadyExists error This was being silently ignored until this commit `e2bf91452a` s3: report errors on bucket creation (mkdir) correctly	2020-04-22 18:14:03 +01:00
Nick Craig-Wood	e2bf91452a	s3: report errors on bucket creation (mkdir) correctly Before this fix errors on bucket creation were being silently swallowed. See: https://forum.rclone.org/t/rclone-with-brand-new-aws-account-for-s3/15590	2020-04-15 13:13:13 +01:00
Michał Matczuk	6893ce0bbf	s3: do not resize buf on put to memBuf This is handled by Pool implementation.	2020-04-11 16:35:48 +01:00
Michał Matczuk	399cf18013	s3: use single memory pool Previously we had a map of pools for different chunk sizes. In practice the mapping is not very useful and requires a lock. Pools of size other that ChunkSize can only happen when we have a huge file (over 10k * ChunkSize). We need to have a bunch of identically sized huge files. In such case most likely ChunkSize should be increased. The mapping and its lock is replaced with a single initialised pool for ChunkSize, in other cases pool is allocated and freed on per file basis.	2020-04-11 16:34:05 +01:00
Jack Anderson	815ae7df45	backend/s3: add SSE-C support for AWS, Ceph, and MinIO	2020-03-31 18:16:45 +01:00
Nick Craig-Wood	a5c2f2c138	s3: ignore directory markers at the root also See: https://forum.rclone.org/t/issue-with-lsf-r-files-only-first-line-is-blank/15229/	2020-03-31 11:45:52 +01:00
Nick Craig-Wood	dc06973796	s3: use rclone's low level retries instead of AWS SDK to fix listing retries In `5470d34740` "backend/s3: use low-level-retries as the number of SDK retries" we switched over to using the AWS SDK low level retries instead of rclone's low level retry logic. This had the unfortunate attempt that retrying listings to correct XML Syntax errors failed on non S3 backends such as CEPH. The AWS SDK was also retrying the XML Syntax error request which doesn't make sense. This change turns off the AWS SDK retries in favour of just using rclone's retry logic.	2020-03-14 18:04:24 +00:00
Joachim Brandon LeBlanc	132ce94139	backend/s3: use the provided size parameter when allocating a new memory pool - fixes #4047 (#4049 )	2020-03-09 16:56:21 +00:00
Lars Lehtonen	fef2c6bf7a	backend/s3: replace deprecated session.New() with session.NewSession()	2020-03-05 11:34:10 +00:00
Aleksandar Jankovic	708b967f15	backend/s3: fix multipart abort context S3 couldn't abort multi-part upload when context is canceled because canceled context prevents abort request from being sent.	2020-02-25 12:11:32 +01:00
Aleksandar Janković	5470d34740	backend/s3: use low-level-retries as the number of SDK retries Amazon S3 is built to handle different kinds of workloads. In rare cases where S3 is not able to scale for whatever reason users will face status 500 errors. Main mechanism for handling these errors are retries. Amount of needed retries varies for each different use case. This change is making retries for s3 backend configurable by using --low-level-retries option.	2020-02-24 16:43:44 +01:00
Maciej Zimnoch	ac9cb50fdb	backend/s3: use memory pool for buffer allocations Currently each multipart upload allocated his own buffers, which after file upload was garbaged. Next files couldn't leverage already allocated memory which resulted in inefficent memory management. This change introduces backend memory pool keeping memory chunks which can be used during object operations. Fixes #3967	2020-02-24 13:32:32 +01:00
Michał Matczuk	e75c1f70bb	backend/s3: Added 500 as retryErrorCode The error code 500 Internal Error indicates that Amazon S3 is unable to handle the request at that time. The error code 503 Slow Down typically indicates that the requests to the S3 bucket are very high, exceeding the request rates described in Request Rate and Performance Guidelines. Because Amazon S3 is a distributed service, a very small percentage of 5xx errors are expected during normal use of the service. All requests that return 5xx errors from Amazon S3 can and should be retried, so we recommend that applications making requests to Amazon S3 have a fault-tolerance mechanism to recover from these errors. https://aws.amazon.com/premiumsupport/knowledge-center/http-5xx-errors-s3/	2020-02-12 11:43:18 +00:00
Michał Matczuk	19a4d74ee7	backend/s3: Fail fast multipart upload When a part upload request fails error is returned and gCtx is cancelled. This does not prevent from other parts being tried. They immediately fail due to a canceled context, but are retried by rclone anyway... Example AWS debug output ``` ----------------------------------------------------- 2020/02/11 14:12:17 DEBUG: Retrying Request s3/UploadPart, attempt 4 2020/02/11 14:12:17 DEBUG: Request s3/UploadPart Details: ---[ REQUEST POST-SIGN ]----------------------------- PUT /backuptest-rclone/huge/file.db?partNumber=11&uploadId=190939b4-3c43-4b98-ac11-92303e3f11b0 HTTP/1.1 Host: 192.168.100.99:9000 User-Agent: aws-sdk-go/1.23.8 (go1.13.1; linux; amd64) Content-Length: 5242880 Authorization: AWS4-HMAC-SHA256 Credential=miniouser/20200211/us-east-1/s3/aws4_request, SignedHeaders=content-length;content-md5;expect;host;x-amz-content-sha256;x-amz-date, Signature=3fc03a01f651cec09b05290459e9ceb26db9a8aa00c4e1b16e8cf5617eb81da8 Content-Md5: XzY+DlipXwbL6bvGYsXftg== Expect: 100-Continue X-Amz-Content-Sha256: c036cbb7553a909f8b8877d4461924307f27ecb66cff928eeeafd569c3887e29 X-Amz-Date: 20200211T131217Z Accept-Encoding: gzip ----------------------------------------------------- http://192.168.100.99:9000/backuptest-rclone/huge/file.db?partNumber=11&uploadId=190939b4-3c43-4b98-ac11-92303e3f11b0 2020/02/11 14:12:17 DEBUG: Response s3/UploadPart Details: ---[ RESPONSE ]-------------------------------------- HTTP/1.1 500 InternalServerError Content-Length: 0 ----------------------------------------------------- UploadPartWithContext() error InternalError: We encountered an internal error. Please try again status code: 500, request id: , host id: 2020/02/11 14:12:18 DEBUG ERROR: Request s3/UploadPart: ---[ REQUEST DUMP ERROR ]----------------------------- context canceled ------------------------------------------------------ UploadPartWithContext() error RequestCanceled: request context canceled caused by: context canceled 2020/02/11 14:12:20 DEBUG ERROR: Request s3/UploadPart: ---[ REQUEST DUMP ERROR ]----------------------------- context canceled ------------------------------------------------------ UploadPartWithContext() error RequestCanceled: request context canceled caused by: context canceled 2020/02/11 14:12:22 DEBUG ERROR: Request s3/UploadPart: ---[ REQUEST DUMP ERROR ]----------------------------- context canceled ------------------------------------------------------ UploadPartWithContext() error RequestCanceled: request context canceled caused by: context canceled ``` This adds a fail fast behaviour in case the context was cancelled.	2020-02-12 11:40:34 +00:00
Nick Craig-Wood	90377f5e65	s3: Specify that Minio supports URL encoding in listings Thanks to @harshavardhana for pointing this out See #3934 for background	2020-02-09 12:03:20 +00:00
Dave Koston	9f99c20232	s3: Add StackPath Object Storage Support	2020-01-31 16:05:44 +00:00
Nick Craig-Wood	bafe7d5a73	backends: move encoding definitions from fs/encodings	2020-01-16 14:40:36 +00:00
Nick Craig-Wood	3c620d521d	backend: adjust backends to have encoding parameter Fixes #3761 Fixes #3836 Fixes #3841	2020-01-16 14:40:36 +00:00
Nick Craig-Wood	b6e86b2c7f	s3: fix missing x-amz-meta-md5chksum headers for multipart uploads This reverts "s3: fix DisableChecksum condition" which introduced the problem. This reverts commit `c05bb63f96`. The code was correct as it stands - the comment was incorrect and this commit updates it. See: https://forum.rclone.org/t/s3-upload-md5-check-sum/13706	2020-01-07 19:39:39 +00:00
Tennix	15d19131bd	s3: use aws web identity role provider	2020-01-05 19:49:31 +00:00
Nick Craig-Wood	9d993e584b	s3: force path style bucket access to off for AWS deprecation AWS are deprecating path style bucket access so rclone should stop using it by default for this provider. This change shouldn't break any workflows as all AWS endpoints support virtual hosted style lookups of buckets. It may even improve performance. See: https://aws.amazon.com/blogs/aws/amazon-s3-path-deprecation-plan-the-rest-of-the-story/	2020-01-05 17:53:45 +00:00
Nick Craig-Wood	7242c7ce95	s3: fix multipart upload uploading 0 length files This regression was introduced by the recent re-write of the s3 multipart upload code.	2020-01-05 12:32:55 +00:00
Nick Craig-Wood	7e6fac8b1e	s3: re-implement multipart upload to fix memory issues There have been quite a few reports of problems with the multipart uploader using too much memory and not retrying possible errors. Before this change the multipart uploader used the s3manager abstraction in the AWS SDK. There are numerous bug reports of this using up too much memory. This change re-implements a much simplified version of the s3manager code specialized for rclone's purposes. This should use much less memory and retry chunks properly. See: https://forum.rclone.org/t/memory-usage-s3-alike-to-glacier-without-big-directories/13563 See: https://forum.rclone.org/t/copy-from-local-to-s3-has-high-memory-usage/13405 See: https://forum.rclone.org/t/big-file-upload-to-s3-fails/13575	2020-01-03 22:19:28 +00:00
Thomas Kriechbaumer	584e705c0c	s3: introduce list_chunk option for bucket listing The S3 ListObject API returns paginated bucket listings, with "MaxKeys" items for each GET call. The default value is 1000 entries, but for buckets with millions of objects it might make sense to request more elements per request, if the backend supports it. This commit adds a "list_chunk" option for the user to specify a lower or higher value. This commit does not add safe guards around this value - if a user decides to request a too large list, it might result in connection timeouts (on the server or client). In AWS S3, there is a fixed limit of 1000, some other services might have one too. In Ceph, this can be configured in RadosGW.	2020-01-02 12:15:01 +00:00
Outvi V	db1c7f9ca8	s3: Add new region Asia Patific (Hong Kong)	2020-01-02 11:10:48 +00:00
Nick Craig-Wood	0ecb8bc2f9	s3: fix url decoding of NextMarker - fixes #3799 Before this patch we were failing to URL decode the NextMarker when url encoding was used for the listing. The result of this was duplicated listings entries for directories with >1000 entries where the NextMarker was a file containing a space.	2019-12-12 13:33:30 +00:00
Nick Craig-Wood	0d10640aaa	s3: add --s3-copy-cutoff for size to switch to multipart copy Before this change we used the same (relatively low limits) for server side copy as we did for multipart uploads. It doesn't make sense to use the same limits since no data is being downloaded or uploaded for a server side copy. This change introduces a new parameter --s3-copy-cutoff to control when the switch from single to multipart server size copy happens and defaults it to the maximum 5GB. This makes server side copies much more efficient. It also fixes the erroneous error when trying to set the modification time of a file bigger than 5GB. See #3778	2019-12-03 10:37:55 +00:00

1 2 3

117 Commits