rclone

mirror of https://github.com/rclone/rclone.git synced 2024-11-29 03:45:25 +01:00

Author	SHA1	Message	Date
Nick Craig-Wood	5b5fdc6bc5	s3: add provider quirk --s3-might-gzip to fix corrupted on transfer: sizes differ Before this change, some files were giving this error when downloaded from Cloudflare and other providers. ERROR corrupted on transfer: sizes differ NNN vs MMM This is because these providers auto gzips the object when rclone wasn't expecting it to. (AWS does not gzip objects without their being uploaded gzipped). This patch adds a quirk to for fix the problem and a flag to control it. The quirk `might_gzip` is set to `true` for all providers except AWS. See: https://forum.rclone.org/t/s3-error-corrupted-on-transfer-sizes-differ-nnn-vs-mmm/33694/ Fixes: #6533	2022-11-04 16:53:32 +00:00
Nick Craig-Wood	028832ce73	s3: if bucket or object ACL is empty string then don't add X-Amz-Acl: header - fixes #5730 Before this fix it was impossible to stop rclone generating an X-Amx-Acl: header which is incompatible with GCS with uniform access control and is generally deprecated at AWS.	2022-11-03 17:06:24 +00:00
Philip Harvey	c7c9356af5	s3: stop setting object and bucket ACL to "private" if it is an empty string #5730	2022-11-03 17:06:24 +00:00
Anthony Pessy	10c884552c	s3: use different strategy to resolve s3 region The API endpoint GetBucketLocation requires top level permission. If we do an authenticated head request to a bucket, the bucket location will be returned in the HTTP headers. Fixes #5066	2022-11-02 11:48:08 +00:00
Nick Craig-Wood	fce22c0065	s3: add --s3-no-system-metadata to suppress read and write of system metadata See: https://forum.rclone.org/t/problems-with-content-disposition-and-backblaze-b2-using-s3/33292/	2022-10-14 11:12:04 +01:00
Bachue Zhou	66ed0ca726	s3: add Qiniu KODO to s3 provider list - fixes #6195	2022-10-13 15:49:22 +01:00
Nick Craig-Wood	90d23139f6	s3: drop binary metadata with an ERROR message Before this change, rclone would attempt to upload metadata with binary contents which fail to be uploaded by net/http. This checks the keys and values for validity as http header values before uploading. See: https://forum.rclone.org/t/invalid-metadata-key-names-result-in-a-failure-to-transfer-xattr-results-in-failure-to-upload-net-http-invalid-header-field-value-for-x-amz-meta-samba-pai/33406/	2022-10-13 12:00:45 +01:00
Nick Craig-Wood	cf0bf159ab	s3: try to keep the maximum precision in ModTime with --user-server-modtime Before this change if --user-server-modtime was in use the ModTime could change for an object as we receive it accurate to the nearest ms in listings, but only accurate to the nearest second in HEAD and GET requests. Normally AWS returns the milliseconds as .000 in listings, but if versions are in use it may not. Storj S3 also seems to return milliseconds. This patch tries to keep the maximum precision in the last modified time, so it doesn't update a last modified time with a truncated version if the times were the same to the nearest second. See: https://forum.rclone.org/t/cache-fingerprint-miss-behavior-leading-to-false-positive-stalen-cache/33404/	2022-10-12 09:18:10 +01:00
Richard Bateman	4f374bc264	s3: add --s3-sse-customer-key-base64 to supply keys with binary data Fixes #6400	2022-09-17 17:28:44 +01:00
Dmitry Deniskin	c080b39e47	s3: add support for IONOS Cloud Storage	2022-09-15 16:04:34 +01:00
Josh Soref	ce3b65e6dc	all: fix spelling across the project * abcdefghijklmnopqrstuvwxyz * accounting * additional * allowed * almost * already * appropriately * arise * bandwidth * behave * bidirectional * brackets * cached * characters * cloud * committing * concatenating * configured * constructs * current * cutoff * deferred * different * directory * disposition * dropbox * either way * error * excess * experiments * explicitly * externally * files * github * gzipped * hierarchies * huffman * hyphen * implicitly * independent * insensitive * integrity * libraries * literally * metadata * mimics * missing * modification * multipart * multiple * nightmare * nonexistent * number * obscure * ourselves * overridden * potatoes * preexisting * priority * received * remote * replacement * represents * reproducibility * response * satisfies * sensitive * separately * separator * specifying * string * successful * synchronization * syncing * šenfeld * take * temporarily * testcontents * that * the * themselves * throttling * timeout * transaction * transferred * unnecessary * using * webbrowser * which * with * workspace Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com>	2022-08-30 11:16:26 +02:00
Nick Craig-Wood	0501773db1	azureblob,b2,s3: fix chunksize calculations producing too many parts Before this fix, the chunksize calculator was using the previous size of the object, not the new size of the object to calculate the chunk sizes. This meant that uploading a replacement object which needed a new chunk size would fail, using too many parts. This fix fixes the calculator to take the size explicitly.	2022-08-09 12:57:38 +01:00
Nick Craig-Wood	ebe86c6cec	s3: add --s3-decompress flag to download gzip-encoded files Before this change, if an object compressed with "Content-Encoding: gzip" was downloaded, a length and hash mismatch would occur since the go runtime automatically decompressed the object on download. If --s3-decompress is set, this change erases the length and hash on compressed objects so they can be downloaded successfully, at the cost of not being able to check the length or the hash of the downloaded object. If --s3-decompress is not set the compressed files will be downloaded as-is providing compressed objects with intact size and hash information. See #2658	2022-08-05 16:45:23 +01:00
Nick Craig-Wood	4b981100db	s3: refactor to use generated code instead of reflection to copy structs	2022-08-05 16:45:23 +01:00
Nick Craig-Wood	4344a3e2ea	s3: implement --s3-version-at flag - Fixes #1776	2022-08-05 16:45:23 +01:00
Nick Craig-Wood	1542a979f9	s3: refactor f.list() to take an options struct as it had too many parameters	2022-08-05 16:45:23 +01:00
Nick Craig-Wood	81d242473a	s3: implement Purge to purge versions and `backend cleanup-hidden`	2022-08-05 16:45:23 +01:00
Nick Craig-Wood	0ae171416f	s3: implement --s3-versions flag - See #1776	2022-08-05 16:45:23 +01:00
Nick Craig-Wood	a59fa2977d	s3: factor different listing versions into separate objects	2022-08-05 16:42:30 +01:00
Nick Craig-Wood	7243918069	s3: implement backend versioning command to get/set bucket versioning	2022-08-05 16:42:30 +01:00
Nick Craig-Wood	6fd9e3d717	build: reformat comments to pass go1.19 vet See: https://go.dev/doc/go1.19#go-doc	2022-08-05 16:35:41 +01:00
Nick Craig-Wood	440d0cd179	s3: fix --s3-no-head panic: reflect: Elem of invalid type s3.PutObjectInput In `22abd785eb` s3: implement reading and writing of metadata #111 The reading information of objects was refactored to use the s3.HeadObjectOutput structure. Unfortunately the code branch with `--s3-no-head` was not tested otherwise this panic would have been discovered. This shows that this is path is not integration tested, so this adds a new integration test. Fixes #6322	2022-07-18 23:38:50 +01:00
albertony	986bb17656	staticcheck: awserr.BatchError is deprecated: Replaced with BatchedErrors	2022-07-04 11:24:59 +02:00
Nick Craig-Wood	22abd785eb	s3: implement reading and writing of metadata #111	2022-06-29 14:29:36 +01:00
Nick Craig-Wood	a692bd2cd4	s3: change metadata storage to normal map with lowercase keys	2022-06-29 14:29:36 +01:00
vyloy	326c43ab3f	s3: add IDrive e2 to provider list	2022-06-28 09:12:36 +01:00
Nick Craig-Wood	f7c36ce0f9	s3: unwrap SDK errors to reveal underlying errors on upload The SDK doesn't wrap errors in a Go standard way so they can't be unwrapped and tested for - eg fatal error. The code looks for a Serialization or RequestError and returns the unwrapped underlying error if possible. This fixes the fs/operations integration tests checking for fatal errors being returned.	2022-06-17 16:52:30 +01:00
Nick Craig-Wood	c85fbebce6	s3: simplify PutObject code to use the Request.SetStreamingBody method In this commit `e5974ac4b0` s3: use PutObject from the aws SDK to upload single part objects rclone was made to upload objects to s3 using PUT requests rather than using signed uploads. However this change missed the fact that there is a supported way to do this in the SDK using the SetStreamingBody method on the Request. This therefore reverts a lot of the previous commit to do with making an unsigned connection and other complication and uses the SDK facility.	2022-06-16 23:26:19 +01:00
Nick Craig-Wood	fa48b880c2	s3: retry RequestTimeout errors See: https://forum.rclone.org/t/s3-failed-upload-large-files-bad-request-400/27695	2022-06-16 22:13:50 +01:00
Maciej Radzikowski	2e91287b2e	docs/s3: add note about chunk size decreasing progress accuracy	2022-06-16 22:29:36 +02:00
m00594701	02b4638a22	backend: add Huawei OBS to s3 provider list	2022-06-14 09:21:01 +01:00
albertony	ec117593f1	Fix lint issues reported by staticcheck Used staticcheck 2022.1.2 (v0.3.2) See: staticcheck.io	2022-06-13 21:13:50 +02:00
Alex JOST	a34276e9b3	s3: Add Warsaw location for Scaleway Add new location in Warsaw (Poland) to endpoints for Scaleway. More Information: https://blog.scaleway.com/scaleway-is-now-in-warsaw/ https://www.scaleway.com/en/docs/storage/object/how-to/create-a-bucket/	2022-05-19 14:06:16 +01:00
Nick Craig-Wood	813a5e0931	s3: Remove bucket ACL configuration for Cloudflare R2 Bucket ACLs are not supported by Cloudflare R2. All buckets are private and must be shared using a Cloudflare Worker.	2022-05-17 15:57:09 +01:00
Eng Zer Jun	4f0ddb60e7	refactor: replace strings.Replace with strings.ReplaceAll strings.ReplaceAll(s, old, new) is a wrapper function for strings.Replace(s, old, new, -1). But strings.ReplaceAll is more readable and removes the hardcoded -1. Signed-off-by: Eng Zer Jun <engzerjun@gmail.com>	2022-05-17 11:08:37 +01:00
Derek Battams	fb4f7555c7	s3: use chunksize lib to determine chunksize dynamically	2022-05-13 09:25:48 +01:00
Vincent Murphy	319ac225e4	s3: backend restore command to skip non-GLACIER objects	2022-05-12 20:42:37 +01:00
Nick Craig-Wood	6f91198b57	s3: Support Cloudflare R2 - fixes #5642	2022-05-12 08:49:20 +01:00
Nick Craig-Wood	e5974ac4b0	s3: use PutObject from the aws SDK to upload single part objects Before this change rclone used presigned requests to upload single part objects. This was because of a limitation in the SDK which didn't allow non seekable io.Readers to be passed in. This is incompatible with some S3 backends, and rclone wasn't adding the `X-Amz-Content-Sha256: UNSIGNED-PAYLOAD` header which was incompatible with other S3 backends. The SDK now allows for this so rclone can use PutObject directly. This sets the `X-Amz-Content-Sha256: UNSIGNED-PAYLOAD` flag on the PUT request. However rclone will add a `Content-Md5` header if at all possible so the body data is still protected. Note that the old behaviour can still be configured if required with the `use_presigned_request` config parameter. Fixes #5422	2022-05-12 08:49:20 +01:00
ehsantdy	a446106041	s3: update Arvancloud default values and correct docs	2022-05-02 16:04:01 +01:00
ehsantdy	e34c543660	s3: Add ArvanCloud AOS to provider list	2022-04-28 10:42:30 +01:00
Nick Craig-Wood	603e51c43f	s3: sync providers in config description with providers	2022-03-31 17:55:54 +01:00
GuoXingbin	c2bfda22ab	s3: Add ChinaMobile EOS to provider list China Mobile Ecloud Elastic Object Storage (EOS) is a cloud object storage service, and is fully compatible with S3. Fixes #6054	2022-03-24 11:57:00 +00:00
Nick Craig-Wood	189cba0fbe	s3: add other regions for Lyve and correct Provider name	2022-03-14 15:43:35 +00:00
Nick Craig-Wood	6a6d254a9f	s3: add support for Seagate Lyve Cloud storage	2022-03-09 11:30:55 +00:00
Nick Craig-Wood	537b62917f	s3: add --s3-use-multipart-etag provider quirk #5993 Before this change the new multipart upload ETag checking code was failing in the integration tests with Alibaba OSS. Apparently Alibaba calculate the ETag in a different way to AWS. This introduces a new provider quirk with a flag to disable the checking of the ETag for multipart uploads. Mulpart Etag checking has been enabled for all providers that we can test for and work, and left disabled for the others.	2022-03-01 16:36:39 +00:00
Nick Craig-Wood	8f164e4df5	s3: Use the ETag on multipart transfers to verify the transfer was OK Before this rclone ignored the ETag on multipart uploads which missed an opportunity for a whole file integrity check. This adds that check which means that we now check even harder that multipart uploads have arrived properly. See #5993	2022-02-25 16:19:03 +00:00
Márton Elek	25ea04f1db	s3: add specific provider for Storj Shared gateways - unsupported features (Copy) are turned off for Storj - enable urlEncodedListing for Storj provider - set chunksize to 64Mb	2022-02-08 11:40:29 +00:00
Nick Craig-Wood	051685baa1	s3: fix multipart upload with --no-head flag - Fixes #5956 Before this change a multipart upload with the --no-head flag returned the MD5SUM as a base64 string rather than a Hex string as the rest of rclone was expecting.	2022-01-29 12:48:51 +00:00
Yunhai Luo	408d9f3e7a	s3: Add GLACIER_IR storage class	2021-12-03 14:46:45 +00:00
Logeshwaran Murugesan	bcf0e15ad7	Simplify content length processing in s3 with download url	2021-11-25 12:03:14 +00:00
lindwurm	b5abbe819f	s3: Add Wasabi AP Northeast 2 endpoint info * Wasabi starts to provide AP Northeast 2 (Osaka) endpoint, so add it to the list * Rename ap-northeast-1 as "AP Northeast 1 (Tokyo)" from "AP Northeast" Signed-off-by: lindwurm <lindwurm.q@gmail.com>	2021-11-22 18:02:57 +00:00
bbabich	b16f603c51	s3: Add RackCorp object storage to providers	2021-11-09 11:46:58 +00:00
Nick Craig-Wood	e43b5ce5e5	Remove github.com/pkg/errors and replace with std library version This is possible now that we no longer support go1.12 and brings rclone into line with standard practices in the Go world. This also removes errors.New and errors.Errorf from lib/errors and prefers the stdlib errors package over lib/errors.	2021-11-07 11:53:30 +00:00
Nick Craig-Wood	454574e2cc	s3: collect the provider quirks into a single function and update This removes the checks against the provider throughout the code and puts them into a single setQuirks function for easy maintenance when adding a new provider. It also updates the quirks with the results of testing against backends we have access to. This also adds a list_url_encode parameter so that quirk can be manually set.	2021-11-03 21:44:09 +00:00
Nick Craig-Wood	8d92f7d697	s3: fallback to ListObject v1 on unsupported providers This implements a quirks system for providers and notes which providers we have tested to support ListObjectsV2. For those providers which don't support ListObjectsV2 we use the original ListObjects call.	2021-11-03 19:13:50 +00:00
Felix Bünemann	fd56abc5f2	s3: Use ListObjectsV2 for faster listings Using ListObjectsV2 with a continuation token is about 5-6x faster than ListObjectsV2 with a marker.	2021-11-03 19:13:50 +00:00
Nick Craig-Wood	cf2c2792e6	s3: fix corrupted on transfer: sizes differ 0 vs xxxx with Ceph In this commit, released in 1.56.0 we started reading the size of the object from the Content-Length header as returned by the GET request to read the object. `4401d180aa` s3: add --s3-no-head-object However some object storage systems, notably Ceph, don't return a Content-Length header. The new code correctly calls the setMetaData function with a nil pointer to the ContentLength. However due to this commit from 2014, released in v1.18, the setMetaData function was not ignoring the size as it should have done. `0da6f24221` s3: use official github.com/aws/aws-sdk-go including multipart upload #101 This commit correctly ignores the content length if not set. Fixes #5732	2021-10-30 12:01:09 +01:00
Nick Craig-Wood	e6e1c49b58	s3: fix shared_credentials_file auth after reverting incorrect fix #5762 Before this change the `shared_credentials_file` config option was being ignored. The correct value is passed into the SDK but it only sets the credentials in the default provider. Unfortunately we wipe the default provider in order to install our own chain if env_auth is true. This patch restores the shared credentials file in the session options, exactly the same as how we restore the profile. Original fix: `1605f9e14d` s3: Fix shared_credentials_file auth	2021-10-30 11:54:17 +01:00
Nick Craig-Wood	712f9c9760	s3: fix IAM Role for Service Account not working and other auth problems This patch reverts this commit `1605f9e14d` s3: Fix shared_credentials_file auth It unfortunately had the side effect of making the s3 SDK ignore the config in our custom chain and use the default provider. This means that advanced auth was being ignored such as --s3-profile with role_arn. Fixes #5468 Fixes #5762	2021-10-30 11:54:17 +01:00
albertony	e2f47ecdeb	docs: punctuation cleanup See #5538	2021-10-20 22:56:19 +02:00
Nick Craig-Wood	f5c7c597ba	s3: Use a combination of SDK retries and rclone retries - fixes #5509 This reverts commit `dc06973796` Revert "s3: use rclone's low level retries instead of AWS SDK to fix listing retries" Which in turn reverted `5470d34740` "backend/s3: use low-level-retries as the number of SDK retries" So we are back where we started. It then modifies it to set the AWS SDK to `--low-level-retries` retries, but set the rclone retries to 2 so that directory listings can be retried.	2021-10-19 20:12:17 +01:00
Logeshwaran	ceaafe6620	s3: add support to use CDN URL to download the file The egress charges while using a CloudFront CDN url is cheaper when compared to accessing the file directly from S3. So added a download URL advanced option, which when set downloads the file using it.	2021-10-14 11:19:38 +01:00
Tatsuya Noyori	05f128868f	azureblob: add --azureblob-no-head-object	2021-09-06 10:41:54 +01:00
hota	839c20bb35	s3: add Wasabi's AP-Northeast endpoint info * Wasabi starts to provide AP Northeast (Tokyo) endpoint for all customers, so add it to the list Signed-off-by: lindwurm <lindwurm.q@gmail.com>	2021-08-01 14:56:52 +01:00
Chuan Zh	ba836d45ff	s3: update Alibaba OSS endpoints	2021-07-08 12:03:04 +01:00
Chris Lu	1f846c18d4	s3: Add SeaweedFS	2021-06-08 09:59:57 +01:00
Nick Craig-Wood	c0cda087a8	s3: don't check to see if remote is object if it ends with / Before this change, rclone would always check the root to see if it was an object. This change doesn't check to see if the root is an object if the path ends with a / This avoids a transaction where rclone HEADs the path to see if it exists. See #4990	2021-05-17 16:43:34 +01:00
Tatsuya Noyori	4401d180aa	s3: add --s3-no-head-object This stops rclone doing any HEAD requests on objects.	2021-04-28 11:05:54 +01:00
albertony	2925e1384c	Use binary prefixes for size and rate units Includes adding support for additional size input suffix Mi and MiB, treated equivalent to M. Extends binary suffix output with letter i, e.g. Ki and Mi. Centralizes creation of bit/byte unit strings.	2021-04-27 02:25:52 +03:00
Nick Craig-Wood	e618ea83dd	s3: remove WebIdentityRoleProvider to fix crash on auth #5255 This code removes the code added in `15d19131bd` s3: use aws web identity role provider This code no longer works because it doesn't initialise the tokenFetcher - leading to a nil pointer crash. The proper way to initialise this is with the NewWebIdentityCredentials but it isn't clear where to get the other parameters: roleARN, roleSessionName, path. In the linked issue a user reports rclone working with EKS anyway, so perhaps this code is no longer needed. If it is needed, hopefully someone who knows AWS better will come along and fix it! See: https://forum.rclone.org/t/add-support-for-aws-sso/23569	2021-04-26 16:55:50 +01:00
Nick Craig-Wood	b9a015e5b9	s3: fix --s3-profile which wasn't working - fixes #4757	2021-03-16 16:25:07 +00:00
Nick Craig-Wood	f2c0f82fc6	backends: Add context checking to remaining backends #4504 This is a follow up to `4013bc4a4c` which missed some backends. It adds a ctx parameter to shouldRetry and checks it.	2021-03-16 16:17:22 +00:00
Nick Craig-Wood	f7e3115955	s3: fix Wasabi HEAD requests returning stale data by using only 1 transport In this commit `fc5b14b620` s3: Added `--s3-disable-http2` to disable http/2 We created our own transport so we could disable http/2. However the added function is called twice meaning that we create two HTTP transports. This didn't happen with the original code because the default transport is cached by fshttp. Rclone normally does a PUT followed by a HEAD request to check an upload has been successful. With the two transports, the PUT and the HEAD were being done on different HTTP transports. This means that it wasn't re-using the same HTTP connection, so the HEAD request showed the previous object value. This caused rclone to declare the upload was corrupted, delete the object and try again. This patch makes sure we only create one transport and use it for both PUT and HEAD requests which fixes the problem with Wasabi. See: https://forum.rclone.org/t/each-time-rclone-is-run-1-3-fails-2-3-succeeds/22545	2021-03-05 15:34:56 +00:00
Nick Craig-Wood	b029fb591f	s3: fix failed to create file system with folder level permissions policy Before this change, if folder level access permissions policy was in use, with trailing `/` marking the folders then rclone would HEAD the path without a trailing `/` to work out if it was a file or a folder. This returned a permission denied error, which rclone returned to the user. Failed to create file system for "s3:bucket/path/": Forbidden: Forbidden status code: 403, request id: XXXX, host id: Previous to this change `53aa03cc44` s3: complete sse-c implementation rclone would assume any errors when HEAD-ing the object implied it didn't exist and this test would not fail. This change reverts the functionality of the test to work as it did before, meaning any errors on HEAD will make rclone assume the object does not exist and the path is referring to a directory. Fixes #4990	2021-02-24 20:35:44 +00:00
Dmitry Chepurovskiy	1605f9e14d	s3: Fix shared_credentials_file auth S3 backend shared_credentials_file option wasn't working neither from config option nor from command line option. This was caused cause shared_credentials_file_provider works as part of chain provider, but in case user haven't specified access_token and access_key we had removed (set nil) to credentials field, that may contain actual credentials got from ChainProvider. AWS_SHARED_CREDENTIALS_FILE env varible as far as i understood worked, cause aws_sdk code handles it as one of default auth options, when there's not configured credentials.	2021-02-17 12:04:26 +00:00
Nick Craig-Wood	bbe791a886	swift: update github.com/ncw/swift to v2.0.0 The update to v2 of the swift library introduces a context parameter to each function. This required a lot of mostly mechanical changes adding context parameters. See: https://github.com/ncw/swift/issues/159 See: https://github.com/ncw/swift/issues/161	2021-02-03 20:23:37 +00:00
Nick Craig-Wood	bcac8fdc83	Use http.NewRequestWithContext where possible after go1.13 minimum version	2021-02-03 17:41:27 +00:00
Nick Craig-Wood	8b41dfa50a	s3: add --s3-no-head parameter to minimise transactions on upload See: https://forum.rclone.org/t/prevent-head-on-amazon-s3-family/21935	2021-02-02 10:07:48 +00:00
Nick Craig-Wood	3877df4e62	s3: update help for --s3-no-check-bucket #4913	2021-01-10 17:54:19 +00:00
kelv	9e87f5090f	s3: add requester pays option - fixes #301	2020-12-27 15:43:44 +00:00
Anagh Kumar Baranwal	8a429d12cf	s3: Added error handling for error code 429 indicating too many requests Signed-off-by: Anagh Kumar Baranwal <6824881+darthShadow@users.noreply.github.com>	2020-12-01 18:13:31 +00:00
Nick Craig-Wood	9d574c0d63	fshttp: read config from ctx not passed in ConfigInfo #4685	2020-11-26 16:40:12 +00:00
Nick Craig-Wood	2e21c58e6a	fs: deglobalise the config #4685 This is done by making fs.Config private and attaching it to the context instead. The Config should be obtained with fs.GetConfig and fs.AddConfig should be used to get a new mutable config that can be changed.	2020-11-26 16:40:12 +00:00
Nick Craig-Wood	76ee3060d1	s3: Add MD5 metadata to objects uploaded with SSE-AWS/SSE-C Before this change, small objects uploaded with SSE-AWS/SSE-C would not have MD5 sums. This change adds metadata for these objects in the same way that the metadata is stored for multipart uploaded objects. See: #1824 #2827	2020-11-25 12:28:02 +00:00
Nick Craig-Wood	4bb241c435	s3: store md5 in the Object rather than the ETag This enables us to set the md5 to cache it. See: #1824 #2827	2020-11-25 12:28:02 +00:00
Nick Craig-Wood	a06f4c2514	s3: fix hashes on small files with aws:kms and sse-c If rclone is configured for server side encryption - either aws:kms or sse-c (but not sse-s3) then don't treat the ETags returned on objects as MD5 hashes. This fixes being able to upload small files. Fixes #1824	2020-11-25 12:28:02 +00:00
Nick Craig-Wood	53aa03cc44	s3: complete sse-c implementation This now can complete all operations with SSE-C enabled. Fixes #2827 See: https://forum.rclone.org/t/issues-with-aws-s3-sse-c-getting-strange-log-entries-and-errors/20553	2020-11-25 12:28:02 +00:00
Nick Craig-Wood	8b96933e58	fs: Add context to fs.Features.Fill & fs.Features.Mask #3257 #4685	2020-11-09 18:05:54 +00:00
Nick Craig-Wood	d846210978	fs: Add context to NewFs #3257 #4685 This adds a context.Context parameter to NewFs and related calls. This is necessary as part of reading config from the context - backends need to be able to read the global config.	2020-11-09 18:05:54 +00:00
Josh Soref	0a6196716c	docs: style: avoid double-nesting parens Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-10-28 18:16:23 +00:00
Josh Soref	a15f50254a	docs: grammar: if, then Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-10-28 18:16:23 +00:00
Josh Soref	5d4f77a022	docs: grammar: Oxford comma Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-10-28 18:16:23 +00:00
Josh Soref	a089de0964	docs: grammar: uncountable: links Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-10-28 18:16:23 +00:00
Josh Soref	3068ae8447	docs: grammar: count agreement: files Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-10-28 18:16:23 +00:00
Josh Soref	67ff153b0c	docs: grammar: article: a-file Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-10-28 18:16:23 +00:00
Josh Soref	e4a87f772f	docs: spelling: e.g. Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-10-28 18:16:23 +00:00
Josh Soref	d4f38d45a5	docs: spelling: high-speed Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-10-28 18:16:23 +00:00
Josh Soref	bbe7eb35f1	docs: spelling: server-side Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-10-28 18:16:23 +00:00
Josh Soref	d0888edc0a	Spelling fixes Fix spelling of: above, already, anonymous, associated, authentication, bandwidth, because, between, blocks, calculate, candidates, cautious, changelog, cleaner, clipboard, command, completely, concurrently, considered, constructs, corrupt, current, daemon, dependencies, deprecated, directory, dispatcher, download, eligible, ellipsis, encrypter, endpoint, entrieslist, essentially, existing writers, existing, expires, filesystem, flushing, frequently, hierarchy, however, implementation, implements, inaccurate, individually, insensitive, longer, maximum, metadata, modified, multipart, namedirfirst, nextcloud, obscured, opened, optional, owncloud, pacific, passphrase, password, permanently, persimmon, positive, potato, protocol, quota, receiving, recommends, referring, requires, revisited, satisfied, satisfies, satisfy, semver, serialized, session, storage, strategies, stringlist, successful, supported, surprise, temporarily, temporary, transactions, unneeded, update, uploads, wrapped Signed-off-by: Josh Soref <jsoref@users.noreply.github.com>	2020-10-14 15:21:31 +01:00
Anagh Kumar Baranwal	fc5b14b620	s3: Added `--s3-disable-http2` to disable http/2 Fixes #4673 Signed-off-by: Anagh Kumar Baranwal <6824881+darthShadow@users.noreply.github.com>	2020-10-13 17:11:22 +01:00
Anagh Kumar Baranwal	e3a5bb9b48	s3: Add missing regions for AWS Signed-off-by: Anagh Kumar Baranwal <6824881+darthShadow@users.noreply.github.com>	2020-10-06 16:54:42 +01:00
Christopher Stewart	f3cf6fcdd7	s3: fix spelling mistake Fix spelling mistake "patific" => "pacific"	2020-09-18 12:03:13 +01:00
wjielai	22937e8982	docs: add Tencent COS to s3 provider list - fixes #4468 * add Tencent COS to s3 provider list. Co-authored-by: wjielai <wjielai@tencent.com>	2020-09-08 16:34:25 +01:00
Nick Craig-Wood	725ae91387	s3: reduce the default --s3-copy-cutoff to < 5GB The maximum value for the --s3--copy-cutoff should be 5GiB as tested with AWS S3. However b2 have implemented this as 5GB rather than 5GiB so having the default at 5 GiB makes the b2s3 server side copy of a large file by default. This patch sets the default to 4768 MiB which is slightly less than 5GB. This should have very little effect on anything. If in future rclone can lower this limit more if Copy can multithread. See: https://forum.rclone.org/t/copying-files-within-a-b2-bucket/16680/76	2020-09-01 18:53:29 +01:00
Nick Craig-Wood	b7dd3ce608	s3: preserve metadata when doing multipart copy Before this change the s3 multipart server side copy was not preserving the metadata of the object. This was most noticeable because the modtime was not preserved. This change fetches the metadata from the object before starting the copy and overwrites it if requires. It will also mean any other metadata is preserved. See: https://forum.rclone.org/t/copying-files-within-a-b2-bucket/16680/70	2020-09-01 18:39:30 +01:00
Egor Margineanu	921e384c4d	s3: update IBM COS endpoints - fixes #4522	2020-08-30 17:21:11 +01:00
Nick Craig-Wood	801a820c54	s3: fix detection of bucket existing This reverts part of `151f03378f` s3: fix upload of single files into buckets without create permission This erroneously assumed that a HEAD request on a non existent object would return "NotFound" if the bucket was found. In fact it returns "NotFound" when the bucket isn't found also. This will break the fix for #4297 - however that can be made to work using the new --s3-assume-bucket-exists flag	2020-08-21 13:28:08 +01:00
Nick Craig-Wood	d5f4c74697	s3: implement cleanup and backend command to list & remove multipart uploads This implements `rclone cleanup` to remove multipart uploads over 24 hours old. It also implements the backend command `list-multipart-uploads` to see which ones are available and `cleanup` to delete them with a configurable expiry interval. See #4302	2020-07-28 11:37:46 +01:00
Nick Craig-Wood	2288a5c617	s3: implement `profile` and `shared_credentials_file` options It is impossible to use two different profiles at the same time - these config vars enable that. See: https://forum.rclone.org/t/s3-source-destination-named-profile/17417	2020-07-28 11:32:32 +01:00
Nick Craig-Wood	f406dbbb4d	s3: add --s3-no-check-bucket for minimising rclone transactions and perms Fixes #4449	2020-07-27 17:49:40 +01:00
Nick Craig-Wood	80d2f38192	s3: fix bucket Region auto detection when Region unset in config #2915 Previous to this fix if Region was not set and Endpoint was not set then we set the endpoint to "https://s3.amazonaws.com/". This is unecessary because if the Region alone isn't set then we set it to "us-east-1" which has the same endpoint. Having the endpoint set breaks the bucket region auto detection with the error "Failed to update region for bucket: can't set region to "xxx" as endpoint is set". This fix removes that check.	2020-07-10 17:16:59 +01:00
Nick Craig-Wood	c820576329	fs: define SlowModTime and SlowHash features in the relevant backends	2020-06-30 12:01:36 +01:00
David	9058ec32e1	s3: Use regional s3 us-east-1 endpoint	2020-06-26 16:25:52 +01:00
Nick Craig-Wood	fd7c63bc78	s3: add backend restore command to restore objects from GLACIER See: https://forum.rclone.org/t/rclone-settier-fails-with-scaleway-entitytoolarge/17384	2020-06-25 21:33:23 +01:00
Nick Craig-Wood	5f75444ef6	s3: cancel in progress multipart uploads and copies on rclone exit #4300	2020-06-25 12:55:56 +01:00
Nick Craig-Wood	85bcacac90	s3: Cap expiry duration to 1 Week and return error when sharing dir	2020-06-18 17:50:50 +01:00
Vincent Feltz	f4d7e41f24	s3: add Scaleway provider - fixes #4338	2020-06-13 11:55:37 +01:00
Nick Craig-Wood	2ea15a72bc	s3: fix --header-upload - Fixes #4303 Before this change we were setting the headers on the PUT request for normal and multipart uploads. For normal uploads this caused the error 403 Forbidden: There were headers present in the request which were not signed After this fix we set the headers in the object upload request itself as the s3 SDK expects. This means that we only support a limited range of headers - Cache-Control - Content-Disposition - Content-Encoding - Content-Language - Content-Type - X-Amz-Tagging - X-Amz-Meta- Note for the last of those are for setting custom metadata in the form "X-Amz-Meta-Key: value". This now works for multipart uploads and single part uploads See also #59	2020-06-10 12:28:48 +01:00
Kamil Trzciński	7458d37d2a	s3: add `max_upload_parts` support - fixes #4159 * s3: add `max_upload_parts` support This allows to configure a maximum amount of chunks used to upload file: - Support Scaleway which has a limit of 1k chunks currently - Reduce a cost on S3 when each request costs some money at the expense of memory used Co-authored-by: Nick Craig-Wood <nick@craig-wood.com>	2020-06-08 18:22:34 +01:00
Roman Kredentser	c0521791db	s3: implement link sharing with PublicLink	2020-06-05 14:51:05 +01:00
Nick Craig-Wood	151f03378f	s3: fix upload of single files into buckets without create permission Before this change, attempting to upload a single file into an s3 bucket which did not have create permission gave AccessDenied: Access Denied error when it tried to create the bucket. This was masked until `e2bf91452a` was fixed. This fix marks the bucket as OK if a fetch on an object indicates it is OK. This stops rclone thinking it has to create the bucket in the first place. Fixes #4297	2020-06-02 14:33:21 +01:00
Martin Michlmayr	4aee962233	doc: fix typos throughout docs and code	2020-05-20 15:54:51 +01:00
Nick Craig-Wood	8a58e0235d	s3: don't leak memory or tokens in edge cases for multipart upload	2020-05-14 07:48:18 +01:00
Nick Craig-Wood	4e869e03f7	s3: improve docs for --s3-disable-checksum	2020-04-28 17:47:10 +01:00
Tim Gallant	5cb7229a16	s3: add support for HTTPOption	2020-04-23 11:07:21 +01:00
Nick Craig-Wood	f8039deb7c	s3: fix detection of BucketAlreadyOwnedByYou and BucketAlreadyExists error This was being silently ignored until this commit `e2bf91452a` s3: report errors on bucket creation (mkdir) correctly	2020-04-22 18:14:03 +01:00
Nick Craig-Wood	e2bf91452a	s3: report errors on bucket creation (mkdir) correctly Before this fix errors on bucket creation were being silently swallowed. See: https://forum.rclone.org/t/rclone-with-brand-new-aws-account-for-s3/15590	2020-04-15 13:13:13 +01:00
Michał Matczuk	6893ce0bbf	s3: do not resize buf on put to memBuf This is handled by Pool implementation.	2020-04-11 16:35:48 +01:00
Michał Matczuk	399cf18013	s3: use single memory pool Previously we had a map of pools for different chunk sizes. In practice the mapping is not very useful and requires a lock. Pools of size other that ChunkSize can only happen when we have a huge file (over 10k * ChunkSize). We need to have a bunch of identically sized huge files. In such case most likely ChunkSize should be increased. The mapping and its lock is replaced with a single initialised pool for ChunkSize, in other cases pool is allocated and freed on per file basis.	2020-04-11 16:34:05 +01:00
Jack Anderson	815ae7df45	backend/s3: add SSE-C support for AWS, Ceph, and MinIO	2020-03-31 18:16:45 +01:00
Nick Craig-Wood	a5c2f2c138	s3: ignore directory markers at the root also See: https://forum.rclone.org/t/issue-with-lsf-r-files-only-first-line-is-blank/15229/	2020-03-31 11:45:52 +01:00
Nick Craig-Wood	dc06973796	s3: use rclone's low level retries instead of AWS SDK to fix listing retries In `5470d34740` "backend/s3: use low-level-retries as the number of SDK retries" we switched over to using the AWS SDK low level retries instead of rclone's low level retry logic. This had the unfortunate attempt that retrying listings to correct XML Syntax errors failed on non S3 backends such as CEPH. The AWS SDK was also retrying the XML Syntax error request which doesn't make sense. This change turns off the AWS SDK retries in favour of just using rclone's retry logic.	2020-03-14 18:04:24 +00:00
Joachim Brandon LeBlanc	132ce94139	backend/s3: use the provided size parameter when allocating a new memory pool - fixes #4047 (#4049 )	2020-03-09 16:56:21 +00:00
Lars Lehtonen	fef2c6bf7a	backend/s3: replace deprecated session.New() with session.NewSession()	2020-03-05 11:34:10 +00:00
Aleksandar Jankovic	708b967f15	backend/s3: fix multipart abort context S3 couldn't abort multi-part upload when context is canceled because canceled context prevents abort request from being sent.	2020-02-25 12:11:32 +01:00
Aleksandar Janković	5470d34740	backend/s3: use low-level-retries as the number of SDK retries Amazon S3 is built to handle different kinds of workloads. In rare cases where S3 is not able to scale for whatever reason users will face status 500 errors. Main mechanism for handling these errors are retries. Amount of needed retries varies for each different use case. This change is making retries for s3 backend configurable by using --low-level-retries option.	2020-02-24 16:43:44 +01:00
Maciej Zimnoch	ac9cb50fdb	backend/s3: use memory pool for buffer allocations Currently each multipart upload allocated his own buffers, which after file upload was garbaged. Next files couldn't leverage already allocated memory which resulted in inefficent memory management. This change introduces backend memory pool keeping memory chunks which can be used during object operations. Fixes #3967	2020-02-24 13:32:32 +01:00
Michał Matczuk	e75c1f70bb	backend/s3: Added 500 as retryErrorCode The error code 500 Internal Error indicates that Amazon S3 is unable to handle the request at that time. The error code 503 Slow Down typically indicates that the requests to the S3 bucket are very high, exceeding the request rates described in Request Rate and Performance Guidelines. Because Amazon S3 is a distributed service, a very small percentage of 5xx errors are expected during normal use of the service. All requests that return 5xx errors from Amazon S3 can and should be retried, so we recommend that applications making requests to Amazon S3 have a fault-tolerance mechanism to recover from these errors. https://aws.amazon.com/premiumsupport/knowledge-center/http-5xx-errors-s3/	2020-02-12 11:43:18 +00:00
Michał Matczuk	19a4d74ee7	backend/s3: Fail fast multipart upload When a part upload request fails error is returned and gCtx is cancelled. This does not prevent from other parts being tried. They immediately fail due to a canceled context, but are retried by rclone anyway... Example AWS debug output ``` ----------------------------------------------------- 2020/02/11 14:12:17 DEBUG: Retrying Request s3/UploadPart, attempt 4 2020/02/11 14:12:17 DEBUG: Request s3/UploadPart Details: ---[ REQUEST POST-SIGN ]----------------------------- PUT /backuptest-rclone/huge/file.db?partNumber=11&uploadId=190939b4-3c43-4b98-ac11-92303e3f11b0 HTTP/1.1 Host: 192.168.100.99:9000 User-Agent: aws-sdk-go/1.23.8 (go1.13.1; linux; amd64) Content-Length: 5242880 Authorization: AWS4-HMAC-SHA256 Credential=miniouser/20200211/us-east-1/s3/aws4_request, SignedHeaders=content-length;content-md5;expect;host;x-amz-content-sha256;x-amz-date, Signature=3fc03a01f651cec09b05290459e9ceb26db9a8aa00c4e1b16e8cf5617eb81da8 Content-Md5: XzY+DlipXwbL6bvGYsXftg== Expect: 100-Continue X-Amz-Content-Sha256: c036cbb7553a909f8b8877d4461924307f27ecb66cff928eeeafd569c3887e29 X-Amz-Date: 20200211T131217Z Accept-Encoding: gzip ----------------------------------------------------- http://192.168.100.99:9000/backuptest-rclone/huge/file.db?partNumber=11&uploadId=190939b4-3c43-4b98-ac11-92303e3f11b0 2020/02/11 14:12:17 DEBUG: Response s3/UploadPart Details: ---[ RESPONSE ]-------------------------------------- HTTP/1.1 500 InternalServerError Content-Length: 0 ----------------------------------------------------- UploadPartWithContext() error InternalError: We encountered an internal error. Please try again status code: 500, request id: , host id: 2020/02/11 14:12:18 DEBUG ERROR: Request s3/UploadPart: ---[ REQUEST DUMP ERROR ]----------------------------- context canceled ------------------------------------------------------ UploadPartWithContext() error RequestCanceled: request context canceled caused by: context canceled 2020/02/11 14:12:20 DEBUG ERROR: Request s3/UploadPart: ---[ REQUEST DUMP ERROR ]----------------------------- context canceled ------------------------------------------------------ UploadPartWithContext() error RequestCanceled: request context canceled caused by: context canceled 2020/02/11 14:12:22 DEBUG ERROR: Request s3/UploadPart: ---[ REQUEST DUMP ERROR ]----------------------------- context canceled ------------------------------------------------------ UploadPartWithContext() error RequestCanceled: request context canceled caused by: context canceled ``` This adds a fail fast behaviour in case the context was cancelled.	2020-02-12 11:40:34 +00:00
Nick Craig-Wood	90377f5e65	s3: Specify that Minio supports URL encoding in listings Thanks to @harshavardhana for pointing this out See #3934 for background	2020-02-09 12:03:20 +00:00
Dave Koston	9f99c20232	s3: Add StackPath Object Storage Support	2020-01-31 16:05:44 +00:00
Nick Craig-Wood	bafe7d5a73	backends: move encoding definitions from fs/encodings	2020-01-16 14:40:36 +00:00
Nick Craig-Wood	3c620d521d	backend: adjust backends to have encoding parameter Fixes #3761 Fixes #3836 Fixes #3841	2020-01-16 14:40:36 +00:00
Nick Craig-Wood	b6e86b2c7f	s3: fix missing x-amz-meta-md5chksum headers for multipart uploads This reverts "s3: fix DisableChecksum condition" which introduced the problem. This reverts commit `c05bb63f96`. The code was correct as it stands - the comment was incorrect and this commit updates it. See: https://forum.rclone.org/t/s3-upload-md5-check-sum/13706	2020-01-07 19:39:39 +00:00
Tennix	15d19131bd	s3: use aws web identity role provider	2020-01-05 19:49:31 +00:00
Nick Craig-Wood	9d993e584b	s3: force path style bucket access to off for AWS deprecation AWS are deprecating path style bucket access so rclone should stop using it by default for this provider. This change shouldn't break any workflows as all AWS endpoints support virtual hosted style lookups of buckets. It may even improve performance. See: https://aws.amazon.com/blogs/aws/amazon-s3-path-deprecation-plan-the-rest-of-the-story/	2020-01-05 17:53:45 +00:00
Nick Craig-Wood	7242c7ce95	s3: fix multipart upload uploading 0 length files This regression was introduced by the recent re-write of the s3 multipart upload code.	2020-01-05 12:32:55 +00:00
Nick Craig-Wood	7e6fac8b1e	s3: re-implement multipart upload to fix memory issues There have been quite a few reports of problems with the multipart uploader using too much memory and not retrying possible errors. Before this change the multipart uploader used the s3manager abstraction in the AWS SDK. There are numerous bug reports of this using up too much memory. This change re-implements a much simplified version of the s3manager code specialized for rclone's purposes. This should use much less memory and retry chunks properly. See: https://forum.rclone.org/t/memory-usage-s3-alike-to-glacier-without-big-directories/13563 See: https://forum.rclone.org/t/copy-from-local-to-s3-has-high-memory-usage/13405 See: https://forum.rclone.org/t/big-file-upload-to-s3-fails/13575	2020-01-03 22:19:28 +00:00
Thomas Kriechbaumer	584e705c0c	s3: introduce list_chunk option for bucket listing The S3 ListObject API returns paginated bucket listings, with "MaxKeys" items for each GET call. The default value is 1000 entries, but for buckets with millions of objects it might make sense to request more elements per request, if the backend supports it. This commit adds a "list_chunk" option for the user to specify a lower or higher value. This commit does not add safe guards around this value - if a user decides to request a too large list, it might result in connection timeouts (on the server or client). In AWS S3, there is a fixed limit of 1000, some other services might have one too. In Ceph, this can be configured in RadosGW.	2020-01-02 12:15:01 +00:00

1 2 3 4 5 ...

320 Commits