From 6bd664854559fd4fbafeecff662bc8a8bd58b3dc Mon Sep 17 00:00:00 2001
From: =?UTF-8?q?Micka=C3=ABl=20Schoentgen?= <contact@tiger-222.fr>
Date: Fri, 6 Aug 2021 12:35:38 +0200
Subject: [PATCH] Simplify `get_content_type()` (#1125)

We were using the potential `encoding` returned by `mimetypes.guess_type()`
to expand the `Content-Type` header.
According to the RFC-7231 [1] the `Content-Type` should contain a charset
and nothing more. But as stated in the `mimetypes.guess_type()` doc [2],
the `encoding` would be the name of the program used to encode (e.g. compress
or gzip) the payload. The `encoding` is suitable for use as a `Content-Encoding`
header. See [3] for potential `encoding`s, none is a IANA registered one [4], and
so a valid charset to be used by the `Content-Type` header.

[1] https://httpwg.org/specs/rfc7231.html#header.content-type
[2] https://docs.python.org/3/library/mimetypes.html#guess_type
[3] https://github.com/python/cpython/blob/938e84b4fa410f1a86f5e0708ebc3af6bb8efb0e/Lib/mimetypes.py#L416-L422
[4] https://www.iana.org/assignments/character-sets/character-sets.xhtml
---
 httpie/utils.py | 7 +------
 1 file changed, 1 insertion(+), 6 deletions(-)

diff --git a/httpie/utils.py b/httpie/utils.py
index d828d488..7c8a598c 100644
--- a/httpie/utils.py
+++ b/httpie/utils.py
@@ -80,12 +80,7 @@ def get_content_type(filename):
     to ``mimetypes``.
 
     """
-    mime, encoding = mimetypes.guess_type(filename, strict=False)
-    if mime:
-        content_type = mime
-        if encoding:
-            content_type = f'{mime}; charset={encoding}'
-        return content_type
+    return mimetypes.guess_type(filename, strict=False)[0]
 
 
 def split_cookies(cookies):