소스 검색

[extractor/common] fallback on utf-8 when charset is not found

fixes #2721
Anisse Astier 11 년 전
부모
커밋
ec0fafbb19
1개의 변경된 파일4개의 추가작업 그리고 1개의 파일을 삭제
  1. 4 1
      youtube_dl/extractor/common.py

+ 4 - 1
youtube_dl/extractor/common.py

@@ -251,7 +251,10 @@ class InfoExtractor(object):
             with open(filename, 'wb') as outf:
                 outf.write(webpage_bytes)
 
-        content = webpage_bytes.decode(encoding, 'replace')
+        try:
+            content = webpage_bytes.decode(encoding, 'replace')
+        except LookupError:
+            content = webpage_bytes.decode('utf-8', 'replace')
 
         if (u'<title>Access to this site is blocked</title>' in content and
                 u'Websense' in content[:512]):