loongson/pypi/: webencodings-0.5.1 metadata and description

Homepage Simple index

Character encoding aliases for legacy web content

author Simon Sapin
author_email simon.sapin@exyr.org
classifiers
  • Development Status :: 4 - Beta
  • Intended Audience :: Developers
  • License :: OSI Approved :: BSD License
  • Programming Language :: Python
  • Programming Language :: Python :: 2
  • Programming Language :: Python :: 2.6
  • Programming Language :: Python :: 2.7
  • Programming Language :: Python :: 3
  • Programming Language :: Python :: 3.3
  • Programming Language :: Python :: 3.4
  • Programming Language :: Python :: 3.5
  • Programming Language :: Python :: 3.6
  • Programming Language :: Python :: Implementation :: CPython
  • Programming Language :: Python :: Implementation :: PyPy
  • Topic :: Internet :: WWW/HTTP
license BSD
maintainer Geoffrey Sneddon
maintainer_email me@gsnedders.com
platform
  • UNKNOWN

Because this project isn't in the mirror_whitelist, no releases from root/pypi are included.

File Tox results History
webencodings-0.5.1-py2.py3-none-any.whl
Size
10 KB
Type
Python Wheel
Python
2.7
  • Replaced 2 time(s)
  • Uploaded to loongson/pypi by loongson 2022-08-08 02:48:28

This is a Python implementation of the WHATWG Encoding standard.

In order to be compatible with legacy web content when interpreting something like Content-Type: text/html; charset=latin1, tools need to use a particular set of aliases for encoding labels as well as some overriding rules. For example, US-ASCII and iso-8859-1 on the web are actually aliases for windows-1252, and an UTF-8 or UTF-16 BOM takes precedence over any other encoding declaration. The Encoding standard defines all such details so that implementations do not have to reverse-engineer each other.

This module has encoding labels and BOM detection, but the actual implementation for encoders and decoders is Python’s.