Package encodings

Standard "encodings" Package

    Standard Python encoding modules are stored in this package
    directory.

    Codec modules must have names corresponding to normalized encoding
    names as defined in the normalize_encoding() function below, e.g.
    'utf-8' must be implemented by the module 'utf_8.py'.

    Each codec module must export the following interface:

    * getregentry() -> codecs.CodecInfo object
    The getregentry() API must a CodecInfo object with encoder, decoder,
    incrementalencoder, incrementaldecoder, streamwriter and streamreader
    atttributes which adhere to the Python Codec Interface Standard.

    In addition, a module may optionally also define the following
    APIs which are then used by the package's codec search function:

    * getaliases() -> sequence of encoding name strings to use as aliases

    Alias names returned by getaliases() must be normalized encoding
    names as defined by normalize_encoding().

Written by Marc-Andre Lemburg (mal@lemburg.com).

(c) Copyright CNRI, All Rights Reserved. NO WARRANTY.

Submodules

[hide private]

encodings.aliases: Encoding Aliases Support
encodings.ascii: Python 'ascii' Codec
encodings.base64_codec: Python 'base64_codec' Codec - base64 content transfer encoding
encodings.big5
encodings.big5hkscs
encodings.bz2_codec: Python 'bz2_codec' Codec - bz2 compression encoding
encodings.charmap: Generic Python Character Mapping Codec.
encodings.cp037: Python Character Mapping Codec cp037 generated from 'MAPPINGS/VENDORS/MICSFT/EBCDIC/CP037.TXT' with gencodec.py.
encodings.cp1006: Python Character Mapping Codec cp1006 generated from 'MAPPINGS/VENDORS/MISC/CP1006.TXT' with gencodec.py.
encodings.cp1026: Python Character Mapping Codec cp1026 generated from 'MAPPINGS/VENDORS/MICSFT/EBCDIC/CP1026.TXT' with gencodec.py.
encodings.cp1140: Python Character Mapping Codec cp1140 generated from 'python-mappings/CP1140.TXT' with gencodec.py.
encodings.cp1250: Python Character Mapping Codec cp1250 generated from 'MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1250.TXT' with gencodec.py.
encodings.cp1251: Python Character Mapping Codec cp1251 generated from 'MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1251.TXT' with gencodec.py.
encodings.cp1252: Python Character Mapping Codec cp1252 generated from 'MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1252.TXT' with gencodec.py.
encodings.cp1253: Python Character Mapping Codec cp1253 generated from 'MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1253.TXT' with gencodec.py.
encodings.cp1254: Python Character Mapping Codec cp1254 generated from 'MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1254.TXT' with gencodec.py.
encodings.cp1255: Python Character Mapping Codec cp1255 generated from 'MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1255.TXT' with gencodec.py.
encodings.cp1256: Python Character Mapping Codec cp1256 generated from 'MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1256.TXT' with gencodec.py.
encodings.cp1257: Python Character Mapping Codec cp1257 generated from 'MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1257.TXT' with gencodec.py.
encodings.cp1258: Python Character Mapping Codec cp1258 generated from 'MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1258.TXT' with gencodec.py.
encodings.cp424: Python Character Mapping Codec cp424 generated from 'MAPPINGS/VENDORS/MISC/CP424.TXT' with gencodec.py.
encodings.cp437: Python Character Mapping Codec cp437 generated from 'VENDORS/MICSFT/PC/CP437.TXT' with gencodec.py.
encodings.cp500: Python Character Mapping Codec cp500 generated from 'MAPPINGS/VENDORS/MICSFT/EBCDIC/CP500.TXT' with gencodec.py.
encodings.cp737: Python Character Mapping Codec cp737 generated from 'VENDORS/MICSFT/PC/CP737.TXT' with gencodec.py.
encodings.cp775: Python Character Mapping Codec cp775 generated from 'VENDORS/MICSFT/PC/CP775.TXT' with gencodec.py.
encodings.cp850: Python Character Mapping Codec generated from 'VENDORS/MICSFT/PC/CP850.TXT' with gencodec.py.
encodings.cp852: Python Character Mapping Codec generated from 'VENDORS/MICSFT/PC/CP852.TXT' with gencodec.py.
encodings.cp855: Python Character Mapping Codec generated from 'VENDORS/MICSFT/PC/CP855.TXT' with gencodec.py.
encodings.cp856: Python Character Mapping Codec cp856 generated from 'MAPPINGS/VENDORS/MISC/CP856.TXT' with gencodec.py.
encodings.cp857: Python Character Mapping Codec generated from 'VENDORS/MICSFT/PC/CP857.TXT' with gencodec.py.
encodings.cp860: Python Character Mapping Codec generated from 'VENDORS/MICSFT/PC/CP860.TXT' with gencodec.py.
encodings.cp861: Python Character Mapping Codec generated from 'VENDORS/MICSFT/PC/CP861.TXT' with gencodec.py.
encodings.cp862: Python Character Mapping Codec generated from 'VENDORS/MICSFT/PC/CP862.TXT' with gencodec.py.
encodings.cp863: Python Character Mapping Codec generated from 'VENDORS/MICSFT/PC/CP863.TXT' with gencodec.py.
encodings.cp864: Python Character Mapping Codec generated from 'VENDORS/MICSFT/PC/CP864.TXT' with gencodec.py.
encodings.cp865: Python Character Mapping Codec generated from 'VENDORS/MICSFT/PC/CP865.TXT' with gencodec.py.
encodings.cp866: Python Character Mapping Codec generated from 'VENDORS/MICSFT/PC/CP866.TXT' with gencodec.py.
encodings.cp869: Python Character Mapping Codec generated from 'VENDORS/MICSFT/PC/CP869.TXT' with gencodec.py.
encodings.cp874: Python Character Mapping Codec cp874 generated from 'MAPPINGS/VENDORS/MICSFT/WINDOWS/CP874.TXT' with gencodec.py.
encodings.cp875: Python Character Mapping Codec cp875 generated from 'MAPPINGS/VENDORS/MICSFT/EBCDIC/CP875.TXT' with gencodec.py.
encodings.cp932
encodings.cp949
encodings.cp950
encodings.euc_jis_2004
encodings.euc_jisx0213
encodings.euc_jp
encodings.euc_kr
encodings.gb18030
encodings.gb2312
encodings.gbk
encodings.hex_codec: Python 'hex_codec' Codec - 2-digit hex content transfer encoding
encodings.hp_roman8: Python Character Mapping Codec generated from 'hp_roman8.txt' with gencodec.py.
encodings.hz
encodings.idna
encodings.iso2022_jp
encodings.iso2022_jp_1
encodings.iso2022_jp_2
encodings.iso2022_jp_2004
encodings.iso2022_jp_3
encodings.iso2022_jp_ext
encodings.iso2022_kr
encodings.iso8859_1: Python Character Mapping Codec iso8859_1 generated from 'MAPPINGS/ISO8859/8859-1.TXT' with gencodec.py.
encodings.iso8859_10: Python Character Mapping Codec iso8859_10 generated from 'MAPPINGS/ISO8859/8859-10.TXT' with gencodec.py.
encodings.iso8859_11: Python Character Mapping Codec iso8859_11 generated from 'MAPPINGS/ISO8859/8859-11.TXT' with gencodec.py.
encodings.iso8859_13: Python Character Mapping Codec iso8859_13 generated from 'MAPPINGS/ISO8859/8859-13.TXT' with gencodec.py.
encodings.iso8859_14: Python Character Mapping Codec iso8859_14 generated from 'MAPPINGS/ISO8859/8859-14.TXT' with gencodec.py.
encodings.iso8859_15: Python Character Mapping Codec iso8859_15 generated from 'MAPPINGS/ISO8859/8859-15.TXT' with gencodec.py.
encodings.iso8859_16: Python Character Mapping Codec iso8859_16 generated from 'MAPPINGS/ISO8859/8859-16.TXT' with gencodec.py.
encodings.iso8859_2: Python Character Mapping Codec iso8859_2 generated from 'MAPPINGS/ISO8859/8859-2.TXT' with gencodec.py.
encodings.iso8859_3: Python Character Mapping Codec iso8859_3 generated from 'MAPPINGS/ISO8859/8859-3.TXT' with gencodec.py.
encodings.iso8859_4: Python Character Mapping Codec iso8859_4 generated from 'MAPPINGS/ISO8859/8859-4.TXT' with gencodec.py.
encodings.iso8859_5: Python Character Mapping Codec iso8859_5 generated from 'MAPPINGS/ISO8859/8859-5.TXT' with gencodec.py.
encodings.iso8859_6: Python Character Mapping Codec iso8859_6 generated from 'MAPPINGS/ISO8859/8859-6.TXT' with gencodec.py.
encodings.iso8859_7: Python Character Mapping Codec iso8859_7 generated from 'MAPPINGS/ISO8859/8859-7.TXT' with gencodec.py.
encodings.iso8859_8: Python Character Mapping Codec iso8859_8 generated from 'MAPPINGS/ISO8859/8859-8.TXT' with gencodec.py.
encodings.iso8859_9: Python Character Mapping Codec iso8859_9 generated from 'MAPPINGS/ISO8859/8859-9.TXT' with gencodec.py.
encodings.johab
encodings.koi8_r: Python Character Mapping Codec koi8_r generated from 'MAPPINGS/VENDORS/MISC/KOI8-R.TXT' with gencodec.py.
encodings.koi8_u: Python Character Mapping Codec koi8_u generated from 'python-mappings/KOI8-U.TXT' with gencodec.py.
encodings.latin_1: Python 'latin-1' Codec
encodings.mac_arabic: Python Character Mapping Codec generated from 'VENDORS/APPLE/ARABIC.TXT' with gencodec.py.
encodings.mac_centeuro: Python Character Mapping Codec mac_centeuro generated from 'MAPPINGS/VENDORS/APPLE/CENTEURO.TXT' with gencodec.py.
encodings.mac_croatian: Python Character Mapping Codec mac_croatian generated from 'MAPPINGS/VENDORS/APPLE/CROATIAN.TXT' with gencodec.py.
encodings.mac_cyrillic: Python Character Mapping Codec mac_cyrillic generated from 'MAPPINGS/VENDORS/APPLE/CYRILLIC.TXT' with gencodec.py.
encodings.mac_farsi: Python Character Mapping Codec mac_farsi generated from 'MAPPINGS/VENDORS/APPLE/FARSI.TXT' with gencodec.py.
encodings.mac_greek: Python Character Mapping Codec mac_greek generated from 'MAPPINGS/VENDORS/APPLE/GREEK.TXT' with gencodec.py.
encodings.mac_iceland: Python Character Mapping Codec mac_iceland generated from 'MAPPINGS/VENDORS/APPLE/ICELAND.TXT' with gencodec.py.
encodings.mac_latin2: Python Character Mapping Codec generated from 'LATIN2.TXT' with gencodec.py.
encodings.mac_roman: Python Character Mapping Codec mac_roman generated from 'MAPPINGS/VENDORS/APPLE/ROMAN.TXT' with gencodec.py.
encodings.mac_romanian: Python Character Mapping Codec mac_romanian generated from 'MAPPINGS/VENDORS/APPLE/ROMANIAN.TXT' with gencodec.py.
encodings.mac_turkish: Python Character Mapping Codec mac_turkish generated from 'MAPPINGS/VENDORS/APPLE/TURKISH.TXT' with gencodec.py.
encodings.mbcs: Python 'mbcs' Codec for Windows
encodings.palmos: Python Character Mapping Codec for PalmOS 3.5.
encodings.ptcp154: Python Character Mapping Codec generated from 'PTCP154.txt' with gencodec.py.
encodings.punycode: Codec for the Punicode encoding, as specified in RFC 3492
encodings.quopri_codec: Codec for quoted-printable encoding.
encodings.raw_unicode_escape: Python 'raw-unicode-escape' Codec
encodings.rot_13: Python Character Mapping Codec for ROT13.
encodings.shift_jis
encodings.shift_jis_2004
encodings.shift_jisx0213
encodings.string_escape: Python 'escape' Codec
encodings.tis_620: Python Character Mapping Codec tis_620 generated from 'python-mappings/TIS-620.TXT' with gencodec.py.
encodings.undefined: Python 'undefined' Codec
encodings.unicode_escape: Python 'unicode-escape' Codec
encodings.unicode_internal: Python 'unicode-internal' Codec
encodings.utf_16: Python 'utf-16' Codec
encodings.utf_16_be: Python 'utf-16-be' Codec
encodings.utf_16_le: Python 'utf-16-le' Codec
encodings.utf_7: Python 'utf-7' Codec
encodings.utf_8: Python 'utf-8' Codec
encodings.utf_8_sig: Python 'utf-8-sig' Codec This work similar to UTF-8 with the following changes:
encodings.uu_codec: Python 'uu_codec' Codec - UU content transfer encoding
encodings.zlib_codec: Python 'zlib_codec' Codec - zlib compression encoding

Classes

[hide private]

CodecRegistryError

Functions

[hide private]

normalize_encoding(encoding)
Normalize an encoding name.

search_function(encoding)

Variables

[hide private]

_cache = {}

_unknown = '--unknown--'

_import_tail = ['*']

_norm_encoding_map = ' ...

_aliases = {'037': 'cp037', '1026': 'cp1026', '1140': 'cp1140'...

Imports: codecs, types, aliases, ascii, cp869, latin_1, utf_8

Function Details

[hide private]

normalize_encoding(encoding)

Normalize an encoding name.

Normalization works as follows: all non-alphanumeric characters except the dot used for Python package names are collapsed and replaced with a single underscore, e.g. ' -;#' becomes '_'. Leading and trailing underscores are removed.

Note that encoding names should be ASCII only; if they do use non-ASCII characters, these must be Latin-1 compatible.

Variables Details

[hide private]

_norm_encoding_map

Value:

'                                              . 0123456789       ABCD
EFGHIJKLMNOPQRSTUVWXYZ      abcdefghijklmnopqrstuvwxyz                
                                                                      
                                               '

_aliases

Value:

{'037': 'cp037',
 '1026': 'cp1026',
 '1140': 'cp1140',
 '1250': 'cp1250',
 '1251': 'cp1251',
 '1252': 'cp1252',
 '1253': 'cp1253',
 '1254': 'cp1254',
...