/phputf8/utils/bad.php
Return code from utf8_bad_identify() when a five octet sequence is detected.
Note: 5 octets sequences are valid UTF-8 but are not supported by Unicode so do not represent a useful character
Return code from utf8_bad_identify() when a six octet sequence is detected.
Note: 6 octets sequences are valid UTF-8 but are not supported by Unicode so do not represent a useful character
Return code from utf8_bad_identify().
From Unicode 3.1, non-shortest form is illegal
Return code from utf8_bad_identify().
Invalid octet for use as start of multi-byte UTF-8 sequence
Return code from utf8_bad_identify().
Incomplete multi-octet sequence Note: this is kind of a "catch-all"
Return code from utf8_bad_identify().
From Unicode 3.2, surrogate characters are illegal
Return code from utf8_bad_identify().
Codepoints outside the Unicode range are illegal
Takes a return code from utf8_bad_identify() are returns a message (in English) explaining what the problem is.
- int $code: return code from utf8_bad_identify
Locates the first bad byte in a UTF-8 string returning it's
byte index in the string PCRE Pattern to locate bad bytes in a UTF-8 string Comes from W3 FAQ: Multilingual Forms Note: modified to include full ASCII range including control chars
- string $str
Locates all bad bytes in a UTF-8 string and returns a list of their
byte index in the string PCRE Pattern to locate bad bytes in a UTF-8 string Comes from W3 FAQ: Multilingual Forms Note: modified to include full ASCII range including control chars
- string $str
Reports on the type of bad byte found in a UTF-8 string. Returns a
status code on the first bad byte found
- string $str: UTF-8 encoded string
- &$i
Replace bad bytes with an alternative character - ASCII character
recommended is replacement char PCRE Pattern to locate bad bytes in a UTF-8 string Comes from W3 FAQ: Multilingual Forms Note: modified to include full ASCII range including control chars
- string $str: to search
- string $replace: to replace bad bytes with (defaults to '?') - use ASCII
Strips out any bad bytes from a UTF-8 string and returns the rest
PCRE Pattern to locate bad bytes in a UTF-8 string Comes from W3 FAQ: Multilingual Forms Note: modified to include full ASCII range including control chars
- string $str
Documentation generated on Mon, 05 Mar 2007 20:53:03 +0000 by phpDocumentor 1.3.1