MediaWiki  REL1_21
README File Reference

Go to the source code of this file.

Functions

On text that can be determined
quickly to already be in NFC
it s not too awful but it can
quickly get uncomfortably
particularly for Korean
fetching the data from from
the net if necessary If it
reports something is going
wrong!You may have to set up
PHPUnit first pear channel
discover pear phpunit de pear
install phpunit the
normalization functions will
automatically use it If you
can 
t (or don't want to) load it in php.ini
On text that can be determined
quickly to already be in NFC
it s not too awful but it can
quickly get uncomfortably
particularly for Korean 
text (the hangul decomposition/composition code is extra slow).

Variables

if there are corrupt
characters this may produce
erroneous results To also
check for illegal 
characters
On text that can be determined
quickly to already be in NFC
it s not too awful but it can
quickly get uncomfortably
particularly for Korean
fetching the data from from
the net if necessary If it
reports 
failure
On text that can be determined
quickly to already be in NFC
it s not too awful but it can
quickly get uncomfortably
particularly for Korean
fetching the data from from
the net if necessary If it
reports something is going
wrong!You may have to set up
PHPUnit first pear channel
discover pear phpunit de pear
install phpunit 
PHPUnit
This directory contains some
Unicode normalization routines
These routines are meant to be
reusable in other 
projects
On text that can be determined
quickly to already be in NFC
it s not too awful but it can
quickly get uncomfortably 
slow
if there are corrupt
characters this may produce
erroneous results To also
check for illegal use and if
necessary convert to
normalization form C
Performance is kind of stinky
in absolute 
terms
if there are corrupt
characters this may produce
erroneous results To also
check for illegal use and if
necessary convert to
normalization form C
Performance is kind of stinky
in absolute though it should
be speedy on pure ASCII 
text
this will convert a given UTF
string to Normalization Form C
if it s not already such The
function assumes that the
input string is already valid 
UTF

Function Documentation

On text that can be determined quickly to already be in NFC it s not too awful but it can quickly get uncomfortably particularly for Korean fetching the data from from the net if necessary If it reports something is going wrong ! You may have to set up PHPUnit first pear channel discover pear phpunit de pear install phpunit the normalization functions will automatically use it If you can t ( or don't want  to)
On text that can be determined quickly to already be in NFC it s not too awful but it can quickly get uncomfortably particularly for Korean text ( the hangul decomposition/composition code is extra  slow)

Variable Documentation

if there are corrupt characters this may produce erroneous results To also check for illegal characters
On text that can be determined quickly to already be in NFC it s not too awful but it can quickly get uncomfortably particularly for Korean fetching the data from from the net if necessary If it reports failure

Definition at line 32 of file README.

Referenced by NewParserTest::addDBData().

On text that can be determined quickly to already be in NFC it s not too awful but it can quickly get uncomfortably particularly for Korean fetching the data from from the net if necessary If it reports something is going wrong ! You may have to set up PHPUnit first pear channel discover pear phpunit de pear install phpunit PHPUnit
Initial value:
= Benchmarks ==

Run 'make bench' to download some sample texts from Wikipedia and run some
cheap benchmarks of some of the functions. Take all numbers with large
grains of salt.


== PHP module extension ==

There's an experimental PHP extension module which wraps the ICU library's
normalization functions. This is *MUCH* faster than doing this work in pure
PHP code. This is in the 'normal' directory in MediaWiki's CVS extensions
module. It is known to work with PHP 4.3.8 and 5.0.2 on Linux/x86 but hasn't
been thoroughly tested on other configurations.

If the php_normal.so module is loaded in php.ini

Definition at line 40 of file README.

This directory contains some Unicode normalization routines These routines are meant to be reusable in other projects

Definition at line 2 of file README.

On text that can be determined quickly to already be in NFC it s not too awful but it can quickly get uncomfortably slow

Definition at line 16 of file README.

Referenced by BackupReader::__construct().

if there are corrupt characters this may produce erroneous results To also check for illegal use and if necessary convert to normalization form C Performance is kind of stinky in absolute terms

Definition at line 14 of file README.

if there are corrupt characters this may produce erroneous results To also check for illegal use and if necessary convert to normalization form C Performance is kind of stinky in absolute though it should be speedy on pure ASCII text

Definition at line 14 of file README.

this will convert a given UTF string to Normalization Form C if it s not already such The function assumes that the input string is already valid UTF

Definition at line 7 of file README.

Referenced by UtfNormal::quickIsNFCVerify().