MediaWiki  REL1_22
README File Reference

Go to the source code of this file.

Functions

On text that can be determined
quickly to already be in NFC
it s not too awful but it can
quickly get uncomfortably
particularly for Korean
fetching the data from from
the net if necessary If it
reports something is going
wrong!You may have to set up
PHPUnit first pear channel
discover pear phpunit de pear
install phpunit which
currently runs PHP on Linux It
hasn t been thoroughly tested
on other but may work If the
php_normal so module is loaded
in php the normalization
functions will automatically
use it If you can 
t (or don't want to) load it in php.ini
On text that can be determined
quickly to already be in NFC
it s not too awful but it can
quickly get uncomfortably
particularly for Korean 
text (the hangul decomposition/composition code is extra slow).

Variables

if there are corrupt
characters this may produce
erroneous results To also
check for illegal 
characters
On text that can be determined
quickly to already be in NFC
it s not too awful but it can
quickly get uncomfortably
particularly for Korean
fetching the data from from
the net if necessary If it
reports something is going
wrong!You may have to set up
PHPUnit first pear channel
discover pear phpunit de pear
install phpunit which
currently runs PHP on Linux It
hasn t been thoroughly tested
on other 
configurations
On text that can be determined
quickly to already be in NFC
it s not too awful but it can
quickly get uncomfortably
particularly for Korean
fetching the data from from
the net if necessary If it
reports 
failure
On text that can be determined
quickly to already be in NFC
it s not too awful but it can
quickly get uncomfortably
particularly for Korean
fetching the data from from
the net if necessary If it
reports something is going
wrong!You may have to set up
PHPUnit first pear channel
discover pear phpunit de pear
install phpunit which
currently runs PHP on Linux It
hasn t been thoroughly tested
on other but may work If the
php_normal so module is loaded
in php 
ini
On text that can be determined
quickly to already be in NFC
it s not too awful but it can
quickly get uncomfortably
particularly for Korean
fetching the data from from
the net if necessary If it
reports something is going
wrong!You may have to set up
PHPUnit first pear channel
discover pear phpunit de pear
install phpunit 
PHPUnit
This directory contains some
Unicode normalization routines
These routines are meant to be
reusable in other 
projects
On text that can be determined
quickly to already be in NFC
it s not too awful but it can
quickly get uncomfortably 
slow
if there are corrupt
characters this may produce
erroneous results To also
check for illegal use and if
necessary convert to
normalization form C
Performance is kind of stinky
in absolute 
terms
if there are corrupt
characters this may produce
erroneous results To also
check for illegal use and if
necessary convert to
normalization form C
Performance is kind of stinky
in absolute though it should
be speedy on pure ASCII 
text
this will convert a given UTF
string to Normalization Form C
if it s not already such The
function assumes that the
input string is already valid 
UTF

Function Documentation

On text that can be determined quickly to already be in NFC it s not too awful but it can quickly get uncomfortably particularly for Korean fetching the data from from the net if necessary If it reports something is going wrong ! You may have to set up PHPUnit first pear channel discover pear phpunit de pear install phpunit which currently runs PHP on Linux It hasn t been thoroughly tested on other but may work If the php_normal so module is loaded in php the normalization functions will automatically use it If you can t ( or don't want  to)
On text that can be determined quickly to already be in NFC it s not too awful but it can quickly get uncomfortably particularly for Korean text ( the hangul decomposition/composition code is extra  slow)

Variable Documentation

if there are corrupt characters this may produce erroneous results To also check for illegal characters
On text that can be determined quickly to already be in NFC it s not too awful but it can quickly get uncomfortably particularly for Korean fetching the data from from the net if necessary If it reports something is going wrong ! You may have to set up PHPUnit first pear channel discover pear phpunit de pear install phpunit which currently runs PHP on Linux It hasn t been thoroughly tested on other configurations

Definition at line 40 of file README.

On text that can be determined quickly to already be in NFC it s not too awful but it can quickly get uncomfortably particularly for Korean fetching the data from from the net if necessary If it reports failure

Definition at line 32 of file README.

Referenced by NewParserTest::addDBData().

On text that can be determined quickly to already be in NFC it s not too awful but it can quickly get uncomfortably particularly for Korean fetching the data from from the net if necessary If it reports something is going wrong ! You may have to set up PHPUnit first pear channel discover pear phpunit de pear install phpunit which currently runs PHP on Linux It hasn t been thoroughly tested on other but may work If the php_normal so module is loaded in php ini

Definition at line 40 of file README.

On text that can be determined quickly to already be in NFC it s not too awful but it can quickly get uncomfortably particularly for Korean fetching the data from from the net if necessary If it reports something is going wrong ! You may have to set up PHPUnit first pear channel discover pear phpunit de pear install phpunit PHPUnit
Initial value:
= Benchmarks ==

Run 'make bench' to download some sample texts from Wikipedia and run some
cheap benchmarks of some of the functions. Take all numbers with large
grains of salt.


== PHP module extension ==

There's an experimental PHP extension module which wraps the ICU library's
normalization functions. This is *MUCH* faster than doing this work in pure
PHP code. This is at https:
It is used by the WMF

Definition at line 40 of file README.

This directory contains some Unicode normalization routines These routines are meant to be reusable in other projects

Definition at line 2 of file README.

On text that can be determined quickly to already be in NFC it s not too awful but it can quickly get uncomfortably slow

Definition at line 16 of file README.

Referenced by BackupReader::__construct().

if there are corrupt characters this may produce erroneous results To also check for illegal use and if necessary convert to normalization form C Performance is kind of stinky in absolute terms

Definition at line 14 of file README.

if there are corrupt characters this may produce erroneous results To also check for illegal use and if necessary convert to normalization form C Performance is kind of stinky in absolute though it should be speedy on pure ASCII text

Definition at line 14 of file README.

this will convert a given UTF string to Normalization Form C if it s not already such The function assumes that the input string is already valid UTF

Definition at line 7 of file README.

Referenced by UtfNormal::quickIsNFCVerify().