Search/Lucene/Analysis/Analyzer.php
Zend Framework
LICENSE
This source file is subject to the new BSD license that is bundled with this package in the file LICENSE.txt. It is also available through the world-wide-web at this URL: http://framework.zend.com/license/new-bsd If you did not receive a copy of the license and are unable to obtain it through the world-wide-web, please send an email to [email protected] so we can send you a copy immediately.
- Category
- Zend
- Copyright
- Copyright (c) 2005-2012 Zend Technologies USA Inc. (http://www.zend.com)
- License
- New BSD License
- Package
- Zend_Search_Lucene
- Subpackage
- Analysis
- Version
- $Id: Analyzer.php 24593 2012-01-05 20:35:02Z matthew $
\Zend_Search_Lucene_Analysis_Analyzer
An Analyzer is used to analyze text.
It thus represents a policy for extracting index terms from text.
Note: Lucene Java implementation is oriented to streams. It provides effective work with a huge documents (more then 20Mb). But engine itself is not oriented such documents. Thus Zend_Search_Lucene analysis API works with data strings and sets (arrays).
- Children
- \Zend_Search_Lucene_Analysis_Analyzer_Common
- Category
- Zend
- Copyright
- Copyright (c) 2005-2012 Zend Technologies USA Inc. (http://www.zend.com)
- License
- New BSD License
Properties


\Zend_Search_Lucene_Analysis_Analyzer $_defaultImpl =
The Analyzer implementation used by default.
Methods


getDefault() : \Zend_Search_Lucene_Analysis_Analyzer
Return the default Analyzer implementation used by indexing code.
Type | Description |
---|---|
\Zend_Search_Lucene_Analysis_Analyzer |


nextToken() : \Zend_Search_Lucene_Analysis_Token | null
Tokenization stream API Get next token Returns null at the end of stream
Tokens are returned in UTF-8 (internal Zend_Search_Lucene encoding)
Type | Description |
---|---|
\Zend_Search_Lucene_Analysis_Token | null |


setDefault( $analyzer) : void
Set the default Analyzer implementation used by indexing code.
Name | Type | Description |
---|---|---|
$analyzer |


setInput(string $data, $encoding = '') : void
Tokenization stream API Set input
Name | Type | Description |
---|---|---|
$data | string | |
$encoding |