public class PersianLuceneAnalyzer extends AbstractBookAnalyzer
TokenStream is built from a
ArabicLetterTokenizer filtered with LowerCaseFilter,
ArabicNormalizationFilter, PersianNormalizationFilter and
Persian StopFilter (optional)The GNU Lesser General Public License for details.| Modifier and Type | Field and Description |
|---|---|
private org.apache.lucene.util.Version |
matchVersion |
book, doStemming, doStopWords, stopSet| Constructor and Description |
|---|
PersianLuceneAnalyzer() |
| Modifier and Type | Method and Description |
|---|---|
org.apache.lucene.analysis.TokenStream |
reusableTokenStream(String fieldName,
Reader reader)
Returns a (possibly reused)
TokenStream which tokenizes all the
text in the provided Reader. |
org.apache.lucene.analysis.TokenStream |
tokenStream(String fieldName,
Reader reader) |
getBook, getDoStopWords, setBook, setDoStemming, setDoStopWords, setStopWordspublic final org.apache.lucene.analysis.TokenStream tokenStream(String fieldName, Reader reader)
tokenStream in class org.apache.lucene.analysis.Analyzerpublic org.apache.lucene.analysis.TokenStream reusableTokenStream(String fieldName, Reader reader) throws IOException
TokenStream which tokenizes all the
text in the provided Reader.reusableTokenStream in class org.apache.lucene.analysis.AnalyzerTokenStream built from a ArabicLetterTokenizer
filtered with LowerCaseFilter,
ArabicNormalizationFilter,
PersianNormalizationFilter and Persian Stop wordsIOException