org.apache.lucene.analysis.cn

Class ChineseFilter

public final class ChineseFilter extends TokenFilter

Title: ChineseFilter Description: Filter with a stop word table Rule: No digital is allowed. English word/token should larger than 1 character. One Chinese character as one Chinese word. TO DO: 1. Add Chinese stop words, such as  2. Dictionary based Chinese word extraction 3. Intelligent Chinese word extraction Copyright: Copyright (c) 2001 Company:

Version: 1.0

Author: Yiyi Sun

Field Summary
static String[]STOP_WORDS
Constructor Summary
ChineseFilter(TokenStream in)
Method Summary
Tokennext()

Field Detail

STOP_WORDS

public static final String[] STOP_WORDS

Constructor Detail

ChineseFilter

public ChineseFilter(TokenStream in)

Method Detail

next

public final Token next()
Copyright © 2000-2008 Apache Software Foundation. All Rights Reserved.