org.apache.lucene.analysis

Class WordlistLoader

public class WordlistLoader extends Object

Loader for text files that represent a list of stopwords.

Version: $Id: WordlistLoader.java 192989 2005-06-22 19:59:03Z dnaber $

Author: Gerhard Schwarz

Method Summary
static HashSetgetWordSet(File wordfile)
Loads a text file and adds every line as an entry to a HashSet (omitting leading and trailing whitespace).
static HashSetgetWordSet(Reader reader)
Reads lines from a Reader and adds every line as an entry to a HashSet (omitting leading and trailing whitespace).
static HashtablegetWordtable(String path, String wordfile)
static HashtablegetWordtable(String wordfile)
static HashtablegetWordtable(File wordfile)

Method Detail

getWordSet

public static HashSet getWordSet(File wordfile)
Loads a text file and adds every line as an entry to a HashSet (omitting leading and trailing whitespace). Every line of the file should contain only one word. The words need to be in lowercase if you make use of an Analyzer which uses LowerCaseFilter (like StandardAnalyzer).

Parameters: wordfile File containing the wordlist

Returns: A HashSet with the file's words

getWordSet

public static HashSet getWordSet(Reader reader)
Reads lines from a Reader and adds every line as an entry to a HashSet (omitting leading and trailing whitespace). Every line of the Reader should contain only one word. The words need to be in lowercase if you make use of an Analyzer which uses LowerCaseFilter (like StandardAnalyzer).

Parameters: reader Reader containing the wordlist

Returns: A HashSet with the reader's words

getWordtable

public static Hashtable getWordtable(String path, String wordfile)

Deprecated: Use {@link #getWordSet(File)} instead

Parameters: path Path to the wordlist wordfile Name of the wordlist

getWordtable

public static Hashtable getWordtable(String wordfile)

Deprecated: Use {@link #getWordSet(File)} instead

Parameters: wordfile Complete path to the wordlist

getWordtable

public static Hashtable getWordtable(File wordfile)

Deprecated: Use {@link #getWordSet(File)} instead

Parameters: wordfile File object that points to the wordlist

Copyright © 2000-2008 Apache Software Foundation. All Rights Reserved.