Lucene.Net
3.0.3
Lucene.Net is a .NET port of the Java Lucene Indexing Library
|
Loader for text files that represent a list of stopwords. More...
Static Public Member Functions | |
static ISet< string > | GetWordSet (System.IO.FileInfo wordfile) |
Loads a text file and adds every line as an entry to a HashSet (omitting leading and trailing whitespace). Every line of the file should contain only one word. The words need to be in lowercase if you make use of an Analyzer which uses LowerCaseFilter (like StandardAnalyzer). | |
static ISet< string > | GetWordSet (System.IO.FileInfo wordfile, System.String comment) |
Loads a text file and adds every non-comment line as an entry to a HashSet (omitting leading and trailing whitespace). Every line of the file should contain only one word. The words need to be in lowercase if you make use of an Analyzer which uses LowerCaseFilter (like StandardAnalyzer). | |
static ISet< string > | GetWordSet (System.IO.TextReader reader) |
Reads lines from a Reader and adds every line as an entry to a HashSet (omitting leading and trailing whitespace). Every line of the Reader should contain only one word. The words need to be in lowercase if you make use of an Analyzer which uses LowerCaseFilter (like StandardAnalyzer). | |
static ISet< string > | GetWordSet (System.IO.TextReader reader, System.String comment) |
Reads lines from a Reader and adds every non-comment line as an entry to a HashSet (omitting leading and trailing whitespace). Every line of the Reader should contain only one word. The words need to be in lowercase if you make use of an Analyzer which uses LowerCaseFilter (like StandardAnalyzer). | |
static Dictionary< string, string > | GetStemDict (System.IO.FileInfo wordstemfile) |
Reads a stem dictionary. Each line contains: wordstem (i.e. two tab seperated words) | |
Loader for text files that represent a list of stopwords.
Definition at line 24 of file WordlistLoader.cs.
|
static |
Reads a stem dictionary. Each line contains: wordstem
(i.e. two tab seperated words)
<throws> IOException </throws>
Definition at line 117 of file WordlistLoader.cs.
|
static |
Loads a text file and adds every line as an entry to a HashSet (omitting leading and trailing whitespace). Every line of the file should contain only one word. The words need to be in lowercase if you make use of an Analyzer which uses LowerCaseFilter (like StandardAnalyzer).
wordfile | File containing the wordlist |
Definition at line 34 of file WordlistLoader.cs.
|
static |
Loads a text file and adds every non-comment line as an entry to a HashSet (omitting leading and trailing whitespace). Every line of the file should contain only one word. The words need to be in lowercase if you make use of an Analyzer which uses LowerCaseFilter (like StandardAnalyzer).
wordfile | File containing the wordlist |
comment | The comment string to ignore |
Definition at line 50 of file WordlistLoader.cs.
|
static |
Reads lines from a Reader and adds every line as an entry to a HashSet (omitting leading and trailing whitespace). Every line of the Reader should contain only one word. The words need to be in lowercase if you make use of an Analyzer which uses LowerCaseFilter (like StandardAnalyzer).
reader | Reader containing the wordlist |
Definition at line 66 of file WordlistLoader.cs.
|
static |
Reads lines from a Reader and adds every non-comment line as an entry to a HashSet (omitting leading and trailing whitespace). Every line of the Reader should contain only one word. The words need to be in lowercase if you make use of an Analyzer which uses LowerCaseFilter (like StandardAnalyzer).
reader | Reader containing the wordlist |
comment | The string representing a comment. |
Definition at line 91 of file WordlistLoader.cs.