|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectopennlp.tools.sentdetect.SentenceDetectorME
public class SentenceDetectorME
A sentence detector for splitting up raw text into sentences.
A maximum entropy model is used to evaluate the characters ".", "!", and "?" in a string to determine if they signify the end of a sentence.
Field Summary | |
---|---|
static String |
NO_SPLIT
Constant indicates no sentence split. |
static String |
SPLIT
Constant indicates a sentence split. |
Constructor Summary | |
---|---|
SentenceDetectorME(SentenceModel model)
Initializes the current instance. |
|
SentenceDetectorME(SentenceModel model,
Factory factory)
|
Method Summary | |
---|---|
double[] |
getSentenceProbabilities()
Returns the probabilities associated with the most recent calls to sentDetect(). |
String[] |
sentDetect(String s)
Detect sentences in a String. |
Span[] |
sentPosDetect(String s)
Detect the position of the first words of sentences in a String. |
static SentenceModel |
train(String languageCode,
ObjectStream<SentenceSample> samples,
boolean useTokenEnd,
Dictionary abbreviations)
|
static SentenceModel |
train(String languageCode,
ObjectStream<SentenceSample> samples,
boolean useTokenEnd,
Dictionary abbreviations,
int cutoff,
int iterations)
Deprecated. use train(String, ObjectStream, boolean, Dictionary, TrainingParameters)
instead and pass in a TrainingParameters object. |
static SentenceModel |
train(String languageCode,
ObjectStream<SentenceSample> samples,
boolean useTokenEnd,
Dictionary abbreviations,
TrainingParameters mlParams)
|
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final String SPLIT
public static final String NO_SPLIT
Constructor Detail |
---|
public SentenceDetectorME(SentenceModel model)
model
- the SentenceModel
public SentenceDetectorME(SentenceModel model, Factory factory)
Method Detail |
---|
public String[] sentDetect(String s)
sentDetect
in interface SentenceDetector
s
- The string to be processed.
public Span[] sentPosDetect(String s)
sentPosDetect
in interface SentenceDetector
s
- The string to be processed.
public double[] getSentenceProbabilities()
public static SentenceModel train(String languageCode, ObjectStream<SentenceSample> samples, boolean useTokenEnd, Dictionary abbreviations, TrainingParameters mlParams) throws IOException
IOException
@Deprecated public static SentenceModel train(String languageCode, ObjectStream<SentenceSample> samples, boolean useTokenEnd, Dictionary abbreviations, int cutoff, int iterations) throws IOException
train(String, ObjectStream, boolean, Dictionary, TrainingParameters)
instead and pass in a TrainingParameters object.
IOException
public static SentenceModel train(String languageCode, ObjectStream<SentenceSample> samples, boolean useTokenEnd, Dictionary abbreviations) throws IOException
IOException
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |