public class DefaultSentenceSplitter extends ICU4JBreakIteratorSentenceSplitter implements SentenceSplitter
Uses the built-in Java BreakIterator class to identify candidate sentences. Several heuristics are used to correct the sentence identification produced by BreakIterator when a sentence potentially ends with an abbreviation or a bracket character (right parenthesis, right bracket, or right brace).
abbreviations, disallowedSentenceStarters, logger, names, partOfSpeechGuesser, sentenceSplitterIterator, wordTokenizer
Constructor and Description |
---|
DefaultSentenceSplitter() |
addSentence, addSentenceBad, extractSentences, extractSentences, findSentenceOffsets, fixUpSentence, getLogger, isClosingPunctuationOnly, isNoun, isPronoun, isProperNoun, isVerb, quoteOnlySentence, setAbbreviations, setLogger, setPartOfSpeechGuesser, setSentenceSplitterIterator, splitSentenceWordList, verbSeen
close
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
extractSentences, extractSentences, findSentenceOffsets, setAbbreviations, setPartOfSpeechGuesser, setSentenceSplitterIterator
close