public class CybozuLabsLanguageRecognizer extends AbstractLanguageRecognizer
logger
Constructor and Description |
---|
CybozuLabsLanguageRecognizer()
Create a language recognizer with the default language profiles.
|
CybozuLabsLanguageRecognizer(java.util.List<java.lang.String> languages)
Create a language recognizer with list of languages to recognize.
|
Modifier and Type | Method and Description |
---|---|
java.util.List<java.lang.String> |
getAvailableLanguageProfileNames()
Get list of available languages for detection.
|
static java.util.List<java.lang.String> |
getDefaultLanguageProfileNames()
Get default list of languages for detection.
|
ScoredString[] |
recognizeLanguage(java.lang.String text)
Returns a scored list of possible languages for a text string.
|
close, getLogger, setLogger
public CybozuLabsLanguageRecognizer() throws LangDetectException
LangDetectException
public CybozuLabsLanguageRecognizer(java.util.List<java.lang.String> languages) throws LangDetectException
languages
- List of names of languages to recognize.
The list of languages references the profile names. These are usually two or three character ISO 696 language codes, e.g., "en" for English and "sco" for Scots dialect.
LangDetectException
public ScoredString[] recognizeLanguage(java.lang.String text)
recognizeLanguage
in interface LanguageRecognizer
recognizeLanguage
in class AbstractLanguageRecognizer
text
- The text for which to determine the language.public java.util.List<java.lang.String> getAvailableLanguageProfileNames()
The profile names are generally the two or three character ISO 639 two or three character language codes.
public static java.util.List<java.lang.String> getDefaultLanguageProfileNames()
The profile names are generally the two or three character ISO 639 two or three character language codes. The current language recognizer may not use all of these profiles, or may add extra custom profiles.