Package org.apache.lucene.analysis.hunspell
A Java implementation of Hunspell stemming and
spell-checking algorithms (
Hunspell
), and a stemming
TokenFilter (HunspellStemFilter
) based on it.
For dictionaries, see e.g. LibreOffice repository or Titus Wormer's collection (UTF)
-
Interface Summary Interface Description AffixCondition Checks the "condition" part of affix definition, as inDictEntries An object representing homonym dictionary entries.GeneratingSuggester.AffixProcessor Stemmer.CaseVariationProcessor Stemmer.RootProcessor -
Class Summary Class Description CheckCompoundPattern CompoundRule ConvTable ICONV or OCONV replacement tableDictionary In-memory structure for the dictionary (.dic) and affix (.aff) data of a hunspell dictionary.Dictionary.Breaks Possible word breaks according to BREAK directivesDictionary.DefaultAsUtf8FlagParsingStrategy Used to read flags as UTF-8 even if the rest of the file is in the default (8-bit) encodingDictionary.DoubleASCIIFlagParsingStrategy Implementation ofDictionary.FlagParsingStrategy
that assumes each flag is encoded as two ASCII characters whose codes must be combined into a single character.Dictionary.FlagParsingStrategy Abstraction of the process of parsing flags taken from the affix and dic filesDictionary.NumFlagParsingStrategy Implementation ofDictionary.FlagParsingStrategy
that assumes each flag is encoded in its numerical form.Dictionary.SimpleFlagParsingStrategy Simple implementation ofDictionary.FlagParsingStrategy
that treats the chars in each String as a individual flags.FlagEnumerator A structure similar toBytesRefHash
, but specialized for sorted char sequences used for Hunspell flags.FlagEnumerator.Lookup GeneratingSuggester A class that traverses the entire dictionary and applies affix rules to check if those yield correct suggestions similar enough to the given misspelled wordGeneratingSuggester.EntryFilter GeneratingSuggester.Weighted<T extends java.lang.Comparable<T>> Hunspell A spell checker based on Hunspell dictionaries.HunspellStemFilter TokenFilter that uses hunspell affix rules and words to stem tokens.HunspellStemFilterFactory TokenFilterFactory that creates instances ofHunspellStemFilter
.ISO8859_14Decoder ModifyingSuggester A class that modifies the given misspelled word in various ways to get correct suggestionsRepEntry Root<T extends java.lang.CharSequence> Stemmer Stemmer uses the affix rules declared in the Dictionary to generate one or more stems for a word.Suggestion TrigramAutomaton An automaton allowing to achieve the same results as non-weightedGeneratingSuggester.ngramScore(int, java.lang.String, java.lang.String, boolean)
, but faster (in O(s2.length) time).WordStorage A data structure for memory-efficient word storage and fast lookup/enumeration.WordStorage.Builder -
Enum Summary Enum Description AffixKind TimeoutPolicy A strategy determining what to do when Hunspell API calls take too much timeWordCase WordCase.CharCase WordContext -
Exception Summary Exception Description SuggestionTimeoutException An exception thrown whenHunspell.suggest(java.lang.String)
call takes too long, ifTimeoutPolicy.THROW_EXCEPTION
is used.