
StopWords
- Namespace
- Rowles.LeanCorpus.Analysis
- Assembly
- Rowles.LeanCorpus.dll
Stop word lists for common languages.
public static class StopWords
Arabic
Gets the built-in Arabic stop word list.
Chinese
Gets the built-in Chinese stop word list.
Dutch
Gets the built-in Dutch stop word list.
English
Gets the classic 33-word English stop word list. This is the default used by StandardAnalyser.
EnglishExtended
Gets the extended English stop word list (~95 words) which covers prepositions,
pronouns, modals, adverbs, and negation fragments in addition to the classic set.
Pass this to IndexWriterConfig.StopWords to opt in to more
aggressive stop word removal.
French
Gets the built-in French stop word list.
German
Gets the built-in German stop word list.
Italian
Gets the built-in Italian stop word list.
Japanese
Gets the built-in Japanese stop word list.
Korean
Gets the built-in Korean stop word list.
Portuguese
Gets the built-in Portuguese stop word list.
Russian
Gets the built-in Russian stop word list.
Spanish
Gets the built-in Spanish stop word list.
SupportedLanguages
Returns all supported BCP 47 language codes.
ForLanguage(string)
Gets the stop word list for a BCP 47 language code, or null if the language is not supported.