Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov
Dictionaries from the SPECIALIST Lexicon
I. Introduction
The SPECIALIST lexicon is a large syntactic lexicon of biomedical and general English. All lexical items are reviewed and verified by linguists. Different dictionaries are generated from the Lexicon for different needs.
II. Generation
lexCheck2016/sources/gov/nih/nlm/nls/lexCheck/Api/ToDicVarsApi.java
Word | POS | Inflection | Source (EUI) | AcrAbb Flag | properNoun Flag | spVar Flag |
Case sensitive |
|
| EUI |
|
|
|
* Unique flag from inflVar is not used. It is set to false if all properties are the same, but the type of inflectional rules are different.
III. Output
The following dictionaries are generated
Dictionary | Description |
---|---|
lexicon.all.dic | All terms, case sensitive |
lexicon.mw.dic | multiwords, case sensitive |
lexicon.sw.dic | single-words, case sensitive |
lexicon.nw.dic | non words (unigram, only in mw, not in sw) |
lexicon.ew.dic | element words (= unigram = sw + nw), case sensitive |
lexicon.aa.dic | abbreviations or acronyms, case sensitive |
lexicon.pn.dic | proper nouns, case sensitive |
lexicon.sv.dic | spelling variants, case sensitive |
lexicon.noAa.dic | en + pn |
lexicon.paa.dic | pure aa, (= aa - en) |
lexicon.en.dic | English word (= all - pn - aa), case sensitive |
lexicon.swEn.dic | English word, also single word only |
lexicon.noAa.dic | English word and proper noun (= all -aa), used in check element words in split |
IV. Notes
DictionaryBasedSpellChecker.java