Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov

CSpell

Dictionary from MEDLINE

I. Introduction

Words from MEDLINE titles and abstracts are used to generate dictionary. They are tested in CSpell.

II. Algorithm

  • MEDLINE N-gram set is used to retrieve
    • Unigram
    • word count >= 30
  • The core term of Unigram from above is used for dictionary
    • lower case
    • combined by core-term

III. Output

  • File name: ${PRE_PROCESS}/data/Medline/${YEAR}/outData/medline.dic
  • Format: lowercase unigrams