Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.

The SPECIALIST Lexicon

Spelling Variant Patterns - Test on Lexicon-LRSPL

Norm, MES, and ES are used in a sequential order to retrieve the most spelling variant groups. This model is tested on Lexicon Spelling variants (LRSPL) for the recall. This is the inital test on this model. The results are shown as follows:

  • Results:

    2014

    StepMethodsEdit DistanceCombined No.Total GroupsSpVars GroupsSingle GroupsSpVar No.Group (Recall)
    0Lexicon.2014N/A0N/AN/A0249,231100.00 %
    1NormN/A126,708122,52392,41530,108219,12387.92%
    2MES214,571107,952104,1573,795245,43698.48%
    3ES11,418106,534105,2011,333247,89899.47%
    4MES3145106,389105,3081,081248,15099.57%
    5ES2431105,958105,618340248,89199.86%
    6MES437105,921105,647274248,95799.89%

    2015

    StepMethodsEdit DistanceCombined No.Total GroupsSpVars GroupsSingle GroupsSpVar No.Group (Recall)
    0Lexicon.2015N/A0N/AN/A0260,431100.00 %
    1NormN/A?126,67596,21630,459229,97288.30%
    2MES214,773111,902108,0923,810256,62198.53%
    3ES11,428110,474109,1441,330259,10199.49%
    4MES3144110,330109,2521,078259,35399.59%
    5ES2429109,901109,563338260,09399.87%
    6MES437109,864109,592272260,15999.90%

  • Future Work:
    • Only recall are tested (because the results are used for SpVar Matcher). Should try Lexicon (include no-spVar) to check precision and recall and find the optimum point.
      => The PRF model should be established for:
      • finding the optimal processes of this model
      • a measurement index when enhanced SpVar matcher and its componenet
      • Could be a short paper
    • Try different order in step to gain the best results (precision and recall)