Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.

Text Categorization

JDI: MeSH

  • Description:

    Read in the input MeSH (Medical Subject Headings, MH|SH) and perform JD indexing based on document count. Main Headings and Subheadings are separated by '|'. Also, Subheadings can be represented by two-letter abbreviations or full names.

  • Inputs:
    • MeSH, starred MH and SH are separated by '|'
    • a file, such as 9801.2004.MH.in

  • Algorithm:
    • Pre-Processes (Input Filter):
      • Tokenize all MeSHs (SH/MH) from the input
      • Filter out illegal MeSHs (not in Mh-Jd Table or Sh-Jd Table
      • Assign legal MeSHs
    • Processes:
    • Post-Processes (Output Filter):
      • Print out Input MeSH
      • Print out Legal MeSHs

      • Output filter option details
      • Score entries display number
      • No output message
      • JD candidate
      • Cluster option
      • Use alphabetical order for JDs have same scores

  • Sample commands:
    > jdi -imh -p
    => index input MeSH from standard input with prompt
    
    > jdi -imh -i:9801.2004.MH.in -o:9801.2004.MH.out
    => index MeSH from file, 9801.2004.MH.in, and send results to a file, 9801.2004.MH.out
    

  • Sample Outputs:
    • a file, such as 9801.2004.MH.out