Text Categorization

PreProcess: lsi.xml

  • Description:
    List of Serials Indexed file (lsi) describes details information for all Journals cited in MEDLINE.

  • Input:

  • Procedures:
    • From NLM, Esther Baldinger (baldinge@mail.nlm.nih.gov) or ftp://ftp.nlm.nih.gov/online/journals/
    • shell> cp lsi2009.xml lsi2009.xml.org
    • Remove the XML head (first two lines)
      <?xml version="1.0" encoding="UTF-8" ?>
      <!DOCTYPE SerialsSet PUBLIC "-//NLM//DTDSERIALS, 1st January 2009//EN"
      "http://www.nlm.nih.gov/databases/dtd/nlmserials_090101.dtd">
      

  • Output:
    • None

  • Notes:
    • lsi.xml is used to generate Jid-Ta-Jds table