All dPairs in orgD.yes.data.final.yesEui.type.P.tbd.data must be tagged. If it is not empty (or 1 known exception from past), send them to linguists to tag.
- In 2015+ release, there is 1 known tbd (invalid) prefixD from past
motor neuron|noun|E0354096|neuron|noun|E0042456|no
=> motor is not a prefix, "motor neuron" is a compound.
- add the linguist's tags to orgD.yes.data.final.yesEui.type.P.tbd.data.tag
- Manually retrieve valid (yes) of above file to orgD.yes.data.final.yesEui.type.P.tbd.data.yes.${YEAR}
- In 2015 release, orgD.yes.data.final.yesEui.type.P.tbd.data.yes.${YEAR} is empty.
- If no [yes] tag is tagged,
=> shell> touch orgD.yes.data.final.yesEui.type.P.tbd.data.yes.${YEAR}
- These valid prefixD will be add to orgD.yes.${YEAR} in step 9
6
| 7 | - Add known tags to type U - unknown type
AppendFieldSeparator.java =>These orgD can't be identified dType by program
| - ${TAR_DIR}:
- orgD.yes.data.final.yesEui.type.U
- ${PREV_TAR_DIR}:
- orgD.yes.data.final.yesEui.type.U.raw
| - orgD.yes.data.final.yesEui.type.U.raw
- orgD.yes.data.final.yesEui.type.U.raw.old
- orgD.yes.data.final.yesEui.type.U.raw.new
- Manually copy and update orgD.yes.data.final.yesEui.type.U.yes.${YEAR}
=> See details in the next column
|
- orgD.yes.data.final.yesEui.type.U.raw.new should be empty (0)
- If not, send it to linguists to tag (yes|no)
=> The difference (new) are the orgDs from new EUIs
- Copy orgD.yes.data.final.yesEui.type.U.yes.${YEAR} from previous year
- Manually update negation|dType|prefix on new valid orgDs to orgD.yes.data.final.yesEui.type.U.yes.${YEAR}
| 7
|
8 | - Add known tags to type S - suffixD
AppendFieldSeparator.java- GetSuffixDMetaFile.java
- SplitSuffixDMetaFile.java
| - ${TAR_DIR}:
- orgD.yes.data.final.yesEui.type.S
- ${SUFFIX_SRC_DIR}:
- ${NOM_TAR_DIR}:
| - orgD.yes.data.final.yesEui.type.S.raw
- orgD.yes.data.final.yesEui.type.S.meta
- orgD.yes.data.final.yesEui.type.S.yes.data
- orgD.yes.data.final.yesEui.type.S.no.data
- orgD.yes.data.final.yesEui.type.S.yesNo.data
=> Already known in suffixD (duplicates)
- orgD.yes.data.final.yesEui.type.S.tbd.data
=> Need to be tagged for these suffixD from new Lexicon updates
| - All dPairs in orgD.yes.data.final.yesEui.type.S.tbd.data must be tagged.
- If not empty:
- Continue with steps 81-83 to complete suffix with TBD tags in orgD
in step 81 , it split into .old and .new
=> old: most of them were tagged (known from the past) from previous years
=> new: new orgD in the updated Lexcion, need to sent to linguist to tag
| 8
|
81 | - Find new suffix TBD orgD and Manually complete tag file
Subset1Way.java
|
- ${PREV_YEAR_ORG_TAR_DIR}:
- orgD.yes.data.final.yesEui.type.S.tbd.data
- ${ORG_TAR_DIR}:
- orgD.yes.data.final.yesEui.type.S.tbd.data
|
- orgD.yes.data.final.yesEui.type.S.tbd.data.old
- orgD.yes.data.final.yesEui.type.S.tbd.data.new
- Manually copy (from previous year) and update orgD.yes.data.final.yesEui.type.S.tag.data.${YEAR}, see detail from the next column
|
- The new TBD suffix orgD (orgD.yes.data.final.yesEui.type.S.tbd.data.new), must be empty (0)
=> these are from updates of new Lexicon, SpVar, or nominalizations
=> even if it is not empty, should be very small
- If not empty, send to linguists to tag (yes|no)
- Manually copy orgD.yes.data.final.yesEui.type.S.tag.data.${YEAR} from previous year
- Manually add tagging results (yes|no) suffixD to orgD.yes.data.final.yesEui.type.S.tag.data.${YEAR}
- Go to Step 82.
| 81
|
82 | - Add tags (yes|no) to suffix TBD orgD file
- Split tagged file (yes|no|tbd|yesNo)
GetSuffixDMetaFile.java- SplitSuffixDMetaFile.java
| - ${NOM_TAR_DIR}:
- ${ORG_TAR_DIR}:
- orgD.yes.data.final.yesEui.type.S.tbd.data
- orgD.yes.data.final.yesEui.type.S.tag.data.${YEAR}
=> Copy from previous year
=> updated from tagged result of Step-81
| - orgD.yes.data.final.yesEui.type.S.tag.data
- orgD.yes.data.final.yesEui.type.S.tag.data.yes
- orgD.yes.data.final.yesEui.type.S.tag.data.no
- orgD.yes.data.final.yesEui.type.S.tag.data.yesNo
- orgD.yes.data.final.yesEui.type.S.tag.data.tbd
|
- make sure no conflict tags from nomD when adding tag (from the log.82)
=> this might happen due to new nomalization
=> sent conflict to linugist to confirm the tag is [yes], or tag [yes|no]
- make sure No. of tbd is 0 (orgD.yes.data.final.yesEui.type.S.tag.data.tbd)
- If not, check orgD.yes.data.final.yesEui.type.S.tag.data.${YEAR} in Step 81 and rerun Steps: 81 ~ 82
| 82
|
83 | - Finalize suffix OrgD: auto add negation (N|O), dType|prefix (S|None)
AddNegationTagToFile.java- GenerateSuffixDTable.java
|
- ${ORG_TAR_DIR}:
- orgD.yes.data.final.yesEui.type.S.tag.data.yes
|
- orgD.yes.data.final.yesEui.type.S.tag.data.yes.negation
- orgD.yes.data.final.yesEui.type.S.tag.data.yes.${YEAR}
|
- orgD.yes.data.final.yesEui.type.S.tbd.data.yes.${YEAR} is valid suffixD from TBD orgD
- This final file is used in Step 9
| 83
|
9 | - Combine Z, S, P, U (Steps 4-7) to orgD.yes.${YEAR}
| - ${TAR_DIR}: Must run 5-8 to get following files.
|