Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.
Comparison on Optimized Set on 2014 and 2015
I. From 2014 to 2015:
The 2014 optimized set is based on 2013 SD-Rule data. It is used as baseline for 2015. 15 new SD-Rules are then added to the 2014 SD-Rule set for evaluation and used for 2015 release. 11 of them are evaluated as good rules in the optimized set, 2 are bad rules and 2 are duplicated (child rule of existing rules). Also, in the optimized set, 2 child rules are used to replace proposed rules.
SD-Rule | Precision | Instances | Source | Results |
---|---|---|---|---|
Good Rules | ||||
se$|verb|zation$|noun | 100.00% | 1108 | NOM_D | Good SD-Rule |
sation$|noun|ze$|verb | 100.00% | 1071 | NOM_D | Good SD-Rule |
ility$|noun|le$|adj | 99.94% | 1625 | NOM_D | Good SD-Rule |
$|adj|ally$|adv | 99.08% | 2072 | ORG_D | Good SD-Rule |
ce$|noun|t$|adj | 98.82% | 847 | NOM_D | Child rule nce$|noun|nt$|adj is used
|
cy$|noun|t$|adj | 98.77% | 406 | NOM_D | Good SD-Rule |
e$|verb|ion$|noun | 98.76% | 2336 | NOM_D | Good SD-Rule |
c$|adj|s$|noun | 91.46% | 281 | ORG_D | Child rule ic$|adj|is$|noun is used
|
e$|verb|ing$|noun | 91.43% | 210 | Suggestions | Good SD-Rule |
ian$|adj|ia$|noun | 86.31% | 263 | Suggestions | Duplicated, parent rule an$|adj|a$|noun is used
|
al$|adj|us$|noun | 84.35% | 262 | Suggestions | Good SD-Rule |
es$|noun|ic$|adj | 73.91% | 23 | Suggestions | Good SD-Rule |
Bad Rules | ||||
$|noun|ize$|verb | 59.05% | 442 | Suggestions | Bad SD-Rule |
ian$|noun|ia$|noun | 0.36% | 274 | Suggestions | Duplicated, parent rule an$|noun|a$|noun is a bad SD-Rule
|
es$|noun|ic$|noun | 0.00% | 19 | Suggestions | Bad SD-Rule |
II. Comparison of SD-Rule set:
Item | 2014 | 2015 |
---|---|---|
Total Unique Rules | 96 | 101 |
Total Good Rules | 73 | 76 |
Opti. System Precision | 95.30% | 95.22% |
Opti. System Recall | 95.01% | 95.70% |
Opti. System Performance | 1.9031 | 1.9093 |
Cufoff Rule | ar$|adj|e$|noun
| ar$|adj|e$|noun
|
Optimized Set | 2014 Optimized Set | 2015 Optimized Set |
Optimized Diagram |
For the Optimial set:
III. Transaction Details:
The detail transaction of SD-Rules are described as below:
Type | 2014 | 2015 | Details | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
No Change | 65 | 65 | ... | ||||||||||
Parent-1-Child | 4 | 4 |
| ||||||||||
Parent-2-Child | 4 | 2 |
| ||||||||||
New in 2015 | 0 | 5 |
| ||||||||||
Total | 73 | 76 |
Computer Generated SD-Rules | ||||||||
---|---|---|---|---|---|---|---|---|
ID | Proposed New Rule | Source | Results | Rank & Rule 2015 | Rank & Rule 2014 | Type | Count Change | Accu. Count |
01-CG1 | se$|verb|zation$|noun | nomD | Good | 02: se$|verb|zation$|noun | None | New in 2015 | +1 | 74 |
02-CG2 | sation$|noun|ze$|verb | nomD | Good | 03: sation$|noun|ze$|verb | None | New in 2015 | +1 | 75 |
03-CG3 | ility$|noun|le$|adj | nomD | Good | 09: ility$|noun|le$|adj | 02: ability$|noun|able$|adj | Parent-1-Child | +0 | 75 |
04-CG4 | $|adj|ally$|adv | orgD | Good | 15: $|adj|ally$|adv | 08: ic$|adj|ically$|adv | Parent-1-Child | +0 | 75 |
05-CG5 | nce$|noun|nt$|adj | nomD | Good | 18: nce$|noun|nt$|adj
| 16: ance$|noun|ant$|adj
18: ence$|noun|ent$|adj | Parent-2-child | -1 | 74 |
06-CG6 | cy$|noun|t$|adj | nomD | Good | 19: cy$|noun|t$|adj | 21: ency$|noun|ent$|adj | Parent-1-Child | +0 | 74 |
07-CG7 | e$|verb|ion$|noun | nomD | Good | 20: e$|verb|ion$|noun
| 10: ate$|verb|ation$|noun
63: se$|verb|sion$|noun | Parent-2-Child | -1 | 73 |
08-CG8 | c$|adj|s$|noun | orgD | Good | 43: ic$|adj|is$|noun | 41: ic$|adj|is$|noun | Child | +0 | 73 |
Expert-Suggested SD-Rules | ||||||||
09-ES1 | e$|verb|ing$|noun | Experts | Good | 45: e$|verb|ing$|noun | None | New in 2015 | +1 | 74 |
10-ES2 | al$|adj|us$|noun | Experts | Good | 61: al$|adj|us$|noun | None | New in 2015 | +1 | 75 |
11-ES3 | es$|noun|ic$|adj | Experts | Good | 67: es$|noun|ic$|adj | None | New in 2015 | +1 | 76 |
12-ES4 | $|noun|ize$|verb | Experts | Bad | 78: $|noun|ize$|verb | None | New | +0 | 76 |
13-ES5 | es$|noun|ic$|noun | Experts | Bad | 101: es$|noun|ic$|noun | None | New | +0 | 76 |
14-ES6 | ian$|adj|ia$|noun | Experts | Good | 57: a$|noun|an$|adj | 53: a$|noun|an$|adj | Duplicated-Child | +0 | 76 |
15-ES7 | ian$|noun|ia$|noun | Experts | Bad | 99: a$|noun|an$|noun | 93: a$|noun|an$|noun | Duplicated-Child | +0 | 76 |
The conclusion is the optimized set of SD-Rules is very steady as we expected. Does this imply that Lexicon is a good representative subset of general English?