UTC-03182 |
Date | Description |
---|---|
IRG #57 2021-09-16 (Thu) 9:38 am +0800 Recorded by CHEN Zhuang | Not unified to 虷 U+8677, new evidence of non cognate accepted. |
Version | Description |
---|---|
2.0 | For 03505, add Discussion Record "Not unified to 虷 U+8677, new evidence accepted, non-cognate, IRG 57." |
Source Reference | Glyph |
---|---|
UTC-03182 | 1.0 |
group | UTC |
a) Source Reference | UTC-03182 |
b) PUA Code Point | U+F46D |
c) Kangxi Radical Code | 142 |
d) Stroke Count | 3 |
e) First Stroke | 3 |
g) Total Strokes | 9 |
i) IDS | ⿰虫千 |
j) Similar Ideographs | U+8677 虷 |
k) References for Evidence Images | 電碼新編 (1976), #9168 |
Review Comments
unify to 虷 (U+8677)?
Unify to 虷 (U+8677). As telegraph code books should only list relatively common characters that are well-attested elsewhere, any unencoded characters must be variants or mistakes of already-encoded characters. But as the code books do not provide readings or meanings for the characters listed we have to guess what the true identification of the character is. In this case it is almost certainly a mistake or variant for U+8677. Unify according to UCV principle a-1 (differences in stroke initiation direction).
The first stroke of the right-hand component goes top-right to bottom-left. 千 and 干 are different components and thus not unifiable.
If we have to confirm that this is not a wrongly printed glpyh, we can reject it according to IRG PnP 2.2.1.d.(2), the evidence satisfy none of the items under IRG PnP 2.2.1.d.(2).
It is ridiculous to say "“千” and “干” are apparently different characters and they can't be unified" without providing any evidence for the meaning of the ⿰虫千. The telegraph code book is not a dictionary but only lists characters in current use, so ⿰虫千 must be presumed to be a mistake/variant of 虷 unless reliable evidence that it is a separate character can be produced.
What's more, I believe UK said "As telegraph code books should only list relatively common characters that are well-attested elsewhere, any unencoded characters must be variants or mistakes of already-encoded characters" and suggest to unify ⿰虫千 to 虷 (U+8677) based on this, I'd like to know which character should UTC-03171 be unified to.
https://hc.jsecs.org/irg/ws2021/app/?find=UTC-03171
So if the evidece is not clear enough, we'd better reject the evidence but unify the proposed glyph by inference.