Please wait while loading

IRG Working Set 2021v2.0

Source: Henry CHAN
Date: Generated on 2024-06-13

Show Deleted | Show comments from version: 1.0 2.0 3.0 4.0 5.0 6.0 7.0
The Image/Source column is displayed as it was in WS2021 v2.0. The character may have a different status in the latest working set.

Unification

SnImage/SourceComment TypeDescription
04762
04762
鳥 196.12.2
GKJ-00305
TS 23 · IDS
Unification
U+2EB58
Unify to 𮭘 (U+2EB58) after normalization?
03668
03668
虫 142.19.4
GKJ-00444
TS 25 · IDS
Unification
[ WS2017-03912 ]
Unify to WS2017-03912 (KC-07866).
03226
03226
肉 130.6.1
GKJ-00770
TS 10 · IDS
Oppose Unification
From the entry for 脨 in the Kangxi Dictionary (copied from zi.tools):
【未集下】【肉字部】 【集韻】趨玉切,音促。本作𦠁。弗𦠁。炙筯。或从肉。 又秦昔切,音籍。瘦也。 集韻(1039)

It appears that the shape for 朿 affects the pronunication and the meaning, so ⿰月朿 and 脨 (⿰月束) should be considered non-cognate.

Further sources from Jiyun are required.
Oppose Unification



They are two separate head characters, with the first head character in the same series as 促戚趨趣誎 (from phonetic 束) and the second head character in the same series as 籍耤藉莋葃膌 and also 刺洓 (from phonetic 朿).
03246
03246
肉 130.10.3
SAT-05648
TS 14 · IDS
Unification
U+8173
Unify to 腳; change NUCV-343 to UCV-343.

Despite there being a NUCV-343 [卻, 郤], historically ⿰木郤 and ⿰木卻 have been unified at U+3B9D, until Unicode 12.0 where the G glyph was corrected from ⿰木卻 to ⿰木郤.

It is suggested to change NUCV-343 to a UCV rule, as it appears such variation is not rare. The non-cognate rule can be invoked to separately code characters with 郤 and 卻 when actually required.
03113
03113
糸 120.13.5
SAT-06345
TS 19 · IDS &S4-02;
UCV
According to the MOE Dictionary, this kind of variation is also seen in other characters:


We can add a new UCV "bottom part of 雋 to be extended at bottom" and "bottom part of 雋 on bottom right"
UCV
Suggest to add the following UCV instead:
02476
02476
玉 96.7.1
SAT-06673
TS 11 · IDS 𢌜
Unification
U+73FD
To be unified to 珽 (U+73FD) after applying new UCV rule 廷 and 𢌜 in IRG 57.
01439
01439
手 64.5.1
SAT-06941
TS 8 · IDS 𰆰
Unification
U+6280
Unifiable to 技 (U+6280)?

In WS2015 𰆰 (00511, USAT09010, semantic variant of 支) was added, while in WS2017 ⿰虫𰆰 (03847, USAT07003, semantic variant of 蚑) was added. Are there more examples of characters using 𰆰 instead of 支, and if it is systematic, is it better to unify them as a new UCV?
03255
03255
肉 130.11.5
SAT-06958
TS 15 · IDS
Unification
U+81A0
Unify to 膠 (U+81A0); add new UCV ⿱羽尒 and 翏.
04457
04457
食 184.10.2
SAT-07068
TS 19 · IDS
Unification
U+992E
Unifiable with 餮 (U+992E)?

This looks like a strict transcripted form; it may be better to add a UCV 歹/歺.
03136
03136
缶 121.8.3
SAT-07099
TS 15 · IDS 𦈢𰏤
Unification
U+7F3E
Unify to 缾 (U+7F3E); add new UCV 𰏤 = 并
03083
03083
糸 120.10.2
T9-763C
TS 16 · IDS 𧆠
Unification
Suggest to code 14-2725 (⿰糹虖) instead of 9-763C.

It's highly likely that 14-2725 would be unified with 9-763C. There is also 15-5721 which also has the same variant relationship with 2-5E4E. Inevitably TCA will need to register an IVD to support both person's name. TCA should preferably code the character which uses more common components.
01497
01497
手 64.10.4
T9-7A5B
TS 14 · IDS
UCV
Suggest to add 弯, ⿳亠业弓 and ⿱並弓 to UCV.

Additional characters encoded in CNS11643 but unencoded in UCS:
⿱並弓 is encoded by CNS11643 as 11-6924
⿰氵⿱並弓 is encoded by CNS11643 as 11-6064
⿰口⿱並弓 is encoded by CNS11643 as 13-4935
⿰氵⿳亠业弓 is encoded by CNS11643 as 11-7032
04561
04561
鬼 194.8.1
T9-7B7C
TS 18 · IDS
Unification
U+4C2A
Per Ng Hou Man's comment, 䰪 (U+4C2A) is printed in the Kangxi Dictionary as follows:



Another version gives this, where the dots are slightly more clearly separated:



Note this is filed under 10 strokes, so the character definitely has 脊 on the left.

The pronunciation of T9-7B7C in the CNS11643 website is also mèi, which matches the pronunciation of 䰪 (U+4C2A).
01924
01924
歹 78.9.1
TD-4E3E
TS 13 · IDS
Unification
U+6B9B
Possibly Unifiable to 殛 (U+6B9B)
02507
02507
玉 96.12.1
TD-6D41
TS 16 · IDS
Oppose Unification
The given pronunciation is tì, which would make it non-cognate with 㻸 (U+3EF8). The pronunciation is also plausible as a personal name character.
04023
04023
邑 163.15.1
TE-2931
TS 18 · IDS 𬥁
Unification
U+9139
Unify to 鄹 (U+9139).

Excerpt from Hanyu Dazidian shows that 鄹 (U+9139) can also be read as jù when used as a place name:


There is also a similar case in the UCV, with a single stroke difference:
02710
02710
目 109.12.2
UK-20425
TS 17 · IDS 𭦟
Unification
U+25291
Unify to 𥊑 (U+25291); consider new UCV 𭦟 = 曼.


Attributes

SnImage/SourceComment TypeDescription
02823
02823
石 112.19.3
GDM-00268
TS 24 · IDS 𬙔
Residual Stroke Count
SC=9, TS=14, FS=2
04875
04875
鹿 198.10.3
GKJ-00694
TS 21 · IDS 鹿
IDS
IDS=⿸鹿留


Evidence

SnImage/SourceComment TypeDescription
03651
03651
虫 142.14.5
GKJ-00252
TS 20 · IDS
Evidence
Kangxi gives a different form:


Which one is correct?
02993
02993
竹 118.24.3
GKJ-00271
TS 30 · IDS
Evidence
Which character is it equal to? (In Evidence 2).

It should be determined if it should be ⿱竹⿰魚詹, or ⿰魚簷.
04814
04814
鸟 196′.9.5
GKJ-00393
TS 14 · IDS
Evidence
Misprint of 骘? What is the main entry for 陰~ in this source?
00486
00486
口 30.7.5
UTC-00332
TS 10 · IDS 𥁀
Evidence
I also suspect this is a typographical error for 啗. It's not uncommon for handwritten form of 臼 to be written in a form similar to 血; further corruption to 皿 is definitely plausible. I suggest UTC to withdraw this unless there is other evidence of such use.


Glyph Design & Normalization

SnImage/SourceComment TypeDescription
04579
04579
魚 195.5.3
GKJ-00262
TS 16 · IDS
Normalization
Consider updating glyph and IDS to ⿰魚卯.

夘 and 卯 are unifiable per WS2021-04910:
04910
鼠 208.5.3
GKJ-00747
TS 18 · IDS
IRGN2516WS2021v2.0Unified&Withdrawn
Unified to 𪕋 U+2A54B, add a new UCV rule of 夘 and 卯, IRG 57.


The glyph ⿰魚卯 is attested in 《重訂直音篇》 as a variant of 鰡 with pronunication 留, which is also consistent with the provided evidence:
04606
04606
魚 195.8.5
GKJ-00280
TS 19 · IDS
Glyph design
The right hand side should be updated to have a hook in the middle stroke to follow modern PRC conventions:
02888
02888
禾 115.12.5
T9-7C5F
TS 17 · IDS
Glyph design
It is suggested to encode TE-7465 ⿰禾巽 instead, and T9-7C5F be encoded via the IVD.

Entry for TE-7465 in CNS11643 website:
00753
00753
口 30.21.4
UK-20673
TS 24 · IDS
Normalization
I believe the original glyph was OK as it's a reasonable normalization.


Editorial

SnImage/SourceComment TypeDescription
04741
04741
鳥 196.9.5
GKJ-00339
TS 20 · IDS
Editorial issue
The editorial error has been corrected.
04835
04835
鹿 198.4.1
GKJ-00692
TS 15 · IDS 鹿
Editorial issue
The editorial issue has been corrected directly by the ORT manager.
01655
01655
日 72.3.2
SAT-04583
TS 7 · IDS 丿
Editorial issue
The editorial error has been corrected.
00402
00402
厂 27.9.4
TC-7D4A
TS 11 · IDS
Editorial issue
The Status field was left unchanged in error; the issue has been corrected.
04266
04266
阜 170.9.3
UK-20475
TS 12 · IDS 𠚏
Editorial issue
The editorial issue has been corrected.
04238
04238
門 169.11.3
UTC-03180
TS 19 · IDS
Editorial issue
The editorial issue has been fixed.


Other

SnImage/SourceComment TypeDescription
01186
01186
山 46.13.3
GDM-00343
TS 16 · IDS 𤲞
Comment
I think we should try and see if the correct form is ⿱山𤲞 or ⿱山畬 (⿱山畲).

Based on the replacement character for the place names, it seems the pronunciation of the place name should be similar to 畬(畲).
04693
04693
鳥 196.4.4
GKJ-00351
TS 15 · IDS
Comment
To clarify the meeting minutes, in IRG #58, it was mentioned by Wang Yifan that U+9B06 鴆 and U+2B6B2 𫚲 are separately coded.
Comment
This was not discussed in IRG 58, but I believe the disunification of U+9B06 鴆 and U+2B6B2 𫚲 is not sufficient to prevent unification between GKJ-00351 and U+2EB2E, because U+2B6B2 𫚲 is in Extension C and it is a single source character with TD-6727, which has a pronunciation identical with U+9B06 鴆.
02473
02473
玉 96.6.1
GKJ-00751
TS 10 · IDS
Comment
The pronunciation given in the Kangxi Dictionary is 辛律切, so the phonetic 戌 is the correct form.
04108
04108
金 167.9.3
GKJ-00934
TS 17 · IDS
Comment
In the meeting, Tao Yang said that he will try to find further evidence for the original submitted glyph ⿰信金, or withdraw otherwise.
00655
00655
口 30.14.1
GKJ-01024
TS 17 · IDS
Comment
頓 and ⿰击頁 are variants based on the MOE Dictionary:


But I couldn't find the scan nor other characters.
03741
03741
見 147.10.4
SAT-06662
TS 17 · IDS
Comment
A similar case for 竟 can be found on the MOE Dictionary:
03816
03816
貝 154.5.1
SAT-07087
TS 12 · IDS 𠆴
Comment
A point which I mentioned in the meeting:
For IVD registration purposes, it might be better to use U+8CAC as the base character instead of U+27D69, because U+27D69 is sufficiently rare and is also a transliterated form of 責.
00385
00385
卜 25.8.1
TC-447C
TS 10 · IDS
Other
Here are some Unification candidates:
U+2A2FF

U+2A38D

U+2B9EE
00188
00188
人 9.10.4
TD-2261
TS 12 · IDS 𠈌
Other
According to the CNS11643, this character now has two readings: sǎn and chuí. That means this character is likely to be both a variant of 傘 and 垂.
03031
03031
米 119.12.5
TE-2F3A
TS 18 · IDS
Comment
The reading suggests the right hand side should have been 属 instead of ⿸尸禺. Could there be confirmation on the correct shape based on the romanization of the person's name?
04623
04623
魚 195.10.3
UK-20003
TS 21 · IDS
Comment
As someone mentioned about evidence, here are some stone inscriptions where some 余 is written as ⿱人未 (due to time constraint they have not been circled):
01541
01541
手 64.12.4
UK-20385
TS 15 · IDS
Comment
Regarding the correct G shape, please refer to 汉语俗字丛考(修订本) (Source from Huang Junliang)
04547
04547
鬼 194.4.1
UK-20786
TS 14 · IDS
Comment
The character U+29C93 ⿺鬼夭 is also used by Zhuang, where the correct form is indeed 夭:


So it may be better to keep ⿺鬼天 and ⿺鬼夭 separate.


Data for Unihan

SnImage/SourceComment TypeDescription
03651
03651
虫 142.14.5
GKJ-00252
TS 20 · IDS
Semantic variant
𧎾
01642
01642
方 70.9.3
TB-5029
TS 13 · IDS 便
Semantic variant
02774
02774
石 112.9.1
TD-5161
TS 14 · IDS
Semantic variant
U+7164 煤
03881
03881
足 157.7.1
UK-20276
TS 14 · IDS 𧾷
Trad variant
U+2C9C4 𬧄