Please wait while loading

IRG Working Set 2021v4.0

Source: Henry CHAN
Date: Generated on 2024-06-13

Show Deleted | Show comments from version: 1.0 2.0 3.0 4.0 5.0 6.0 7.0
The Image/Source column is displayed as it was in WS2021 v4.0. The character may have a different status in the latest working set.

Unification

SnImage/SourceComment TypeDescription
00901
00901
土 32.13.3
GDM-00293
TS 16 · IDS 𤲞
Unification
U+302AE
Unify to 𰊮 (U+302AE).

In IRG 59, WS2021-01186 has been unified to 𱜍 (U+3170D).

01186
山 46.13.3
GDM-00343
TS 16 · IDS 𤲞
IRGN2581WS2021v4.0Unified&Withdrawn
Unified to 𱜍 U+3170D, IRG 59.
Postponed for unification to WS2017-01129 ⿱山畬, evidence accepted, IRG 58.

U+3170D

02391
02391
犬 94.9.3
GKJ-00559
TS 12 · IDS
Unification
U+2480F
Should be unified to 𤠏 U+2480F and encoded via an IVD collection if the shape is necessary to be encoded.
03432
03432
艸 140.14.1
GKJ-00985
TS 17 · IDS
Unification
.
藂 U+85C2 is also a variant form of 叢, according to the source cited by Huang Junliang.
00655
00655
口 30.14.1
GKJ-01024
TS 17 · IDS
Unification
U+5678
Unify to 噸 U+5678 with 頓~⿰击頁.

The pronunciation for TD-795D in CNS 11643 is also consistent with 頓:

00599
00599
口 30.11.5
SAT-04265
TS 14 · IDS 𠙼
Unification
U+91CF
Potentially unifiable to 量 U+91CF.

How often do these transliterations of Shuowen Guwen into Sungti typeface happen in SAT's repertoire? SAT-04264 was added in WS2021.

If there is still a certain amount of characters pending, we should potentially make a rule to make them all unifiable so SAT can directly register them as variants via an IVD collection.
Unification
U+31531
Potential unification to 𱔱 (U+31531). Suggest it to be coded via IVS.
03517
03517
虫 142.5.2
SAT-04405
TS 11 · IDS
Unification
U+867A
U+27247
IVS to 虺 (U+867A) or 𧉇 (U+27247)?

Consider also 00249 (SAT-04406) ⿺兄貴 = 尵 (U+5C35) / 𫵒 (U+2BD52).
00249
00249
儿 10.15.2
SAT-04406
TS 17 · IDS
Unification
U+5C35
U+2BD52
IVS to 尵 (U+5C35) or 𫵒 (U+2BD52)?

Consider also 03517 (SAT-04405) ⿺兄虫 = 虺 (U+867A) or 𧉇 (U+27247).
04941
04941
齊 210.4.4
SAT-04670
TS 16 · IDS &D10-01;丿
Unification
U+9F4C
Unify to 齌 (U+9F4C)?

Add new UCV:
01610
01610
攴 66.9.2
SAT-05880
TS 13 · IDS 𰏘
Unification
U+655D
Why was this not unified to 敝 (U+655D)?

They are variants without doubt. Suggest to add a new UCV of 𰏘 (Extension G) and 㡀.
01731
01731
月 74.4.1
SAT-06399
TS 8 · IDS
Unification
.
Are there any intermediate forms of 翅 and SAT-06399 found, where the middle two strokes protrude on the left? We should consider IVD to 翅 directly if there are such examples.
02808
02808
石 112.14.1
SAT-06454
TS 19 · IDS
Unification
U+78E7
Unify to 磧 (U+78E7)?

This is a strictly transliterated form of 磧. Suggest to add new UCV rule of 責 and ⿱束貝.
01842
01842
木 75.12.2
SAT-06594
TS 16 · IDS
Oppose Unification
「從木敞聲 」should be sufficient to determine this to be a different abstract shape to 𢿵 U+22FF5.
02936
02936
竹 118.9.1
SAT-06728
TS 15 · IDS
Oppose Unification
Oppose unification to 𢲿 U+22CBF.

Wrapping structures themselves (e.g. 咸、夙) can sometimes take in the bottom component, but in this case 巩 is not a wrapping structure.

If it is not common for 巩 to take in the bottom component in the middle, this should not be unified.
00251
00251
儿 10.18.3
SAT-06900
TS 20 · IDS
Unification
U+4C2B
Possible unification to 䰫 (U+4C2B)
03746
03746
角 148.4.1
SAT-08750
TS 11 · IDS
Unification
U+89DD
Consider unification to 觝 U+89DD.

Another variation of 氐.

Existing UCV #453:
03297
03297
舟 137.3.3
T9-7E3E
TS 9 · IDS
Unification
U+8224
Unify to 舤 (U+8224)?

Pronunication is given as fán which suggests the right hand side 凢 is a variant of 凡.

The following variants are currently coded:
U+51E2 凢 = 凡
U+51E3 凣 = 凡
U+3836 㠶 = 帆
U+225BE 𢖾 = 忛
U+233C6 𣏆 = 杋
U+25425 𥐥 = 矾
U+250F6 𥃶 = 𥃵
U+2AD6C 𪵬 = 汎
U+2D0AB 𭂫 = 凡

U+51E2 凢 and U+51E3 凣 are considered Source Code Separation with each other.

There is one example in Ext A, one in Ext C and one in Ext F. The rest are Ext B. We should consider expanding UCV for 凢 and 凣 to also cover 𭂫 and 凡.
01824
01824
木 75.10.5
TB-4B6E
TS 14 · IDS
UCV
There is another character ⿰木𡿺 (TD-3C7A) which is similar with this character. Would TCA consider encoding TD-3C7A instead?



Consider changing IDS to ⿰木𡿺 and source reference to TD-3C7A, also add new UCV 𡿺~⿱巛囱.
02761
02761
石 112.7.3
TB-696C
TS 12 · IDS
Unification
U+787C
Suggest to Unify to 硼 (U+787C); and add a new UCV rule of 朋 ~ ⿰月习.

Based on the handwritten form, it seems very likely that ⿰月习 is an abbreviated form of 朋.

Another example of 朋 written as ⿰月习:


Sometimes it is completely joined as 用:
01768
01768
木 75.3.1
TC-7739
TS 7 · IDS
Oppose Unification
The reading provided by TCA is xiǔ, because 《廣碑別字》 lists it as a variant of 朽 (U+673D).

However, this character is sourced from 內政部戶政用字 based on the info from the CNS11643 website. As a person's name character, it is more likely to be a variant of 「行」 with the 「彳」 component swapped out to 「木」 for the custom of 五行.
01490
01490
手 64.10.1
TD-3B4D
TS 13 · IDS
Oppose Unification
Oppose unification to 𠺃 (U+20E83).

Based on the provided readings, U+20E83 appears to be taking phonetic 振 while TD-3B4D appears to be using phonetic 唇. If this is the case, they are non-cognate and shouldn't be unified.
02950
02950
竹 118.11.1
TE-2668
TS 17 · IDS 𭅗
Oppose Unification
UCV 307c only applies to 艹 but not 𥫗. If TCA wants to unify, we need a new UCV rule.
02980
02980
竹 118.15.1
TE-415D
TS 21 · IDS
Oppose Unification
UCV 307c only applies to 艹 but not 𥫗. If TCA wants to unify, we need a new UCV rule.
02982
02982
竹 118.16.1
TE-446F
TS 22 · IDS
Oppose Unification
UCV 307c only applies to 艹 but not 𥫗. If TCA wants to unify, we need a new UCV rule.
02041
02041
水 85.9.1
UK-20188
TS 12 · IDS
UCV
Suggest new UCV 松 ~ ⿰木㕣 ~ ⿰木⿱儿口 to cover the most common variations.
02472
02472
玉 96.5.3
UK-20199
TS 9 · IDS
UCV
If we add a new UCV, suggest using 公 ~ 㕣 ~ ⿱儿口 ~ ⿱几口 as level 2, or whole character as level 1.
02073
02073
水 85.11.2
UK-20398
TS 16 · IDS
Unification
U+2ADC2
Unify to 𪷂 (U+2ADC2).

The pronunciation of U+2ADC2 𪷂 is also mu4, so it is also a variant of 慕 without a doubt.

Suggest to update UCV #32a to Level 1 as well.
01157
01157
山 46.9.5
UK-20468
TS 12 · IDS
UCV
IRG should reconsider adding a new unification rule of 眉 ~ 睂 because the 眉 component is extremely productive.

Based on a quick lookup of IDS, there are 38 characters with 眉 component while there are 10 characters encoded with 睂.
02670
02670
皮 107.19.2
UK-20579
TS 24 · IDS 𧁧
UCV
繭 (糸 left, 虫 right) appears to be used in 7 other characters, such as 𣀺 𥀹 𢺃 𨇿 𣠷 𧅆 𥜲, and 𧁧 (虫 left, 糸 right) is only encoded as a standalone character.

It might be suitable to make this a UCV rule for the whole character, so derived variants can be coded in an IVD collection, which would make searching easier.
00578
00578
口 30.11.1
UTC-03216
TS 14 · IDS
Unification
U+2BAD5
Unify to 𫫕 (U+2BAD5).

I prefer switching the source and glyph of U+2BAD5 to UTC-03216 (⿰口梃) instead of the current one, because 廷 is the predominant form in Hong Kong, and 𢌜 is practically no longer used.


Attributes

SnImage/SourceComment TypeDescription
00180
00180
人 9.10.1
KC-04818
TS 12 · IDS
IDS
IDS=⿰亻卨
00767
00767
囗 31.4.2
SAT-06654
TS 7 · IDS 𠔁
FS
FS=3


Evidence

SnImage/SourceComment TypeDescription
04608
04608
魚 195.9.1
頁 181.11.3
GKJ-00268
TS 20 · IDS
Evidence
Can Tao Yang please provide the correct source names for evidence 2, 3, 4, 5.
04739
04739
鳥 196.9.5
GKJ-00309
TS 20 · IDS
Evidence
Suggest to withdraw this character along with 04814, as the evidence for 04739 suggests that it is a misprint.

Otherwise, if the evidence for the simplified character is accepted, keep both.

04739
鳥 196.9.5
GKJ-00309
TS 20 · IDS

04814
鸟 196′.10.5
GKJ-00393
TS 15 · IDS
SC=10, TS=15, IRG 58.
03432
03432
艸 140.14.1
GKJ-00985
TS 17 · IDS
Evidence
Please confirm the evidence source name.
02206
02206
火 86.10.1
GXM-00267
TS 14 · IDS
Evidence
Suggested to be withdrawn based on the comment on Andrew West in #7551.


Glyph Design & Normalization

SnImage/SourceComment TypeDescription
01695
01695
日 72.11.3
TB-4B46
TS 15 · IDS
Glyph design
Should the last stroke of 生 be 挑 instead of 橫?


Other

SnImage/SourceComment TypeDescription
00599
00599
口 30.11.5
SAT-04265
TS 14 · IDS 𠙼
Comment
These are just multiple transliterations of the Shuowen form:

00332
00332
刀 18.19.1
UK-20463
TS 21 · IDS
Comment
Given that this is a mistake, would you prefer to unify with a UCV, or withdraw?