Please wait while loading

IRG Working Set 2024v2.0

Source: Andrew WEST
Date: Generated on 2025-05-15

Show Deleted | Show comments from version: 1.0 2.0 3.0 | Show comments with status: Show All New Only Unresolved Only

Unification

Showing 9 comments.

SnImage/SourceComment TypeDescription
00093
00093
亠 8.2.5
GZHSJ-0024
TS 4 · IDS
Oppose Unification
Agree with #8669
01172
01172
山 46.3.1
SAT-09829
TS 6 · IDS
Unification
U+37A9
U+5D50
Evidence 2 shows U+37A9 㞩 which is a variant form of 嵐. Evidence 1 also looks like a corrupt form of 㞩. Unify to 㞩 (U+37A9).
00112
00112
人 9.4.3
SAT-10237
TS 6 · IDS 𠆢
Oppose Unification
Move back to the main set as it is a component in SAT-10235 (⿰糸⿱𠆢凶), and there have been no objections to encoding SAT-10235
02969
糸 120.6.3
SAT-10235
TS 12 · IDS 𠆢
04656
04656
鼠 208.8.2
UK-30059
TS 21 · IDS
Oppose Unification
In my opinion the difference is too great to unify.
04389
04389
鬼 194.16.1
UK-30106
TS 26 · IDS
UCV
Suggest to change NUCV #294 to a UCV. 曆~暦 is a very common variation, and there are a large number of unencoded characters with 暦 etc. that correspond to encoded characters with 曆 etc.
03701
03701
車 159.3.5
UK-30360
TS 10 · IDS
UCV
Support lv. 2 UCV for 弓 and 𠔃. Note that Evidence 2 transcribes ⿰車𠔃 as ⿰車兮, but 𠔃 here is a variant form of 弓 as proved by Evidence 4 which shows ⿰車弓 for the same text transcribed as ⿰車兮 in Evidence 2.
00920
00920
土 32.11.2
UK-30590
TS 14 · IDS
Unification
U+2D3EB
Unification to 𭏫 (U+2D3EB) seems reasonable. Suggest to withdraw UK-30590, and horizontally extend to U+2D3EB, but as all evidences show ⿰土⿱艹㚏 rather than ⿰土⿱艹𬌪, change UK glyph to ⿰土⿱艹㚏.
01312
01312
工 48.0.0
UTC-03377
TS 2 · IDS 𠃍
UCV
Agree with #5621 by Eiso. Encode UTC-03377 then add a UCV for it as a component in other characters.
01145
01145
尸 44.3.1
VN-F2458
TS 6 · IDS
Unification
U+21C31
U+5C3C
Consider possible unification to 𡰱 (U+21C31) as both are variant forms of U+5C3C 尼

Attributes

Showing 3 comments.

SnImage/SourceComment TypeDescription
03041
03041
缶 121.5.3
GCW-00191
TS 11 · IDS
Radical
Change primary radical to 196.1 (鸟), SC=6, FS=3 (cf. additional evidence in comment #7314 where the character is under 鸟 radical). Add 121.0 缶 as secondary radical for compatibility with U+26253 𦉓.
00941
00941
土 32.12.5
UK-30283
TS 15 · IDS
IDS
IDS is OK as ⿰土逯 because G-source glyph form of U+902F
逯 is ⿺辶录
01144
01144
尸 44.2.4
UTC-03390
TS 5 · IDS
IDS
If we accept kanji-kana hybrids for encoding then allow hiragana and katakana in IDS, but only when the component really is a kana letter (not just when a component looks like a kana letter, e.g. do not allow リ for lhs of 师). In that case, ⿸尸ソ would be a better IDS for this character.

Evidence

Showing 11 comments.

SnImage/SourceComment TypeDescription
03946
03946
钅 167′.12.3
GCCPP-00025
TS 17 · IDS
New evidence
{{中国海毛颚类物种多样性的研究 https://www.biodiversity-science.net/CN/PDF/10.17520/biods.1995011}} in 生物多样性 = Chinese Biodiversity 1995.3(2):71:

04635
04635
麥 199.10.2
GZHSJ-0116
TS 21 · IDS
New evidence
Regarding Comment #8643, PnP 2.2.5b (5) states: "Characters found in quotations from classical or pre-20th-century texts in a modern typeset edition should also provide an image of an original edition of the text in order to be sure that the character form given in the modern edition is not an error form".

I do not think that other experts should be doing the submitter's homework for them, but I did randomly check one 1714 edition of 《隋書》卷25 for the quotation shown in Evidence 3, and it actually shows U+2A32D 𪌭. It might be that 𪌭 in this edition is an error for ⿺麥員, or it might be that ⿺麥員 is an error for 𪌭, but I think it should be the submitter's responsibility to determine this. And it should be the submitter's responsibility to provide evidence from the original woodblock editions to support the encoding.

03487
03487
襾 146.6.5
UK-30620
TS 12 · IDS
Evidence
The UK requests that UK-30620 be moved back to the M-set as the evidence is sufficient.
00061
00061
丿 4.10.5
UK-30621
TS 11 · IDS
Evidence
The UK requests that UK-30621 be moved back to the M-set as the evidence is sufficient.
03096
03096
老 125.3.1
大 37.6.1
UK-30639
TS 9 · IDS
Evidence
The UK requests that UK-30639 be moved back to the M-set as the evidence is sufficient.
01166
01166
尸 44.19.1
UTC-03337
TS 22 · IDS
New evidence
《新鐫笑林廣記》 (寶仁堂重刊本, 清乾隆26 [1761]):



This shows ⿸尸𬋕 which should be a unifiable form of ⿸尸醮 given in the modern editions.
01580
01580
手 64.6.3
UTC-03359
TS 9 · IDS
Misidentified glyph
UTC-03359 may be a misidentification of the cursive form of U+62DC 拜 shown in #5557. Yes, the the shapes are too different for unification, but the shapes are only too different because someone misidentified the cursive form of 拜 and invented a new character. Therefore suggest to postpone for additional evidence.
03810
03810
辵 162.9.4
UTC-03389
TS 13 · IDS 𭕄
New evidence
New evidence
00398
00398
口 30.5.5
VN-F007E
TS 8 · IDS
New evidence
汉语大字典(第二版) p. 658 has ⿰口尻 with no dot:
01161
01161
尸 44.15.4
VN-F0157
TS 18 · IDS
Evidence
Evidence 2 image is incorrect (duplicate of Evidence 1)

Glyph Design & Normalization

Showing 12 comments.

SnImage/SourceComment TypeDescription
01162
01162
尸 44.16.3
GCW-00101
TS 19 · IDS
Glyph design
No evidence for ⿺尾童, and G-source characters do not show an obvious preference for ⿺尾X over ⿰尾X, so changing the glyph to ⿺尾童 cannot really be considered as normalization. I suggest to keep the current glyph, and simply update IDS to ⿰尾童.
04598
04598
鸟 196′.9.2
GCW-00259
TS 14 · IDS
Glyph design
Change glyph to use the ⿱冃目 form of 冒 following China conventions. I did a quick check, and it seems that every single G-source character with 冒 (up to and including GKJ-00319 in Extension J) is written with the ⿱冃目 form.
01169
01169
屮 45.8.2
GZHSJ-0138
TS 11 · IDS
Glyph design
Both evidences show ⿱䶹囷. Consider changing glyph to match the evidence.
03227
03227
舟 137.10.4
KC-10045
TS 16 · IDS
Glyph design
Glyph has not been changed yet.
03767
03767
車 159.12.1
SAT-09344
TS 19 · IDS 𫟛
Glyph design
Based on the evidence shown in #5660 (all showing an extra stroke, and 3rd evidence showing two extra strokes), I think that the rhs should be U+2E5C6 𮗆 (variant of 覃). Cf. rhs of U+2DB1E 𭬞 (⿰木⿱覀卑). Therefore I suggest to change glyph and IDS to ⿰車⿱覀卑.
U+2E5C6

U+2DB1E
01821
01821
日 72.17.2
UK-30028
TS 21 · IDS 𣋓
Glyph design
Yes, the note is confusing. I think that the original intention was to normalize the actual glyph forms ⿰目⿳日⿻𠈌丨丂 (Evidence 1) or ⿰日⿳日⿻𠈌丨亐 (Evidences 2 and 3) which are not used in any encoded character to 𣋓 which is used in the cognate character U+244AB 𤒫. Therefore keep current glyph and IDS.
03200
03200
臣 131.9.2
UK-30045
TS 15 · IDS
Glyph design
I consider Evidence 3 to be the most authoritative source as it uses the type prepared by the Commercial Press for the lost 1932 edition of the book (destroyed when the Commercial Press building was bombed during the Japanese bombardment of Shanghai in 1932). This evidence shows a dot at the top right as expected for the character 尨. The last stroke of 彡 does not cross over 乚 in the early evidence, but we have normalized the glyph to follow the form of 尨 used in PRC.
03536
03536
言 149.12.1
UK-30532
TS 19 · IDS
Glyph design
Disagree with comment #8487. Even in Evidence 1 the 厂 component partially covers 欠, and in Evidence 2 it fully covers 欠. Therefore no need to modify the UK glyph.
02027
02027
比 81.6.3
UK-30682
TS 10 · IDS 𠔿
Glyph design
Change glyph to match IDS and all evidences (⿵冂人 instead of ⿵冂八)
03493
03493
見 147.7.1
UK-30724
TS 14 · IDS
Glyph design
Modify 4th stroke of left component to follow China's glyph conventions. Cf. [WS2021-04709] and Henry's not implemented comment for
03499
03499
见 147′.7.1
UK-30725
TS 11 · IDS
Glyph design
Modify 4th stroke of left component to follow China's glyph conventions.
04549
04549
鳥 196.8.4
VN-F20BC
TS 16 · IDS 𫠓
Normalization
Oppose treating 𫠓 as a separate radical (196.2). Radicals should be used systematically within a particular region, which is not the case for 𫠓 (or 𫠉). There are no existing encoded ideographs with 𫠓, which suggests that VN-F20BC is an idiosyncratic usage. It would be better to normalize the character to ⿰鳥京 which is the description given in the definition for the character in the evidence.

Editorial

Showing 2 comments.

SnImage/SourceComment TypeDescription
01866
01866
水 85.5.2
UK-30787
TS 7 · IDS 𰛄
Editorial issue
A new UK font with updated glyph for UK-30787 was provided to the ORT manager, but the glyph has not yet been updated for WS2024 v.2.0.
00441
00441
口 30.8.1
UK-30815
TS 11 · IDS
Editorial issue
A new UK font with updated glyph for UK-30815 was provided to the ORT manager, but the glyph has not yet been updated for WS2024 v.2.0.

Other

Showing 9 comments.

SnImage/SourceComment TypeDescription
02252
02252
火 86.10.3
GCCPP-00010
TS 14 · IDS
Other
Yes, papers published in Taiwan use 火積:



It would be so useful if we could fast track encoding of traditional forms of proposed simplified Chinese characters. Perhaps TCA could propose ⿰火積 as an UNC?
01372
01372
廴 54.5.2
GCW-00137
TS 8 · IDS
Other
The other unencoded draft simplified characters listed in "Long Story of Short Forms" p. 30 shown in Comment #10575 are: ⿱𰆖心, ⿰虫入 (this is the 2nd stage part 2 simplified form of U+8815 蠕, not 蝶 as given here), and ⿱占一. These can be considered for the next working set.
03308
03308
艸 140.10.5
GZ-1181201
TS 14 · IDS 丿丿
Other
Erjian (Table 2) form of 第 is ⿱丷⿹𰆊丿
02981
02981
糸 120.8.4
SAT-09364
TS 14 · IDS
Other
Right side component (⿱宀丙) is submitted as
01081
宀 40.5.1
UK-30572
TS 8 · IDS
UK-30572
01156
01156
尸 44.11.5
SAT-09426
TS 14 · IDS
Comment
I don't think 古鞾 is the fanqie reading
02382
02382
犬 94.9.5
SAT-09479
TS 12 · IDS
Other
文字情報基盤検索システム gives the reading しゃぐま = 赤熊 Brown bear (Ursus arctos). U+2C33C is very likely to be a variant form of ⿰犭勇, and so unifiable with SAT-09479 even if SAT-09479 is not related in meaning to U+2C33C.
01153
01153
尸 44.10.1
SAT-90030
TS 13 · IDS
Other
I guess the character below (⿸尸⿱水水) is a variant form of U+21C98 𡲘
04423
04423
魚 195.11.2
UK-30061
TS 22 · IDS
Other
I don't have access to other editions of 《蠕範》, but additional evidence is welcome. The 《漢字海》evidence suggests that this form should be encoded.
04412
04412
魚 195.10.1
UK-30486
TS 21 · IDS
Other
Thanks, the correct character in Evidence 4 is the character above the highlighted character.