Please wait while loading

IRG Working Set 2024v1.0

Source: SHEN Tianheng (CheonHyeong Sim)
Date: Generated on 2026-02-14

Show Deleted | Show comments from version: 1.0 2.0 3.0 4.0 | Show comments with status: Show All New Only Unresolved Only
The Image/Source column is displayed as it was in WS2024 v1.0. The character may have a different status in the latest working set.

Labels

Showing 4 comments.

SnImage/SourceComment TypeDescription
03257
03257
艸 140.6.3
UTC-03392
TS 10 · IDS 𦘱
Label
Hybrid
01356
01356
广 53.5.3
UTC-03391
TS 8 · IDS 广
Label
Hybrid
01144
01144
尸 44.2.4
UTC-03390
TS 5 · IDS
Label
Hybrid
01351
01351
广 53.2.5
UTC-03385
TS 5 · IDS 广
Label
Hybrid

Unification

Showing 18 comments.

SnImage/SourceComment TypeDescription
00374
00374
又 29.9.5
GCW-00020
TS 11 · IDS
Unification
U+2B73C
Since TCA will submit this character as a UNC (mentioned in L2/24-165), it should be unified to that codepoint (probably U+2B73C) as a horizontal extension.
01504
01504
心 61.12.2
TD-5B73
TS 15 · IDS
Unification
U+227E8
Unify to 𢟨 (U+227E8).
U+246D8
U+29A8E
03289
03289
艸 140.9.2
UTC-03369
TS 12 · IDS
Unification

After the horizontal extension by J-source, this would be an exact match.
00815
00815
囗 31.1.4
UTC-03393
TS 4 · IDS
Oppose Unification
Non-cognate with 𠮚 (U+20B9A), even the shape looks similar. The outer part of this character is 囗, but the outer part of U+20B9A is 口.
Oppose Unification
Unicode does not encode *glyphs* but *characters*.
Oppose Unification
The two 机s are non-cognate but they have the same *abstract shape* as ⿰{木}{几}. However, ⿴口丶 and ⿴囗丶 do not, even their *absolute shape* are not identical but just similar. Additionally, ⿴囗丶 also has the Q-like shape according to one of the evidences, you could not determine the character identity just from one of the possible absolute shape, which is the same issue as the relationship between {月(月)}, {月(肉)}, {月(舟)}, {月(丹)} and {月(冃)}.
Oppose Unification
Also agree with #3123 that the radical difference is enough to show they are two different *characters*.
02638
02638
目 109.16.1
UTC-03478
TS 21 · IDS 𰤷
Unification
U+77D1
I also agree to unify to 矑 (U+77D1). As the co-author of the original proposal to add this character to UAX#45, when preparing the proposal, I have already told the first author that this character would most likely to be unified, and I suggest U-source just do an horizontal extension.
03680
03680
足 157.14.1
UTC-03479
TS 21 · IDS 𧾷𠪨
Unification
U+8E94
I also agree to unify to 躔 (U+8E94). As the co-author of the original proposal to add this character to UAX#45, when preparing the proposal, I have already told the first author that this character would most likely to be unified, and I suggest U-source just do an horizontal extension.
04659
04659
鼠 208.16.1
UTC-03487
TS 26 · IDS 𣆨丿𰤷
Unification
U+2A58C
I also agree to unify to 𪖌 (U+2A58C). As the co-author of the original proposal to add this character to UAX#45, when preparing the proposal, I have already told the first author that this character would most likely to be unified, and I suggest U-source just do an horizontal extension.
01489
01489
心 61.10.3
VN-F1FB8
TS 14 · IDS
Unification
U+2BEB1
Unify to 𫺱 (U+2BEB1).
Suggest to add ⿱匕⿺㇉一 and 𪟽 as UCV Lv.1 due to the cognition (both the simplified form of 疑, also see the small character in the parenthesis next to the character entry). It seems that the former one is preferred by the Jing nationality (京族) in China and the latter one is preferred by people in Vietnam. I do not think we need to encode both shapes separately.
Unification
U+99AC
U+91D1
The two structures have a strict correspondence, and we could treat them as only glyph variants. That is a very different situation from 馬/马, 金/钅, etc. See the colors below.
Unification
The question is, how to define “quite different”? They are both the simplified forms of the same character, not the simplified-traditional relationship. What is more, the correspondence works between strokes but not components; however, 馬 obviously has a lot more strokes than 马, you cannot simply establish such a correspondence.

For example, do you think that
04658
鼠 208.9.1
UTC-03486
TS 19 · IDS 𣆨𮧓
and
04657
鼠 208.9.1
UTC-03485
TS 22 · IDS
look “quite different”?
01730
01730
日 72.4.1
VN-F1FF7
TS 8 · IDS
Unification
01733
日 72.4.2
VN-F0259
TS 8 · IDS 𪟽
Unify to WS2024-01733.
01733
日 72.4.2
VN-F0259
TS 8 · IDS 𪟽

Suggest to add ⿱匕⿺㇉一 and 𪟽 as UCV Lv.1 due to the cognition (both the simplified form of 疑, also see the small character in the parenthesis next to the character entry). It seems that the former one is preferred by the Jing nationality (京族) in China and the latter one is preferred by people in Vietnam. I do not think we need to encode both shapes separately.
Unification
See #4776.
Unification
https://hc.jsecs.org/irg/ws2024/app/?id=01489
01437
01437
心 61.4.1
VN-F1FFC
TS 7 · IDS
Unification
U+2AAE2
Unify to 𪫢 (U+2AAE2).
Suggest to add ⿱匕⿺㇉一 and 𪟽 as UCV Lv.1 due to the cognition (both the simplified form of 疑, also see the small character in the parenthesis next to the character entry). It seems that the former one is preferred by the Jing nationality (京族) in China and the latter one is preferred by people in Vietnam. I do not think we need to encode both shapes separately.
Unification
See #4776. https://hc.jsecs.org/irg/ws2024/app/?id=01489

Attributes

Showing 5 comments.

SnImage/SourceComment TypeDescription
01827
01827
曰 73.5.5
GZ-0862101
TS 9 · IDS
Radical
Change Radical to 72.0 (日)
IDS
Should be ⿱日召 instead of ⿱曰召.
01866
01866
木 75.3.2
UK-30787
TS 7 · IDS
Radical
Then the radical should be changed from 木 to 水.
00340
00340
厂 27.8.4
VN-F2002
TS 10 · IDS
IDS
IDS=⿸厂⿻沈丶
02893
02893
竹 118.10.5
VN-F22F4
TS 16 · IDS
Radical
Do not change the radical to 73.0 since 筆 is the semantic part (so-called 形旁) and 曰 is the phonetic part (so-called 声旁).

Evidence

Showing 3 comments.

SnImage/SourceComment TypeDescription
04243
04243
馬 187.5.2
UTC-03447
TS 15 · IDS
New evidence

Also KP1-8989. The evidence above is from IRGN1275.
04292
04292
髟 190.12.1
VN-F064F
TS 22 · IDS
Misidentified glyph
The IDS says that the lower part is 提; however the evidence shows a strange structure neither 捉 nor 提. There was a similar case that ⿱上提(䶶) was disunified from ⿱上捉(𫠼), so I hope the V-source experts could double-check if this character is wrong in the dictionary. If it is right, then please change the IDS.
U+4DB6

U+2B83C
02893
02893
竹 118.10.5
VN-F22F4
TS 16 · IDS
Evidence
Evidence 2 seems not to be "Takeuchi Yonosuke. Jinan Jiten (字喃字典). Tokyo. 1988., p.595". Please check if the picture was wrongly uploaded.

Glyph Design & Normalization

Showing 10 comments.

SnImage/SourceComment TypeDescription
01304
01304
山 46.19.1
GCW-00133
TS 22 · IDS
Glyph design
The 3rd stroke of the lower right part (隹) should be 丶 instead of 丿 according to G-source convention, even if the evidence shows like 丿 (because that is so-called 旧字形, which is different from the G-source convention nowadays).
00109
00109
人 9.3.5
子 39.2.3
GDM-00378
TS 5 · IDS
Glyph design
Suggest to modify the 2nd stroke to 捺 instead of 点 to match both the G-source convention and the SJ/T 11239—2001 evidence.
03202
03202
自 132.7.5
GZ-0501401
TS 13 · IDS 𠬶
Glyph design
The 2nd stroke of the right part should not pass through the 1st stroke according to both G-source convention and the evidence.
01827
01827
曰 73.5.5
GZ-0862101
TS 9 · IDS
Glyph design
Semantically the upper part should be 日 instead of 曰, so consider modify the glyph to match both G-source convention and the evidence.
03077
03077
羊 123.8.4
GZ-1911102
TS 14 · IDS
Glyph design
The 5th stroke should be 横 instead of 提 according to both G-source convention and the evidence.
04148
04148
韋 178.9.4
GZ-4901401
TS 19 · IDS
Glyph design
The bottom part of 韋 should be normalized according to the G-source convention.
03836
03836
邑 163.7.1
GZHSJ-0076
TS 9 · IDS 𦔮
Glyph design
The last stroke should have a hook to match both the G-source convention and the evidences.
02985
02985
糸 120.9.3
KC-10168
TS 15 · IDS
Glyph design
Suggest to remove the redundant hook from 糸 according to K-source convention.
Glyph design
I am confused whether the component between 彳 and 亍 is 氵 or 冫.
02685
02685
石 112.8.5
VN-F03C5
TS 13 · IDS
Glyph design
Suggest to change the last stroke from 丶 to ㇏.
The last stroke of 啜 and
03618
赤 155.8.5
VN-F056B
TS 15 · IDS
are both ㇏ in the evidence, so suggest to keep the consistency.

Other

Showing 6 comments.

SnImage/SourceComment TypeDescription
04237
04237
馬 187.-4.0
GCW-00243
TS 6 · IDS
Comment
After the CJK component block being encoded, what if someone find an encoded “component” is “the real Hanzi”? We need a solution on that before encoding the components.
04263
04263
马 187′.-1.0
GCW-00244
TS 2 · IDS
Comment
After the CJK component block being encoded, what if someone find an encoded “component” is “the real Hanzi”? We need a solution on that before encoding the components.
00817
00817
囗 31.3.2
SAT-04332
TS 4 · IDS
Comment
I think the "note" at the top of this page should be "Resubmitted from WS2021" instead of "Resubmitted from WS2024".
03779
03779
车 159′.0.0
UTC-00792
TS 4 · IDS
Other
I wonder if we could use the reserved codepoints from U+2EF4 to U+2EFF.
03132
03132
聿 129.0.0
UTC-03248
TS 6 · IDS 𦘒
Other
I wonder if we could use the reserved codepoints from U+2EF4 to U+2EFF.
00236
00236
冫 15.4.3
VN-F0046
TS 5 · IDS
Comment
I am just curious about if the character does appear in ancient literatures. The 冫 part on the right side seems to be a little bit strange anyway. Maybe I need to broaden my horizons haha.