Talk:Debate on the use of Korean mixed script

Page contents not supported in other languages.
Source: Wikipedia, the free encyclopedia.

Information theory + partiality

I am by no means an expert of Korean, nor of Asian scripts, however I am reasonably knowledgeable in mathematics and computer science; and it seems to me that the section about information theory, at least, is close to nonsense.

Obviously a larger set of symbols allows for expressing words with less symbols; that doesn’t take into account the complexity of knowing the set. By having more possible symbols, you trade easiness of learning against density of information, in an inefficient deal (because density grows only logarithmically in the number of possible symbols). And what is a symbol, anyway? On this broken metric, Hangul could appear to outperform Hanja, simply by considering that the symbols are the syllables, rather than the individual letters (there are, apparently, 11,172 possible syllables, compared with the 2,000 common Hanja characters). Yet the set of Hangul syllables is much easier to learn, and Hangul syllables are much easier to decode, because they are formed in a principled way from a reduced set of 24 letters, whereas Hanja characters are just arbitrary drawings. The drawing of Hangul syllables conveys phonemic information, whereas the drawing of Hanja characters conveys nothing.

The objection about homophones is of course valid, but the current paragraph lacks the amount of quantification that one would expect from a section titled “Information theory” and, besides, it makes a number of claims that ought to be supported by sources.

Also, −log2(1/24) is approximately 4.58, not 4.75 as written in the article (and −log2(1/2000) is closer to 10.97 than to 10.96).

In any case, no source whatsoever is provided in that section, that would demonstrates this is more than personal work. I second @

strawman fallacy. I failed to find the alleged 2005 study about literacy in the OECD; I don’t doubt it exists, but for lack of the source, I can’t check the related claim in this article (the given source is an archived newspaper article which isn’t very explicit about the discussed study). Maëlan 04:39, 17 February 2024 (UTC)[reply
]

Looking over this article again, I am thinking about AFD'ing this article as a
WP:TNT case: its topic is notable, but it is wholly a net negative on this site in its present state. Remsense 05:58, 17 February 2024 (UTC)[reply
]
Most of the stuff in Information Theory other than the math is really an over-explanation of how Hangul cannot replace Hanja as a whole as it is a phonogram and cannot represent the meaning each logographic meaning a Hanja character has. 00101984hjw (talk) 07:11, 19 February 2024 (UTC)[reply]