字符编码问题

来源:互联网 发布:第六季网络复活赛 编辑:程序博客网 时间:2024/06/06 05:20

Alex_ShengShen的博客[http://blog.csdn.net/shsalex/article/details/52104898] 有三篇文章。
1. 编码与乱码

Words and sentences in text are created from characters.


youtube视频Characters, Symbols and the Unicode Miracle


Unicode is not an encoding.
There are several ways to encode Unicode code points into bits.

-

Unicode is one large standard effort which has catalogued and specified a number ⟷ character relationship for virtually all characters and symbols of every major language in use, which is hundreds of thousands of characters

-

UTF-8, 16 and 32 are different sub-standards for how to encode this ginormous catalog of numbers to bytes, each with different size tradeoffs

Excusez-moi? = Excuse me?

所以Unicode只是character ⟷ number只是

Using Unicode, you can write a document containing virtually any language using any character you can type into a computer.

Code point

原创粉丝点击