ASCII and Unicode
来源:互联网 发布:上睑松弛知乎 编辑:程序博客网 时间:2024/05/29 16:28
《编程导论(Java)·1.2.3 五种Java元素》中提到:
Java支持Unicode字符集(Unicode character set)。它以16位来表示一个字符,其中最前的128个字符则是ASCII( The American StandardCode form Information Interchange)码字符。
Java中的标识符、注释、字符文字、字符串文字可以使用Unicode,而关键字、操作符等其他Java元素则使用ASCII中的(一些)字符。
代码库codes中的相关代码
package tips;import static tips.Print.*;/** *ASCII字符 * @author yqj2065 * @version 2011.10 */public class Ascii{ //打印Ascii,其中不可打印字符,自然看不见。 public static void plintAscii(){ for(byte i =0;i< 128 && i>=0;i++){ System.out.println(i+" "+ (char)i); } } /** * Unicode字符集(Unicode character set),以16位来表示一个字符, * 其中最前的128个字符则是ASCII. * 测试参数:48、65、97、130、258、20005 * 将形参改为byte,测试[0,127]数据! */ public static void testUnicode(int i){ //int[] data = {}; pln(i+" "+ (char)i); } }
ASCII and Latin-1 字符表
ASCII and Latin-1 Character Table | Char | Dec | Hex | Octal | HTML | Notes | ^@00x000000^@NUL nul^A10x010001^ASOH start of header^B20x020002^BSTX start of text^C30x030003^CETX end of text^D40x040004^DEOT end of transmission^E50x050005^EENQ enquiry^F60x060006^FACK acknowledege^G70x070007^GBEL bell^H80x080010^HBS backspace [/b]^I90x090011^IHT horizonal tab [/t]^J100x0a0012^JLF line feed [/n]^K110x0b0013^KVT vertical tab^L120x0c0014^LFF form feed [/f]^M130x0d0015^MCR carriage return [/r]^N140x0e0016^NSO shift out^O150x0f0017^OSI shift in^P160x100020^PDLE data link escape^Q170x110021^QDC1 device control 1, XON resume transmission^R180x120022^RDC2 device control 2^S190x130023^SDC3 device control 3, XOFF pause transmission^T200x140024^TDC4 device control 4^U210x150025^UNAK negative acknowledge^V220x160026^VSYN synchronise^W230x170027^WETB end text block^X240x180030^XCAN cancel^Y250x190031^YEM end message^Z260x1a0032^ZSUB substitute^[270x1b0033^[ESC escape^/280x1c0034^/FS file separator, usually used to separate groups of records.^]290x1d0035^]GS group separator, usually used to separate fields.^^300x1e0036^^RS record separator, usually used to separate records.^_310x1f0037^_US unit separator, usually used to separate subfields. 320x200040 space!330x210041!bang, exclamation"340x220042"quote#350x230043#sharp, number sign$360x240044$dollar sign%370x250045%percent&380x260046&ampersand'390x270047'apostrophe(400x280050(left parenthesis)410x290051)right parenthesis*420x2a0052*star, asterisk+430x2b0053+plus,440x2c0054,comma-450x2d0055-minus.460x2e0056.period/470x2f0057/slash, not backslash!0480x3000600digit 01490x3100611digit 12500x3200622digit 23510x3300633digit 34520x3400644digit 45530x3500655digit 56540x3600666digit 67550x3700677digit 78560x3800708digit 89570x3900719digit 9:580x3a0072:colon;590x3b0073;semicolon<600x3c0074<less than=610x3d0075=equals>620x3e0076>greater than?630x3f0077?question mark@640x400100@at signA650x410101Aupper case AB660x420102Bupper case BC670x430103Cupper case CD680x440104Dupper case DE690x450105Eupper case EF700x460106Fupper case FG710x470107Gupper case GH720x480110Hupper case HI730x490111Iupper case IJ740x4a0112Jupper case JK750x4b0113Kupper case KL760x4c0114Lupper case LM770x4d0115Mupper case MN780x4e0116Nupper case NO790x4f0117Oupper case OP800x500120Pupper case PQ810x510121Qupper case QR820x520122Rupper case RS830x530123Supper case ST840x540124Tupper case TU850x550125Uupper case UV860x560126Vupper case VW870x570127Wupper case WX880x580130Xupper case XY890x590131Yupper case YZ900x5a0132Zupper case Z[910x5b0133[left square bracket/920x5c0134/backslash, not slash!]930x5d0135]right square bracket^940x5e0136^hat, circumflex_950x5f0137_underscore`960x600140`grave, rhymes with havea970x610141alower case ab980x620142blower case bc990x630143clower case cd1000x640144dlower case de1010x650145elower case ef1020x660146flower case fg1030x670147glower case gh1040x680150hlower case hi1050x690151ilower case ij1060x6a0152jlower case jk1070x6b0153klower case kl1080x6c0154llower case lm1090x6d0155mlower case mn1100x6e0156nlower case no1110x6f0157olower case op1120x700160plower case pq1130x710161qlower case qr1140x720162rlower case rs1150x730163slower case st1160x740164tlower case tu1170x750165ulower case uv1180x760166vlower case vw1190x770167wlower case wx1200x780170xlower case xy1210x790171ylower case yz1220x7a0172zlower case z{1230x7b0173{left curly brace|1240x7c0174|vertical bar}1250x7d0175}right curly brace~1260x7e0176~tilde1270x7f0177DEL delete€1280x800200€ 1290x810201 ‚1300x820202‚ ƒ1310x830203ƒ „1320x840204„ …1330x850205… †1340x860206† ‡1350x870207‡ ˆ1360x880210ˆ ‰1370x890211‰ Š1380x8a0212Š ‹1390x8b0213‹ Œ1400x8c0214Œ 1410x8d0215 Ž1420x8e0216Ž 1430x8f0217 1440x900220 ‘1450x910221‘ ’1460x920222’ “1470x930223“ ”1480x940224” •1490x950225• –1500x960226– —1510x970227— ˜1520x980230˜ ™1530x990231™ š1540x9a0232š ›1550x9b0233› œ1560x9c0234œ 1570x9d0235 ž1580x9e0236ž Ÿ1590x9f0237Ÿ 1600xa00240  ¡1610xa10241¡PostScript (¡) exclamdown¢1620xa20242¢PostScript (¢) cent£1630xa30243£PostScript (£) sterling¤1640xa40244¤PostScript (/) fraction¥1650xa50245¥PostScript (¥) yen¦1660xa60246¦PostScript (ƒ) florin§1670xa70247§PostScript (§) section¨1680xa80250¨PostScript (¤) currency©1690xa90251©PostScript (') quotesingleª1700xaa0252ªPostScript (“) quotedblleft«1710xab0253«PostScript («) guillemotleft¬1720xac0254¬PostScript (<) guilsinglleft1730xad0255­PostScript (>) guilsinglright®1740xae0256®PostScript fi ligature¯1750xaf0257¯PostScript fl ligature;°1760xb00260° ±1770xb10261±PostScript (–) endash²1780xb20262²PostScript (†) dagger³1790xb30263³PostScript (·) periodcentered´1800xb40264´ µ1810xb50265µ ¶1820xb60266¶PostScript (¶) paragraph·1830xb70267·PostScript (•) bullet¸1840xb80270¸PostScript (,) quotesinglbase¹1850xb90271¹PostScript („) quotedblbaseº1860xba0272ºPostScript (”) quotedblright»1870xbb0273»PostScript (») guillemotright¼1880xbc0274¼PostScript (…) ellipsis½1890xbd0275½PostScript (‰) perthousand¾1900xbe0276¾ ¿1910xbf0277¿PostScript (¿) questiondownÀ1920xc00300À Á1930xc10301ÁPostScript (`) graveÂ1940xc20302ÂPostScript (´) acuteÃ1950xc30303ÃPostScript (^) circumflexÄ1960xc40304ÄPostScript (~) tildeÅ1970xc50305ÅPostScript (¯) macron, overbar accentÆ1980xc60306ÆPostScript (u) breve, flattened u-shaped accentÇ1990xc70307ÇPostScript (·) dotaccentÈ2000xc80310ÈPostScript (¨) dieresisÉ2010xc90311É Ê2020xca0312ÊPostScript (°) ringË2030xcb0313ËPostScript (¸) cedillaÌ2040xcc0314Ì Í2050xcd0315ÍPostScript (”) hungarumlautÎ2060xce0316ÎPostScript (,) ogonek, reverse commaÏ2070xcf0317ÏPostScript (v) caron, flattened v-shaped accentÐ2080xd00320ÐPostScript (—) emdashÑ2090xd10321Ñ Ò2100xd20322Ò Ó2110xd30323Ó Ô2120xd40324Ô Õ2130xd50325Õ Ö2140xd60326Ö ×2150xd70327× Ø2160xd80330Ø Ù2170xd90331Ù Ú2180xda0332Ú Û2190xdb0333Û Ü2200xdc0334Ü Ý2210xdd0335Ý Þ2220xde0336Þ ß2230xdf0337ß à2240xe00340à á2250xe10341áPostScript (Æ) AEâ2260xe20342â ã2270xe30343ãPostScript (ª) ordfeminineä2280xe40344ä å2290xe50345å æ2300xe60346æ ç2310xe70347ç è2320xe80350èPostScript (L/) Lslash, L with / overstrikeé2330xe90351éPostScript (Ø) Oslashê2340xea0352êPostScript (Œ) OEë2350xeb0353ëPostScript (º) ordmasculineì2360xec0354ì í2370xed0355í î2380xee0356î ï2390xef0357ï ð2400xf00360ð ñ2410xf10361ñPostScript (æ) aeò2420xf20362ò ó2430xf30363ó ô2440xf40364ô õ2450xf50365õPostScript (1) dotlessi, i without dotö2460xf60366ö ÷2470xf70367÷ ø2480xf80370øPostScript (l/) l with / overstrikeù2490xf90371ùPostScript (ø) oslashú2500xfa0372úPostScript (œ) oeû2510xfb0373ûPostScript (ß) germandblsü2520xfc0374ü ý2530xfd0375ý þ2540xfe0376þ ÿ2550xff0377ÿ
- ASCII and Unicode
- ASCII and Unicode transfer
- ASCII and Unicode
- ASCII, Unicode and UTF8 (Big endian and Little endian)
- Unicode&ASCII
- The basic way to support both ASCII and Unicode:
- 基础知识Notes: 字符串编码(ASCII, Unicode and UTF8)
- Some things about the ASCII,Unicode and UTF-8
- About AscII ,Unicode ...
- ASCII与UNICODE
- ASCII、宽字符集、Unicode
- ASCII, DBCS, Unicode【上】
- ASCII, DBCS, Unicode【下】
- ASCII码 和 Unicode
- ASCII, DBCS,Unicode小结
- UNICODE和ASCII
- ASCII vs UNICODE
- ASCII Unicode UTF-8
- 一个用c#写的扫描asp源码漏洞的应用程序(续)
- c#.net常用函数和方法集(转)
- 项目经验技术总结三:系统业务基础数据维护
- 注意环境变量中tmp指向的路径是否有空格
- 慌了
- ASCII and Unicode
- 计划: 把 Google 和 Yahoo 的分类目录给盗取过来.^^
- 在Web页中使用Media Player
- 连想都不要想
- 新浪网中的一角 (关于陈氏的言论)
- 《Undocumented Windows 2000 Secrets》翻译 --- 第四章(8)
- Linux 下将 MySQL 编码转换至 UTF-8 的实例
- 《Undocumented Windows 2000 Secrets》翻译 --- 第四章(9)
- MySQL 编码转换 UTF-8 方法 (WINDOWS)