Java 完美判断中文字符

来源:互联网 发布:24周胎儿发育正常数据 编辑:程序博客网 时间:2024/05/22 23:53

转自:http://www.micmiu.com/lang/java/java-check-chinese/


Java判断一个字符串是否有中文一般情况是利用Unicode编码(CJK统一汉字的编码区间:0x4e00–0x9fbb)的正则来做判断,但是其实这个区间来判断中文不是非常精确,因为有些中文的标点符号比如:,。等等是不能识别的。


// 根据Unicode编码完美的判断中文汉字和符号
35    private static boolean isChinese(char c) {
36        Character.UnicodeBlock ub = Character.UnicodeBlock.of(c);
37        if (ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS
38                || ub == Character.UnicodeBlock.CJK_COMPATIBILITY_IDEOGRAPHS
39                || ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS_EXTENSION_A
40                || ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS_EXTENSION_B
41                || ub == Character.UnicodeBlock.CJK_SYMBOLS_AND_PUNCTUATION
42                || ub == Character.UnicodeBlock.HALFWIDTH_AND_FULLWIDTH_FORMS
43                || ub == Character.UnicodeBlock.GENERAL_PUNCTUATION) {
44            return true;
45        }
46        return false;
47    }

原创粉丝点击