你在用C++编程时遇到的最多的几个汉字是啥？

来源：互联网发布：python程序化交易编辑：程序博客网时间：2024/06/05 00:37

本文转载自http://blog.csdn.net/liygcheng/article/details/16836371

首先，什么都不说，我们来看一段代码，然后试着编译运行：

[cpp] view plaincopyprint?

#include <iostream>
#include <malloc.h>
#include <string>
using namespace std;
char* h = (char*)malloc(2*sizeof(char));
int main()
{
string str;
str.push_back(h[0]);
str.push_back(h[1]);
cout<<str<<endl;
system("pause");
return 0;
}

#include <iostream>#include <malloc.h>#include <string>using namespace std;char* h = (char*)malloc(2*sizeof(char));int main(){string str;str.push_back(h[0]);str.push_back(h[1]);cout<<str<<endl;system("pause");return 0;}

运行结果截图：

问题一：为什么会出现“屯”？

下面，我们稍微改动一下，再来看看：

[cpp] view plaincopyprint?

#include <iostream>
#include <malloc.h>
#include <string>
using namespace std;
int main()
{
string str;
char s[10];
str.push_back(s[0]);
str.push_back(s[1]);
cout<<str<<endl;
system("pause");
return 0;
}

#include <iostream>#include <malloc.h>#include <string>using namespace std;int main(){string str;char s[10];str.push_back(s[0]);str.push_back(s[1]);cout<<str<<endl;system("pause");return 0;}

运行结果截图：(注意：正常情况下编译时会出现以下情况，请点击continue)

生成结果：

问题二：

为什么会出现这个大家很熟悉的“烫”字？

在学习C++的时候，也许我们遇到的最多的两个汉字就是他们了(VC平台)，有人可能会想，这是为什么？换句话说，对于上述二个问题，你有什么看法？下面请跟着我一步步的去探究这里面的原因：

首先对于问题一：

我们要清楚的一点是，这究竟是个例，还是普遍现象？我们再来验证一下：

[cpp] view plaincopyprint?

#include <iostream>
#include <malloc.h>
#include <string>
using namespace std;
char* h1= (char*)malloc(2*sizeof(char));
char* h2= (char*)malloc(2*sizeof(char));
char* h3= (char*)malloc(10*sizeof(char));
char* h4= new char[2];
int main()
{
string str;
str.push_back(h1[0]);
str.push_back(h1[1]);
cout<<"h1: "<<str<<endl;
str.clear();
str.push_back(h2[0]);
str.push_back(h2[1]);
cout<<"h2: "<<str<<endl;
str.clear();
str.push_back(h3[0]);
str.push_back(h3[1]);
str.push_back(h3[2]);
str.push_back(h3[3]);
cout<<"h3: "<<str<<endl;
str.clear();
str.push_back(h4[0]);
str.push_back(h4[1]);
cout<<"h4: "<<str<<endl;
system("pause");
return 0;
}

#include <iostream>#include <malloc.h>#include <string>using namespace std;char* h1= (char*)malloc(2*sizeof(char));char* h2= (char*)malloc(2*sizeof(char));char* h3= (char*)malloc(10*sizeof(char));char* h4= new char[2];int main(){string str;str.push_back(h1[0]);str.push_back(h1[1]);cout<<"h1: "<<str<<endl;str.clear();str.push_back(h2[0]);str.push_back(h2[1]);cout<<"h2: "<<str<<endl;str.clear();str.push_back(h3[0]);str.push_back(h3[1]);str.push_back(h3[2]);str.push_back(h3[3]);cout<<"h3: "<<str<<endl;str.clear();str.push_back(h4[0]);str.push_back(h4[1]);cout<<"h4: "<<str<<endl;system("pause");return 0;}

结果如下：

看到结果之后，我们基本可以肯定，这并不是一个个例，而是普遍存在的现象(VC平台)。我们都清楚，在C++中，一般分成栈区域(Stack),堆区域(Heap),代码区(Code Area),数据区(Data Area)(又分为自由存储区，全局/静态存储区，常亮存储区).此处，我们仅仅考虑堆区域，和栈区域和自由存储区。堆区域一般存放的是动态内存，供程序随机申请使用，但是内存的申请与释放必须程序员亲力亲为，常见的比如New出来的内存块。栈区域则存放程序需要的局部数据，这些数据可以动态的反应程序中对函数的调用状态，一般来说系统自动回收。自由存储区是那些由malloc等分配的内存块，跟堆有点相似。这些是C++程序员应该掌握的一些基本内容。接着来，

一个汉字占两个字节，经过以上验证，我们至少可以肯定的说，堆(还有自由存储区)未初始化时的内容是最多以两个字节(都是汉字"屯"嘛，当然也更可能是以一个字节为单位，因为也可能存在GBK编码重复的嘛(真实情况就是这样啦！后面验证。。))为单位的信息(至于什么信息，后面再来验证)。

为了弄清，这里的"屯"字究竟是什么，请转到我的其中一篇博文http://blog.csdn.net/liygcheng/article/details/11686117，然后在该页面按Ctrl + F, 查找"屯"字的GBK编码，是不是下面这个情况呢？

相信这个大家都能看得懂吧，显然，"屯"字的GBK编码为“CDCD",这是16进制，那么这不就是说，每个字符的GBK编码是CD么？显然这是一个负数-51，于是我们就可以下结论了，堆空间(还有自由存储区)未初始化时的默认字符是-51。不过，至于为什么一定要是-51，这个估计就要问微软了。。

至此，问题一基本上已经回答完毕，那么问题二是不是也很简单呢？

提示：在VC中，栈空间未初始化时的字符默认是-52，补码是0xCC.

补：在文章的结尾，我们再来做个有趣的事情，来证明我们上面的结论都是正确的，参见代码如下：

[cpp] view plaincopyprint?

#include <iostream>
#include <string>
using namespace std;
char* h = new char[4];
int main()
{
memset(h,-52,4);//以字节为单位批量填充，是不是就将原本该显示的"屯"换成了"烫"呢。。
cout<<h<<endl;
system("pause");
return 0;
}

#include <iostream>#include <string>using namespace std;char* h = new char[4];int main(){memset(h,-52,4);//以字节为单位批量填充，是不是就将原本该显示的"屯"换成了"烫"呢。。cout<<h<<endl;system("pause");return 0;}

0 0