UrlRewrite、地址映射技术(生成静态页面)

来源：互联网发布：江宁广电网络客服电话编辑：程序博客网时间：2024/05/21 22:57

<%@ page language="C#" %>
<%@ import namespace=System.IO %>
<script runat="server">
protected override void OnInit (EventArgs e)
{
　 int id;
　 try
　 {
　　　 id = int.Parse (Request.QueryString["id"]);
　 }
　 catch
　 {
　　　 throw (new Exception ("页面没有指定id"));
　 }
　
　方案1：
/// <summary>
/// 传入URL返回网页的html代码
/// </summary>
/// <param name="Url">URL</param>
/// <returns></returns>
public static string getUrltoHtml(string Url)
{
errorMsg = "";
try
{
System.Net.WebRequest wReq = System.Net.WebRequest.Create(Url);
// Get the response instance.
System.Net.WebResponse wResp =wReq.GetResponse();
// Read an HTTP-specific property
//if (wResp.GetType() ==HttpWebResponse)
//{
//DateTime updated =((System.Net.HttpWebResponse)wResp).LastModified;
//}
// Get the response stream.
System.IO.Stream respStream = wResp.GetResponseStream();
// Dim reader As StreamReader = New StreamReader(respStream)
System.IO.StreamReader reader = new System.IO.StreamReader(respStream, System.Text.Encoding.GetEncoding("gb2312"));
return reader.ReadToEnd();

}
catch(System.Exception ex)
{
errorMsg = ex.Message ;
}
return "";
}

　　你可以用这个函数获取网页的客户端的html代码，然后保存到.html文件里就可以了。

　　方案2：

　　生成单个的静态页面不是难点，难的是各个静态页面间的关联和链接如何保持完整；特别是在页面频繁更新、修改、或删除的情况下；

　　像阿里巴巴的页面也全部是html的，估计用的是地址映射的功能关于地址映射可参考：http://www.easewe.com/Article/ShowArticle.aspx?article=131

　　可以看看这个页面，分析一下他的“竞价倒计时”功能http://info.china.alibaba.com/news/subject/v1-s5011580.html?head=top4&Bidding=home5

　　ASP.Net生成静态HTML页
　　在Asp中实现的生成静态页用到的FileSystemObject对象!
　　在.Net中涉及此类操作的是System.IO
　　以下是程序代码注:此代码非原创!参考别人代码

CODE:
//生成HTML页
public static bool WriteFile(string strText,string strContent,string strAuthor)
{
string path = HttpContext.Current.Server.MapPath("/news/");
Encoding code = Encoding.GetEncoding("gb2312");
// 读取模板文件
string temp = HttpContext.Current.Server.MapPath("/news/text.html");
StreamReader sr=null;
StreamWriter sw=null;
string str="";
try
{
sr = new StreamReader(temp, code);
str = sr.ReadToEnd(); // 读取文件
}
catch(Exception exp)
{
HttpContext.Current.Response.Write(exp.Message);
HttpContext.Current.Response.End();
sr.Close();
}

string htmlfilename=DateTime.Now.ToString("yyyyMMddHHmmss")+".html";
// 替换内容
// 这时,模板文件已经读入到名称为str的变量中了
str =str.Replace("ShowArticle",strText); //模板页中的ShowArticle
str = str.Replace("biaoti",strText);
str = str.Replace("content",strContent);
str = str.Replace("author",strAuthor);
// 写文件
try
{
sw = new StreamWriter(path + htmlfilename , false, code);
sw.Write(str);
sw.Flush();
}
catch(Exception ex)
{
HttpContext.Current.Response.Write(ex.Message);
HttpContext.Current.Response.End();
}
finally
{
sw.Close();
}
return true;

　　此函数放在Conn.CS基类中了在添加新闻的代码中引用注：工程名为Hover

if(Hover.Conn.WriteFilethis.Title.Text.ToString),this.Content.Text.ToString),this.Author.Text.ToString)))
{
Response.Write("添加成功");
}
else
{
Response.Write("生成HTML出错!");
}

　　模板页Text.html代码

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" >
<HTML>
<HEAD>
<title>ShowArticle</title>
<body>
biaoti
<br>
content<br>
author
</body>
</HTML>
biaoti
<br>
content<br>
author
</body>
</HTML>

　　提示添加成功后会出以当前时间为文件名的html文件!上面只是把传递过来的几个参数直接写入了HTML文件中,在实际应用中需要先添加数据库，然后再写入HTML文件

　　方案3：给一个客户端参考的例子（SJ）

　　它的作用在于以客户端的方式获取某个页面的代码，然后可以做为其他用途，本例是直接输出

<script>
var oXmlHttp = new ActiveXObject("Microsoft.XMLHTTP");
oXmlHttp.open("GET","http://www.webjx.com", false);
oXmlHttp.send()
var oStream = new ActiveXObject("ADODB.Stream");
if(oStream == null)
alert("您的机器不支持ADODB.Stream.")
else
{
oStream.Type=1;
oStream.Mode=3;
oStream.Open() ;
oStream.Write(oXmlHttp.responseBody);
oStream.Position= 0;
oStream.Type= 2;
oStream.Charset="gb2312";
var result= oStream.ReadText();
oStream.Close();
oStream = null;
var aa = window.open("","")
document.write(result);
aa.document.write(result);
}
</script>

　　方案4：学csdn一样。用xml保存数据，模版XSL也只有一个文件。

　　使用xml来保存数据，使用xsl来定义模板并且生称数据。可以通过xsl来很方便的在客户端或者服务段显示数据。如果要生成静态叶面那更简单了。去查一下.net的xml类包问题解决。

　　优点：可以方便快速转换成你想要的格式和内容。
　　缺点：需要学习更多的内容，不好入门。

　　方案5：

　　思路

　　1. 利用如Dw-Mx这样的工具生成html格式的模板，在需要添加格式的地方加入特殊标记(如$htmlformat$),动态生成文件时利用代码读取此模板，然后获得前台输入的内容，添加到此模板的标记位置中，生成新文件名后写入磁盘，写入后再向数据库中写入相关数据。
2. 使用后台代码硬编码Html文件，可以使用HtmlTextWriter类来写html文件。
优点

　　1. 可以建立非常复杂的页面，利用包含js文件的方法，在js文件内加入document.write()方法可以在所有页面内加入如页面头，广告等内容。

　　2. 静态html文件利用MS Windows2000的Index Server可以建立全文搜索引擎，利用asp.net可以以DataTable的方式得到搜索结果。而Win2000的Index服务无法查找xml文件的内容。如果包括了数据库搜索与Index索引双重查找，那么此搜索功能将非常强大。

　　3. 节省服务器的负荷，请求一个静态的html文件比一个aspx文件服务器资源节省许多。

　　缺点

　　思路二：如果用硬编码的方式，工作量非常大，需要非常多的html代码。调试困难。而且使用硬编码生成的html样式无法修改，如果网站更换样式，那么必须得重新编码，给后期带来巨大的工作量。

　　因此这里采用的是第一种思路

　　示列代码

　　1.定义(template.htm)html模板页面

　　＜html＞

　　＜head＞

　　＜title＞＜/title＞

　　＜meta http-equiv="Content-Type" content="text/html; charset=gb2312"＞

　　＜/head＞

　　＜body ＞

　　＜table $htmlformat[0] height="100%" border="0" width="100%" cellpadding="10" cellspacing="0" bgcolor="#eeeeee" style="border:1px solid #000000"＞

　　＜tr＞

　　＜td width="100%" valign="middle" align="left"＞

　　＜span style="color: $htmlformat[1];font-size: $htmlformat[2]"＞$htmlformat[3]＜/span＞

　　＜/td＞

　　＜/tr＞

　　＜/table＞

　　＜/body＞

　　＜/html＞

　　2.asp.net代码：

　　//---------------------读html模板页面到stringbuilder对象里----

　　string[] format=new string[4];//定义和htmlyem标记数目一致的数组

　　StringBuilder htmltext=new StringBuilder();

　　try

　　{

　　　using (StreamReader sr = new StreamReader("存放模板页面的路径和页面名"))

　　　{

　　String line;

　　while ((line = sr.ReadLine()) != null)

　　{

　　　htmltext.Append(line);

　　}

　　sr.Close();

　　　}

　　}

　　catch

　　{

　　　Response.Write("＜Script＞alert('读取文件错误')＜/Script＞");

　　}

　　//---------------------给标记数组赋值------------

　　format[0]="background="/blog/bg.jpg"";//背景图片

　　format[1]= "#990099";//字体颜色

　　format[2]="150px";//字体大小

　　format[3]= "＜marquee＞生成的模板html页面＜/marquee＞";//文字说明

　　//----------替换htm里的标记为你想加的内容

　　for(int i=0;i＜4;i++)

　　{

　　　htmltext.Replace("$htmlformat["+i+"]",format[i]);

　　}

　　//----------生成htm文件------------------――

　　try

　　{

　　　using(StreamWriter sw=new StreamWriter("存放路径和页面名",false,System.Text.Encoding.GetEncoding("GB2312")))

　　{

　　　sw.WriteLine(htmltext);

　　　sw.Flush();

　　　sw.Close();

　　}

　　}

　　catch

　　{

　　Response.Write ("The file could not be wirte:");

　　}


<%@ page language="C#" %>
<%@ import namespace=System.IO %>
<script runat="server">
protected override void OnInit (EventArgs e)
{
　 int id;
　 try
　 {
　　　 id = int.Parse (Request.QueryString["id"]);
　 }
　 catch
　 {
　　　 throw (new Exception ("页面没有指定id"));
　 }
　
　 string filename=Server.MapPath("statichtml_"+id+".html");
　
　 //尝试读取已有文件
　 Stream s = GetFileStream (filename);
　 if (s != null)//如果文件存在并且读取成功
　 {
　　　 using (s)
　　　 {
　　　　　 Stream2Stream (s, Response.OutputStream);
　　　　　 Response.End ();
　　　 }
　 }
　
　
　 //调用Main_Execute,并且获取其输出
　 StringWriter sw = new StringWriter ();
　 Server.Execute ("Main_Execute.aspx", sw);
　
　 string content = sw.ToString ();
　
　 //输出到客户端
　 Response.Write(content);
　 Response.Flush();
　
　 //写进文件
　
　 try
　 {
　　　 using (FileStream fs = new FileStream (filename, FileMode.Create, FileAccess.Write, FileShare.Write))
　　　 {
　　　　　 using (StreamWriter streamwriter = new StreamWriter (fs, Response.ContentEncoding))
　　　　　 {
　　　　　　　 streamwriter.Write (content);
　　　　　 }
　　　 }
　 }
　 finally
　 {
　　　 //Response.End ();
　 }
}
static public void Stream2Stream (Stream src, Stream dst)
{
　 byte[] buf = new byte[4096];
　 while (true)
　 {
　　　 int c = src.Read (buf, 0, buf.Length);
　　　 if(c==0)
　　　　　 return;
　　　 dst.Write (buf, 0, c);
　 }
}
public Stream GetFileStream(string filename)
{
　 try
　 {
　　　 DateTime dt = File.GetLastWriteTime (filename);
　　　 TimeSpan ts=dt - DateTime.Now;
　　　 if(ts.TotalHours>1)
　　　　　 return null;　　　 //1小时后过期
　　　 return new FileStream (filename, FileMode.Open, FileAccess.Read, FileShare.Read);
　 }
　 catch
　 {
　　　 return null;
　 }
}
</script>


<%@ page language="C#" %>
<html>
<head runat="server">
　 <title>Untitled Page</title>
</head>
<body>

ID:
<%=Request.QueryString["id"]%>

</body>
</html>

其中原理是这样的.
Main_Execute.aspx是生成HTML的页面.

现在用Main.aspx来对它进行缓存.
过程如下:

首先根据页面参数算出文件名.(这个例子只根据Request.QueryString["id"]来算)
尝试读取缓存的文件.如果成功,那么Response.End();
如果不成功:
使用Server.Execute来调用Main_Execute.aspx,并且获取它的结果内容.
得到内容后,立刻输出到客户端.
最后把内容写进文件里,提供给下一次做为缓存度取.

一、首先请按以下地址下载ISAPI_Rewrite组件；
http://cms.joekoe.com/public/download/?rewrite
二

1、满足搜索引擎的要求
某些搜索引擎不能支持动态页面的抓取，大量的信息就不能被潜在用户搜索到。用UrlRewrite技术你可以把 http://server/news.asp?id=111 变成 http://server/news/111.htm 这样他们就会被搜索引擎收录了。google虽然可以抓取动态页面，但是google对动态页面的评分一般低于静态页面。所以，对大量信息发布的网站，把网站地址改变成静态的绝对是值得的。

2、隐藏技术实现，提高网站的移植性
每个页面都挂着鲜明的.asp/.jsp这种开发语言的标记，可以一眼让人看出你的网站使用什么语言做的。而且在改变网站的语言的时候，你需要改动大量的链接。而且，一个页面修改了扩展名，他的pagerank也会随之消失，从头开始。我们可以用UrlRewrite技术隐藏我们的实现细节，这样修改移植都很方便，而且完全不损失pagerank。

3、满足美感的要求
对于追求完美主义的网站设计师，即使是网页的地址也要看起来简洁明快。形如 http://server/news.asp?channel=3&id=111 的网页地址，肯定是上不了完美主义者的法眼的，用UrlRewrite技术，你可以把他变成 http://server/news/3/111.htm 。

IIS 5.0支持UrlRewrite么？

答案很简单，不支持。但是我们可以通过安装服务器扩展让IIS支持。

目前有两种产品支持IIS 5.0的UrlRewrite，isapi_rewrite 和 IIS Rewrite。

isapi_rewrite: http://www.helicontech.com/download/#isapi_rewrite
IIS Rewrite :http://www.qwerksoft.com/products/iisrewrite/download.asp

这里只有ISAPI Rewrite的一个LITE版本是免费的，其它都是trial版本。ISAPI Rewrite Lite的版本功能。

我们采用isapi_rewrite Lite Version(免费版本)。

引用:
This is simplified edition of ISAPI_Rewrite. It does not support per-virtual-site configurations, proxiing, metabase monitoring and automatic cache cleanup but all other features are supported.

所以，lite版本不支持虚拟站点配置，元数据监测和自动缓存清理。

metabase元数据：metabase 元数据库指一个驻留内存的数据存储区域，其中存放着IIS的配置值。/Metabase是储存成System32/Inetsrv
资料夹中的Metabase.bin文件

如何进行UrlRewrite的设置？

isapi_rewrite利用正则表达式进行替换规则的表示。

下面是一个简单的例子，我想让我们的用户输入 http://server/test-12314.html 实际上访问的是 http://server/test.asp?id=12314 。那么我们的匹配表达式应该是 /test-([0-9]*).html 对应的格式化表达式应该为 /test.asp/?id=$1 。

进行正则表达式的编写的时候，可以利用isapi_rewrite提供的正则表达式测试工具（默认安装提供），进行调试。如下图：

做好了匹配表达式和格式化表达式，我们可以按照下面的格式，把它们放到安装目录下的httpd.ini里面。

格式：RewriteRule 匹配表达式格式化表达式
刚才的例子：RewriteRule /test-([0-9]*).html /test.asp/?id=$1

文件保存后，不需重新启动iis即可生效。

参考资料：

面向Google(Search Engine Friendly)的URL设计
http://www.chedong.com/tech/google_url.html

ISAPI REWRITE文档
http://www.isapirewrite.com/docs/

操作实例：

1.下载ISAPI_Rewrite.ISAPI_Rewrite分精简(Lite)和完全(Full)版.精简版不支持对每个虚拟主机站点进行重写,只能进行全局处理.不过对于有服务器的朋友,精简版也就够啦.精简版下载地址:http://www.helicontech.com/download/,就是那Lite Version (free)啦.

2.安装.msi的文件,和装一般程序一样装就可以了,俺就装在D:/ISAPI_Rewrite.

3.接下来一步比较重要哦,看仔细喽.打开Internet 信息服务,右键,web站点属性,电ISAPI筛选器选项卡.添加筛选器,名称自己填,路径自己指定ISAPI_Rewrite.dll,然后确定.

.来测试一下.新建一个1ting.asp,里面写上

CODE: [Copy to clipboard]
<%=request.querystring("inso")%>

效果就是执行的时候1ting.asp?inso=*浏览器显示*.

5.这一步很重要哦,开始添加rewrite规则.正则,好头痛,幸亏这个例子比较简单.
找到ISAPI_Rewrite目录,把httpd.ini的只读属性去掉,打开编辑.我们要把1ting.asp?inso=im286映射成为1ting-im286.html这样的类型,需要在httpd.ini里加上这么一行:

CODE: [Copy to clipboard]
RewriteRule /1ting-([0-9,a-z]*).html /1ting.asp/?inso=$1

,保存.

转自：http://onhigh.blog.hexun.com/10223847_d.html