利用VB提取HTML文件中的EMAIL地址

来源:互联网 发布:农村淘宝现状 编辑:程序博客网 时间:2024/06/03 19:37
<script type="text/javascript"><!--google_ad_client = "pub-2947489232296736";/* 728x15, 创建于 08-4-23MSDN */google_ad_slot = "3624277373";google_ad_width = 728;google_ad_height = 15;//--></script><script type="text/javascript"src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
<script type="text/javascript"><!--google_ad_client = "pub-2947489232296736";/* 160x600, 创建于 08-4-23MSDN */google_ad_slot = "4367022601";google_ad_width = 160;google_ad_height = 600;//--></script><script type="text/javascript"src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>

利用VB提取HTML文件中EMAIL地址

    电子邮件(EMAIL)是INTERNET上应用最广泛的一种服务之一。我们每天都在使用电子邮件,有时为了宣传我们的产品、网站等,更是离不开电子邮件,这就需要收集很多的EMAIL地址。下面我们将向大家介绍用VB自编一个EMAIL地址提取器,用来提取保存在我们硬盘中的HTML文件中所包含的EMAIL地址一 设计界面    进入VB,选择“标准EXE”新建一工程,选择“工程”菜单下的“引用”,选中Microsoft scripting Runtime”,然后再选择“工程”菜单中的“部件”,在弹出的对话框中选择“Microsoft common dialog control 6.0”,在工具箱中加入通用对话框控件。接下来在默认窗体FORM1上添加三个标签控件,一个文本框控件text1,一个列表框控件LIST1,并命名为lstEMAIL,三个命令command1~command3,其Caption属性分别设置为“提取”、“整理”、“保存”,设置完成的界面如下图所示:

二 输入源程序Dim X, Y, St1, St2, tmpY As Integer'提取EMAIL地址子程序Private Sub StripEMAIL(FilePath As String)Dim tmpEMAIL1, tmpEMAIL2 As StringOpen FilePath For Input As #1Do Until EOF(1)On Error Resume NextInput #1, tmpEMAIL1For X = 1 To Len(tmpEMAIL1)tmpEMAIL2 = Mid(tmpEMAIL1, X, 7)'查找EMAIL标志If tmpEMAIL2 = "mailto:" ThenSt1 = XtmpY = X + 1For Y = 1 To Len(tmpEMAIL1)tmpEMAIL2 = Mid(tmpEMAIL1, tmpY, 1)If tmpEMAIL2 = Chr(34) Or tmpEMAIL2 = "?" ThenSt2 = tmpYtmpEMAIL2 = Mid(tmpEMAIL1, St1 + 7, ((St2 - St1) - 7))If (Left(tmpEMAIL2, 2) <> "//") And (Left(tmpEMAIL2, 1) <> " ") ThenlstEMAIL.AddItem tmpEMAIL2Exit ForEnd IfEnd IftmpY = tmpY + 1Next YEnd IfNext XLoopClose #1End SubPrivate Sub Command1_Click()Dim fs As New FileSystemObject ' 建立 FileSystemObjectDim fd As Folder ' 定义 Folder 对象Dim sfd As FolderSet fd = fs.GetFolder(Text1)Command1.Enabled = FalseScreen.MousePointer = VBHourglassFindFile fd, "*.htm" 'Text1.TextCommand1.Enabled = TrueScreen.MousePointer = VBDefaultEnd SubSub FindFile(fd As Folder, FileName As String)Dim sfd As Folder, f As File' Part I查找该文件夹的所有文件For Each f In fd.FilesIf UCase(f.Name) Like UCase(FileName) ThenLabel2 = f.PathStripEMAIL (f.Path)lblEMAIL = "已查找到的地址数为: " & lstEMAIL.ListCountEnd IfDoEventsNext' Part II循环查找所有子文件夹For Each sfd In fd.SubFoldersFindFile sfd, FileName ' 循环查找NextEnd Sub Private Sub Command2_Click()'去掉重复的EMAIL地址For i = 0 To lstEMAIL.ListCount - 1For X = 0 To lstEMAIL.ListCount - 1If i = X Then GoTo NextxIf LCase(lstEMAIL.List(X)) = LCase(lstEMAIL.List(i)) ThenOn Error Resume NextlstEMAIL.RemoveItem XEnd IfNextx:Next XNext ilblEMAIL = "共有" & lstEMAIL.ListCount & "个地址"End Sub'保存Private Sub Command3_Click()'设置文件名Dim strname As Stringcommondialog1.Filter = "文本文件(*.txt)|*.txt"commondialog1.ShowSaveIf commondialog1.FileName <> "" Thenstrname = commondialog1.FileNameElsestrname = App.Path & "EMAILaddress.txt"End If'保存文件Open strname For Output As #1On Error Resume NextFor i = 0 To lstEMAIL.ListCount - 1Print #1, lstEMAIL.List(i)NextClose #1End Sub本程序在WINDOWS ME、VB6.0中文企业版中运行通过。以上程序稍加修改即可实现提取其他类型文件中EMAIL地址。<script type="text/javascript"><!--google_ad_client = "pub-2947489232296736";/* 728x15, 创建于 08-4-23MSDN */google_ad_slot = "3624277373";google_ad_width = 728;google_ad_height = 15;//--></script><script type="text/javascript"src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>
<script type="text/javascript"><!--google_ad_client = "pub-2947489232296736";/* 160x600, 创建于 08-4-23MSDN */google_ad_slot = "4367022601";google_ad_width = 160;google_ad_height = 600;//--></script><script type="text/javascript"src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script>