正则表达式匹配HTML
来源:互联网 发布:网龙网络控股有限公司 编辑:程序博客网 时间:2024/05/22 17:13
正则表达式 | <(\S*?) [^>]*>.*?</\1>|<.*? />匹配 | <html>hello</html>|<a>abcd</a>不匹配 | abc|123|<html>ddd正则表达式 | ^[^<>`~!/@\#}$%:;)(_^{&*=|'+]+$匹配 | This is a test不匹配 | <href = | <br> | That's it正则表达式 | <!--.*?-->匹配 | <!-- <h1>this text has been removed</h1> --> | <!-- yada -->不匹配 | <h1>this text has not been removed</h1>正则表达式 | (\[(\w+)\s*(([\w]*)=('|")?([a-zA-Z0-9|:|\/|=|-|.|\?|&]*)(\5)?)*\])([a-zA-Z0-9|:|\/|=|-|.|\?|&|\s]+)(\[\/\2\])匹配 | [link url="http://www.domain.com/file.extension?getvar=value&secondvar=value"]Link[/li不匹配 | [a]whatever[/b] | [a var1=something var2=somethingelse]whatever[/a] | [a]whatever[a]正则表达式 | href=[\"\'](http:\/\/|\.\/|\/)?\w+(\.\w+)*(\/\w+(\.\w+)?)*(\/|\?\w*=\w*(&\w*=\w*)*)?[\"\']匹配 | href="www.yahoo.com" | href="http://localhost/blah/" | href="eek"不匹配 | href="" | href=eek | href="bad example"正则表达式 | "([^"](?:\\.|[^\\"]*)*)"匹配 | "This is a \"string\"."不匹配 | "This is a \"string\".正则表达式 | (?i:on(blur|c(hange|lick)|dblclick|focus|keypress|(key|mouse)(down|up)|(un)?load|mouse(move|o(ut|ver))|reset|s(elect|ubmit)))匹配 |
| onclick | onsubmit | onmouseover不匹配 | click | onandon | mickeymouse
正则表达式 | (?s)/\*.*\*/匹配 | /* .................... */ | /* imagine lots of lines here */不匹配 | */ malformed opening tag */ | /* malformed closing tag /*正则表达式 | <(\S*?) [^>]*>.*?</\1>|<.*? />匹配 | <html>hello</html>|<a>abcd</a>不匹配 | abc|123|<html>ddd正则表达式 | \xA9匹配 | ©不匹配 | anything正则表达式 | src[^>]*[^/].(?:jpg|bmp|gif)(?:\"|\')匹配 | src="../images/image.jpg" | src="http://domain.com/images/image.jpg" | src='d:\w不匹配 | src="../images/image.tif" | src="cid:value"正则表达式 | /\*[\d\D]*?\*/匹配 | /* my comment */ | /* my multiline comment */ | /* my nested comment */不匹配 | */ anything here /* | anything between 2 seperate comments | \* *\正则表达式 | <[a-zA-Z]+(\s+[a-zA-Z]+\s*=\s*("([^"]*)"|'([^']*)'))*\s*/>匹配 | <img src="test.gif"/>不匹配 | <img src="test.gif"> | <img src="test.gif"a/> 0 0