php中处理大文本数据字符统计

来源:互联网 发布:桌宠软件 编辑:程序博客网 时间:2024/06/05 12:49

10G文本数据 如何从中统计每行每个字符出现次数和 出现的位置等


static function getString() {
$handle = fopen("./file.txt","r");
$rows = "";
$result = array();
while(!feof($handle)) {
$rows = fgets($handle);
$result[] = self::countString($rows);
}
fclose($handle);
print_r($result);
}
//处理数据
static function countString($str) {
if(!$str) return false; 
$string = explode(" ",$str);
if(!is_array($string)) return false;
foreach ($string as $key=>$val) {
if(in_array($val,$string))
$result[$val] +=1;
}
return $result;
}
效果:
Array
(
[0] => Array
(
[lygy] => 2
[is,] => 1
[asjkow] => 1
[baley] => 1
[sji'] => 1
[sji] => 1
[aa] => 2
[bb] => 2
[cc] => 1
[cc
] => 1
)
[1] => Array
(
[my] => 1
[name] => 1
[is] => 2
[james,this] => 1
[book'
] => 1
)
[2] => Array
(
[bn] => 1
[ass] => 1
[sbbs] => 1
[ssa] => 1
[skww] => 1
[www] => 2
)
)

0 0
原创粉丝点击