Data Analysis学习笔记 --- python数据清洗矢量处理字符串

来源:互联网 发布:asp.net考试系统源码 编辑:程序博客网 时间:2024/04/29 10:16



# -*- coding=utf-8 -*-import numpy as npimport pandas as pdimport re#vectorizationdata = pd.Series({'li':'120@qq.com', 'wang':'5632@qq.com', 'chen':'8622@xinlang.com', 'zhao':'9872@163.com', 'sun':'5243@gmail.com'})print(data)#containsprint(data.str.contains('0'))#get need stringdata = pd.Series({'li':'120@qq.com', 'wang':'5632@qq.com', 'chen':'8622@xinlang.com', 'zhao':'9872@163.com', 'sun':'5243@gmail.com'})pattern = re.compile(r'(\d*)@([a-z]+)\.([a-z]{2,4})')result = data.str.match(pattern)print(result)#get pre_emailprint(result.str.get(0))#get email donameprint(result.str.get(1))


结果:




0 0
原创粉丝点击