机器学习常用工具
来源:互联网 发布:郑州市招聘网网络 编辑:程序博客网 时间:2024/04/29 23:29
机器学习
Support Vector Machine
- SVMlight
- LIBSVM
Decision Tree
- C4.5
Maximum Entropy
- YASMET
Conditional Random Field
- CRF++
自然语言处理
综合
- OpenNLP
- CMU Statistical Language Modeling Toolkit
- The Dragon ToolKit
- LingPipe
- track mentions of entities (e.g. people or proteins);
- link entity mentions to database entries;
- uncover relations between entities and actions;
- classify text passages by language, character encoding, genre, topic, or sentiment;
- correct spelling with respect to a text collection;
- cluster documents by implicit topic and discover significant trends over time; and
- provide part-of-speech tagging and phrase chunking.
- Natural Language Toolkit
- Antelope
- Advanced Natural Lange Object-oriented Processing Environment.包括一系列工具(特别c#的stanford parser)
分词
- ICTCLAS
- Stanford Chinese Word Segmenter
词性标注
- Brill tagger
- Stanford POS Tagger
- MBT:Memory-based Tagger
- TreeTagger
- SVMTool, a POS Tagger based on SVMs
- QTAG Part of speech tagger
命名实体识别
- Stanford Named Entity Recognizer
- LingPipe
- YamCha
Stemming
- Porter Stemming
- Snowball
句法分析
- Stanford Parser
- Berkeley Parser
文本挖掘
摘要
- Rouge Rouge在Windows下的配置
其他
加密
- OpenSSL
压缩
- zlib
日志
- Apache Logging Services
- log4j for Java,
- log4cxx for C++, and
- log4net for MS .Net framework.
Unicode
- ICU
XML
- Xerces
多字符串匹配
- AC in C#: Aho-Corasick string matching in C#
HTML Parser
- Html Agility Pack, an agile HTML parser that builds a read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
- Majestic-12, an open source high-performance .NET C# module that was created to parse HTML for links, indexing and other purposes. 速度快,但不生成dom树
外部联接
- An annotated list of resources by Stanford NLP Group
- KDnuggets 有一些与KDD相关的软件等
文章来源 : http://fuliang.iteye.com/blog/955023
http://video.sina.com.cn/v/b/107900125-2192582404.html
- 机器学习常用工具
- 机器学习常用工具
- 机器学习常用工具<转>
- NLP常用工具及机器学习各类工具比较
- java 学习常用工具
- 常用工具类学习笔记
- Hibernate学习之(Hibernate 常用工具)
- [学习笔记]Java常用工具类
- 脱壳学习笔记一:常用工具
- [学习笔记]Java常用工具类
- Android学习笔记--常用工具类
- BackTrack5 学习笔记2 常用工具
- 常用工具
- 常用工具
- 常用工具
- 常用工具
- 常用工具
- 常用工具
- hdu 4715 Difference Between Primes (打表 枚举)
- 获取汉字的拼音
- 算法大师资料
- Oracle 默认表空间(default permanent tablespace) 说明
- 面试题20130909
- 机器学习常用工具
- 接口和开发的首发机会降临的萨芬
- #面试题#最长等差数列
- 白话经典算法
- POJ 1936
- confluence+jira+mysql 破解记录
- 深入理解C/C++数组和指针
- 并查集
- netfilter内核态与用户态 通信 之 sockopt