再来一个免费词频表,学英语必备。
来源:互联网 发布:fm2017数据分析师属性 编辑:程序博客网 时间:2024/05/01 12:32
N-GRAMS
from the COCA and COHA corpora of American English
These n-grams are based on the largest publicly-available, genre-balanced corpus of English -- the 450 million word Corpus of Contemporary American English (COCA). With this n-grams data (2, 3, 4, 5-word sequences, with their frequency), you can carry out powerful queries offline -- without needing to access the corpus via the web interface.
A few examples (from among an unlimited number of searches) might be:
NOUN + NOUN sequences three word strings with a preposition in the middle position VERB + the + NOUN sequences two word strings, where the words begin or end with certain letters like + word + word (potential) phrasal verb: VERB + ADV particleThe data is available in several different formats:
1Free lists1 million most frequent 2, 3, 4, and 5-grams
2Inexpensive data setsAll n-grams that occur three times or more: 6.2 million 2-grams, 11.9 million 3-grams, and 8.3million 4-grams
3All 2, 3, and 4-gramsUp to 155 million distinct strings -- searchable by word form and part of speech (as above), and also lemma
If you're interested in the frequency of single words (including frequency by genre and sub-genre), or collocates (all words "near by" a given word), you might look at http://www.wordfrequency.info.
- 再来一个免费词频表,学英语必备。
- 高效学英语 - 统计英文书词频
- 再来一个
- 再来一个。。
- 必备英语
- 史上绝地反击,美式英语英文学习大全。美国英语最新词频表
- 免费跟着外教学英语,突破哑巴口语
- 学英语每日一句 On the house. 免费赠送
- 0 基础说一口流利英语,限额免费学!
- 学英语
- 学英语
- 学英语
- 学英语
- 学英语
- 学英语
- 学英语!
- 学英语
- 学英语
- servlet中页面跳转response.sendRedirect() 详解
- 酷壳网陈皓:由12306.cn谈谈网站性能技术
- Java线程同步机制synchronized关键字的理解
- 主动提交网站到各大搜索引擎
- Ibatis中的缓存
- 再来一个免费词频表,学英语必备。
- 从软件生命周期说项目经理工作职责与流程
- <<High Performance JavaScript>>读书笔记-5.Strings and Regular Expressions
- 学习AS3:delete关键字和类成员
- 关于centos下面的php中soap的调用 及环境安装
- 批处理与进程
- 非结构化数据
- rpm安装包相关命令
- Linux下mysql备份 恢复