Spell Checker

来源:互联网 发布:手机批量卸载软件 编辑:程序博客网 时间:2024/05/31 13:15

1. Problem

Write a program that reads a large list of English words (e.g. from /usr/share/dict/words on a unix system) into memory, and then reads words from stdin, and prints either the best spelling suggestion, or “NO SUGGESTION” if no suggestion can be found. The program should print “>” as a prompt before reading each word, and should loop until killed.

Your solution should be faster than O(n) per word checked, where n is the length of the dictionary. That is to say, you can’t scan the dictionary every time you want to spellcheck a word.

For example:

  1. >sheeeeep
  2. sheep
  3. >peepple
  4. people
  5. >sheeple
  6. NO SUGGESTION

The class of spelling mistakes to be corrected is as follows:

Case (upper/lower) errors:

  1. "inSIDE" => "inside"

Repeated letters:

  1. "jjoobbb" => "job"

Incorrect vowels:

  1. "weke" => "wake"

In addition, any combination of the above types of error in a single word should be corrected (e.g. "CUNsperrICY" => "conspiracy").

If there are many possible corrections of an input word, your program can choose one in any way you like, however your results must match the examples above (e.g. "sheeeeep" should return "sheep" and not "shap").

Final step: Write a second program that generates words with spelling mistakes of the above form, starting with correctly spelled English words. Pipe its output into the first program and verify that there are no occurrences of “NO SUGGESTION” in the output.

2. Solution

- See more at:http://bo-yang.github.io/2014/09/13/spell-checker/#sthash.67cPBa6r.dpuf 

http://bo-yang.github.io/2014/09/13/spell-checker

0 0
原创粉丝点击