项目作者: mfournial

项目描述 :
Fast spell checking of words into english dictionary
高级语言: Java
项目地址: git://github.com/mfournial/SpellChecker.git
创建时间: 2017-04-21T11:59:40Z
项目社区:https://github.com/mfournial/SpellChecker

开源协议:MIT License

下载


SpellChecker

Spell checking made fast and easy via this fast and scalable spellchecker.
SpellChecker is a very fast way of checking the spelling of you documents.
For now, it takes a doc (eg .txt file) as an argument and outputs the misspelled words.

PROJECT STILL IN BETA soon versions will split working code and beta code

Example:

  1. Reading from: src/example.txt
  2. Numbers of words checked: 48
  3. Numbers of words in dictionary: 354975
  4. Misspelled words:
  5. writtten
  6. 'text
  7. deected
  8. theee
  9. 'dont'
  10. 'dont
  11. onkey
  12. onkey

Acknowledgements

SpellChecker was originaly made for a coursework for CS50 in 2015 written in C at the time.
English dictionary from dwyl.

Known bugs and Issues

Numbers are not checked (like 1st ~ 1nd), and special characters are largely ignored, maybe there’s a better correction to provide using those.

What’s next for Spellchecker

  • Integrate it in a software or a GUI to do continuous spellchecking, which is what it is the best at.
    (Load time >> check time).
  • Improve load time by saving Dictree state
  • Support of other languages. Why not also mix languages?
  • Provide options with similar words.
  • Compare it with an implementation with a radix tree, or switch to radix tree after like 3/4 letters not to waste space with empty arrays