项目作者: piravp

项目描述 :
Summarize a text using a simple ranking algorithm.
高级语言: Java
项目地址: git://github.com/piravp/auto-summarizer.git
创建时间: 2017-03-11T04:10:49Z
项目社区:https://github.com/piravp/auto-summarizer

开源协议:MIT License

下载


Summarizr

Application returns a summary of a text. It’s currently implemented with a simple ranking algorithm based on word occurrences.

Example

See example for example dump.

Files

  • file_x.txt is a file with dummy text. x is the language code according to the ISO 639-1 standard.
  • stopwords-x.txt is a list of stop-words for respective language.

Documentation

Documentation can be found in docs/ with entry point index.html at the root level.

To-Do

Since the core functionality is implemented, further improvements can be made on top. This includes (listed after most likely to be done first)…

  • Generate Javadocs
  • Generate executable
  • Analyze text reduction
  • Write final summary to a file
  • Import text from other file formats (currently only .txt)
  • Create GUI
  • Generate UML diagram