项目作者: GeorgeDittmar

项目描述 :
Markov chains for funnies
高级语言: Python
项目地址: git://github.com/GeorgeDittmar/Mimic.git
创建时间: 2016-12-17T05:29:48Z
项目社区:https://github.com/GeorgeDittmar/Mimic

开源协议:Apache License 2.0

下载


Mimic Text Generator

Building some base markov chain based text generators in python and spark. Plan to use this to compare against deep learning approaches.

Currently has support for an ngram based model built in python and one built in pyspark. The API takes in a path of text files and will do basic tokenization on the documents and then feed that into the markov model.

Blog associated with this work
https://towardsdatascience.com/natural-language-generation-part-1-back-to-basics-2f0b2654624f