项目作者: m3hrdadfi

项目描述 :
First Semantic Search for Persian based on Transformers
高级语言:
项目地址: git://github.com/m3hrdadfi/semantic-search.git
创建时间: 2020-10-16T08:33:30Z
项目社区:https://github.com/m3hrdadfi/semantic-search

开源协议:

下载


Persian Semantic Search

First Semantic Search for Persian based on Transformers

This project was made based on the Transformer models. The semantic search operation tested on three different scopes:

  • Jobinja, is a well-known job search system in Iran in favor of job seekers and recruiters. The dataset consists of 9,952 job offers.
  • Taaghche, consists of books’ meta-information available on Taaghche as of 2019 (around 4,505 books).
  • Universal, includes a huge range of topics from many sources, DigiMag, Chetor, Wikipedia, Ninisite, 1Pezeshk, and some others. For this particular example, we used only 44,000 records (out of 807,185 documents).

Notes:

  • You can see the results in the video. Also, it is important to mention that the whole dataset and code would publish soon
  • For creating a real scenario some misspelling and grammatical errors happened in the demo.

Persian Semantic Search