A web tool that helps biomedical researchers understand how their work is being used by others, by analyzing the content in papers that cite them
Python3
and Flask
using Biopython
, py-processors
, NLTK
, Scikit-Learn
, Numpy
, Gensim
, Fasttext
, Plotly
, D3
, and more.py-processors
.See our wiki for installation instructions!
Where are your citers publishing their work? Could you be influencing authors with publications in Nature?
Visualizations the key words in your papers with either a wordcloud, heatmap, or clustermap! Sort by categories of keywords: Bioprocess, Cell-lines, Cellular components, Family, Gene or gene products, Organs, Simple chemicals, Sites, Species, Tissue-types.
See the latent themes in the documents that site you! Check out Embeddings
, which uses word vectors to cluster important words into topics. If you have a large number of citations, LSA
(Latent Semantic Analysis) and LDA
(Latend Dirichlet Analysis) might work well for you too.
Project citing documents into 3D space! The most similar documents will be close together in the graph or the same color. Zoom and click to explore.
A list of your citations
Get the big picture about your citers — How many (unique) papers cite you? How many of them are unique? How many does SCKE take into account? Which years do you have the most citers for your paper(s)?
See how similar your citing publications are to famous works like “On the Origin of Species” or “Sherlock Holmes”. If your publication(s) are available, see how similar your citers are to your work!
Note: The yellow bar in this visualization is one of the input papers (Lyons, 2008). Of course, Lyons, 2008 should have 100% similarity for itself. The next highest green bar for this paper also happens to be Lyons, 2008. The similarity score is a bit lower (but sitll very high) because stopwords (e.g. the, a, an, them) have been removed.