HTML Text mining for YOGA

HTML-TM Yoga is related to the article of Ieger Raittz, et al., 2022

We’ve provided two HTML’s:

Texts - each line corresponds to an article
Words - each line corresponds to a term (sort by term id)
Words - each line corresponds to a term (sort by tree order - tree created using the ‘complete’ algorithm)
Tree - phylogenetic tree of 1500 words (HTML of tree produced with Phy2HTML)

Aditional material at Zenodo - DOI: 10.5281/zenodo.14763945.

How to use the HTMLs?

TEXTS

TEXTS contains a list of all the articles analysed, as shown in the image below. Each article has a link to a list of articles (titles+abstract) that most closely match the query article.

WORDS

WORDS_id and WORDS_tree contains: 1) Cod: word id , 2) WORD: the query term, 3) Related words: a list of the 10 closest words, 3) Link title: a link to the articles (title+abstract) most related to the term, 4) link abstracts: a link to the tree of terms rooted in the query term, and 5) link graphic: a graph of the frequency of the term in the literature.

The trees are all rooted in the query term. The graph has two curves, one in blue with the frequency of the term yoga in the literature against time, and in purple the frequency that the query term is found.

TREE

TREE presents a phylogenetic tree of 1,500 words. The number before the word (label of branch) corresponds to the word id.

Citation

Please cite this study as follows:

Ieger Raittz, R., De Pierri, C. R., Perico, C. P., Machado, D. J., Marchaukoski, J. N., & Raittz, R. T. (2022). What are we learning with Yoga: a text mining approach to literature. medRxiv, 2022-12. https://doi.org/10.1101/2022.12.05.22282979