Using semantic analysis of texts for the identification of drugs with similar therapeutic effects
- Autores: Tutubalina E.V.1, Miftahutdinov Z.S.1, Nugmanov R.I.1, Madzhidov T.I.1, Nikolenko S.I.1,2, Alimova I.S.1, Tropsha A.E.1,3
-
Afiliações:
- Kazan Federal University
- St. Petersburg Department of V. A. Steklov Institute of Mathematics, Russian Academy of Sciences
- University of North Carolina at Chapel Hill
- Edição: Volume 66, Nº 11 (2017)
- Páginas: 2180-2189
- Seção: Full Article
- URL: https://bakhtiniada.ru/1066-5285/article/view/241709
- DOI: https://doi.org/10.1007/s11172-017-2000-8
- ID: 241709
Citar
Resumo
Semantic analysis of text collections was used to identify drugs with similar therapeutic activity. Natural language processing methods were applied to analyse > 2.5 mln texts from drug reviews (in English) found on patient forums and discussion boards. In order to obtain distributed word representations form the input data, a continuous bag-of-words type model was used. Such model is one of the word2vec models intended to analyse the natural language semantics. This allowed the assignment of a numeric vector to each drug name. A list of pairs of drugs with similar vectors was formed. An analysis of this list confirmed that similar word vectors correspond to either drugs with the same active compound or to drugs with close therapeutic effects that belong to the same therapeutic group. The chemical similarity in such drug pairs was found to be low. The suggested procedure was used to visualize the chemical drug space and in the search for compounds with potentially similar biological effects among drugs of different therapeutic groups.
Sobre autores
E. Tutubalina
Kazan Federal University
Autor responsável pela correspondência
Email: elvtutubalina@kpfu.ru
Rússia, 18 ul. Kremlyovskaya, Kazan, 420008
Z. Miftahutdinov
Kazan Federal University
Email: elvtutubalina@kpfu.ru
Rússia, 18 ul. Kremlyovskaya, Kazan, 420008
R. Nugmanov
Kazan Federal University
Email: elvtutubalina@kpfu.ru
Rússia, 18 ul. Kremlyovskaya, Kazan, 420008
T. Madzhidov
Kazan Federal University
Email: elvtutubalina@kpfu.ru
Rússia, 18 ul. Kremlyovskaya, Kazan, 420008
S. Nikolenko
Kazan Federal University; St. Petersburg Department of V. A. Steklov Institute of Mathematics, Russian Academy of Sciences
Email: elvtutubalina@kpfu.ru
Rússia, 18 ul. Kremlyovskaya, Kazan, 420008; 27 nab. Reki Fontanki, St. Petersburg, 191011
I. Alimova
Kazan Federal University
Email: elvtutubalina@kpfu.ru
Rússia, 18 ul. Kremlyovskaya, Kazan, 420008
A. Tropsha
Kazan Federal University; University of North Carolina at Chapel Hill
Email: elvtutubalina@kpfu.ru
Rússia, 18 ul. Kremlyovskaya, Kazan, 420008; 153A Country club Road, Jackson Hall, North Carolina, NC 27514
Arquivos suplementares
