A Semi-Supervised Pattern-Learning Approach to Extract Pharmacogenomics-Specific Drug-Gene Pairs from Biomedical Literature

Rong Xu; Quanqiu Wang

doi:10.4172/2153-0645.1000117

Awards Nomination 20+ Million Readerbase

Google Scholar citation report

Citations : 613

Journal of Pharmacogenomics & Pharmacoproteomics received 613 citations as per Google Scholar report

Journal of Pharmacogenomics & Pharmacoproteomics peer review process verified at publons

25+ Million Website Visitors

Indexed In

Open J Gate
Genamics JournalSeek
Academic Keys
JournalTOCs
ResearchBible
Electronic Journals Library
RefSeek
Hamdard University
EBSCO A-Z
OCLC- WorldCat
Proquest Summons
SWB online catalog
Virtual Library of Biology (vifabio)
Publons
MIAR
Euro Pub
Google Scholar

Useful Links

Share This Page

Journal Flyer

Open Access Journals

Abstract

A Semi-Supervised Pattern-Learning Approach to Extract Pharmacogenomics-Specific Drug-Gene Pairs from Biomedical Literature

Rong Xu and Quanqiu Wang

Personalized medicine is to deliver the right drug to the right patient in the right dose. Pharmacogenomics (PGx), the studies in identifying genetic variants that may affect drug response, is important for personalized medicine. Computational approaches in studying the relationships between genes and drug response are emerging as an active area of research for personalized medicine. Currently, systematic study of drug-gene relationships is limited because a large-scale machine understandable drug-gene relationship knowledge base is difficult to build and to keep update. Scientific literature contains rich information of drug-gene relationships, therefore is the ultimate knowledge source for PGx studies and for personalized medicine. However, this information is largely buried in free text with limited machine understandability. There is a need to develop automatic approaches to extract structured drug-gene relationships from biomedical literature. In this study, we present a semi-supervised approach to extracting drug-gene relationships from MEDLINE. The technique uses one seed pattern and iteratively learns various ways the relationship may be expressed in 20 million MEDLINE abstracts. Our approach has achieved high precisions (0.961-1.00) in extracting drug-gene relationships from MEDLINE and found many drug-gene pairs that are not available in PharmGKB, a large-scale manually curated PGx knowledge base.