Mining officially unrecognized side effects of drugs by combining web search and machine learning

Carlo Carino*, Yuanyuan Jia, Bruce Lambert, Patricia M. West, Clement Yu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

We consider the problem of finding officially unrecognized side effects of drugs. By submitting queries to the Web involving a given drug name, it is possible to retrieve pages concerning the drug. However, many retrieved pages are irrelevant and some relevant pages are not retrieved. More relevant pages can be obtained by adding the active ingredient of the drug to the query. In order to eliminate irrelevant pages, we propose a machine learning process to filter out the undesirable pages. The process is shown experimentally to be very effective. Since obtaining training data for the machine learning process can be time consuming and expensive, we provide an automatic method to generate the training data. The method is also shown to be very accurate. The side effects of three drugs which are not recognized by FDA are validated by an expert. We believe that the same approach can be applied to many real life problems and will yield high precision. Thus, this could lead a new way to perform retrieval with high accuracy.

Original languageEnglish (US)
Title of host publicationCIKM'05 - Proceedings of the 14th ACM International Conference on Information and Knowledge Management
Pages365-372
Number of pages8
StatePublished - 2005
EventCIKM'05 - Proceedings of the 14th ACM International Conference on Information and Knowledge Management - Bremen, Germany
Duration: Oct 31 2005Nov 5 2005

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings

Other

OtherCIKM'05 - Proceedings of the 14th ACM International Conference on Information and Knowledge Management
Country/TerritoryGermany
CityBremen
Period10/31/0511/5/05

Keywords

  • Accurate retrieval
  • Machine learning
  • Mining side effects of drugs
  • Precision

ASJC Scopus subject areas

  • General Business, Management and Accounting

Fingerprint

Dive into the research topics of 'Mining officially unrecognized side effects of drugs by combining web search and machine learning'. Together they form a unique fingerprint.

Cite this