Skip to content

Source code and datasets for PresRecRF published on Phytomedicine.

Notifications You must be signed in to change notification settings

2020MEAI/PresRecRF

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PresRecRF: Herbal prescription recommendation via the representation fusion of large TCM semantics and molecular knowledge

1. Introduction

This repository contains source code and datasets for "PresRecRF: Herbal prescription recommendation via the representation fusion of large TCM semantics and molecular knowledge".

In this study, we proposed a novel herbal Prescription Recommendation model with the Representation Fusion of large TCM semantics and molecular knowledge (termed PresRecRF, See graphical abstract figure below). The entire process of the proposed PresRecRF model closely mirrors the actual diagnosis and treatment procedures carried out by doctors, which are better applied in real clinical scenarios.

2. Overview

PresRecRF_GA.jpg

Fig1. Framework of PresRecRF. The model integrates large-scale TCM semantics and molecular knowledge for enhanced representation. PresRecRF comprises three key modules, namely knowledge representation learning, representation fusion, and prescription recommendation with precise herbal dosages. This approach seeks to enhance patient characterization and achieve precise herbal prescription recommendations.

3. Install Python libraries needed

$ conda create -n presrecrf_env python=3.9
$ conda activate presrecrf_env
$ pip install -r requirements.txt

4. Basic Usage

(1) dataset

The relevant data required by the model are uniformly placed in the "data" folder. This folder contains the following data files:

  • Lung_case.xlsx: Example dataset. This dataset originates from the work of Dong et al. JAMIA 2024 , and has been augmented with herb dosage information to form the dataset used in this study. Due to data privacy concerns, we cannot publicly release the complete dataset. For research purposes, please contact the corresponding author Xin Dong at x_dong@bjtu.edu.cn .
  • Semantic_BERT and Semantic_LLM: Initial vectors for semantic features. Semantic_BERT contains vector representations of symptoms and herbs trained through TCM BERT, while Semantic_LLM stores vector representations of symptoms and herbs generated by ChatGPT-3.5.
  • Structural_Network: Initial vectors for structural features. Contains vector representations of symptoms and herbs obtained through HSPKG training in this study.
  • (2) main code

    The python script file of the model is shown in this project, including the following files:

  • main.py: Run the entire program (based on the Lung_case dataset)
  • model.py: Model-related programs
  • utils.py: Some related utility functions
  • parameter.py: Parameter-related programs

    (3) result

    After running the "main.py" file, the model's Top@K performance metrics on the test set will be generated. The results are saved as log files in the "./log/" directory.

    5. Citation and Contact

    If you find PresRecRF useful for your research, please consider citing the following paper:

    Yang K, Dong X, Zhang S, Yu H, Zhong L, Zhang L, Zhao H, Hou Y, Song X, Zhou X. PresRecRF: Herbal prescription recommendation via the representation fusion of large TCM semantics and molecular knowledge. Phytomedicine. 2024 Oct 1; 135: 156116.
    
    @article{yang2024presrecrf,
      title={PresRecRF: Herbal prescription recommendation via the representation fusion of large TCM semantics and molecular knowledge},
      author={Yang, Kuo and Dong, Xin and Zhang, Shuhan and Yu, Haibin and Zhong, Liqun and Zhang, Lei and Zhao, He and Hou, Yutong and Song, Xinpeng and Zhou, Xuezhong},
      journal={Phytomedicine},
      volume={135},
      pages={156116},
      year={2024},
      publisher={Elsevier}
    }
    

    If you have better suggestions or questions about our work, please contact us: x_dong@bjtu.edu.cn and kuoyang@bjtu.edu.cn.

    Welcome to follow our project on GitHub: https://github.com/2020MEAI and https://github.com/xdong97 .

  • About

    Source code and datasets for PresRecRF published on Phytomedicine.

    Topics

    Resources

    Stars

    Watchers

    Forks

    Releases

    No releases published

    Packages

    No packages published

    Contributors 3

    •  
    •  
    •  

    Languages