MedicalGPT training medical GPT model with ChatGPT training pipeline, implementation of Pretraining, Supervised Finetuning, Reward Modeling and Reinforcement Learning. MedicalGPT trains large medical models, including secondary pre-training, supervised fine-tuning, reward modeling, and reinforcement learning training.

Features

  • The first stage: PT (Continue PreTraining) incremental pre-training, pre-training the GPT model twice on massive domain document data to inject domain knowledge
  • The second stage: SFT (Supervised Fine-tuning) has supervised fine-tuning, constructs an instruction fine-tuning data set, and performs instruction fine-tuning on the basis of the pre-training model to align instruction intentions
  • The third stage: RM (Reward Model) reward model modeling, constructing a human preference ranking data set, training the reward model to align human preferences, mainly the "HHH" principle, specifically "helpful, honest, harmless"
  • The fourth stage: RL (Reinforcement Learning) is based on human feedback reinforcement learning (RLHF), using the reward model to train the SFT model, and the generation model uses rewards or penalties to update its strategy in order to generate higher quality, more in line with human preferences
  • We provide a simple Gradio-based interactive web interface
  • After the training is complete, now we load the trained model to verify the effect of the model generating text

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow MedicalGPT

MedicalGPT Web Site

Other Useful Business Software
SoftCo: Enterprise Invoice and P2P Automation Software Icon
SoftCo: Enterprise Invoice and P2P Automation Software

For companies that process over 20,000 invoices per year

SoftCo Accounts Payable Automation processes all PO and non-PO supplier invoices electronically from capture and matching through to invoice approval and query management. SoftCoAP delivers unparalleled touchless automation by embedding AI across matching, coding, routing, and exception handling to minimize the number of supplier invoices requiring manual intervention. The result is 89% processing savings, supported by a context-aware AI Assistant that helps users understand exceptions, answer questions, and take the right action faster.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of MedicalGPT!