Tuning & Hyperparameters
The model used for the project was T5 Large for Medical Text Summarization from Falcons AI. This model was chosen after an initial test of the T5-small model. The T5-small produced and average ROUGE score of 0.42 but after inspection it appeared to just be repeating the primary sentence or two of the findings as the impression. A couple of changes in hyperparameters showed amended this but reduced the ROUGE to 0.12. The change to the Falcons AI model was undertaken as the model had been pretrained on large amounts of medical data which would make it more suitable for the summarization task. A learning rate of 3e-5 was used after an initial run with 2e-5 and an increased number of epochs from 3 to 5.