Early testing of GPT-3 revealed that it could be useful for basic admin tasks such as appointment booking. However, when we looked further, we found that the model had no clear understanding of time or any true logic behind its decisions. Additionally, memory gaps were observed; in our example below, the patient’s initial 6pm deadline was missed as GPT-3 suggested a 7pm appointment after several messages.

