Recent deep learning technologies have enabled significant performance improvements in QA tasks that produce appropriate responses to given questions. The well-known SQuAD is one such task, but the difficulty of combining multiple models into a single model remains due to differences in desired shape and question answer format for each task. AllenAI’s UnifiedQA seeks to tackle this issue by training a single model covering 20 different QA datasets, including SQuAD, NarrativeQA, ARC-challenge and more. Notably, each of these tasks may require distinct purposes when context is provided – such as extracting facts or summarizing a situation – which UnifiedQA is able to handle with just one model.

