
NLU: A search for a generalizable model that consistently performs well across multiple tasks
Below I attempt to paraphrase the following paper in plain in English: Better Fine Tuning by Reducing Representational Collapse by Armen Aghajanyan, Akshat Shrivastava, Anchit