XLNet Archives - ETCentric

Microsoft and Nvidia Debut World’s Largest Language Model

By Paula Parisi
October 14, 2021

Microsoft and Nvidia have trained what they describe as the most powerful AI-driven language model to date, the Megatron-Turing Natural Language Generation model (MT-NLG), which has “set the new standard for large-scale language models in both model scale and quality,” the firms say. As the successor to the companies’ Turing NLG 17B and Megatron-LM, the new MT-NLG has 530 billion parameters, or “3x the number of parameters compared to the existing largest model of this type” and demonstrates unmatched accuracy in a broad set of natural language tasks. Continue reading Microsoft and Nvidia Debut World’s Largest Language Model

SuperGLUE Is Benchmark For Language-Understanding AI

By Rob Scott
August 22, 2019

Researchers recently introduced a series of rigorous benchmark tasks that measure the performance of sophisticated language-understanding AI. Facebook AI Research with Google’s DeepMind, University of Washington and New York University introduced SuperGLUE last week, based on the idea that deep learning models for today’s conversational AI require greater challenges. SuperGLUE, which uses Google’s BERT representational model as a performance baseline, follows the 2018 introduction of GLUE (General Language Understanding Evaluation), and encourages the creation of models that can understand more nuanced, complex language. Continue reading SuperGLUE Is Benchmark For Language-Understanding AI