Nvidia and Microsoft develop 530 billion parameter AI model, but it still suffers from bias

About the Author

By Ryan Daws | October 12, 2021 https://twitter.com/gadget_ry

Categories: Companies, Ethics & Society, Microsoft, NVIDIA,

Ryan Daws is a senior editor at TechForge Media with over a decade of experience in crafting compelling narratives and making complex topics accessible. His articles and interviews with industry leaders have earned him recognition as a key influencer by organisations like Onalytica. Under his leadership, publications have been praised by analyst firms such as Forrester for their excellence and performance. Connect with him on X (@gadget_ry) or Mastodon (@gadgetry@techhub.social)

Nvidia and Microsoft have developed an incredible 530 billion parameter AI model, but it still suffers from bias.

The pair claim their Megatron-Turing Natural Language Generation (MT-NLG) model is the “most powerful monolithic transformer language model trained to date”.

For comparison, OpenAI’s much-lauded GPT-3 has 175 billion parameters.

The duo trained their impressive model on 15 datasets with a total of 339 billion tokens. Various sampling weights were given to each dataset to emphasise those of a higher-quality.

The OpenWebText2 dataset – consisting of 14.8 billion tokens – was given the highest sampling weight of 19.3 percent. This was followed by CC-2021-04 – consisting of 82.6 billion tokens, the largest amount of all the datasets – with a weight of 15.7 percent. Rounding out the top three is Books 3 – a dataset with 25.7 billion tokens – that was given a weight of 14.3 percent.

However, despite the large increase in parameters, MT-NLG suffered from the same issues as its predecessors.

“While giant language models are advancing the state of the art on language generation, they also suffer from issues such as bias and toxicity,” the companies explained.

“Our observations with MT-NLG are that the model picks up stereotypes and biases from the data on which it is trained.”

Nvidia and Microsoft say they remain committed to addressing this problem.

Find out more about Digital Transformation Week North America, taking place on 9-10 November 2021, a virtual event and conference exploring advanced DTX strategies for a ‘digital everything’ world.

Tags: ai, artificial intelligence, bias, datasets, ethics, microsoft, Model, mt-nlg, Nvidia

Nvidia and Microsoft develop 530 billion parameter AI model, but it still suffers from bias

About the Author

Leave a Reply Cancel reply

LATEST ARTICLES

Meta unveils five AI models for multi-modal processing, music generation, and more

The rise and fall of AI at the McDonald’s drive-thru

The impact of AI on online slot gaming in the UK

Snap introduces advanced AI for next-level augmented reality

AI comes to Ireland’s remote Islands through Microsoft’s ‘Skill Up’ program

Register

Personal Details

Company Details

Account Details

About the Author

Leave a Reply Cancel reply

Join our community

Create your free account now to access all our premium content and recieve the latest tech news to your inbox.

LATEST ARTICLES

Meta unveils five AI models for multi-modal processing, music generation, and more

The rise and fall of AI at the McDonald’s drive-thru

The impact of AI on online slot gaming in the UK

Snap introduces advanced AI for next-level augmented reality

AI comes to Ireland’s remote Islands through Microsoft’s ‘Skill Up’ program

Login

Register

Personal Details

Company Details

Account Details