Amazon trains 980M parameter LLM with 'emergent abilities'

About the Author

By Ryan Daws | February 15, 2024 https://twitter.com/gadget_ry

Categories: AGI, Amazon, Applications, Artificial Intelligence, Chatbots, Companies,

Ryan Daws is a senior editor at TechForge Media with over a decade of experience in crafting compelling narratives and making complex topics accessible. His articles and interviews with industry leaders have earned him recognition as a key influencer by organisations like Onalytica. Under his leadership, publications have been praised by analyst firms such as Forrester for their excellence and performance. Connect with him on X (@gadget_ry) or Mastodon (@gadgetry@techhub.social)

Researchers at Amazon have trained a new large language model (LLM) for text-to-speech that they claim exhibits “emergent” abilities.

The 980 million parameter model, called BASE TTS, is the largest text-to-speech model yet created. The researchers trained models of various sizes on up to 100,000 hours of public domain speech data to see if they would observe the same performance leaps that occur in natural language processing models once they grow past a certain scale.

They found that their medium-sized 400 million parameter model – trained on 10,000 hours of audio – showed a marked improvement in versatility and robustness on tricky test sentences.

The test sentences contained complex lexical, syntactic, and paralinguistic features like compound nouns, emotions, foreign words, and punctuation that normally trip up text-to-speech systems. While BASE TTS did not handle them perfectly, it made significantly fewer errors in stress, intonation, and pronunciation than existing models.

“These sentences are designed to contain challenging tasks—none of which BASE TTS is explicitly trained to perform,” explained the researchers.

The largest 980 million parameter version of the model – trained on 100,000 hours of audio – did not demonstrate further abilities beyond the 400 million parameter version.

While an experimental process, the creation of BASE TTS demonstrates these models can reach new versatility thresholds as they scale—an encouraging sign for conversational AI. The researchers plan further work to identify optimal model size for emergent abilities.

The model is also designed to be lightweight and streamable, packaging emotional and prosodic data separately. This could allow the natural-sounding spoken audio to be transmitted across low-bandwidth connections.

You can find the full BASE TTS paper on arXiv here.

(Photo by Nik on Unsplash)

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

Tags: ai, Amazon, artificial intelligence, base tts, conversational ai, large language model, llm

Amazon trains 980M parameter LLM with ’emergent abilities’

About the Author

Leave a Reply Cancel reply

LATEST ARTICLES

Meta unveils five AI models for multi-modal processing, music generation, and more

The rise and fall of AI at the McDonald’s drive-thru

The impact of AI on online slot gaming in the UK

Snap introduces advanced AI for next-level augmented reality

AI comes to Ireland’s remote Islands through Microsoft’s ‘Skill Up’ program

Register

Personal Details

Company Details

Account Details

About the Author

Leave a Reply Cancel reply

Join our community

Create your free account now to access all our premium content and recieve the latest tech news to your inbox.

LATEST ARTICLES

Meta unveils five AI models for multi-modal processing, music generation, and more

The rise and fall of AI at the McDonald’s drive-thru

The impact of AI on online slot gaming in the UK

Snap introduces advanced AI for next-level augmented reality

AI comes to Ireland’s remote Islands through Microsoft’s ‘Skill Up’ program

Login

Register

Personal Details

Company Details

Account Details