llm Archives - Page 2 of 4

Elon Musk’s xAI open-sources Grok

Elon Musk's startup xAI has made its large language model Grok available as open source software. The 314 billion parameter model can now be freely accessed, modified, and distributed by anyone under an Apache 2.0 license.

The release fulfils Musk's promise to open source Grok in an effort to accelerate AI development and adoption.

XAI announced the move in a blog post, stating: "We are releasing the base model weights and network architecture of Grok-1, our large...

18 March 2024 | Artificial Intelligence

Anthropic’s latest AI model beats rivals and achieves industry first

Anthropic’s latest cutting-edge language model, Claude 3, has surged ahead of competitors like ChatGPT and Google's Gemini to set new industry standards in performance and capability.

According to Anthropic, Claude 3 has not only surpassed its predecessors but has also achieved "near-human" proficiency in various tasks. The company attributes this success to rigorous testing and development, culminating in three distinct chatbot variants: Haiku, Sonnet, and...

5 March 2024 | Applications

AIs in India will need government permission before launching

In an advisory issued by India’s Ministry of Electronics and Information Technology (MeitY) last Friday, it was declared that any AI technology still in development must acquire explicit government permission before being released to the public.

Developers will also only be able to deploy these technologies after labelling the potential fallibility or unreliability of the output generated.

Furthermore, the document outlines plans for implementing a "consent popup"...

4 March 2024 | Applications

Mistral AI unveils LLM rivalling major players

Mistral AI, a France-based startup, has introduced a new large language model (LLM) called Mistral Large that it claims can compete with several top AI systems on the market.

Mistral AI stated that Mistral Large outscored most major LLMs except for OpenAI's recently launched GPT-4 in tests of language understanding. It also performed strongly in maths and coding assessments.

Co-founder and Chief Scientist Guillaume Lample said Mistral Large represents a major...

27 February 2024 | Applications

Reddit is reportedly selling data for AI training

Reddit has negotiated a content licensing deal to allow its data to be used for training AI models, according to a Bloomberg report.

Just ahead of a potential $5 billion initial public offering (IPO) debut in March, Reddit has reportedly signed a $60 million deal with an undisclosed major AI company. This move could be seen as a last-minute effort to showcase potential revenue streams in the rapidly growing AI industry to prospective investors.

Although Reddit has yet to...

19 February 2024 | Artificial Intelligence

Amazon trains 980M parameter LLM with ’emergent abilities’

Researchers at Amazon have trained a new large language model (LLM) for text-to-speech that they claim exhibits "emergent" abilities.

The 980 million parameter model, called BASE TTS, is the largest text-to-speech model yet created. The researchers trained models of various sizes on up to 100,000 hours of public domain speech data to see if they would observe the same performance leaps that occur in natural language processing models once they grow past a certain...

15 February 2024 | AGI

DeepMind framework offers breakthrough in LLMs’ reasoning

A breakthrough approach in enhancing the reasoning abilities of large language models (LLMs) has been unveiled by researchers from Google DeepMind and the University of Southern California.

Their new 'SELF-DISCOVER' prompting framework – published this week on arXiV and Hugging Face – represents a significant leap beyond existing techniques, potentially revolutionising the performance of leading models such as OpenAI’s GPT-4 and Google’s PaLM 2.

The framework...

8 February 2024 | Artificial Intelligence

NCSC: AI to significantly boost cyber threats over next two years

A report published by the UK's National Cyber Security Centre (NCSC) warns that AI will substantially increase cyber threats over the next two years.

The centre warns of a surge in ransomware attacks in particular; involving hackers deploying malicious software to encrypt a victim's files or entire system and demanding a ransom payment for the decryption key.

The NCSC assessment predicts AI will enhance threat actors' capabilities mainly in carrying out more persuasive...

24 January 2024 | Applications

OpenAI’s GPT Store to launch next week after delays

OpenAI has announced that its GPT Store, a platform where users can sell and share custom AI agents created using OpenAI's GPT-4 large language model, will finally launch next week.

An email was sent to individuals enrolled as GPT Builders that urges them to ensure their GPT creations align with brand guidelines and advises them to make their models public:

The GPT Store was unveiled at OpenAI's November developers conference, revealing the company's plan to enable...

5 January 2024 | Applications

AI & Big Data Expo: Demystifying AI and seeing past the hype

In a presentation at AI & Big Data Expo Global, Adam Craven, Director at Y-Align, shed light on the practical applications of AI and the pitfalls often overlooked in the hype surrounding it.

Craven — with an extensive background in engineering and leadership roles at McKinsey & Company, HSBC, Nokia, among others — shared his experiences as a consultant helping C-level executives navigate the complex landscape of AI adoption. The core message revolved around...