Google’s next-gen AI model Gemini outperforms GPT-4

Google has unveiled Gemini, a cutting-edge AI model that stands as the company's most capable and versatile to date.

Demis Hassabis, CEO and Co-Founder of Google DeepMind, introduced Gemini as a multimodal model that is capable of seamlessly understanding and combining various types of information, including text, code, audio, image, and video.

https://youtu.be/jV1vkHv4zq8

Gemini comes in three optimised versions: Ultra, Pro, and Nano. The Ultra model boasts...

OpenAI reveals DALL-E 3 text-to-image model

OpenAI has announced DALL-E 3, the third iteration of its acclaimed text-to-image model. 

DALL-E 3 promises significant enhancements over its predecessors and introduces seamless integration with ChatGPT.

One of the standout features of DALL-E 3 is its ability to better understand and interpret user intentions when confronted with detailed and lengthy prompts:

"A middle-aged woman of Asian descent, her dark hair streaked with silver,...

Stability AI unveils ‘Stable Audio’ model for controllable audio generation

Stability AI has introduced "Stable Audio," a latent diffusion model designed to revolutionise audio generation.

This breakthrough promises to be another leap forward for generative AI and combines text metadata, audio duration, and start time conditioning to offer unprecedented control over the content and length of generated audio—even enabling the creation of complete songs.

Audio diffusion models traditionally faced a significant limitation in generating audio of...

Baidu deploys its ERNIE Bot generative AI to the public

Chinese tech giant Baidu has announced that its generative AI product ERNIE Bot is now open to the public through various app stores and its website.

ERNIE Bot can generate text, images, and videos based on natural language inputs. It is powered by ERNIE (Enhanced Representation through Knowledge Integration), a powerful deep learning model.

The first version of ERNIE was introduced and open-sourced in 2019 by researchers at Tsinghua University to demonstrate the natural...

Meta unveils SeamlessM4T multimodal translation model

Meta researchers have unveiled SeamlessM4T, a pioneering multilingual and multitask model that facilitates seamless translation and transcription across both speech and text. 

The internet, mobile devices, social media, and communication platforms have ushered in an era where access to multilingual content has reached unprecedented levels. SeamlessM4T aims to realise the vision of seamless communication and comprehension across languages.

Boasting an impressive array of...

Baidu to launch powerful ChatGPT rival

Chinese web giant Baidu is preparing to launch a powerful ChatGPT rival in March.

Baidu is often called the “Google of China” because it offers similar services, including search, maps, email, ads, cloud storage, and more. Baidu, like Google, also invests heavily in AI and machine learning.

Earlier this month, AI News reported that Google was changing its AI review processes to speed up the release of new solutions. One of the first products to be released under...

Microsoft releases Azure OpenAI Service and will add ChatGPT ‘soon’

Microsoft has announced the general availability of the Azure OpenAI Service and plans to add ChatGPT in the near future.

Currently, Azure OpenAI Service provides access to some of the most powerful AI models in the world—including Codex and DALL-E 2.

A “fine-tuned” version of GPT-3.5 will also be available through Azure OpenAI Service soon.

https://twitter.com/OpenAI/status/1615160228366147585

Azure OpenAI Service was unveiled in November 2021....

OpenAI upgrades GPT-3 with impressive new skills

OpenAI’s latest upgrade for GPT-3 has given the generalised language model some impressive new creative skills.

This week, OpenAI released a new text model (text-davinci-003) for GPT-3. Researchers have been playing around with the model to see what it can now do.

One user on Hacker News asked GPT-3 to write “a short rhyming poem explaining Einstein's theory of general relativity in easy but accurate terms.”

This was the result:

“If you want...

Stable Diffusion text-to-image generator is now publicly available

Text-to-image generator Stable Diffusion is now available for anyone to put to the test.

Stable Diffusion is developed by Stability AI and was initially released for researchers earlier this month. The image generator claims to deliver a breakthrough in speed and quality that can run on consumer GPUs.

The model is based on the latent diffused model created by CompVis and Runway but enhanced with insights from conditional diffusion models by Stable Diffusion’s lead...

AI21 Labs raises $64M to help it compete against OpenAI

AI21 Labs has raised $64 million in a funding round to help it compete against OpenAI and other NLP leaders.

Competition in NLP (Natural Language Processing) is heating up. OpenAI is currently seen as the industry leader with its GPT-3 model but rivals are gaining traction.

Investors see AI21 Labs as one of the most promising contenders.

"We completed this round during a period of market uncertainty, which highlights the confidence our investors have in AI21's...