Inflection-2 beats Google’s PaLM 2 across common benchmarks

Inflection, an AI startup aiming to create "personal AI for everyone", has announced a new large language model dubbed Inflection-2 that beats Google's PaLM 2.

Inflection-2 was trained on over 5,000 NVIDIA GPUs to reach 1.025 quadrillion floating point operations (FLOPs), putting it in the same league as PaLM 2 Large. However, early benchmarks show Inflection-2 outperforming Google's model on tests of reasoning ability, factual knowledge, and stylistic prowess.

On a...

Azure and NVIDIA deliver next-gen GPU acceleration for AI

Microsoft Azure users are now able to harness the latest advancements in NVIDIA's accelerated computing technology, revolutionising the training and deployment of their generative AI applications.

The integration of Azure ND H100 v5 virtual machines (VMs) with NVIDIA H100 Tensor Core GPUs and Quantum-2 InfiniBand networking promises seamless scaling of generative AI and high-performance computing applications, all at the click of a button.

This cutting-edge collaboration...

US introduces new AI chip export restrictions

NVIDIA has revealed that it’s subject to new laws restricting the export of AI chips to China and Russia.

In an SEC filing, NVIDIA says the US government has informed the chipmaker of a new license requirement that impacts two of its GPUs designed to speed up machine learning tasks: the current A100, and the upcoming H100.

“The license requirement also includes any future NVIDIA integrated circuit achieving both peak performance and chip-to-chip I/O performance equal...