GPT-4o delivers human-like AI interaction with text, audio, and vision integration

OpenAI has launched its new flagship model, GPT-4o, which seamlessly integrates text, audio, and visual inputs and outputs, promising to enhance the naturalness of machine interactions.

GPT-4o, where the “o” stands for “omni,” is designed to cater to a broader spectrum of input and output modalities. “It accepts as input any combination of text, audio, and image and generates any combination of text, audio, and image outputs,” OpenAI announced.

Users can...

OpenAI makes GPT-4 Turbo with Vision API generally available

OpenAI has announced that its powerful GPT-4 Turbo with Vision model is now generally available through the company's API, opening up new opportunities for enterprises and developers to integrate advanced language and vision capabilities into their applications.

The launch of GPT-4 Turbo with Vision on the API follows the initial release of GPT-4's vision and audio upload features last September and the unveiling of the turbocharged GPT-4 Turbo model at OpenAI's developer...

ML Olympiad returns with over 20 challenges

The popular ML Olympiad is back for its third round with over 20 community-hosted machine learning competitions on Kaggle.

The ML Olympiad – organised by groups including ML GDE, TFUG, and other ML communities – aims to provide developers with hands-on opportunities to learn and practice machine learning skills by tackling real-world challenges.

Over the previous two rounds, an impressive 605 teams participated across 32 competitions, generating 105 discussions and...

Google launches Gemini 1.5 with ‘experimental’ 1M token context

Google has unveiled its latest AI model, Gemini 1.5, which features what the company calls an "experimental" one million token context window. 

The new capability allows Gemini 1.5 to process extremely long text passages – up to one million characters – to understand context and meaning. This dwarfs previous AI systems like Claude 2.1 and GPT-4 Turbo, which max out at 200,000 and 128,000 tokens respectively:

“Gemini 1.5 Pro achieves near-perfect recall on...

Developers believe AI will have a positive world impact

AI is among the “next” technologies that developers believe will have a positive world impact.

Some artists, developers, writers, and other creators have expressed concern that generative AIs may pose a threat to their livelihoods. However, an increasing number view such AIs as assistive tools that will help creators rather than replace them.

Stack Overflow surveyed its developer community to find out how developers feel about technologies currently making the...

OpenAI now allows developers to customise GPT-3 models

OpenAI is making it easy for developers to “fine-tune” GPT-3, enabling custom models for their applications.

The company says that existing datasets of “virtually any shape and size” can be used for custom models.

A single command in the OpenAI command-line tool, alongside a user-provided file, is all that it takes to begin training. The custom GPT-3 model will then be available for use in OpenAI’s API immediately.

One customer says that it was...

GTC 2021: Nvidia debuts accelerated computing libraries, partners with Google, IBM, and others to speed up quantum research

Nvidia has unveiled 65 new and updated software development kits at GTC 2021, alongside a partnership with industry leaders to speed up quantum research.

The company’s roster of accelerated computing kits now exceeds 150 and supports the almost three million developers in NVIDIA’s Developer Program.

Four of the major new SDKs are:

ReOpt – Automatically optimises logistical processes using advanced, parallel algorithms. This includes vehicle routes, warehouse...

Unity devs aren’t too happy their work is being sold for military AI purposes

Developers from Unity are calling for more transparency after discovering their AI work is being sold to the military.

Video games have pioneered AI developments since Nim was released in 1951. In the decades since, game developers have worked to improve AIs to provide a more enjoyable experience for a growing number of people around the world.

Just imagine the horror if those developers found out their work was instead being used for real military purposes without their...

Experts debate whether GitHub’s latest AI tool violates copyright law

GitHub’s impressive new code-assisting AI tool called Copilot is receiving both praise and criticism.

Copilot draws context from the code that a developer is working on and can suggest entire lines or functions. The system, from OpenAI, claims to be “significantly more capable than GPT-3” in generating code and can help even veteran programmers to discover new APIs or ways to solve problems.

Critics claim the system is using copyrighted code that GitHub then plans...

Google launches fully managed cloud ML platform Vertex AI

Google Cloud has launched Vertex AI, a fully managed cloud platform that simplifies the deployment and maintenance of machine learning models.

Vertex was announced during this year’s virtual I/O developer conference and somewhat breaks from Google’s tradition of using its keynote to focus more on updates to its mobile and web development solutions. Google announcing the platform during the keynote shows how important the company believes it to be for a wide range of...