neural networks Archives - AI News https://www.artificialintelligence-news.com/tag/neural-networks/ Artificial Intelligence News Fri, 16 Feb 2024 13:42:51 +0000 en-GB hourly 1 https://www.artificialintelligence-news.com/wp-content/uploads/sites/9/2020/09/ai-icon-60x60.png neural networks Archives - AI News https://www.artificialintelligence-news.com/tag/neural-networks/ 32 32 Google launches Gemini 1.5 with ‘experimental’ 1M token context https://www.artificialintelligence-news.com/2024/02/16/google-launches-gemini-1-5-experimental-1m-token-context/ https://www.artificialintelligence-news.com/2024/02/16/google-launches-gemini-1-5-experimental-1m-token-context/#respond Fri, 16 Feb 2024 13:42:49 +0000 https://www.artificialintelligence-news.com/?p=14415 Google has unveiled its latest AI model, Gemini 1.5, which features what the company calls an “experimental” one million token context window.  The new capability allows Gemini 1.5 to process extremely long text passages – up to one million characters – to understand context and meaning. This dwarfs previous AI systems like Claude 2.1 and... Read more »

The post Google launches Gemini 1.5 with ‘experimental’ 1M token context appeared first on AI News.

]]>
Google has unveiled its latest AI model, Gemini 1.5, which features what the company calls an “experimental” one million token context window. 

The new capability allows Gemini 1.5 to process extremely long text passages – up to one million characters – to understand context and meaning. This dwarfs previous AI systems like Claude 2.1 and GPT-4 Turbo, which max out at 200,000 and 128,000 tokens respectively:

“Gemini 1.5 Pro achieves near-perfect recall on long-context retrieval tasks across modalities, improves the state-of-the-art in long-document QA, long-video QA and long-context ASR, and matches or surpasses Gemini 1.0 Ultra’s state-of-the-art performance across a broad set of benchmarks,” said Google researchers in a technical paper (PDF).

The efficiency of Google’s latest model is attributed to its innovative Mixture-of-Experts (MoE) architecture.

“While a traditional Transformer functions as one large neural network, MoE models are divided into smaller ‘expert’ neural networks,” explained Demis Hassabis, CEO of Google DeepMind.

“Depending on the type of input given, MoE models learn to selectively activate only the most relevant expert pathways in its neural network. This specialisation massively enhances the model’s efficiency.”

To demonstrate the power of the 1M token context window, Google showed how Gemini 1.5 could ingest the entire 326,914-token Apollo 11 flight transcript and then accurately answer specific questions about it. It also summarised key details from a 684,000-token silent film when prompted.

Google is initially providing developers and enterprises free access to a limited Gemini 1.5 preview with a one million token context window. A 128,000 token general release for the public will come later, along with pricing details.

For now, the one million token capability remains experimental. But if it lives up to its early promise, Gemini 1.5 could set a new standard for AI’s ability to understand complex, real-world text.

Developers interested in testing Gemini 1.5 Pro can sign up in AI Studio. Google says that enterprise customers can reach out to their Vertex AI account team.

(Image Credit: Google)

See also: Amazon trains 980M parameter LLM with ’emergent abilities’

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

The post Google launches Gemini 1.5 with ‘experimental’ 1M token context appeared first on AI News.

]]>
https://www.artificialintelligence-news.com/2024/02/16/google-launches-gemini-1-5-experimental-1m-token-context/feed/ 0
AI & Big Data Expo: Demystifying AI and seeing past the hype https://www.artificialintelligence-news.com/2023/12/07/ai-big-data-expo-demystifying-ai-seeing-past-hype/ https://www.artificialintelligence-news.com/2023/12/07/ai-big-data-expo-demystifying-ai-seeing-past-hype/#respond Thu, 07 Dec 2023 16:29:45 +0000 https://www.artificialintelligence-news.com/?p=14032 In a presentation at AI & Big Data Expo Global, Adam Craven, Director at Y-Align, shed light on the practical applications of AI and the pitfalls often overlooked in the hype surrounding it. Craven — with an extensive background in engineering and leadership roles at McKinsey & Company, HSBC, Nokia, among others — shared his... Read more »

The post AI & Big Data Expo: Demystifying AI and seeing past the hype appeared first on AI News.

]]>
In a presentation at AI & Big Data Expo Global, Adam Craven, Director at Y-Align, shed light on the practical applications of AI and the pitfalls often overlooked in the hype surrounding it.

Craven — with an extensive background in engineering and leadership roles at McKinsey & Company, HSBC, Nokia, among others — shared his experiences as a consultant helping C-level executives navigate the complex landscape of AI adoption. The core message revolved around understanding AI beyond the hype to make informed decisions that align with organisational goals.

Breaking down the AI hype

Craven introduced a systematic approach to demystifying AI, emphasising the need to break down the overarching concept into smaller, manageable components. He outlined key attributes of neural networks, embeddings, and transformers, focusing on large language models as a shared foundation.

  • Neural networks — described as probabilistic and adaptable — form the backbone of AI, mimicking human learning processes.
  • Embeddings allow computers to navigate between levels of abstraction, somewhat akin to human cognition.
  • Transformers — the “attention” mechanism — are the linchpin of the AI revolution, allowing machines to understand context and meaning.

LLMs as search and research engines

Craven assesses if LLMs alone make good search engines. They understand search intent exceptionally well but don’t have access to vast data, give accurate results, or reference sources—all of which are key search requirements.

However, Craven highlighted that large language models (LLMs) are powerful summarising engines for research. He emphasised their ability to summarise data, translate between languages, and serve as research assistants:

Craven went on to caution against relying solely on LLMs for complex tasks—showcasing a study where consultants using language models underperformed in nuanced analysis.

De-hyping AI: Setting realistic expectations

The presentation concluded with practical use cases for organisations, such as documentation tools, high-level decision-making, code review tools, and multimodal decision-makers. Craven advised a thoughtful evaluation of when LLMs are useful, ensuring they align with organisational values and principles.

However, Craven warns against inflated claims about AI’s performance—citing examples where language models enhanced certain tasks but fell short in others. He urged the audience to consider the context and nuances when evaluating AI’s impact, avoiding unwarranted expectations.

Craven offered actionable insights for implementation, urging organisations to capture data for future use, create test cases for specific use cases, and apply a systematic framework to develop a strategy. The emphasis remained on seeing through the hype, saving millions by strategically incorporating AI into existing workflows.

In a world inundated with AI promises, Adam Craven’s pragmatic approach provides a roadmap for organisations to leverage the power of AI while avoiding common pitfalls.

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with Cyber Security & Cloud Expo and Digital Transformation Week.

Explore other upcoming enterprise technology events and webinars powered by TechForge here.

The post AI & Big Data Expo: Demystifying AI and seeing past the hype appeared first on AI News.

]]>
https://www.artificialintelligence-news.com/2023/12/07/ai-big-data-expo-demystifying-ai-seeing-past-hype/feed/ 0