Google introduces Gemini and updates Bard with Gemini Pro

Google introduces Gemini and updates Bard with Gemini Pro

Google has unveiled Gemini, its most advanced and capable artificial intelligence (AI) model with advanced multimodal capabilities.

This innovative model represents a leap forward in AI technology, offering state-of-the-art performance compared to existing large language models (LLMs).

Sundar Pichai, CEO of Google and Alphabet, emphasized that AI is shaping a profound technological change, which may exceed the impact of the mobile and web revolutions.

He highlighted the importance of AI in driving innovation and economic progress, enhancing human knowledge, creativity and productivity.

What is Google Gemini?

Developed by Google DeepMind, led by CEO and co-founder Demis Hassabis, Gemini is a testament to Google’s ongoing commitment to being an early adopter in AI.

I’m really excited to share our work at Gemini today! Gemini is a family of multimodal models that demonstrate very powerful capabilities in the image, audio, video and text domains. Our most capable model, Gemini Ultra, advances the state of the art in 30 of 32 benchmarks,… pic.twitter.com/sQfxBy9tpT

— Jeff Dean (@🏡) (@JeffDean) December 6, 2023

The model shows an impressive range of capabilities, particularly in its multimodal understanding, a feature that allows it to seamlessly process and combine different types of information, including text, code, audio, image, and video.

Google Gemini Ultra beats GPT-4

Gemini 1.0, the first version of the model, comes in three variants: Gemini Ultra, Gemini Pro and Gemini Nano.

Google screenshot, December 2023

Each is optimized for specific tasks, with Gemini Ultra designed for highly complex tasks, Gemini Pro for a wide range of tasks, and Gemini Nano for device-efficient tasks.

The model’s performance is outstanding, outperforming human experts in massively multitasking language understanding (MMLU) with a score of 90.0%.

Additionally, Gemini Ultra outperforms existing models in 30 of the 32 academic benchmarks widely used in large language model research.

google gemini performanceGoogle screenshot, December 2023

Gemini multimodal capabilities and performance

Gemini’s innovative approach to multimodality sets it apart from previous models.

Traditional multimodal models are often limited by their design, which involves training separate components for different modalities and then joining them together.

Instead, Gemini was built from the ground up to be natively multimodal, allowing it to understand and reason across multiple inputs much more effectively.

Google introduces Gemini and updates Bard with Gemini ProGoogle screenshot, December 2023

This capability positions Gemini as a powerful tool in fields ranging from science to finance, where it can discover insights from large amounts of data and provide advanced reasoning in complex topics such as mathematics and physics.

Examples from the Google DeepMind report on Google Gemin demonstrate Gemini’s multimodal capabilities, such as image generation.

Google introduces Gemini and updates Bard with Gemini ProGoogle screenshot, December 2023

In this video, Google tests Gemini with its Emoji kitchen.

It can also handle text, image and audio as shown below.

Google introduces Gemini and updates Bard with Gemini ProGoogle screenshot, December 2023

This video from Google provides more information on Gemini’s ability to process raw audio.

Gemini Benchmarks against external competitors

How does Google Gemini stack up against top AI models from OpenAI, Inflection, Anthropic, Meta and xAI? Here’s how the Gemini Ultra and Pro perform in text comparisons against their competition.

gemini gpt-4 flexion-2 llama 2 grok 1 claude-2 performance comparison benchmarksGoogle screenshot, December 2023

Geminis excel at coding

In addition to its multimodal capabilities, Gemini excels at encoding tasks. His ability to understand, explain, and generate high-quality code in multiple programming languages ​​positions him as a leading role model for coding.

Google introduces Gemini and updates Bard with Gemini ProGoogle screenshot, December 2023

It also forms the basis for more advanced coding systems, such as AlphaCode 2, which significantly improve competitive programming problems.

The model’s efficiency and scalability are boosted by Google’s in-house designed v4 and v5e Tensor Processing Units (TPUs), making it the most reliable and scalable model to train and serve.

Google Bard now powered by Gemini Pro

Google has also announced a significant update to Bard, integrating Gemini Pro to improve AI capabilities.

Google introduces Gemini and updates Bard with Gemini ProScreenshot from Google Bard, December 2023

This update marks the biggest improvement Bard has received to date.

Gemini Pro has been tuned to Bard to significantly improve its performance in understanding and summarizing information, reasoning, coding and planning.

Google introduces Gemini and updates Bard with Gemini ProScreenshot from Google Bard, December 2023

Users can now experience Bard powered by Gemini Pro for text-based interactions, with plans to extend support to other modalities shortly.

Initially available in English in over 170 countries and territories, this update will soon roll out to other languages ​​and regions, including Europe.

Understanding intent with Gemini for personalized UX

This video demonstrates Gemini’s ability to understand user intent and create personalized user experiences.

It starts with understanding the user’s goal and gathering relevant information before reasoning and creating a custom interface for exploration.

The user can interact with the interface and receive more information based on their needs, showing Gemini’s ability to adapt and provide a personalized experience.

Google Pixel 8 Pro: The first smartphone with built-in AI powered by Gemini Nano

Google’s latest update introduces Gemini Nano, an advanced AI model, now built into the Pixel 8 Pro smartphone.

This update marks the Pixel 8 Pro as the first phone designed for AI with Gemini Nano, leveraging Google Tensor G3 technology.

Key features include “Summarize on Recorder” for summarizing audio recordings on your device and “Smart Reply on Gboard” for context-aware text responses. These features enhance user privacy and functionality without the need for a network connection.

Additionally, Google announced upcoming improvements to the Assistant with Bard experience on the Pixel line, further expanding AI capabilities.

The update also includes AI-powered improvements to photography and video, such as improved video stabilization, Night Sight video, and Photo Unblur for clearer pet images.

For productivity, there are new tools like dual-screen preview in Pixel Fold, improved video calling with Pixel phones as webcams, and document scanning cleanup.

Google Password Manager now supports passkeys, and Pixel devices get new security features like repair mode. The Pixel Watch introduces convenient phone unlocking and call selection features, while the Pixel tablet offers Clear Calling and spatial audio support.

Google too expands language support in its Recorder app and expands Direct My Call and Hold for Me features to more regions and devices.

Responsible AI development

Google has prioritized responsible AI development, ensuring full Gemini safety assessments for bias and toxicity.

The company works with various experts and external partners to rigorously test the model and address potential risks.

How to get Gemini

Gemini 1.0 is being gradually integrated into various Google products and platforms and will soon be accessible to developers and business customers through Google AI Studio and Google Cloud Vertex AI.

As part of Google’s commitment to advancing AI responsibly, Gemini Ultra will undergo extensive trust and security checks before its wider release.

The introduction Google’s Gemini marks a significant milestone in the development of AI.

Its advanced capabilities, ranging from sophisticated multimodal reasoning to efficient coding, signal the beginning of a new era in AI, opening remarkable possibilities for innovation in multiple domains.

Featured image: VDB Photos/Shutterstock



[ad_2]

Source link

You May Also Like

About the Author: Ted Simmons

I follow and report the current news trends on Google news.

Leave a Reply

Your email address will not be published. Required fields are marked *