Google has introduced its largest and most capable multimodal AI - Gemini, their most capable and general model .It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across and combine different data types including text, code, audio, image and video.
Gemini 1.0, the first version, has three different sizes:
Gemini Ultra — the largest and most capable model for highly complex tasks.
Gemini Pro — the best model for scaling across a wide range of tasks.
Gemini Nano — the most efficient model for on-device tasks.
With a score of 90.0%,Gemini Ultra is also the first model to outperform human experts on MMLU (massive multitask language understanding), which uses a combination of 57 subjects such as math, physics, history, law, medicine and ethics for testing both world knowledge and problem-solving abilities. Here is a video by Sundar Pichai and Demis Hassabis introducing the Model.
Learn more about Artificial Intelligence by registering for HURU Schools' Artificial Intelligence Picodegree.