Gemini (language model)
![]() | |
Developer(s) | Google DeepMind Google AI |
---|---|
Initial release | December 6, 2023 |
Predecessor | Google Assistant PaLM |
Available in | English |
Type | Large language model |
License | Proprietary |
Website | gemini |
Google Gemini is a family of multimodal large language models developed by Google DeepMind, serving as the successor to LaMDA and PaLM 2. Comprising Gemini Ultra, Gemini Pro, and Gemini Nano, it was announced on December 6, 2023, positioned as a competitor to OpenAI's GPT-4. It powers the chatbot of the same name.
History
Development
In August 2023,
With news of Gemini's impending launch, OpenAI hastened its work on integrating GPT-4 with multimodal features similar to those of Gemini.[10] The Information reported in September that several companies had been granted early access to "an early version" of the LLM, which Google intended to make available to clients through Google Cloud's Vertex AI service. The publication also stated that Google was arming Gemini to compete with both GPT-4 and Microsoft's GitHub Copilot.[11][12]
Launch
On December 6, 2023, Pichai and Hassabis announced "Gemini 1.0" at a virtual press conference.
Gemini Ultra was said to have outperformed GPT-4,
Updates
Google partnered with
In February, Google launched "Gemini 1.5" in a limited capacity, positioned as a more powerful and capable model than 1.0 Ultra.
At the Google I/O 2024, Gemini 1.5 Flash is released.[38]
Technical specifications
The first generation of Gemini ("Gemini 1") has three models, with the same software architecture. They are decoder-only transformers, with modifications to allow efficient training and inference on TPUs. They have a context length of 32,768 tokens, with multi-query attention. Two versions of Gemini Nano, Nano-1 (1.8 billion parameters) and Nano-2 (3.25 billion parameters), are distilled from larger Gemini models, designed for use by edge devices such as smartphones. As Gemini is multimodal, each context window can contain multiple forms of input. The different modes can be interleaved and do not have to be presented in a fixed order, allowing for a multimodal conversation. For example, the user might open the conversation with a mix of text, picture, video, and audio, presented in any order, and Gemini might reply with the same free ordering.
Input images may be of different
Demis Hassabis claims that training Gemini 1 used "roughly the same amount of compute, maybe slightly more than what was rumored for GPT-4".[40]
The second generation of Gemini ("Gemini 1.5") has two models published so far:[41]
- Gemini 1.5 Pro. It is a multimodal sparse mixture-of-experts, with context length of "multiple millions".
- Gemini 1.5 Flash. It is online distilled[42] from Gemini 1.5 Pro, with context length above 2 million.
Reception
Gemini's launch was preluded by months of intense speculation and anticipation, which
Reacting to its unveiling in December 2023,
Google faced criticism for a demonstrative video of Gemini, which was not conducted in real time.[52]
See also
- Gato, a multimodal neural network developed by DeepMind
References
- from the original on May 10, 2023. Retrieved August 21, 2023.
- ZDNet. Archivedfrom the original on May 10, 2023. Retrieved August 21, 2023.
- ^ from the original on December 6, 2023. Retrieved December 6, 2023.
- ^ Levy, Steven (September 11, 2023). "Sundar Pichai on Google;s AI, Microsoft's AI, OpenAI, and ... Did We Mention AI?". Wired. Archived from the original on September 11, 2023. Retrieved September 12, 2023.
- ^ Knight, Will (June 26, 2023). "Google DeepMind's CEO Says Its Next Algorithm Will Eclipse ChatGPT". Wired. Archived from the original on June 26, 2023. Retrieved August 21, 2023.
- ^ a b c Victor, Jon (August 15, 2023). "How Google is Planning to Beat OpenAI". The Information. Archived from the original on August 15, 2023. Retrieved August 21, 2023.
- from the original on January 20, 2023. Retrieved February 6, 2023.
- from the original on July 21, 2023. Retrieved September 7, 2023.
- ^ Carter, Tom (December 7, 2023). "Google confirms that its cofounder Sergey Brin played a key role in creating its ChatGPT rival". Business Insider. Archived from the original on December 7, 2023. Retrieved December 31, 2023.
- ^ Victor, Jon (September 18, 2023). "OpenAI Hustles to Beat Google to Launch 'Multimodal' LLM". The Information. Archived from the original on September 18, 2023. Retrieved October 15, 2023.
- ^ "Google nears release of AI software Gemini, The Information reports". Reuters. September 14, 2023. Archived from the original on September 15, 2023. Retrieved October 2, 2023.
- ^ Nolan, Beatrice (September 23, 2023). "Google is quietly handing out early demos of its GPT-4 rival called Gemini. Here's what we know so far about the upcoming AI model". Business Insider. Archived from the original on September 23, 2023. Retrieved October 16, 2023.
- ^ from the original on December 6, 2023. Retrieved December 6, 2023.
- ^ a b c d Liedtike, Michael; O'Brien, Matt (December 6, 2023). "Google launches Gemini, upping the stakes in the global AI race". Associated Press. Archived from the original on December 6, 2023. Retrieved December 6, 2023.
- ^ Edwards, Benj (December 6, 2023). "Google launches Gemini—a powerful AI model it says can surpass GPT-4". Ars Technica. Archived from the original on December 6, 2023. Retrieved December 6, 2023.
- ^ a b Pierce, David (December 6, 2023). "Google launches Gemini, the AI model it hopes will take down GPT-4". The Verge. Archived from the original on December 6, 2023. Retrieved December 6, 2023.
- ^ Fung, Brian; Thorbecke, Catherine (December 6, 2023). "Google launches Gemini, its most-advanced AI model yet, as it races to compete with ChatGPT". CNN Business. Archived from the original on December 6, 2023. Retrieved December 6, 2023.
- ^ "Google launches Gemini, upping the stakes in the global AI race". CBS News. December 6, 2023. Archived from the original on December 7, 2023. Retrieved December 7, 2023.
- ^ a b c Knight, Will (December 6, 2023). "Google Just Launched Gemini, Its Long-Awaited Answer to ChatGPT". Wired. Archived from the original on December 6, 2023. Retrieved December 6, 2023.
- ^ a b Henshall, Will (December 6, 2023). "Google DeepMind Unveils Its Most Powerful AI Offering Yet". Time. Archived from the original on December 6, 2023. Retrieved December 6, 2023.
- from the original on December 6, 2023. Retrieved December 6, 2023.
- ^ Elias, Jennifer (December 6, 2023). "Google launches its largest and 'most capable' AI model, Gemini". CNBC. Archived from the original on December 6, 2023. Retrieved December 6, 2023.
- ^ Alba, Davey; Ghaffary, Shirin (December 6, 2023). "Google Opens Access to Gemini, Racing to Catch Up to OpenAI". Bloomberg News. Archived from the original on December 6, 2023. Retrieved December 7, 2023.
- ^ Knight, Will (December 6, 2023). "Google DeepMind's Demis Hassabis Says Gemini Is a New Breed of AI". Wired. Archived from the original on December 6, 2023. Retrieved December 7, 2023.
- ^ Gurman, Mark; Love, Julia; Alba, Davey (January 17, 2024). "Samsung Bets on Google-Powered AI Features in Smartphone Revamp". Bloomberg News. Archived from the original on January 17, 2024. Retrieved February 4, 2024.
- ^ Chokkattu, Julian (January 17, 2024). "Samsung's Galaxy S24 Phones Call on Google's AI to Spruce Up Their Smarts". Wired. Archived from the original on January 17, 2024. Retrieved February 4, 2024.
- from the original on February 8, 2024. Retrieved February 8, 2024.
- ^ Dastin, Jeffrey (February 8, 2024). "Google rebrands Bard chatbot as Gemini, rolls out paid subscription". Reuters. Archived from the original on February 8, 2024. Retrieved February 8, 2024.
- 9to5Google. Archivedfrom the original on February 8, 2024. Retrieved February 8, 2024.
- ^ Mehta, Ivan (February 1, 2024). "Google's Bard chatbot gets the Gemini Pro update globally". TechCrunch. Archived from the original on February 1, 2024. Retrieved February 4, 2024.
- ^ Knight, Will (February 15, 2024). "Google's Flagship AI Model Gets a Mighty Fast Upgrade". Wired. Archived from the original on February 15, 2024. Retrieved February 21, 2024.
- ^ Nieva, Richard (February 15, 2024). "Google Unveils Gemini 1.5, But Only Developers And Enterprise Clients Have Access For Now". Forbes. Archived from the original on February 15, 2024. Retrieved February 21, 2024.
- ^ McCracken, Harry (February 15, 2024). "Google's new Gemini 1.5 AI can dive deep into oceans of video and audio". Fast Company. Archived from the original on February 17, 2024. Retrieved February 21, 2024.
- ^ Stokes, Samantha (February 15, 2024). "Here's everything you need to know about Gemini 1.5, Google's newly updated AI model that hopes to challenge OpenAI". Business Insider. Archived from the original on February 19, 2024. Retrieved February 21, 2024.
- ^ Khan, Jeremy (February 21, 2024). "Google unveils new family of open-source AI models called Gemma to take on Meta and others—deciding open-source AI ain't so bad after all". Fast Company. Archived from the original on February 21, 2024. Retrieved February 21, 2024.
- ^ Alba, Davey (February 21, 2024). "Google Delves Deeper Into Open Source with Launch of Gemma AI Model". Bloomberg News. Archived from the original on February 21, 2024. Retrieved February 21, 2024.
- from the original on February 21, 2024. Retrieved February 21, 2024.
- ^ "Gemini Flash". Google DeepMind. Retrieved May 17, 2024.
- ^ Gemini: A Family of Highly Capable Multimodal Models (PDF) (Technical report). Google DeepMind. December 6, 2023. Archived (PDF) from the original on December 6, 2023. Retrieved December 7, 2023.
- ^ Patel, Dwarkesh (February 21, 2024). "Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat". www.dwarkeshpatel.com. Retrieved March 3, 2024.
- ^ Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context (PDF) (Technical report). Google DeepMind. February 15, 2024. Archived (PDF) from the original on February 26, 2024.
- arXiv:1804.03235 [cs.LG].
- ^ a b Heikkilä, Melissa; Heaven, Will Douglas (December 6, 2023). "Google DeepMind's new Gemini model looks amazing—but could signal peak AI hype". MIT Technology Review. Archived from the original on December 6, 2023. Retrieved December 6, 2023.
- ^ howdhury, Hasan (August 29, 2023). "AI bros are at war over declarations that Google's upcoming Gemini AI model smashes OpenAI's GPT-4". Business Insider. Archived from the original on August 29, 2023. Retrieved September 7, 2023.
- ^ Harrison, Maggie (August 31, 2023). "OpenAI Rages at Report that Google's New AI Crushes GPT-4". Fortune. Archived from the original on August 31, 2023. Retrieved September 7, 2023.
- ^ Musk, Elon [@elonmusk] (August 30, 2023). "Are the numbers wrong?" (Tweet). Archived from the original on September 1, 2023. Retrieved October 15, 2023 – via Twitter.
- ^ Langley, Hugh (October 12, 2023). "Google VP teases Gemini's multimodal future: 'I've seen some pretty amazing things.'". Business Insider. Archived from the original on October 12, 2023. Retrieved October 15, 2023.
- ^ Madden, Michael G. (December 15, 2023). "Google's Gemini: is the new AI model really better than ChatGPT?". The Conversation. Archived from the original on December 15, 2023. Retrieved February 4, 2024.
- ^ Sullivan, Mark (December 6, 2023). "Gemini-powered Google phones may make Siri even more of an Achilles' heel for the iPhone". Fast Company. Archived from the original on December 7, 2023. Retrieved December 7, 2023.
- ^ Soni, Aditya (December 7, 2023). "Alphabet soars as Wall Street cheers arrival of AI model Gemini". Reuters. Archived from the original on December 7, 2023. Retrieved February 4, 2024.
- ^ Swartz, Jon (December 7, 2023). "Gemini, Google's long-awaited answer to ChatGPT, is an overnight hit". MarketWatch. Archived from the original on December 7, 2023. Retrieved February 4, 2024.
- ^ Elias, Steve Kovach, Jennifer (December 8, 2023). "Google faces controversy over edited Gemini AI demo video". CNBC. Retrieved December 9, 2023.
{{cite web}}
: CS1 maint: multiple names: authors list (link)
Further reading
- Honan, Matt (December 6, 2023). "Google CEO Sundar Pichai on Gemini and the coming age of AI". MIT Technology Review. Archived from the original on December 6, 2023. Retrieved December 6, 2023.
External links
- Official website
- Press release via The Keyword
- Announcement and demo on YouTube
- White paper for 1.0 and 1.5