Gemini is the large multimodal model powering the Gemini app, but you can also use its API through Google Cloud and integrate it into your applications. Gemini offers different sizes, from Nano to Ultra, including Pro. Its unique feature is its multimodality: you can give it text, images, or videos! This opens up new use cases for you.
In this presentation, we will explore the Gemini model (and its little “open-weights” model sister, Gemma). With our Java hats on, we will learn how to use its API, especially with the LangChain4j library.
How to get the most out of Gemini? We will see how to extract unstructured data, how to classify text, how to extend the model's knowledge with the RAG (Retrieval Augmented Generation) approach, and how to use "function calls" to invoke external services when generating text.
Hold on tight! The Gemini capsule is about to take off!