In this hands-on workshop, we'll explore Google AI Studio, the easy way to prototype and interact with Google's powerful AI, and discuss the Gemini API that drives it. Gemini is Google's advanced, multimodal AI, capable of understanding diverse information like code and images, not just text.

Our focus today is hands-on application: learning how these tools can help you write code faster, debug smarter, communicate clearer, and brainstorm better. By the end, you'll see how to effectively integrate AI into your development process to boost productivity and creativity.

Google AI Studio

Google AI Studio is a free, web-based tool that lets you easily experiment with and build applications using Google's powerful Gemini AI models.

Think of it as a user-friendly playground for developers and enthusiasts alike. You can quickly prototype ideas, test different prompts, and even generate code to integrate the AI's capabilities into your own projects.

Key features include the ability to:

Gemini API

Gemini API is Google's programming interface that lets developers integrate Gemini AI models (like Gemini 2.5, Gemini 2.0, or Gemma) into apps, websites, and services. It enables:

Gemini API

You can build apps that can:

Essentially, the Gemini API gives developers the tools to integrate the "brain" of Google's most advanced AI into their own creations.

Before starting this workshop, ensure you have:

Sign in with your Google account into Google AI Studio

Google AI Studio Interface

The vertical panel on the left side of the interface is where you choose the type of prompt you want to create and access your past work.

Prompt Interfaces

Chat Prompt

This prompt interface is the primary space where you interact with the AI.

Google AI Studio Interface

The panel on the right gives you fine-grained control over how the AI model responds.

Run Settings

Stream Realtime

The stream interface enables live, real-time conversations with Gemini. It lets you use your voice, webcam, and screen for a direct, continuous, multimodal dialog.

Stream realtime

The right panel contains a new set of controls specifically tailored for the "Stream Realtime" audio dialog mode.

Stream realtime

Generate Media

This is where you can create content beyond text, select your model of choice, add a prompt and generate your media. The Imagen model can be used to to generate high quality images across multiple aspect ratios, while Veo can be used to animate images and create video clips.

Generate Media

Build

This section is a developer-focused playground and workbench designed for prototyping and creating applications powered by Google's Gemini family of AI models.

In essence, it's all about:

It's an interactive environment meant to lower the barrier to entry for AI development, enabling creators to easily explore, build, and test the advanced multimodal capabilities of Gemini.

Build

1. Basic Code Generation

Prompt:

Write a Python function that prints "Hello World!"

2. Debugging + Error Explanation

Prompt:

Explain the error in this Python code and fix it:

def divide(a, b):
    return a / b

print(divide(10, 0))

3 Temperature Setting

Prompt with temperature set at 0:

Generate 5 names for a brand of coffee that is exclusively for software developers.

Prompt with temperature set at 2:

Generate 5 names for a brand of coffee that is exclusively for software developers.

4. Structured Output

Turn on the Structured Output tool.

Structured Output

Prompt:

Generate a sample product description for a wireless Bluetooth headset with the following specifications:  
- Brand: 'AudioPro'  
- Model: 'Nexus 2024'  
- Key features: noise cancellation, 30-hour battery life, IPX5 waterproof  
- Price range: mid-tier  

Structured output JSON format:

{
  "type": "object",
  "properties": {
    "product_name": {
      "type": "string"
    },
    "target_audience": {
      "type": "string"
    },
  }
}

5. Grounding with Google Search

Prompt:

Whats the latest React version?

Outdated result:

Outdated result

Turn on Grounding with Google Search tool.

Grounding with Google Search

Updated result:

Updated result

6. Image and Text Prompt

Choose a random image and prompt:

Can you summarize what is in this image?

7. Using System Instructions

System Instruction:

You're a sarcastic joker

System Instruction

Prompt:

Hi, I'm a junior dev

Prompt:

Build a recipe generator app that takes a list of ingredients and generates a recipe with visually appealing images.

After the model generates the app, you can add new features to the app and also inspect the code. code

This is the final product the model generated:

Build an AI Recipe Generator

In this hands-on session, we explored Google AI Studio and the Gemini API. We started by exploring the AI Studio interface, learning how to fine-tune models and adjust settings for optimal results. Through practical exercises, we mastered various prompting techniques, from basic queries to advanced multimodal inputs.

The highlight was building our AI recipe generator, taking ingredients as input and generating complete recipes with visual appeal. This demonstrated Gemini's capabilities in content creation and API integration.

Remember: effective prompting and proper model configuration are key to unlocking Gemini's full potential. We encourage you to apply these techniques to your own projects.

For further learning, visit the Gemini API documentation and continue experimenting in AI Studio.

Thank you for participating!