Gemini Imagen4
Generate stunning images from text descriptions using Google's cutting-edge Imagen 4.0 models. Customize image generation with multiple model variants, aspect ratios, and output formats. Browse and manage generated images locally through the MCP protocol with built-in safety filtering.
Ask AI about Gemini Imagen4
Powered by Claude Β· Grounded in docs
I know everything about Gemini Imagen4. Ask me about installation, configuration, usage, or troubleshooting.
0/500
Reviews
Documentation
π¨ Gemini-Imagen4
Local AI Image Generation MCP Server powered by Google's Imagen 4.0 models
Generate stunning images from text descriptions using Google's cutting-edge Imagen 4.0 models through the Model Context Protocol (MCP). Images are saved locally to ./generated-images/ directory.
β¨ Features
- πΌοΈ High-quality image generation using Google's latest Imagen 4.0 models
- β‘ Multiple model variants for different speed/quality needs
- π― Flexible aspect ratios (1:1, 3:4, 4:3, 9:16, 16:9)
- πΈ Multiple output formats (PNG, JPEG)
- π Built-in safety filtering with reason reporting
- πΎ Local file storage - images saved to
./generated-images/directory - π Easy local setup - run with npx or npm
- π MCP Resources - browse generated images via MCP protocol
π Quick Start
Prerequisites
- Node.js 18+
- Google Gemini API key (Get one here)
Installation & Usage
Option 1: Install from npm (Recommended)
npm install -g gemini-imagen4
Then run:
gemini-imagen4
Option 2: Run directly with npx
npx gemini-imagen4
Option 3: Clone and run locally
git clone https://github.com/chug2k/gemini-imagen4.git
cd gemini-imagen4
npm install
npm run dev
You'll be prompted to enter your Gemini API key when you first run it.
Usage
Once running, the server provides:
Tool: generate_image_from_text
Generate images from text descriptions with optional customization.
Parameters:
prompt(required): Text description of the image to generatemodel(optional): Choose from:imagen-4.0-generate-preview-06-06(default - balanced)imagen-4.0-fast-generate-preview-06-06(faster generation)imagen-4.0-ultra-generate-preview-06-06(highest quality)
aspectRatio(optional):1:1,3:4,4:3,9:16, or16:9outputMimeType(optional):image/png(default) orimage/jpeg
Example:
{
"prompt": "A majestic dragon soaring through a sunset sky",
"model": "imagen-4.0-ultra-generate-preview-06-06",
"aspectRatio": "16:9",
"outputMimeType": "image/png"
}
Response:
Images are saved to ./generated-images/ with timestamped filenames like 1754998591_majestic_dragon_soaring.png
Resource: generated-images
Browse the generated images directory via MCP resources protocol.
π Adding to Claude Desktop
To use this server with Claude Desktop, add it to your MCP configuration:
Edit your claude_desktop_config.json:
{
"mcpServers": {
"gemini-imagen4": {
"command": "npx",
"args": ["gemini-imagen4"],
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}
Replace your-api-key-here with your actual Gemini API key, then restart Claude Desktop.
π§ Configuration
The server requires a Gemini API key and supports the following configuration:
# smithery.yaml
runtime: typescript
startCommand:
type: http
configSchema:
type: object
required: ["geminiApiKey"]
properties:
geminiApiKey:
type: string
title: "Gemini API Key"
description: "Your Google Gemini API key"
modelName:
type: string
title: "Model Name"
description: "Default Imagen model to use"
default: "imagen-4.0-generate-preview-06-06"
enum:
- "imagen-4.0-generate-preview-06-06"
- "imagen-4.0-fast-generate-preview-06-06"
- "imagen-4.0-ultra-generate-preview-06-06"
πββοΈ Development
Run the development server:
npm run dev
π¦ Model Variants
- Standard (
imagen-4.0-generate-preview-06-06): Best balance of quality and speed - Fast (
imagen-4.0-fast-generate-preview-06-06): Optimized for quick generation - Ultra (
imagen-4.0-ultra-generate-preview-06-06): Maximum quality output
π‘οΈ Safety & Content Filtering
All images are processed through Google's built-in safety filters. If content is filtered, the tool will return the reason for filtering instead of an image.
π€ Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
π License
MIT License - see LICENSE file for details.
π Links
Built with β€οΈ using Google's Imagen 4.0 and the Model Context Protocol
