YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Nano Banana 2(Gemini 3.1 Flash Image)

Technical Specifications of Gemini 3.1 Flash Image Preview

Item Gemini 3.1 Flash Image Preview
Provider Google
Model family Gemini 3.1 (Flash tier)
Primary focus Fast multimodal generation with image preview
Input types Text, Image
Output types Text, Image (preview generation)
Context window Up to 1M tokens (Gemini 3.x Flash tier standard)
Latency tier Low-latency, high-throughput
Streaming support Yes
Tool calling Yes (Gemini API tools framework)
Version 3.1

What is Nano Banana 2

Nano Banana 2 is the popular nickname used by the press and developer community for the newly released Gemini-3.1-Flash-Image model. Google positions it as the “Flash”-tier image engine that brings near-Pro visual fidelity to a much lower latency and cost tier — suitable for high-volume generation, rapid iterative editing, and integrated product workflows across Google services. It inherits Gemini 3.1’s multimodal reasoning and adds image-centric capabilities (legible text in images, multi-image composition, wide aspect ratio support, native 4K).

Main features

  • High-speed, multi-resolution generation: Flash-tier speed with options for 0.5K / 1K / 2K / 4K outputs and new extreme aspect ratios (1:4, 4:1, 1:8, 8:1).
  • Real-time web grounding: Integrates both text and image search results to ground generated content in current web information when “Thinking” or search grounding is enabled. Useful for up-to-date references and factual infographics.
  • Improved text rendering: Better short-text and graphic text rendering (fonts, sizes) than earlier Flash models; still imperfect on long paragraphs/small text.
  • Multi-input editing and multi-turn workflows: Strong support for combining several images as inputs and for iterative edits across turns.

📊 Benchmark Performance — Image Generation & Editing (Elo scores)

Capability Gemini 3.1 Flash Image (Nano Banana 2) Gemini 2.5 Flash Image (Nano Banana) Gemini 3 Pro Image (Nano Banana Pro) GPT-Image 1.5 Seedream 5.0 Lite Grok Imagine Image Pro
Text-to-Image — Overall Preference 1079.0 ± 7.0 1073.0 ± 5.0 942.0 ± 6.0 1021.0 ± 5.0 1047.0 ± 5.0 928.0 ± 8.0
Text-to-Image — Visual Quality 1140.0 ± 6.0 1129.0 ± 6.0 929.0 ± 6.0 1043.0 ± 5.0 975.0 ± 5.0 759.0 ± 10.0
Text-to-Image — Infographics (Factuality) 1114.0 ± 14.0 1074.0 ± 12.0 881.0 ± 13.0 1102.0 ± 13.0 985.0 ± 12.0 890.0 ± 22.0
Editing — General 1065.0 ± 9.0 1047.0 ± 9.0 913.0 ± 9.0 1051.0 ± 10.0 995.0 ± 8.0 937.0 ± 9.0
Editing — Character 1056.0 ± 7.0 1049.0 ± 7.0 952.0 ± 7.0 1050.0 ± 8.0 1025.0 ± 7.0 894.0 ± 8.0
Editing — Creative 1023.0 ± 7.0 1031.0 ± 7.0 976.0 ± 7.0 1004.0 ± 7.0 1017.0 ± 7.0 938.0 ± 7.0
Editing — Object/Environment 1029.0 ± 8.0 1018.0 ± 8.0 945.0 ± 8.0 1042.0 ± 10.0 976.0 ± 8.0 946.0 ± 9.0
Editing — Multi-Input 1037.0 ± 8.0 1016.0 ± 8.0 919.0 ± 9.0 1056.0 ± 12.0 1014.0 ± 9.0 N/A
Editing — Stylization 1045.0 ± 7.0 1031.0 ± 7.0 862.0 ± 8.0 1045.0 ± 9.0 996.0 ± 7.0 984.0 ± 7.0

Key takeaways from this benchmark table:

  • Across text-to-image generation and image editing categories, Gemini 3.1 Flash Image consistently leads or matches the highest scores among Flash-tier and many competitive image models.
  • The model shows especially strong results in Visual Quality and Infographic (Factuality) benchmarks—signaling that it excels not only in aesthetic quality but also in rendering structurally accurate content.
  • On Multi-Input editing, Nano Banana 2 also shows robust generalization, with higher scores than its previous Flash generation.

These evaluations are conducted via human side-by-side Elo comparisons on a diverse benchmark suite, reflecting both preference and fidelity across commonly used image generation/editing tasks.

Nano Banana 2 vs Nano Banana vs Nano Banana Pro

Model Positioning Representative benchmark/notes
Gemini 3.1 Flash Image (Nano Banana 2) Flash tier: speed + high visual quality (2K–4K) Overall preference 1079.0 ± 7.0; visual quality 1140 ± 6.0 (internal GenAI-Bench).
Gemini 2.5 Flash Image (Nano Banana) Earlier Flash release (lower fidelity) Slightly lower preference/visual scores vs 3.1.
Gemini 3 Pro Image (Nano Banana Pro) Pro tier: higher perceived fidelity for complex tasks, higher cost/latency Different tradeoffs; some metrics show different relative rankings in specialty tasks.
GPT-Image 1.5 / other commercial models Competitors (open/closed) In Google’s internal benchmarks GPT-Image and others scored below Gemini 3.1 on visual quality and overall preference in the reported eval. Independent third-party comparisons vary.

When to choose Flash Image Preview:

  • Real-time image preview in apps
  • Cost-sensitive large-scale image generation
  • Interactive design assistants

How to access and integrate Nano Banana 2

Step 1: Sign Up for API Key

Log in to cometapi.com. If you are not our user yet, please register first. Sign into your CometAPI console. Get the access credential API key of the interface. Click “Add Token” at the API token in the personal center, get the token key: sk-xxxxx and submit.

Step 2: Send Requests to Nano Banana 2 API

Select the “**gemini-3.1-flash-image-preview8**” endpoint to send the API request and set the request body. The request method and request body are obtained from our website API doc. Our website also provides Apifox test for your convenience. Replace with your actual CometAPI key from your account. Where to call it:Gemini generates image

Nano Banana 2 supports image editing, image generation, and multi-image workflows. For image editing, you need to upload the image URL. For more parameters, please refer to the documentation.

Step 3: Retrieve and Verify Results

Process the API response to get the generated answer. After processing, the API responds with the task status and output data. You can directly download the image to your local machine in the playground (usually in PNG format). An image URL is generated in the API process; please download it promptly.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 1 Ask for provider support