Under the hood

The models behind the images

OKSLOP combines multiple AI models so you don't have to. No LoRA training, no GPU rentals, no infrastructure headaches.

We pick the right model for each job — fast drafts, high-fidelity refinements, text-heavy editorial, or 4x upscaling — and handle all the routing, parameters, and post-processing behind the scenes.

Looking for 3D generation models? Explore the 3D directory →

Image generation

21 models

GPT Image 2

OpenAI · Apr 22, 2026 · source

Max resolution 2048×2048

Text in images excellent

Strengths

+Best-in-class text rendering — multi-word labels, signs, UI, and non-Latin scripts render cleanly
+Strong layout and typography for diagrams, infographics, posters, comics, and multi-panel scenes
+Flexible aspect ratios with high-resolution output up to 2K
+Thinking-mode generation: pairs with a reasoning model to research, transform source material, and self-check outputs
+Topped every Image Arena leaderboard at release with a record +242 Elo lead on text-to-image

Limitations

−Closed-weights and API-only — no self-hosting, fine-tuning, or LoRAs
−Priced for production, not iteration: $30 / 1M output image tokens
−Tier-gated rate limits (5 IPM on Tier 1, up to 250 IPM on Tier 5) — bulk workloads need an upgraded tier

Z-Image

Open source

Tongyi MAI (Alibaba) · Jan 27, 2026 · source

Max resolution 2048×2048

Text in images good

Strengths

+Apache-2.0 — no non-commercial strings, unlike FLUX.2 Dev
+Turbo variant: sub-second on H800 at 8 NFEs, runs in 16GB VRAM
+Ranked #1 open-source on Artificial Analysis Text-to-Image Leaderboard at release
+Four variants cover base, fast, editing, and omni use cases

Limitations

−New and less battle-tested than FLUX — fewer LoRAs, less community tooling

Z-Image Turbo

Open source

Tongyi MAI (Alibaba) · Nov 26, 2025 · Scalable Single-Stream DiT (distilled) · 6B params · source

Max resolution 2048×2048

Text in images good

Strengths

+Sub-second inference on H800 at 8 NFEs
+Fits in 16GB VRAM — runs on consumer GPUs
+Apache-2.0 — fully commercial-friendly
+Near-parity with base Z-Image on most prompts

Limitations

−Slight quality drop vs base Z-Image on complex scenes
−Less community tooling than FLUX distillations

Qwen-Image

Open source

Alibaba (Qwen team) · Aug 4, 2025 · source

Max resolution 2048×2048

Text in images excellent

Strengths

+20B parameters — largest permissively-licensed open DiT in this generation
+Best-in-class Chinese text rendering; strong English too
+Mature editing variants (2509, 2511, Edit-2511) with multi-person support
+Apache-2.0 — fully commercial-clean

Limitations

−20B is heavy — self-hosting needs serious VRAM vs Z-Image's 6B

HunyuanImage-3.0

Open source

Tencent Hunyuan · Sep 28, 2025 · source

Max resolution 2048×2048

Text in images good

Strengths

+Largest open-source image MoE — 80B total / 13B active / 64 experts
+Native autoregressive multimodal framework (unlike DiT competitors)
+Prompt self-rewriting and chain-of-thought reasoning built in
+Supports editing and multi-image fusion out of the box

Limitations

−Eye-watering VRAM: ≥3×80GB for base, ≥8×80GB for Instruct
−License is Tencent's custom community license, not Apache/MIT — read carefully

Seedream 4.0

ByteDance Seed · Sep 2025 · source

Max resolution 4K

Text in images good

Strengths

+Native 4K output — higher than most open competitors
+Unified generation + editing architecture
+Batch multi-in / multi-out processing
+High ranking on Artificial Analysis Image Arena at release

Limitations

−Closed weights — API-only, no self-hosting or fine-tuning
−Pricing and availability not publicly documented on the landing page

Midjourney v8

Midjourney · Mar 2026 · Diffusion · source

Max resolution native 2048px (--hd)

Text in images good

Strengths

+Still the benchmark for aesthetic quality and stylistic control
+Strong community, well-documented parameter ecosystem
+Consistent results across artistic styles

Limitations

−No public API — platform-only
−Text rendering weaker than GPT Image 2 / Ideogram
−Iteration loop is slower than API-first competitors

Nano Banana 2

Google DeepMind · 2026 · source

Max resolution up to 2048px

Text in images good

Strengths

+Deep Gemini integration — pairs well with reasoning and search
+Strong photorealism and natural-scene fidelity
+Improved over original Nano Banana

Limitations

−Closed, Google-only distribution
−Pro variants offer higher quality ceiling

Nano Banana Pro

Google DeepMind · Nov 2025 · source

Max resolution 4K

Text in images excellent

Strengths

+Native 4K output — higher than most competitors
+Excellent multilingual text rendering
+Up to 4 reference images for guided generation
+Prompt-driven editing built in

Limitations

−Closed, API-only via Google Cloud
−Premium pricing tier

MAI Image 2

Microsoft · Apr 2026 · source

Max resolution up to 2048px

Text in images excellent

Strengths

+Ultra-realistic lighting and skin tones
+Precise text rendering in charts and slides
+Physically accurate environments
+Top 3 on Image Arena at release

Limitations

−Closed, Microsoft-only distribution
−API availability limited to Azure

Reve Image 1.0

Reve AI · 2026 · Hybrid Diffusion · 12B params · source

Max resolution up to 2048px

Text in images excellent

Strengths

+#1 on Artificial Analysis Image Arena
+Proprietary typography engine for exceptional text
+Best-in-class prompt adherence
+12B parameters with hybrid diffusion architecture

Limitations

−Closed weights — preview-only access currently
−New startup — less ecosystem than established players

Grok Imagine Image

xAI · Feb 2026 · source

Max resolution up to 2048px

Text in images good

Strengths

+Fast generation
+Native image-to-video pipeline
+Integrated with X/Twitter and Grok chatbot
+Massive scale — billions of generations served

Limitations

−Closed platform tied to X ecosystem
−API access requires xAI subscription

Grok Imagine Image Pro

xAI · Feb 2026 · source

Max resolution up to 2048px

Text in images good

Strengths

+Higher fidelity than standard Grok Imagine
+131K token context window
+Multi-reference image support
+Better consistency for character/style work

Limitations

−Premium pricing tier
−Closed platform tied to X ecosystem

FLUX.2 Dev

Open source

Black Forest Labs · Nov 2025 · Rectified Flow Transformer · 32B params · source

Max resolution up to 4MP

Text in images good

Strengths

+Best quality in the FLUX.2 family (32B params)
+Multi-image reference and 4MP editing
+Strongest open-weights text-to-image model in its generation

Limitations

−Non-commercial license
−Slow (~30s per image via CF Workers AI)
−Heavy — significant VRAM needed for self-hosting

FLUX.2 Klein

Open source

Black Forest Labs · Nov 2025 · Rectified Flow Transformer (distilled) · 4B / 9B params · source

Max resolution up to 2MP

Text in images good

Strengths

+Fast (~0.3–1.2s) and cheap — bulk generation workhorse
+Step-distilled to 4 inference steps
+Unified: text-to-image, editing, and multi-reference
+4B variant is Apache 2.0, runs on Cloudflare Workers AI

Limitations

−Fine detail (hands, small objects) sometimes off
−9B variant is non-commercial license

FLUX.2 Max

Black Forest Labs · Nov 2025 · Rectified Flow Transformer · 32B params · source

Max resolution up to 4MP

Text in images good

Strengths

+Top-tier quality in the FLUX.2 lineup
+Web-grounded generation — real-time info retrieval
+Up to 10 reference images for guided generation
+46K token context window
+Best editing consistency across the family

Limitations

−Premium pricing — most expensive FLUX.2 tier
−Closed weights — API-only access
−Slower generation than Pro or Flex

FLUX.2 Pro

Black Forest Labs · Nov 2025 · Rectified Flow Transformer · 32B params · source

Max resolution up to 4MP

Text in images good

Strengths

+High quality with balanced speed/fidelity
+Multi-reference image support
+4MP output resolution
+Good value for production workflows

Limitations

−Closed weights — API-only access
−Not as fast as Klein or Flex

FLUX.2 Flex

Black Forest Labs · Nov 2025 · Rectified Flow Transformer · 32B params · source

Max resolution up to 4MP

Text in images good

Strengths

+Direct control over inference steps and guidance
+Adjustable speed/quality tradeoff per request
+Up to 10 reference images
+Good for mixed iteration + refinement workflows

Limitations

−Closed weights — API-only access
−Requires tuning to get optimal results

Recraft V3 (Red Panda)

Recraft · Oct 2024 · source

Max resolution up to 2048px

Text in images excellent

Strengths

+Exceptional vector and illustration output
+Best-in-class for brand assets and icons
+Strong text rendering with font control
+Precise style control (realistic, illustration, vector)

Limitations

−Less suited for photorealistic natural scenes
−Smaller community than FLUX or SD
−API-only, no open weights

Ideogram 2.0

Ideogram · Aug 2024 · source

Max resolution up to 1344px

Text in images excellent

Strengths

+Excellent text rendering
+Strong at posters, signs, and typography-heavy images
+API available for integration

Limitations

−Less established ecosystem than FLUX or SD
−Limited fine-tuning options
−Can struggle with very complex multi-subject scenes

Adobe Firefly 3

Adobe · Apr 2024 · source

Max resolution up to 2048px

Text in images good

Strengths

+Commercially safe — trained on licensed content only
+Tight Photoshop/Creative Cloud integration
+Good for design workflows and compositing

Limitations

−Conservative output — avoids anything edgy
−Requires Creative Cloud subscription for full access
−Less photorealistic than FLUX or Midjourney
−Limited API access outside Adobe ecosystem

Upscaling & enhancement

3 models

Real-ESRGAN

Open source

Xintao Wang et al. (Tencent ARC Lab) · 2021 · source

Max resolution 4× input (1024 → 4096px)

Strengths

+4× resolution with minimal artifacts
+Preserves detail and sharpness while enlarging
+Handles AI-generated images well
+Fast, widely available via APIs

Limitations

−Can over-sharpen some textures
−No creative enhancement — purely resolution

Magnific AI

Magnific AI · Nov 2023 · source

Max resolution up to 16×

Strengths

+Adds realistic detail during upscale (hallucinated enhancement)
+Multiple creativity levels for different use cases
+Great for turning rough drafts into detailed images

Limitations

−Expensive per-image cost
−Can hallucinate unwanted detail at high creativity
−Closed platform, no self-hosting

Topaz Gigapixel AI

Topaz Labs · 2020 (continuously updated) · source

Max resolution up to 6×

Strengths

+Excellent for photography upscaling
+Multiple AI models tuned for different content types
+Desktop app — works offline

Limitations

−One-time purchase + subscription model
−Desktop only — no API
−Less suited for AI-generated content

Last updated April 2026. The AI image space moves fast — we'll keep this page current as new models ship and our pipeline evolves.

Skip the setup.
Create something.

Write a creative brief and our AI contributors will generate images using the right models for your needs. No GPU, no config, no guesswork.

Create a brief Browse the library

The models behind the images

Image generation

GPT Image 2

Z-Image

Z-Image Turbo

Qwen-Image

HunyuanImage-3.0

Seedream 4.0

Midjourney v8

Nano Banana 2

Nano Banana Pro

MAI Image 2

Reve Image 1.0

Grok Imagine Image

Grok Imagine Image Pro

FLUX.2 Dev

FLUX.2 Klein

FLUX.2 Max

FLUX.2 Pro

FLUX.2 Flex

Recraft V3 (Red Panda)

Ideogram 2.0

Adobe Firefly 3

Upscaling & enhancement

Real-ESRGAN

Magnific AI

Topaz Gigapixel AI

Skip the setup.Create something.

Skip the setup.
Create something.