Under the hood
The models behind the images
OKSLOP combines multiple AI models so you don't have to. No LoRA training, no GPU rentals, no infrastructure headaches.
We pick the right model for each job — fast drafts, high-fidelity refinements, text-heavy editorial, or 4x upscaling — and handle all the routing, parameters, and post-processing behind the scenes.
Image generation
21 modelsGPT Image 2
OpenAI · Apr 22, 2026 · source
- +Best-in-class text rendering — multi-word labels, signs, UI, and non-Latin scripts render cleanly
- +Strong layout and typography for diagrams, infographics, posters, comics, and multi-panel scenes
- +Flexible aspect ratios with high-resolution output up to 2K
- +Thinking-mode generation: pairs with a reasoning model to research, transform source material, and self-check outputs
- +Topped every Image Arena leaderboard at release with a record +242 Elo lead on text-to-image
- −Closed-weights and API-only — no self-hosting, fine-tuning, or LoRAs
- −Priced for production, not iteration: $30 / 1M output image tokens
- −Tier-gated rate limits (5 IPM on Tier 1, up to 250 IPM on Tier 5) — bulk workloads need an upgraded tier
Z-Image
Tongyi MAI (Alibaba) · Jan 27, 2026 · source
- +Apache-2.0 — no non-commercial strings, unlike FLUX.2 Dev
- +Turbo variant: sub-second on H800 at 8 NFEs, runs in 16GB VRAM
- +Ranked #1 open-source on Artificial Analysis Text-to-Image Leaderboard at release
- +Four variants cover base, fast, editing, and omni use cases
- −New and less battle-tested than FLUX — fewer LoRAs, less community tooling
Z-Image Turbo
Tongyi MAI (Alibaba) · Nov 26, 2025 · Scalable Single-Stream DiT (distilled) · 6B params · source
- +Sub-second inference on H800 at 8 NFEs
- +Fits in 16GB VRAM — runs on consumer GPUs
- +Apache-2.0 — fully commercial-friendly
- +Near-parity with base Z-Image on most prompts
- −Slight quality drop vs base Z-Image on complex scenes
- −Less community tooling than FLUX distillations
Qwen-Image
Alibaba (Qwen team) · Aug 4, 2025 · source
- +20B parameters — largest permissively-licensed open DiT in this generation
- +Best-in-class Chinese text rendering; strong English too
- +Mature editing variants (2509, 2511, Edit-2511) with multi-person support
- +Apache-2.0 — fully commercial-clean
- −20B is heavy — self-hosting needs serious VRAM vs Z-Image's 6B
HunyuanImage-3.0
Tencent Hunyuan · Sep 28, 2025 · source
- +Largest open-source image MoE — 80B total / 13B active / 64 experts
- +Native autoregressive multimodal framework (unlike DiT competitors)
- +Prompt self-rewriting and chain-of-thought reasoning built in
- +Supports editing and multi-image fusion out of the box
- −Eye-watering VRAM: ≥3×80GB for base, ≥8×80GB for Instruct
- −License is Tencent's custom community license, not Apache/MIT — read carefully
Seedream 4.0
ByteDance Seed · Sep 2025 · source
- +Native 4K output — higher than most open competitors
- +Unified generation + editing architecture
- +Batch multi-in / multi-out processing
- +High ranking on Artificial Analysis Image Arena at release
- −Closed weights — API-only, no self-hosting or fine-tuning
- −Pricing and availability not publicly documented on the landing page
Midjourney v8
Midjourney · Mar 2026 · Diffusion · source
- +Still the benchmark for aesthetic quality and stylistic control
- +Strong community, well-documented parameter ecosystem
- +Consistent results across artistic styles
- −No public API — platform-only
- −Text rendering weaker than GPT Image 2 / Ideogram
- −Iteration loop is slower than API-first competitors
Nano Banana 2
Google DeepMind · 2026 · source
- +Deep Gemini integration — pairs well with reasoning and search
- +Strong photorealism and natural-scene fidelity
- +Improved over original Nano Banana
- −Closed, Google-only distribution
- −Pro variants offer higher quality ceiling
Nano Banana Pro
Google DeepMind · Nov 2025 · source
- +Native 4K output — higher than most competitors
- +Excellent multilingual text rendering
- +Up to 4 reference images for guided generation
- +Prompt-driven editing built in
- −Closed, API-only via Google Cloud
- −Premium pricing tier
MAI Image 2
Microsoft · Apr 2026 · source
- +Ultra-realistic lighting and skin tones
- +Precise text rendering in charts and slides
- +Physically accurate environments
- +Top 3 on Image Arena at release
- −Closed, Microsoft-only distribution
- −API availability limited to Azure
Reve Image 1.0
Reve AI · 2026 · Hybrid Diffusion · 12B params · source
- +#1 on Artificial Analysis Image Arena
- +Proprietary typography engine for exceptional text
- +Best-in-class prompt adherence
- +12B parameters with hybrid diffusion architecture
- −Closed weights — preview-only access currently
- −New startup — less ecosystem than established players
Grok Imagine Image
xAI · Feb 2026 · source
- +Fast generation
- +Native image-to-video pipeline
- +Integrated with X/Twitter and Grok chatbot
- +Massive scale — billions of generations served
- −Closed platform tied to X ecosystem
- −API access requires xAI subscription
Grok Imagine Image Pro
xAI · Feb 2026 · source
- +Higher fidelity than standard Grok Imagine
- +131K token context window
- +Multi-reference image support
- +Better consistency for character/style work
- −Premium pricing tier
- −Closed platform tied to X ecosystem
FLUX.2 Dev
Black Forest Labs · Nov 2025 · Rectified Flow Transformer · 32B params · source
- +Best quality in the FLUX.2 family (32B params)
- +Multi-image reference and 4MP editing
- +Strongest open-weights text-to-image model in its generation
- −Non-commercial license
- −Slow (~30s per image via CF Workers AI)
- −Heavy — significant VRAM needed for self-hosting
FLUX.2 Klein
Black Forest Labs · Nov 2025 · Rectified Flow Transformer (distilled) · 4B / 9B params · source
- +Fast (~0.3–1.2s) and cheap — bulk generation workhorse
- +Step-distilled to 4 inference steps
- +Unified: text-to-image, editing, and multi-reference
- +4B variant is Apache 2.0, runs on Cloudflare Workers AI
- −Fine detail (hands, small objects) sometimes off
- −9B variant is non-commercial license
FLUX.2 Max
Black Forest Labs · Nov 2025 · Rectified Flow Transformer · 32B params · source
- +Top-tier quality in the FLUX.2 lineup
- +Web-grounded generation — real-time info retrieval
- +Up to 10 reference images for guided generation
- +46K token context window
- +Best editing consistency across the family
- −Premium pricing — most expensive FLUX.2 tier
- −Closed weights — API-only access
- −Slower generation than Pro or Flex
FLUX.2 Pro
Black Forest Labs · Nov 2025 · Rectified Flow Transformer · 32B params · source
- +High quality with balanced speed/fidelity
- +Multi-reference image support
- +4MP output resolution
- +Good value for production workflows
- −Closed weights — API-only access
- −Not as fast as Klein or Flex
FLUX.2 Flex
Black Forest Labs · Nov 2025 · Rectified Flow Transformer · 32B params · source
- +Direct control over inference steps and guidance
- +Adjustable speed/quality tradeoff per request
- +Up to 10 reference images
- +Good for mixed iteration + refinement workflows
- −Closed weights — API-only access
- −Requires tuning to get optimal results
Recraft V3 (Red Panda)
Recraft · Oct 2024 · source
- +Exceptional vector and illustration output
- +Best-in-class for brand assets and icons
- +Strong text rendering with font control
- +Precise style control (realistic, illustration, vector)
- −Less suited for photorealistic natural scenes
- −Smaller community than FLUX or SD
- −API-only, no open weights
Ideogram 2.0
Ideogram · Aug 2024 · source
- +Excellent text rendering
- +Strong at posters, signs, and typography-heavy images
- +API available for integration
- −Less established ecosystem than FLUX or SD
- −Limited fine-tuning options
- −Can struggle with very complex multi-subject scenes
Adobe Firefly 3
Adobe · Apr 2024 · source
- +Commercially safe — trained on licensed content only
- +Tight Photoshop/Creative Cloud integration
- +Good for design workflows and compositing
- −Conservative output — avoids anything edgy
- −Requires Creative Cloud subscription for full access
- −Less photorealistic than FLUX or Midjourney
- −Limited API access outside Adobe ecosystem
Upscaling & enhancement
3 modelsReal-ESRGAN
Xintao Wang et al. (Tencent ARC Lab) · 2021 · source
- +4× resolution with minimal artifacts
- +Preserves detail and sharpness while enlarging
- +Handles AI-generated images well
- +Fast, widely available via APIs
- −Can over-sharpen some textures
- −No creative enhancement — purely resolution
Magnific AI
Magnific AI · Nov 2023 · source
- +Adds realistic detail during upscale (hallucinated enhancement)
- +Multiple creativity levels for different use cases
- +Great for turning rough drafts into detailed images
- −Expensive per-image cost
- −Can hallucinate unwanted detail at high creativity
- −Closed platform, no self-hosting
Topaz Gigapixel AI
Topaz Labs · 2020 (continuously updated) · source
- +Excellent for photography upscaling
- +Multiple AI models tuned for different content types
- +Desktop app — works offline
- −One-time purchase + subscription model
- −Desktop only — no API
- −Less suited for AI-generated content
Last updated April 2026. The AI image space moves fast — we'll keep this page current as new models ship and our pipeline evolves.
Skip the setup.
Create something.
Write a creative brief and our AI contributors will generate images using the right models for your needs. No GPU, no config, no guesswork.