Serverless Inference Model Library Image Generation 8 min read read

FLUX.1 Dev: specs, benchmarks, and how to run it on Lyceum

The 12B parameter open-weights image generation model by Black Forest Labs.

Caspar Lehmkühler

Caspar Lehmkühler

June 16, 2026 · Head of Product at Lyceum Technology

FLUX.1 Dev is a 12 billion parameter text-to-image model developed by Black Forest Labs, the team behind the original Stable Diffusion architecture. Distilled from the flagship FLUX.1 Pro, the Dev variant offers near-identical output quality and prompt adherence while being more efficient. It utilizes a hybrid architecture of multimodal and parallel diffusion transformer blocks, excelling at photorealism, complex human anatomy, and in-image text generation. Lyceum Technology serves FLUX.1 Dev via our OpenAI-compatible Serverless Inference API, allowing developers to generate high-quality images on GDPR-compliant, EU-hosted infrastructure with zero management overhead.

Get started: call FLUX.1 Dev on Lyceum

You can generate images with FLUX.1 Dev using Lyceum Technology's Serverless Inference API. Because this is a dedicated image generation model, you must use the standard images/generations endpoint via a direct HTTP POST request with Bearer authentication. Do not use the OpenAI chat completions endpoint.

import requests

response = requests.post(
 "https://api.lyceum.technology/api/v2/external/images/generations",
 headers={"Authorization": "Bearer <your lyceum api key>"},
 json={"model": "lyc-flux-1-dev", "prompt": "a sunset over the ocean", "aspect_ratio": "1:1"},
)
print(response.json()["image_url"])

Pricing and region for FLUX.1 Dev

FLUX.1 Dev is hosted on Lyceum's proprietary GPU infrastructure in the eu-north1 region, ensuring complete data residency for European teams. The model is priced at a flat rate of $0.005 per image. Because the API operates on a serverless, pay-per-use model, there are no base fees, no minimum commitments, and no idle compute costs. You are billed exclusively for the successful image generations you request.

By leveraging Lyceum's infrastructure, your engineering team can bypass the complexities of provisioning high-VRAM instances, configuring CUDA environments, or managing container orchestration. The API handles the heavy lifting of model loading and execution, returning the generated image URL directly in the response payload.

What FLUX.1 Dev is good at

State-of-the-art photorealism and anatomy

FLUX.1 Dev excels at generating highly realistic images that overcome the typical "AI look" associated with older diffusion models. It is particularly strong at rendering complex human anatomy, including hands, faces, and dynamic poses, which have traditionally been failure points for text-to-image models. The 12 billion parameter rectified flow transformer architecture provides the massive capacity needed to capture intricate visual details, lighting nuances, and natural textures.

Exceptional prompt adherence

Unlike earlier generation models that often ignore complex instructions or blend concepts together, FLUX.1 Dev closely follows detailed, multi-subject prompts. It accurately maps spatial relationships, specific colors, and distinct attributes to the correct subjects within the generated image. This makes it highly effective for professional workflows that require precise visual outputs based on strict creative briefs, such as advertising mockups or concept art.

In-image text generation

One of the standout capabilities of the FLUX.1 family is its ability to generate legible, accurate text directly within images. Whether you need a storefront sign, a product label, or stylized typography integrated into a graphic design, FLUX.1 Dev handles text rendering with a high degree of reliability. This significantly reduces the need for post-generation editing and allows teams to automate the creation of marketing assets that include specific copy.

Benchmarks and how it compares

FLUX.1 Dev benchmark results

FLUX.1 Dev consistently ranks at the top of open-weights image generation leaderboards, competing directly with, and often surpassing,closed-source proprietary models. On the Artificial Analysis Text-to-Image Arena, which uses blind human preference voting to calculate Elo scores, it achieves a rating that places it among the highest-performing models available globally.

Model Parameters Artificial Analysis Elo License
FLUX.1 Dev 12B ~1028 Non-commercial
Stable Diffusion 3.5 Large 8B ~1024 Community License
FLUX.1 Schnell 12B ~1000 Apache 2.0

Source: Artificial Analysis Text-to-Image Leaderboard (October 2024).

Comparison to sibling models

Compared to FLUX.1 Schnell, the Dev variant produces noticeably higher quality images with better prompt adherence and finer details, but requires significantly more compute time (30-50 steps vs. 1-4 steps). Compared to the closed-source FLUX.1 Pro, Dev offers near-identical quality, as it is guidance-distilled directly from the Pro version - but provides the transparency of open weights for researchers and developers who want to inspect the architecture or build custom workflows.

Using it in production

Production configuration for FLUX.1 Dev

When deploying FLUX.1 Dev in production, managing latency and infrastructure costs are the primary challenges. Because the model requires 30 to 50 steps for high-quality generation, it is highly compute-intensive. Lyceum Technology abstracts this complexity by serving the model via our Serverless Inference API, allowing you to bypass the need to provision, optimize, and manage dedicated H100 or A100 instances yourself.

API parameters and aspect ratios

The Lyceum API endpoint (https://api.lyceum.technology/api/v2/external/images/generations) accepts standard parameters such as prompt and aspect_ratio. For FLUX.1 Dev, it is highly recommended to keep the prompt descriptive and specific, as the model's strong prompt adherence will attempt to render every detail you specify. You can adjust the aspect ratio (e.g., 1:1, 16:9, 9:16) to suit your application's UI requirements, whether you are generating square profile pictures or widescreen marketing assets.

Pricing economics and scaling

On Lyceum, FLUX.1 Dev is priced at a flat rate of $0.005 per image. Unlike our text models which are categorized into Fast or Standard tiers, FLUX.1 Dev operates without a specific tier designation, focusing purely on high-quality generation. For a production application generating 10,000 images per month, the total inference cost would be exactly $50.00. Because the API is serverless, you do not pay for idle GPU time between requests. All generation occurs in the eu-north1 region, ensuring predictable performance without the overhead of maintaining a persistent GPU cluster.

Running FLUX.1 Dev on EU-sovereign infrastructure

Why run FLUX.1 Dev on Lyceum

For European AI startups and enterprise teams, data privacy and regulatory compliance are non-negotiable. Many popular image generation APIs route requests through US-based servers, creating potential GDPR and data residency risks. Lyceum Technology solves this by hosting FLUX.1 Dev entirely on our owned GPU infrastructure in the eu-north1 region.

GDPR compliance and data residency

When you generate images using Lyceum's Serverless Inference API, your prompts and generated outputs never leave Europe. This strict data residency ensures full GDPR compliance, making it safe to process sensitive creative briefs, proprietary product concepts, or user-generated content. We do not use your data to train our models, and our infrastructure is built from the ground up to support the upcoming EU AI Act requirements. European regulation is a competitive advantage, and we provide the compliance foundation you need.

OpenAI-compatible integration and cost advantages

Switching to Lyceum requires zero architectural changes if you are already using standard API patterns. By updating your endpoint URL and providing a Lyceum API key, you can immediately start generating images with FLUX.1 Dev. Furthermore, because Lyceum owns its hardware rather than renting from hyperscalers, we offer a structural pricing advantage. If your team eventually needs to fine-tune models or run custom workloads, you can provision a dedicated VM in 18 seconds.

To learn more about our infrastructure approach, read our guide on GDPR-compliant AI inference in Europe.

Frequently Asked Questions

What is the price of FLUX.1 Dev on Lyceum?

Lyceum Technology charges a flat rate of $0.005 per image generated using the FLUX.1 Dev model. Because our Serverless Inference API uses pay-per-use billing, there are no minimum commitments or base fees. You only pay for the exact number of images you successfully generate.

How do I call the FLUX.1 Dev API?

You can call FLUX.1 Dev using Lyceum's Serverless Inference API by making an HTTP POST request to the images/generations endpoint. Use the model string lyc-flux-1-dev and authenticate with your Lyceum API key via a Bearer token. Do not use the standard chat completions endpoint.

Where is my data processed when using FLUX.1 Dev?

All image generation requests for FLUX.1 Dev are processed on Lyceum Technology's owned GPU infrastructure in the eu-north1 region. This guarantees strict data residency within Europe, ensuring your prompts and generated images remain fully GDPR-compliant and never pass through US-based servers.

What is the difference between FLUX.1 Dev and FLUX.1 Schnell?

FLUX.1 Dev is a guidance-distilled model that requires 30 to 50 inference steps, producing state-of-the-art photorealism and prompt adherence. FLUX.1 Schnell is optimized for speed, requiring only 1 to 4 steps, but trades off some fine detail and quality. Schnell also uses an Apache 2.0 license.

Can I use FLUX.1 Dev for commercial purposes?

FLUX.1 Dev is released under a non-commercial license by Black Forest Labs. This means you cannot use the model weights to power a commercial API or service. However, the outputs, the images you generate, are not subject to this restriction and can be used for commercial projects.

What are the VRAM requirements for self-hosting FLUX.1 Dev?

At 12 billion parameters, FLUX.1 Dev requires significant compute resources. Running it at full FP16 precision requires over 24GB of VRAM, typically necessitating an NVIDIA A100 or H100 GPU. While FP8 quantization can reduce this footprint, using Lyceum's Serverless Inference API bypasses these hardware requirements entirely.

Related Resources

/magazine/image-ultra; /magazine/flux-2-klein; /magazine/wan-image