image-gen
links#
ComfyUI is integrating external API models and demonstrate gpt-image-1
integrated with a ComfyUI workflow. The catch for now is you can’t supply your own API key/endpoint and rather require a ComfyUI account that you load with credits. Pricing reflects OpenAI list price.
The success of the new ChatGPT 4o image generation caused the rollout to be delayed but it’s now available to free users, rate limited to 3 generations per day.





New image model family: Janus-Pro - DeepSeek creators just dropped a stable diffusion competitor.
Janus-Pro, which DeepSeek describes as a “novel autoregressive framework,” can both analyze and create new images… and most Janus-Pro models can only analyze small images with a resolution of up to 384 x 384.
FLUX dropped and it’s blows Stable Diffusion 3 out of the water, though has very high resource requirements. I’m running the schnell version locally. Prompt adherence is great, text capability is incredible.
Stability AI holds on, appointing a new CEO.
SD3 weights dropped last night. I gave it a shot last night myself with their supplied comfyui workflows, as a base model it looks extremely promising, details are next level, though it still doesn’t appear to know jack about hands, faces still need hires fix. Very promising for a base model.
Stable Diffusion 3: Research Paper
Stable Diffusion 3 outperforms state-of-the-art text-to-image generation systems such as DALL·E 3, Midjourney v6, and Ideogram v1 in typography and prompt adherence, based on human preference evaluations.