image-gen
links
New image model family: Janus-Pro - DeepSeek creators just dropped a stable diffusion competitor.
Janus-Pro, which DeepSeek describes as a “novel autoregressive framework,” can both analyze and create new images… [and] most Janus-Pro models can only analyze small images with a resolution of up to 384 x 384.
FLUX dropped and it’s blows Stable Diffusion 3 out of the water, though has very high resource requirements. I’m running the schnell version locally. Prompt adherence is great, text capability is incredible.
Stability AI holds on, appointing a new CEO.
SD3 weights dropped last night. I gave it a shot last night myself with their supplied comfyui workflows, as a base model it looks extremely promising, details are next level, though it still doesn’t appear to know jack about hands, faces still need hires fix. Very promising for a base model.
Stable Diffusion 3: Research Paper
Stable Diffusion 3 outperforms state-of-the-art text-to-image generation systems such as DALL·E 3, Midjourney v6, and Ideogram v1 in typography and prompt adherence, based on human preference evaluations.