Awesome-Agentic-Engineering

🎨 Creative AI

Audience: practitioners Β· Evidence class: mixed

Descriptions in this appendix prioritise workflow characteristics and deployment constraints over superlatives or leaderboard language.

Image Generation

Tool Description
Midjourney v7 Strong stylistic control with Discord and web-based image workflows.
DALL-E 3.5 ChatGPT-integrated image generation with reliable prompt following and text rendering.
FLUX 2 Open-weight image models aimed at photorealistic output and flexible deployment.
Stable Diffusion 3.5 Open-source. ControlNet, LoRAs, ComfyUI ecosystem.
Adobe Firefly 3 Licensed data only. Commercial indemnification. Photoshop.
Google Imagen 4 Photorealistic image generation exposed through Google’s AI tooling.
Ideogram v3 Strong text rendering for logos, posters, and design-heavy image outputs.
Leonardo AI Multi-model. Realtime Canvas. 3D gaming assets. Canva-owned.
Recraft Design-focused. Vector art, brand consistency.

Video Generation

Tool Description
Sora 2 Video generation focused on longer-form scene consistency and motion realism.
Google Veo 3.1 High-fidelity video generation with native audio and Vertex AI integration.
Runway Gen-4.5 Video generation and editing suite with motion-directed controls.
Kling 3.0 Longer-form 4K video generation with API access and native audio.
Seedance 2.0 Quad-modal input. Lip sync. 2K resolution.
Pika 2.5 Beginner-friendly. Pikaswaps. Fast renders.
Luma Dream Machine 4K HDR. Physics simulation. 3D/cinematic.
HaiLuo AI Budget video. 10 free/day. MiniMax.
Wan 2.1 Open-source video generation model suitable for self-hosted experimentation.
HunyuanVideo Tencent OSS. Consumer GPU. Multi-style.
LTX Video OSS. Licensed data. Clear commercial terms.

Music and Audio

Tool Description
Suno Text-to-song generation with full tracks and vocal production.
Udio High-fidelity music gen. Fine control.
ElevenLabs Music Vocals, instrumentals. Sectional editing. Stem separation.
Stable Audio High-quality. Commercial license.
Meta AudioCraft OSS. MusicGen + AudioGen.

3D and Design

Tool Description
Meshy Text/image to 3D. Game assets, products.
Tripo AI Fast 3D from text/images. Multi-format export.
Vizcom Real-time AI rendering for industrial designers.