Text-to-Image
Diffusion Single File
comfyui
anime
Raehoshi Anima

Overview

Raehoshi Anima is an enhanced iteration built upon the Anima Base v1.0 architecture. This release focuses on elevating visual style, integrating extensive new concepts, and expanding character knowledge. The ultimate goal is to deliver a more polished, balanced, and visually stunning output while remaining faithful to the core strengths of the base model.

Model Details

  • Developed by: Raelina
  • Model type: Diffusion-based text-to-image generative model
  • Finetuned from: Anima
  • Prompt style: Booru-style tags and Natural Language

Installation & Requirements

Important Note: This model does not include a built-in Text Encoder or VAE. You must download these components separately to achieve the intended results.

File Placement Guide

  • raehoshi-anima-v1.0.safetensors goes in ComfyUI/models/diffusion_models
  • qwen_3_06b_base.safetensors goes in ComfyUI/models/text_encoders
  • qwen_image_vae.safetensors goes in ComfyUI/models/vae

Recommended settings

  • Sampler: Euler a or ER SDE
  • Scheduled type: Beta or Normal
  • Sampling steps: 32
  • CFG: 4-5
  • Resolution: Any resolution up to 1536 (ensure dimensions are divisible by 32)
  • Positive prompts:
masterpiece, best quality, score_7, absurdres
  • Negative prompts:
worst quality, low quality, score_1, score_2, score_3, artist name, blurry, jpeg artifacts, bad anatomy, bad hands, bad proportions, mutation, deformed, extra digits, fewer digits, missing arms, missing legs

Prompting Tips

  • Tag Ordering: For the most consistent results, follow this structured prompt order: [Quality / Meta / Year / Safety tags] ➔ [1girl / 1boy / Character Count] ➔ [Character Name] ➔ [Series / Copyright] ➔ [Artist] ➔ [General Tags]
  • Character Accuracy: Always include the official series/copyright tags alongside the character name to significantly improve details and accuracy.
  • Hybrid Prompting: The model handles hybrid prompting seamlessly. Feel free to mix dan match danbooru-style tags with natural language descriptions (e.g., use tags for characters and natural language for background/action).

Training Details

Raehoshi Anima was trained using a custom personal fork of Diffusion-pipe across a comprehensive two-stage fine-tuning process. The dataset utilizes multi-level captioning with random selection and tag dropout to ensure flexibility.

Stage 1: Concept & Character Expansion

  • Dataset Size: ~25k images
  • Trained Resolution: 1024x1024
  • Hardware: NVIDIA RTX PRO 6000 (96GB VRAM)
  • Batch Size: 32
  • Gradient accumulation steps: 1
  • Learning Rate: 1.5e-6 (LLM Adapter LR: 2e-7)
  • Focus: Introducing new franchises, series, and character knowledge.

Stage 2: Aesthetic & Style Refinement

  • Dataset Size: ~1k high-curation images
  • Trained Resolution: Multi-aspect (1024x1024 & 1536x1536)
  • Hardware: NVIDIA RTX PRO 6000 (96GB VRAM)
  • Batch Size: Per-resolution batch size (24-1536x1536) & (48-1024x1024)
  • Gradient accumulation steps: 1
  • Learning Rate: 1e-6 (LLM Adapter LR: 0)
  • Focus: Mitigating artifacts, balancing composition, and enhancing the overall visual style.

List of New Series/Characters Trained:

Expanded Knowledge Base (Up to May 2026)

The model’s character and lore library has been updated to include the latest data for:

  • Zenless Zone Zero
  • Wuthering Waves
  • Honkai: Star Rail
  • Genshin Impact
  • Arknights: Endfield
  • Neverness to Everness

For character trait details prompts, please refer to the Danbooru site for accurate tags and references.

Support the Development

If you love using this model and want to help fund future iterations, consider supporting the project buy me a coffee via Ko-Fi

License

CircleStone Labs Non-Commercial License

Downloads last month
24
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Raelina/Raehoshi-Anima

Finetuned
(56)
this model