DR

Open SourceImage Generationby Google Research

DreamBooth

DreamBooth is a fine-tuning technique for text-to-image models. Using just a few photos of a subject, it teaches a diffusion model to generate that specific person, pet or object in new scenes, styles and contexts.

dreamboothfine-tuningimage-generationopen-source-aipersonalized-aistable-diffusion

View on GitHub

Quick facts

TypeFine-Tuning Technique

InputA Few Photos

Works withDiffusion Models

ByGoogle

No ratings yet — be the first

Type

Fine-tune method

personalization

Input

A few photos

subject

Runs on

Consumer GPU

minutes-1hr

Google

research

What is DreamBooth?

DreamBooth is a fine-tuning technique for personalising text-to-image diffusion models, introduced by Google researchers. It is not a model itself but a method: given just a handful of photos of a specific subject — your face, your pet, a particular product or object — DreamBooth teaches a model like Stable Diffusion to recognise that exact subject and then generate it in entirely new scenes, poses, styles and contexts. This 'subject-driven generation' lets you put a specific person or item into AI imagery that never existed, with remarkable likeness.

How it works

DreamBooth fine-tunes the diffusion model so a unique identifier token (a rare word) becomes bound to your subject. You supply a few images of the subject paired with a prompt like 'a photo of [V] dog', and the model learns to associate that token with the subject's appearance. A 'prior-preservation' loss is used to stop the model from forgetting the broader class (e.g. dogs in general) while it learns your specific one. Afterward, you prompt with the identifier in new contexts ('[V] dog on the moon') to generate the subject anywhere.

What it is good at

DreamBooth is ideal for personalised, subject-consistent image generation: custom avatars and portraits, putting a product into varied marketing scenes, character consistency for stories or games, pet portraits, and creative edits featuring a specific person or object. Its strength is strong likeness from very few images combined with the full creative range of the underlying text-to-image model — the subject can appear in any style or setting you can prompt.

Licensing & access

DreamBooth is an openly published technique, widely implemented in open-source tools — most popularly via the Hugging Face Diffusers library and many community trainers. Because it fine-tunes a base model, the licence of the underlying model (e.g. Stable Diffusion's) governs use of the result. Training a DreamBooth model is lightweight enough to run on a single consumer GPU in minutes to an hour, and LoRA-based variants make it even cheaper.

Practical considerations

Results depend on good input photos (varied, clear shots of the subject) and balanced training — too little and likeness is weak, too much and the model overfits (rigid poses, artefacts) or forgets the general class. There are serious ethical and consent issues: only personalise on subjects you have the right to use, never create misleading images of real people, and disclose synthetic imagery. The result is also tied to the base model's licence and capabilities.

How it compares

DreamBooth personalises subjects; ControlNet instead adds structural control (pose, edges, depth) to generation; Pix2Pix does paired image translation. They are complementary tools on top of diffusion models: you might DreamBooth a subject and use ControlNet to control its pose. Compared with lighter alternatives like textual inversion or LoRA-only tuning, DreamBooth often gives stronger likeness at the cost of more training — and is frequently combined with LoRA for efficiency.

Getting started

Use a Diffusers DreamBooth training script: gather a few good photos of your subject, pick a unique identifier token and a base model (e.g. Stable Diffusion), and fine-tune on a single GPU (LoRA-based DreamBooth keeps it cheap). Then prompt with your identifier in new scenes to generate the subject. Use prior preservation to retain the general class, avoid overfitting, and only personalise subjects you have consent to use.

Capabilities

🧑

Subject binding

Binds a unique token to a specific subject from a few photos.

🎨

New-context generation

Places the learned subject in any prompted scene, pose or style.

🛡️

Prior preservation

Keeps the model's general class knowledge while learning your subject.

⚡

Lightweight training

Fine-tunes on a single GPU, and pairs with LoRA for efficiency.

Pros & Cons

Pros6

Strong subject likeness from a few photos
Generates a subject in any new scene or style
Runs on a single consumer GPU
Widely supported (Diffusers, community tools)
Combines with LoRA for efficiency
Openly published, flexible technique

Cons4

A technique, not a standalone model
Needs good, varied input photos
Can overfit or forget the general class
Consent/ethics: only personalise with rights

Inspiration

DreamBooth use cases & project ideas

Custom avatars

Generate your likeness in art.

Product scenes

Place a product anywhere.

Pet portraits

Your pet in new styles.

Character consistency

Keep a subject consistent.

FAQ

Frequently asked questions

What is DreamBooth?+

A fine-tuning technique that teaches a text-to-image diffusion model a specific subject from a few photos, so it can generate that subject in new scenes.

How many images do I need?+

Does it need a lot of compute?+

What licence applies to results?+

What about ethics?+

More to explore

Learn more

From our blog

Tutorials