Sign up to save tools and stay up to date with the latest in AI
bg
bg
1

Ask HN: Any tool to generate AI images with exact same uploaded product images?

Jun 30, 2024 - news.ycombinator.com
The article discusses the use of different methodologies for incorporating new concepts into Stable Diffusion, specifically adding images of people. The methods mentioned include Hypernetworks, Textual Inversion, Dreambooth, and a more recent technique, LoRa. The author found Dreambooth to be the most effective for images of people, while Hypernetworks and Textual Inversion were less successful.

The author also shares their personal experience in creating models using an Nvidia RTX 3060 card, which often required hours or even a day to produce dozens of decent pictures. The quality of the output was largely dependent on the quality and variety of the input photos, with different angles and types of shots needed for each subject. The author also noted that the output often mirrored the most common features in the input photos, such as glasses.

Key takeaways:

  • There are various methodologies for adding new concepts to Stable Diffusion, including Hypernetworks, textual inversion, dreambooth, and a more recent technique, LoRa.
  • Dreambooth is considered better than hypernetworks and textual inversion for images of people.
  • Creating each model using an Nvidia RTX 3060 card can take hours or even a day, but can yield dozens of decent pictures of each subject.
  • The quality of the output depends heavily on the quality and variety of the input photos, including different angles, body shots, and whether the subject is wearing glasses or not.
View Full Article

Comments (0)

Be the first to comment!