Thursday, June 19, 2025
HomeTechnologyProduct images utilizing AI — Planningo’s tech notes | Planning Co., Ltd....

Product images utilizing AI — Planningo’s tech notes | Planning Co., Ltd. | January 2024



Planning Co., Ltd.There are many picture era AIs as of late, however if you wish to management not solely textual content but in addition pictures, you don’t have any alternative however to make use of Control-net’s steady dissemination.We have created a service to stably disseminate product pictures.We will proceed to analysis and make the most of them sooner or later. AI mannequin for creating product content material

There has been information for years that AI can create photo-realistic pictures. However, not too long ago, AI modules have appeared that may generate sensible and pure pictures.

Notable examples embody DALL-E, Midjourney, and Stable Diffusion. Major firms resembling Google and Meta are additionally reportedly growing their very own AI methods for picture era.

Planningno, which focuses on creating product content material, performed a survey to seek out out which AI can be utilized and how you can simply create product content material. If product content material might be simply created utilizing AI, it’s anticipated that will probably be broadly used when creating detailed product pages and promotional supplies for on-line shops.

The consultant text-to-image fashions we now have recognized are:

DALL-E is a text-to-image mannequin created by OpenAI, well-known for its chat-gpt mannequin.

Image created from the DALL-E mannequin

This mannequin creatively generates pictures that match particular sentences or phrases. For instance, if the textual content enter is “banana-shaped couch”, a picture of a banana-shaped couch in the lounge shall be generated, as proven within the picture above. DALL-E can rework textual content into visible components to create inventive and distinctive pictures.

This mannequin is an AI mannequin that makes use of generative adversarial networks (GAN).

Image comprised of mid-journey

GANs are used to generate sensible pictures which might be tough to differentiate from the actual factor, and Midjourney makes use of this expertise to create sensible and artistic pictures. In the previous, an AI-generated work prompted controversy when it was chosen because the profitable work, however that work was created throughout a visit.

Stable Diffusion is a picture era mannequin developed by Stability AI that focuses on steady picture era. This mannequin goals to scale back the instability points that happen through the picture era course of and obtain extra constant picture era.

Image created from Photio’s steady diffusion mannequin

Furthermore, it’s also outfitted with Control-net that may partially management the picture era AI.

After contemplating the traits of those fashions, we determined to undertake the steady diffusion mannequin for our service. It is obvious that different picture era fashions even have good efficiency.

So why did we select steady diffusion?

In actuality, I had to make use of steady diffusion.

Product content material era requires image-to-image conversion, not text-to-image conversion.

To create product content material, it’s good to generate a pure background across the product based mostly on the enter product picture. Therefore, a picture should be current as enter.

Control-net was additionally essential. Through Control-net, Planningo was ready to make use of steady diffusion fashions to create extra product-specific pictures.

What is management web?

controlnet canny mannequin pattern picture — from controlnet git hub

As proven within the picture above, Control-net is a mannequin that permits you to management how Stable Diffusion generates pictures.

We wished to create a extra pure background across the product, producing a picture that would seem as if the product was initially positioned in that background. I used his Canny mannequin from Control-net to generate the pictures.



Source hyperlink

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Most Popular