Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch. The main novelty seems to be an extra layer of indirection with the prior network (whether it is an autoregressive transformer or a diffusion network), which predicts an image embedding based on the text embedding from CLIP. Specifically, this repository will only build out the diffusion prior network, as it is the best performing variant (but which incidentally involves a causal transformer as the denoising network) To train DALLE-2 is a 3 step process, with the training of CLIP being the most important. To train CLIP, you can either use x-clip package, or join the LAION discord, where a lot of replication efforts are already underway. Then, you will need to train the decoder, which learns to generate images based on the image embedding coming from the trained CLIP.

Features

  • Generate the DALL-E2 images from text
  • You can also train the decoder on images of greater than the size (say 512x512)
  • For the layperson, no worries, training will all be automated into a CLI tool
  • Training on Preprocessed CLIP Embeddings
  • Alternatively, you can also use Open Clip
  • Inpainting is also built into the Decoder

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow DALL-E 2 - Pytorch

DALL-E 2 - Pytorch Web Site

Other Useful Business Software
Infor M3 ERP Icon
Infor M3 ERP

Enterprise manufacturers and distributors requiring a solution to manage and execute complex processes

Efficiently executing the complex processes of enterprise manufacturers and distributors. Infor M3 is a cloud-based, manufacturing and distribution ERP system that leverages the latest technologies to provide an exceptional user experience and powerful analytics in a multicompany, multicountry, and multisite platform. Infor M3 and related CloudSuite™ industry solutions include industry-leading functionality for the chemical, distribution, equipment, fashion, food and beverage, and industrial manufacturing industries. Staying ahead of the competition means staying agile. Our new capabilities bring improved data-driven insights and streamlined workflows to help you make informed decisions and take quick action.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of DALL-E 2 - Pytorch!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Image Generators, Python Generative AI, Python Inpainting Tool

Registered

2022-08-01