Skip to main content

Google Launches Muse, A New Text-to-Image Transformer Model






Since the beginning of 2021, the development of numerous text-to-image models powered by deep learning, including Midjourney, Stable Diffusion, and DALL-E-2, to mention a few, has completely changed the landscape of AI research. Google's Muse, a text-to-image Transformer model that aspires to reach cutting-edge image generating performance, is another name to add to the list.

A new text-to-image converter model called Google Muse was created by Google Research . It is intended to provide photos that are comparable to those from current models, but it is said to be quicker and more effective. It is trained on a sizable text-to-image dataset and employs a compressed, discrete latent space. It is intended to offer picture synthesis capabilities for a variety of purposes, from developing graphics from complicated concepts to creating images from text descriptions.




Given the text embedding obtained from a large language model (LLM) that has already been trained, Muse is trained on a masked modelling task in discrete token space. Muse has been trained to predict randomly masked image tokens. Muse asserts to be more effective than pixel-space diffusion models like Imagen and DALL-E 2 since it uses discrete tokens and requires fewer sample iterations. The model generates a zero-shot, mask-free editing for free by iteratively resampling image tokens conditioned on a text prompt.

Model Architecture:




More info on this : https://muse-model.github.io






Comments

Popular posts from this blog

ChatGPT Prompting Cheat Sheet

30 Free APIs to Boost Your Productivity

  APIs (Application Programming Interfaces) allow developers to access and integrate the functionality of other software systems into their own applications. In the world of productivity, there are numerous APIs available that can help you streamline your workflows, manage your tasks and projects, and get more done in less time. Here are 30 free productive APIs that you can use to boost your productivity: Google Maps API: This API allows developers to access and customize Google Maps for their own websites and applications. It includes features such as directions, geocoding, and real-time traffic updates. Documentation can be found at https://developers.google.com/maps/ Trello API: This API allows developers to access and manipulate data from Trello, a popular project management and organization tool. It can be used to create, read, update, and delete Trello boards, lists, and cards. Documentation can be found at https://developers.trello.com/ Asana API: This API allows developers...

AI tools other than ChatGPT to improve your productivity

  Everyone's talking about  #ChatGPT . But 90% of you are missing out on the AI revolution. Here are the top AI tools you NEED to know about. 1. Krisp: Krisp's AI removes background voices, noises, and echo from your calls, giving you peace of call Link:  https://krisp.ai/ 2. Beatoven: Create unique royalty-free music that elevates your story Link:  https://www.beatoven.ai/ 3. Cleanvoice: Automatically edit your podcast episodes Link:  https://cleanvoice.ai/ 4. Podcastle: Studio quality recording, right from your computer Link:  https://podcastle.ai/ 5. Flair: Design branded content in a flash Link:  https://flair.ai/ 6. Illustroke: Create killer vector images from text prompts Link:  https://illustroke.com/ 7. Patterned: Generate the exact patterns you need for and design Link:  https://www.patterned.ai/ 8. Stockimg: Generate the perfect stock photo you need, every time Link:  https://stockimg.ai/ 9. Copy: AI Generated copy, that actual...