The evolution of text-to-video diffusion models has been remarkable, with Sora presenting major advancements in video generation. Understanding the importance of prompts in this domain, a paper titled VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models unveils VidProM—a dataset of 1.67 million unique text-to-video prompts.
This dataset marks a significant step in understanding and optimizing text-to-video diffusion models. Researchers can leverage VidProM to develop better, safer, and more efficient models while gaining a rich understanding of user intentions and market needs.