From the course: AI Trends
OpenAI Sora: Text-to-video
- Imagine you could describe a video and have it seemingly appear out of thin air. Well, OpenAI's Sora allows you to do just that. Sora is a text-to-video diffusion model. It receives a prompt, starts off with a noisy sequence, then iteratively removes noise until it has a clean and crisp video. Sora leans heavily on the transformer architecture, a model architecture that has yielded incredible technologies, such as ChatGPT and DALL-E. While this is extremely exciting, there is some room for caution as this technology could be used to generate misinformation. Now, at the time of watching this, you may not yet have access to Sora or a video generation tool. There are some ways, however, you can get ready for this technology. For starters, you can work on your prompt engineering skills using a text-to-image generation tool. These skills will likely carry over to something like Sora. Another thing you can do is get familiar with the vocabulary of cinematography. Find out the names of different camera angles. Figure out which cameras and lenses were used to shoot your favorite film. As someone who creates video and tells stories through video, I'm extremely excited for this technology and cannot wait to see what it will bring about.
Contents
-
-
-
Microsoft Build 2024: New computers and developer tools6m 45s
-
NPUs vs. GPUs vs. CPUs2m 45s
-
New Google Gemini Models and Google I/O Announcements4m 44s
-
GPT-4o, multimodal AI, and more5m 4s
-
OpenAI Sora: Text-to-video1m 34s
-
Google Gemini3m 40s
-
Multimodal prompting3m 11s
-
Assistant GPTs3m 21s
-
Claude4m 8s
-
OpenAI API3m 21s
-
Microsoft Security Copilot3m 20s
-
Bing and OpenAI2m 51s
-
AI agents6m 4s
-
The LLM landscape2m 43s
-
Google AI products: Bard, PaLM, and more3m 56s
-
PaLM 2 and Bard3m 8s
-
AI regulations6m 48s
-
Azure AI Studio6m 28s
-
General artificial intelligence3m 43s
-
ChatGPT plugins3m 41s
-
GPT-45m 7s
-
ChatGPT3m 54s
-
Prompt engineering3m 25s
-
-
-