GeenStijl: On-voor-stel-baar. OpenAI lanceert eerste (!) "text-to-video model" Sora. Is nú al niet van echte video te onderscheiden

On-voor-stel-baar. OpenAI lanceert eerste (!) "text-to-video model" Sora. Is nú al niet van echte video te onderscheiden

En dan moet het bedrijf die investering van $7 BILJOEN [$7.000.000.000.000, 7 maal Nederlands BNP] nog krijgen.

Slechts 42 woorden = deze fotorealistische 17 seconden

Prompt: “A movie trailer featuring the adventures of the 30 year old space man wearing a red wool knitted motorcycle helmet, blue sky, salt desert, cinematic style, shot on 35mm film, vivid colors.” pic.twitter.com/0JzpwPUGPB
— OpenAI (@OpenAI) February 15, 2024

Goedemorgen deze morgen en u bent getuige van de eerste werkelijke quantum leap sinds OpenAI's ChatGPT. Die stelling durven we zonder meer aan omdat we alle AI-ontwikkelingen hier op de voet volgden in dossier "De Toekomst Van".

Daar volgde u de gestage voortgang van AI's toepassingen, maar wat u hier boven- en onderstaand aantreft is van een geheel andere orde. Ter context, de vorige halte van text-to-video was ongeveer Runway's Gen-2 en OpenAI's eigen DALL·E 3 dat pas vier maanden (!) geleden uitkwam en heel korte, subtiel bewegende """video's""" afleverde.

Maar nu lanceert OpenAI dus hun eerste daadwerkelijke text-to-video model genaamd Sora:

"Sora can generate videos up to a minute long while maintaining visual quality and adherence to the user’s prompt. (...) Sora is able to generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background. The model understands not only what the user has asked for in the prompt, but also how those things exist in the physical world.

The model has a deep understanding of language, enabling it to accurately interpret prompts and generate compelling characters that express vibrant emotions. Sora can also create multiple shots within a single generated video that accurately persist characters and visual style."

Over de zwaktes van wat pas hun allereerste model is schrijven ze:

"The current model has weaknesses. It may struggle with accurately simulating the physics of a complex scene, and may not understand specific instances of cause and effect. For example, a person might take a bite out of a cookie, but afterward, the cookie may not have a bite mark.

The model may also confuse spatial details of a prompt, for example, mixing up left and right, and may struggle with precise descriptions of events that take place over time, like following a specific camera trajectory."

Meer onvoorstelbaar beeldmateriaal na de breek.

Let naast alles ook even op de beweging van de oorbellen

Prompt: “A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. she wears a black leather jacket, a long red dress, and black boots, and carries a black purse. she wears sunglasses and red lipstick. she walks confidently and casually.… pic.twitter.com/cjIdgYFaWq
— OpenAI (@OpenAI) February 15, 2024

In godsnaam kijk naar de textuur en beweging van het 'water'

If you think OpenAI Sora is a creative toy like DALLE, ... think again. Sora is a data-driven physics engine. It is a simulation of many worlds, real or fantastical. The simulator learns intricate rendering, "intuitive" physics, long-horizon reasoning, and semantic grounding, all… pic.twitter.com/pRuiXhUqYR
— Jim Fan (@DrJimFan) February 15, 2024

Feilloze motoriek van een hond in complexe beweging. Interactie met omgeving idd nog niet perfect

Wow..

OpenAI' Sora Text-to-Video.

Prompt: The camera directly faces colorful buildings in burano italy. An adorable dalmation looks through a window on a building on the ground floor. Many people are walking and cycling along the canal streets in front of the buildings.

1/6 pic.twitter.com/5ZKsHXx23d
— Yama 🌴 (@Yamapama) February 15, 2024

Dit valt nu al niet van echt te onderscheiden

And it's able to generate videos of up to 1 minute!

Prompt: This close-up shot of a Victoria crowned pigeon showcases its striking blue plumage and red chest. Its crest is made of delicate, lacy feathers, while its eye is a striking red color. The bird’s head is tilted slightly… pic.twitter.com/b78fRGsFBc
— Kris Lukanov (@HDRobots) February 15, 2024

Zelfs complexe slow motion nagenoeg feilloos

The photorealism is on another level.

Prompt: A litter of golden retriever puppies playing in the snow. Their heads pop out of the snow, covered in. pic.twitter.com/P1eTsMLEDG
— Kris Lukanov (@HDRobots) February 15, 2024

Toekomstige historische video

Old school movies will not be a problem also.

Prompt: Historical footage of California during the gold rush. pic.twitter.com/LhtVspPiAQ
— Kris Lukanov (@HDRobots) February 15, 2024

Dit is extréém complex waterspiegelwerk

So it turns out that OpenAI's Sora is even more insane than I thought. Here are some of the absolutely nutty things it can do ⬇️ pic.twitter.com/EugpRM71qt
— Joseph Mambwe (@MrMambwe) February 16, 2024

Ook dat nog

Sora can also seamless transition between two completely unrelated videos ( the middle videos) ... man, Ima need a cold shower pic.twitter.com/7qF0zPJ7Lg
— Joseph Mambwe (@MrMambwe) February 16, 2024

Jongens, dit is versie 1 (! ! ! !) he.

this could be the "holy shit" moment of AI. OpenAI has just announced Sora, its text-to-video AI model. This video isn't real, it's based on a prompt of "a cat waking up its sleeping owner demanding breakfast..." 🤯 https://t.co/xKy3iQBKwT pic.twitter.com/HPm2p1jbgo
— Tom Warren (@tomwarren) February 15, 2024

Versie 1 (! ! !) worstelt nog met complexe objectinteractie zoals: dat de kaarsjes werkelijk uitgaan als ze blaast

The only weakness is generating complex interactions between muiltiple objects and characters. pic.twitter.com/3ZVlT5jRKO
— Kris Lukanov (@HDRobots) February 15, 2024

Zelfs z'n fouten zijn mooi

even the sora mistakes are mesmerizing pic.twitter.com/OvPSbaa0L9
— Charlie Holtz (@charlieholtz) February 15, 2024

haha

21-year-old woman with dark hair playing a piano as it drives down the road. cinematic lighting, 35mm film, vivid colors https://t.co/JzBW6XD2Ym
— frye (@___frye) February 15, 2024

Tags: OpenAI, ChatGPT, Sora, text-to-video

@Spartacus | 16-02-24 | 09:00 | 196 reacties

Dit wil je ook lezen

OpenAI neemt 100+ ex-bankiers aan om bankiers overbodig te maken

Bankiers, soort kunstenaars!

@Spartacus | 21-10-25 | 20:00 | 88 reacties

Binnenkort op ChatGPT: P0RNEAUX!

He bah

@Zorro | 15-10-25 | 15:00 | 130 reacties

OpenAI presenteert plan 'wereldwijde democratische AI, krijgt toch geen winstoogmerk'

Masters of the Universe kondigen wat geschuif in de kleine lettertjes aan

@Spartacus | 07-05-25 | 19:01 | 117 reacties

Elon 'doet bod' van $97 miljard op OpenAI, Sam Altman 'doet bod' van $9,74 miljard op Twitter

De volwassenen, waaronder (net als in april 2023) "Harry Bölz" weer achter het stuur!

@Spartacus | 11-02-25 | 13:37 | 122 reacties

OpenAI krijgt winstoogmerk, Sam Altman aandelen, benodigde $7 biljoen waarschijnlijker

Zeer relevant nieuws, want dit winstoogmerk maakt OpenAI veel aantrekkelijker voor investeerders tijdens het aantrekken van $7 biljoen (zevenduizendmiljard dollar, $7.000.000.000.000, 7 maal Nederlands bnp) voor de ontwikkeling van Artificial General Intelligence (AGI, wiki).

@Spartacus | 26-09-24 | 16:00 | 113 reacties

Chat-GPT4o verandert stem, Scarlett Johansson "geschokt en kwaad", stuurt advocaten

En in ander nieuws: OpenAI ontmantelt het gehele team dat onderzoek deed naar hoe gebruik "verantwoord en veilig" kan blijven.

@Spartacus | 21-05-24 | 11:01 | 91 reacties

OpenAI's nieuwe Chat GPT-4o klinkt bovenal geiler dan ooit en kan met menselijke snelheid "reason across audio, vision, and text" (saai)

Nee het is weer niet bij te houden

@Spartacus | 14-05-24 | 16:35 | 104 reacties

Geenstijl