Immediately after switching the page, it will work with CSR.
Please reload your browser to see how it works.
In theory you can also animate such scenes but how to actually do that is still a research problem.
Whether this will end up being better than really well optimized polygon based systems like Nanite+photogrammetry is also an open question. The existing poly pipes are pretty damn good already.
- 3d scene reconstruction from a few images: https://dust3r.europe.naverlabs.com/
- gaussian avatars: https://shenhanqian.github.io/gaussian-avatars
- relightable gaussian codec: https://shunsukesaito.github.io/rgca/
- track anything: https://co-tracker.github.io/ https://omnimotion.github.io/
- segment anything: https://github.com/facebookresearch/segment-anything
- good human pose estimate models: (Yolov8, Google's mediapipe models)
- realistic TTS: https://huggingface.co/coqui/XTTS-v2, bark TTS (hit or miss)
- open great STT (mostly whisper based)
- machine translation (ex: seamlessm4t from meta)
It's crazy to see how much is coming out of Meta's R&D alone.