CVPR Daily - Friday

12 DAILY CVPR Friday During the evaluation stage, the team discovered that Unified-IO 2 could perform well in tasks they had not initially targeted, such as video tracking and some embodied tasks. They will showcase these surprising results with iPad demonstrations at their poster session. “We’ve tested the model multiple times, but maybe only with a few modalities and target tasks,” Charles reveals. “It’s a surprise that the model is so good at other tasks we’ve not focused on before. There are lots of interesting behaviors of the models and some very cool visualizations that the model can follow some novel instructions.” The paradigm behind Unified-IO 2, where all modalities are integrated into a single transformer without relying on external unimodal models, is a promising direction for future AI research. “It’s in contention with other ways of training generalist models, and people are still exploring and building on that,” Christopher adds. “I think Unified-IO 2, in particular, has a lot of modalities and tasks and really pushes that way of building models to an extreme.” To learn more about the team’s work, visit Poster Session 6 & Exhibit Hall (Arch 4A-E) from 17:15 to 18:45 [Poster 222]. Highlight Presentation

RkJQdWJsaXNoZXIy NTc3NzU=