Computer Vision News Computer Vision News 6 Best Student Paper Award GeoDiffuser is about geometrybased image editing with diffusion models. More specifically, how do you inject geometry in image editing without any model retraining or any fine-tuning. This zero-shot optimization-based method does not train models but it's like an optimization strategy that allows you to rotate your image or translate it and sort of remove the object as well as remove any distractors. The method is a test-time optimization strategy which views these image editing operations as geometric transformations. These transformations can be directly incorporated into attention layers in diffusion models. “What we do,” explains Rahul “is we device specific loss functions that come from within the attention blocks of this diffusion model and we update the inputs of the model. We leave the model untouched, but we just update the Rahul Sajnani is a third-year PhD student at Brown University. His work won the Best Student Paper award at WACV 2025. GeoDiffuser: Geometry-Based Image Editing with Diffusion Models
RkJQdWJsaXNoZXIy NTc3NzU=