Computer Vision News - December 2020
RL-CycleGAN 5 The authors make use of two important concepts from the ĞĞƉ >ĞĂƌŶŝŶŐ world which we are probably already familiar with: 1) the Domain Adaptation technique, and 2) Reinforcement Learning. Domain adaptation methods aim at training models using many examples from a source domain (here simulation) and few examples from a target domain (here reality). They can be based on a pixel-level adaptation, as does the LJĐůĞ' E , or on a feature-level adaptation. Let’s look at both of these techniques more in depth! ࣦ ீே ሺ ܩ ǡ ܦ ǡ ܺǡ ܻሻ ൌ ॱ ௬̱ ሾ ܦ ሺ ݕ ሻሿ As per the original paper, CycleGAN includes 2 GANs, here called ^ŝŵϮZĞĂů and ZĞĂůϮ^ŝŵ . These learn the mappings from the simulation to real domains and vice versa. The architecture also includes 2 adversarial discriminators. Below we report the mathematical formulations, assuming that X and Y are the two image domains (simulation and real). Generator Sim2Real Generation Real2Sim Adversarial discriminator 1 Adversarial discriminator 2 Adversarial Loss for Sim2Real ॱ ௫̱ ሾ ቀͳ െ ܦ ൫ ܩ ሺ ݔ ሻ൯ቁሿ Adversarial Loss for Real2Sim Cycle Consistency Loss ࣦ ீே ሺ ܨ ǡ ܦ ǡ ܻǡ ܺሻ ൌ ॱ ௫̱ ሾ ܦ ሺ ݔ ሻሿ ॱ ௬̱ ሾ ቀͳ െ ܦ ௫ ൫ ܨ ሺ ݕ ሻ൯ቁሿ ࣦ ௬ ሺ ܩ ǡ ܨ ሻ ൌ ॱ ௫̱ ೞ ݀൫ ܨ ൫ ܩ ሺ ݔ ሻ൯ǡ ݔ ൯ ॱ ௬̱ ೝೌ ݀൫ ܩ ൫ ܨ ሺ ݕ ሻ൯ǡ ݕ ൯ G: X -> Y F: Y -> X Dx: {x} from {F(y)} Dy: {y} from {G(x)} Sim2Real is trained by: Real2Sim is trained by: The last item in the table is the cycle consistency loss, used to ensure that, from the generated images, the original ones can always be recovered, i.e.: ݉݅݊ ீ ݉ܽ ݔ ೊ ࣦ ீே ሺ ܩ ǡ ܦ ǡ ܺǡ ܻሻ ݉݅݊ ி ݉ܽ ݔ ࣦ ீே ሺ ܨ ǡ ܦ ǡ ܻǡ ܺሻ ܩ ՜ ݔ ሺ ݔ ሻ ՜ ܨ ൫ ܩ ሺ ݔ ሻ൯ ൎ ݔ ǡ ܨ ՜ ݕ ሺ ݕ ሻ ՜ ܩ ൫ ܨ ሺ ݕ ሻ൯ ൎ ݕ More on CycleGAN
Made with FlippingBook
RkJQdWJsaXNoZXIy NTc3NzU=