Self-Supervised Audio-Reactive Music Video Synthesis