Submitted by weimin wang 37 Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation Character.AI 1.7k 12