How do Multimodal AI models work? Simple explanation | AssemblyAI Transcripts