Su	Mo	Tu	We	Th	Fr	Sa
29	30	1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31	1	2

Multimodal AI Essentials: Merging Text, Image, and Audio for Next-Generation AI Applications

Posted By: IrGens

Date: 2 Jul 2025 15:19:02

Multimodal AI Essentials: Merging Text, Image, and Audio for Next-Generation AI Applications
.MP4, AVC, 1280x720, 30 fps | English, AAC, 2 Ch | 5h 33m | 1.08 GB
Instructor: Sinan Ozdemir

This course shows you how combining modalities like text, audio, video, and images can enable AI systems to achieve remarkable capabilities. Gain hands-on experience building visual question-and-answer models, generating personalized images with diffusion, designing end to end multimodal applications, and even fine-tuning multimodal models for specific tasks. This course gives you the tools, knowledge, and confidence to design and deploy your own state-of-the-art multimodal AI systems.

Learning objectives

Apply multimodal AI concepts.
Build a voice-to-voice app.
Apply visual question answering (VQA) concepts and architecture.
Construct, fine-tune, and evaluate diffusion models with DreamBooth.
Fine-tune a text-to-speech model with SpeechT5.
Build visual agents from the ground up.
Evaluate the performance of multimodal models.
Extend multimodal systems with advanced techniques like computer use.

Homepage

Download from icerbox.com

Artificial Intelligence (AI) Large Language Models (LLM) Multimodal AI eLearning Video

Tags

Language Afrikaans العربية հայերէն Български Català 中文 Hrvatski Čeština Dansk Nederlands English Eesti keel Føroyskt Suomi Vlaams Français ქართული Deutsch řomani čhib Ελληνικά עברית हिन्दी Magyar Íslenska Bahasa Indonesia Irish Italiano 日本語 한국어 Language neutral Latin Makedonski jazik Bokmål Other Polski Português Română Русский Scandinavian Srpski Slovenščina Español Svenska ภาษาไทย བོད་སྐད་ Türkçe Українська tiếng Việt

Tags: Biographies Business Children Classics Cooking Crime Development Diets Drawing eLearning Video English Erotica Fiction Finance History Learn English More Courses In English Non-Fiction Painting Personal Development Personality Philosophy Photo Physics Politics Programming Psychology Python Romance science Science SCIENCE Teens & Young Adult Thrillers

July 2025