Generative AI Unleashed: crafting voices and music from text & images
While generative AI continues to evolve predominantly around text and imagery, the horizon is broadening with innovations in audio, synthesized voices, and music creation. This session provides an overview of the realm of generative AI for audio. Let's dive into hands-on examples, harnessing the power of open-source pre-trained models, and discover how the Microsoft ecosystem, among others, can be a powerful tool in this journey. Gianni Rosa Gallina R&D Technical Lead @ Deltatre Gianni is a Microsoft MVP for the AI category, former Windows Embedded/Development since 2011, focused on emerging technologies, AI and Virtual/Augmented/Mixed Reality since 2013. Currently he is R&D Technical Lead in Deltatre's Innovation Lab, designing and prototyping next generation solutions for sport related experiences and services, from apps, tools and end-to-end cloud architectures. Besides that, he's an active member of the local community “Torino Technologies Group” (TTG), Pluralsight online courses author, writes articles on his blog and he's speaker in national and international tech conferences and events. Clemente Giorio Senior Principal Engineer (R&D focus) @EYES visiON Passionate about bridging the gap between science, technology, and community engagement. With a knack for Artificial Intelligence, Computer Vision, and Mixed Reality, I strive to innovate and contribute to cutting-edge tech projects.