In this module, we provide a comprehensive overview of the generative AI landscape covering both theory and practical applications. We dive into the fundamental ideas related to autoencoders, generative adversarial networks, and autoregressive
models. We explore advanced techniques, including transformers, diffusion models, and multimodal models, illustrating their
capabilities across various tasks.
The focus is on generative AI techniques and their applications in our everyday life as well as in professional environments spanning fields such as advertising, fashion design, creative writing, music production, and software engineering. We will discuss, understand, and implement various useful generative AI methods like styleGANs, large language models (LLMs) such as GPT and LLaMA, vision-language models like DALL·E, Stable Diffusion, etc. Participants will gain a deeper understanding of these cutting-edge generative AI methods and their practical utility.
Python skills are necessary. This module is taught in English. Most of the AI Literature is in English. An intermediate level is recommended. The students are free to write reports in German or English. The exam questions will be in English, the students are free to write answers in German or English.
Lösen einer Challenge mit Kurzbericht und Präsentation
Schriftliche Prüfung 50%
Challenge 50%
After the successful completion of this module the students are able to:
1. Understand the current state of generative AI
2. Embrace the generative AI technology to explore new ideas much faster and iterate over concepts in a way that previously was not possible.
3. Appreciate the importance of ethical and practical challenges faced by generative AI
4. Train different generative AI models, measure their performance, and deploy them locally
5. Learn to generate and adapt images, create text, create music with the help of generative AI
In this module, we delve into specific generative AI topics and apply them to concrete tangible examples. The emphasis will be on widely employed deep learning techniques and their (potential) applications in real-world scenarios. While the exact list of
topics may vary, the following topics are anticipated to be covered:
1. Learn how large language models like ChatGPT are trained
2. Train your own model for text generation
3. Understand and implement prompting, prompt tuning and few-shot learning and AI agents
4. Understand and implement (a chatbot-based) retrieval augmented search with LLMs
5. Discover how Variational Autoencoders can alter images
6. Train generative adversarial networks (GANs) to generate new images based on a given dataset and explore the latest StyleGAN
7. Build diffusion models to produce new diverse variations of a certain image
8. Compose music using Transformers and MuseGAN
9. Dive into multimodal models such as DALL·E 2, Imagen, and Stable Diffusion
10. Appreciate ethical concerns of generative AI like privacy, fairness, bias, ownership, fake content, etc.
During the semester the students will solve a graded challenge in a non-technical, interdisciplinary context.
Durchführung gemäss Stundenplan