SlideShare una empresa de Scribd logo
1 de 97
Descargar para leer sin conexión
Creative AI &
multimodality:
looking ahead
Roelof Pieters
@graphific
Imperial	 College	 London,	 

1	 Dec	 2015
roelof@graph-technologies.comhttp://artificialexperience.com/http://www.csc.kth.se/~roelof/
AICreative
AI
I kinda expect the audience to know AI & Machine Learning

Let’s move on shall we ?
AI
All references to:
- Arxiv or
- GitXiv if the “code” or “dataset” is available
Collaborative Open Computer Science
more info (Medium)
AI > today’s focus
AI > today’s focus
“Deep learning is a set of
algorithms in machine learning
that attempt to learn in multiple
levels, corresponding to
different levels of abstraction.”
AI > today’s focus
use of several modes (media) to
create a single artifact.
Multimodality
“Mode”
Socially and culturally shaped
resource for making meaning.
— Gunther Kress
Creativity
Creativity
• Many definitions: philosophical, sociological, historical,
practical
Creativity
1. Making unfamiliar combinations of familiar ideas.
2. Explore a structured conceptual space
3. (Radically) transforming ones structured conceptual space
“Exploration”
“Remix”
“The Creative Mind”

— Margaret Boden
“Transformation”
• Skill
• Appreciation
• Imagination
• Learning
• Innovation
• Accountability,
• Subjectivity
• Intentionality.
Creativity > “Traits” software has to exhibit in order to
avoid easy criticism of being “non-creative”.
(Simon Colton)
• Skill
• Appreciation
• Imagination
• Learning
• Innovation
• Accountability,
• Subjectivity
• Intentionality
Creativity > software traits
• Skill
• Appreciation
• Imagination
• Learning
• Innovation
• Accountability,
• Subjectivity
• Intentionality
Creativity > software traits
• Skill
• Appreciation
• Imagination
• Learning
• Innovation
• Accountability,
• Subjectivity
• Intentionality
Creativity > software traits
• Skill
• Appreciation
• Imagination
• Learning
• Innovation
• Accountability,
• Subjectivity
• Intentionality
Creativity > software traits
• Skill
• Appreciation
• Imagination
• Learning
• Innovation
• Accountability,
• Subjectivity
• Intentionality
Creativity > software traits
• Skill
• Appreciation
• Imagination
• Learning
• Innovation
• Accountability,
• Subjectivity
• Intentionality
Creativity > software traits
• Skill
• Appreciation
• Imagination
• Learning
• Innovation
• Accountability,
• Subjectivity
• Intentionality
Creativity > software traits
• Skill
• Appreciation
• Imagination
• Learning
• Innovation
• Accountability,
• Subjectivity
• Intentionality
Creativity > software traits
AICreative
Creative AI > Current possibilities
• Appropriating “standard” nets for creative use
• Reinforcement Learning: Creativity as a Game
• RNNs/LSTMs/GRUs
• Sequence-to-Sequence: Creativity as a Translation Task
• Auto-Encoders
• Attention-based Models
• Generative Adversarial Nets
Creative AI > Current possibilities
• Appropriating “standard” nets for creative use
• Reinforcement Learning: Creativity as a Game
• RNNs/LSTMs/GRUs
• Sequence-to-Sequence: Creativity as a Translation Task
• Auto-Encoders
• Attention-based Models
• Generative Adversarial Nets
Creative AI > Current possibilities > Appropriating “standard” nets for creative use Deep Dream
see also: www.csc.kth.se/~roelof/deepdream/
Creative AI > Current possibilities > Appropriating “standard” nets for creative use Deep Dream
see also: www.csc.kth.se/~roelof/deepdream/ codeyoutubeRoelof Pieters 2015
Creative AI > Current possibilities > Appropriating “standard” nets for creative use Deep Dream
see also: www.csc.kth.se/~roelof/deepdream/
C.M.Kosemen & 

Roelof Pieters (2015)
Gizmodo
Creative AI > Current possibilities > Appropriating “standard” nets for creative use
Leon A. Gatys, Alexander S. Ecker, Matthias Bethge , 2015. 

A Neural Algorithm of Artistic Style (GitXiv)
Style Net
Gene Kogan, 2015. Why is a Raven Like a Writing Desk? (vimeo)
Creative AI > Current possibilities
• Appropriating “standard” nets for creative use
• Reinforcement Learning: Creativity as a Game
• RNNs/LSTMs/GRUs
• Sequence-to-Sequence: Creativity as a Translation Task
• Auto-Encoders
• Attention-based Models
• Generative Adversarial Nets
Creative AI > Current possibilities > Reinforcement Learning
• AMN: Emilio Parisotto, Jimmy Lei Ba, Ruslan Salakhutdinov 2015, Actor-Mimic:
Deep Multitask and Transfer Reinforcement Learning (arxiv)
• DQN: Mnih, Volodymyr, Kavukcuoglu, Koray, Silver, David, Rusu, Andrei A., Veness,
Joel, Bellemare, Marc G., Graves, Alex, Riedmiller, Martin, Fidjeland, Andreas K.,
Ostrovski, Georg, Petersen, Stig, Beattie, Charles, Sadik, Amir, Antonoglou, Ioannis,
King, Helen, Kumaran, Dharshan, Wierstra, Daan, Legg, Shane, and Hassabis,
Demis. Human-level control through deep reinforcement learning. Nature, 518(7540):
529–533, 2015.
Creative AI > Current possibilities > Reinforcement Learning
Ardi Tampuu, Tambet Matiisen, Dorian Kodelja, Ilya Kuzovkin, Kristjan Korjus, Juhan Aru, Jaan Aru, Raul Vicente, 2015 

Multiagent Cooperation and Competition with Deep Reinforcement Learning (GitXiv)
(YouTube)
Reinforcement Learning
Ning Xie, Hirotaka Hachiya, Masashi Sugiyama, 2013 , 

Artist Agent: A Reinforcement Learning Approach to Automatic
Stroke Generation in Oriental Ink Painting (Paper, Lecture,
YouTube)
(YouTube)
Ning Xie, Hirotaka Hachiya, Masashi Sugiyama, 2013

Artist Agent: A Reinforcement Learning Approach to Automatic Stroke Generation
in Oriental Ink Painting (Paper, Lecture, YouTube)
Creative AI > Current possibilities
• Appropriating “standard” nets for creative use
• Reinforcement Learning: Creativity as a Game
• RNNs/LSTMs/GRUs
• Sequence-to-Sequence: Creativity as a Translation Task
• Auto-Encoders
• Attention-based Models
• Generative Adversarial Nets
Creative AI > Current possibilities
• Appropriating “standard” nets for creative use
• Reinforcement Learning: Creativity as a Game
• RNNs/LSTMs/GRUs
• Sequence-to-Sequence: Creativity as a Translation Task
• Auto-Encoders
• Attention-based Models
• Generative Adversarial Nets
Creative AI > Current possibilities
• Appropriating “standard” nets for creative use
• Reinforcement Learning: Creativity as a Game
• RNNs/LSTMs/GRUs
• Sequence-to-Sequence: Creativity as a Translation Task
• Auto-encoders
• Attention-based Models
• Generative Adversarial Nets
Creative AI > Current possibilities
• Standard (“denoising”) Autoencoders
• Variational Autoencoder (VAE) / Stochastic Gradient VB
• Deep Convolutional Inverse Graphics Network
• Variational RNN (VRNN)
Vincent et al, 2010. Stacked Denoising Autoencoders: Learning Useful Representations in
a Deep Network with a Local Denoising Criterion (paper) (code)
Creative AI > Current possibilities
• Standard “denoising” Autoencoders
• Variational Autoencoder (VAE) / Stochastic Gradient VB
• Deep Convolutional Inverse Graphics Network
• Variational RNN (VRNN)
• Diederik P Kingma, Max Welling, 2013. 

Auto-Encoding Variational Bayes (GitXiv)
Creative AI > Current possibilities
• Standard “denoising” Autoencoders
• Variational Autoencoder (VAE)
• Deep Convolutional Inverse Graphics Network (modified VAE)
• Variational RNN (VRNN)
Tejas D. Kulkarni, Will Whitney, Pushmeet Kohli, Joshua B. Tenenbaum, 2015
Deep Convolutional Inverse Graphics Network (GitXiv)
Creative AI > Current possibilities
• Standard “denoising” Autoencoders
• Variational Autoencoder (VAE)
• Deep Convolutional Inverse Graphics Network
• Variational RNN (VRNN) (VAE at every time step)
Junyoung Chung, Kyle Kastner, Laurent Dinh, Kratarth Goel, Aaron Courville, Yoshua Bengio, 2015

A Recurrent Latent Variable Model for Sequential Data (GitXiv)
VAEVAEVAE
Junyoung Chung, Kyle Kastner, Laurent Dinh, Kratarth Goel, Aaron Courville, Yoshua Bengio , 2015. 

A Recurrent Latent Variable Model for Sequential Data (GitXiv) (Audio Samples)
Creative AI > Current possibilities
• Appropriating “standard” nets for creative use
• Reinforcement Learning: Creativity as a Game
• RNNs/LSTMs/GRUs
• Sequence-to-Sequence: Creativity as a Translation Task
• Auto-Encoders
• Attention-based Models
• Generative Adversarial Nets
Karol Gregor, Ivo Danihelka, Alex Graves, Danilo Jimenez Rezende, Daan Wierstra, 2015

DRAW: A Recurrent Neural Network For Image Generation (GitXiv)
Variational Auto-Encoder
Deep Recurrent Attentive Writer
(DRAW) Network
(YouTube)
Creative AI > Current possibilities
• Appropriating “standard” nets for creative use
• Reinforcement Learning: Creativity as a Game
• RNNs/LSTMs/GRUs
• Sequence-to-Sequence: Creativity as a Translation Task
• Auto-Encoders
• Attention-based Models
• Generative Adverserial Nets
Emily Denton, Soumith Chintala, Arthur Szlam, Rob Fergus, 2015. 

Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks (GitXiv)
Alec Radford, Luke Metz, Soumith Chintala , 2015. 

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks (GitXiv)
Alec Radford, Luke Metz, Soumith Chintala , 2015. 

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks (GitXiv)
”turn” vector created from four averaged samples of faces looking
left vs looking right.
Alec Radford, Luke Metz, Soumith Chintala , 2015. 

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks (GitXiv)
walking through the manifold
top: unmodified samples
bottom: same samples dropping out ”window” filters
Autonomy Supervision
Creativity?
- unsupervised training
- generator/discrimator
- latent/z space
- auto encoders
- multimodality
- query - target/class
Creativity?
Process Result
Creative AI > Needs as I see it
Creative AI as a
“tool”

or “brush” to paint
with
A system which marries the need for a creative
process with the need for a creative output
• with as less human input as possible (data)
• with its own style
• with the possibility for human level supervision
for rapid experimentation
Creative AI > a “brush”
A system which marries the need for a creative
process with the need for a creative output
• with as less human input as possible ( )
• with its own style
• with the possibility for human level supervision
for rapid experimentation
Creative AI > a “brush”
data
Creative AI > a “brush” > data
• reuse nets as much as possible
• combining unsupervised & supervised
• multiple modalities
• plug in external knowledge bases
Creative AI > a “brush” > data input
• unlabeled & labeled data
• external knowledge bases (dbpedia, wikipedia)
• one-shot learning
• zero-shot learning
Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng, 2013

Zero-Shot Learning Through Cross-Modal Transfer
a zero-shot model that can predict both seen and unseen classes
Creative AI > a “brush” > data input
Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng, 2013

Zero-Shot Learning Through Cross-Modal Transfer
(slides)
Creative AI > a “brush” > data input
Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng, 2013

Zero-Shot Learning Through Cross-Modal Transfer
(slides)
Creative AI > a “brush” > data input
Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng, 2013

Zero-Shot Learning Through Cross-Modal Transfer
(slides)
A system which marries the need for a creative
process with the need for a creative output
• with as less human input as possible (data)
• with its own style
• with the possibility for human level 

for rapid experimentation
Creative AI > a “brush”
supervision
Creative AI > a “brush” > data
• “rich” latent (“z”) space
• easy user supervision over output:
• priors
• constrain network (units, layers, etc)
• guided input
• mixed input
• latent space
Creative AI > a “brush” > data
• “rich” latent (“z”) space
• easy user supervision over output:
• priors
• constrain network (units, layers, etc)
• guided input
• mixed input
• latent space
Creative AI > a “brush” > data
Deep Dream
Alexander Mordvintsev, Christopher Olah, Mike Tyka, 2015. 

Inceptionism: Going Deeper into Neural Networks
Google Research Blog
Creative AI > a “brush” > data
Deep Dream
Roelof Pieters, 2015 DeepDream - Class visualization Experiment (link)
Roelof Pieters, 2015 DeepDream - Class visualization Experiment (link)
Creative AI > a “brush” > data
• “rich” latent (“z”) space
• easy user supervision over output:
• priors
• constrain network (units, layers, etc)
• guided input
• mixed input
• latent space
Creative AI > a “brush” > data
Deep Dream
Roelof Pieters, 2015 DeepDream - Overview of standard bvlc googlenet (inception) layers (link)
Constrain Layers
Creative AI > a “brush” > data
Deep Dream
Roelof Pieters, 2015 Single Unit Activations (early layer) (Flickr Album)
Constrain Units
Creative AI > a “brush” > data
• “rich” latent (“z”) space
• easy user supervision over output:
• priors
• constrain network (units, layers, etc)
• guided input
• mixed input
• latent space
Creative AI > a “brush” > data
Deep Dream
Roelof Pieters, 2015 DeepDream Video (GitHub)
Creative AI > a “brush” > data
• “rich” latent (“z”) space
• easy user supervision over output:
• priors
• constrain network (units, layers, etc)
• guided input
• mixed input
• latent space
Creative AI > a “brush” > data
Style Net
Roelof Pieters (graphific) (tweet) Roelof Pieters (graphific) (tweet)
Creative AI > a “brush” > data
• “rich” latent (“z”) space
• easy user supervision over output:
• priors
• constrain network (units, layers, etc)
• guided input
• mixed input
• latent space
Image -> Text
“A person riding a motorcycle on a dirt road.”???
Image -> Text
“Two hockey players are fighting over the puck.”???
Image -> Text
Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron
Courville, Ruslan Salakhutdinov, Richard Zemel, Yoshua Bengio,
Show, Attend and Tell: Neural Image Caption Generation with
Visual Attention (arxiv) (info) (code)
Andrej Karpathy Li Fei-Fei , 2015. 

Deep Visual-Semantic Alignments for Generating Image Descriptions (pdf) (info) (code)
Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan ,
2015. Show and Tell: A Neural Image Caption Generator (arxiv)
Text -> Image “A stop sign is flying in blue skies.”
“A herd of elephants flying in the blue skies.”
Elman Mansimov, Emilio Parisotto, Jimmy Lei Ba, Ruslan Salakhutdinov, 2015.
Generating Images from Captions with Attention (arxiv) (examples)
Elman Mansimov, Emilio Parisotto, Jimmy Lei Ba, Ruslan Salakhutdinov, 2015.
Generating Images from Captions with Attention (arxiv) (examples)
Text -> Image
Subhashini Venugopalan, Marcus Rohrbach, Jeff Donahue, Raymond Mooney,
Trevor Darrell, Kate Saenko , 2015. Sequence to Sequence -- Video to Text (GitXiv)
Video -> Text
A system which marries the need for a creative
process with the need for a creative output
• with as less human input as possible (data)
• with its own style
• with the possibility for human level supervision
for 

Creative AI > a “brush”
rapid experimentation
Creative AI > a “brush” > rapid experimentation
Widening
Deepening
Tianqi Chen, Ian Goodfellow, Jonathon Shlens, 2015. Net2Net: Accelerating Learning via
Knowledge Transfer (arxiv) / code (torch)
Reusing Nets:
Bigger Net
Teacher and Student net Hint training
Adriana Romero, Nicolas Ballas, Samira Ebrahimi Kahou, Antoine Chassang, Carlo Gatta,
Yoshua Bengio, 2014. FitNets: Hints for Thin Deep Nets (arxiv)
Knowledge distillation
SVHN Error
MNIST Error
Reusing Nets:
Smaller Net
Hashed Net
Wenlin Chen, James T. Wilson, Stephen Tyree, Kilian Q. Weinberger, Yixin Chen, 2015.
Compressing Neural Networks with the Hashing Trick (arxiv)
Shrinking Nets:
Hashing
Song Han, Huizi Mao, William J. Dally, 2015. Deep Compression: Compressing Deep Neural
Networks with Pruning, Trained Quantization and Huffman Coding (arxiv)
Shrinking Nets:
Pruning,
Quantization &
Huffman coding
Creative AI > a “brush” > rapid experimentation
• experiments need “tooling”, specialised design
software to
• try things
• explore latent spaces (z-space)
• push the AI in the right direction
• be surprised by AI
Creative AI > a “brush” > rapid experimentation
human-machine collaboration
Creative AI > a “brush” > rapid experimentation
(YouTube, Paper)
Creative AI > a “brush” > rapid experimentation
(YouTube, Paper)
Creative AI > a “brush” > rapid experimentation
(Vimeo, Paper)
Creative AI > a “brush” > rapid experimentation
• Advertising and marketing
• Architecture
• Crafts
• Design: product, graphic and fashion design
• Film, TV, video, radio and photography
• IT, software and computer services
• Publishing
• Museums, galleries and libraries
• Music, performing and visual arts
Questions?
love letters? existential dilemma’s? academic questions? gifts? find me at:

www.csc.kth.se/~roelof/
roelof@kth.se

Más contenido relacionado

La actualidad más candente

Generative AI and law.pptx
Generative AI and law.pptxGenerative AI and law.pptx
Generative AI and law.pptxChris Marsden
 
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...David Talby
 
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Episode 2: The LLM / GPT / AI Prompt / Data Engineer RoadmapEpisode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Episode 2: The LLM / GPT / AI Prompt / Data Engineer RoadmapAnant Corporation
 
Generative AI Risks & Concerns
Generative AI Risks & ConcernsGenerative AI Risks & Concerns
Generative AI Risks & ConcernsAjitesh Kumar
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...ssuser4edc93
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfDung Hoang
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesDianaGray10
 
Generative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptxGenerative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptxColleen Farrelly
 
An Introduction to Generative AI - May 18, 2023
An Introduction  to Generative AI - May 18, 2023An Introduction  to Generative AI - May 18, 2023
An Introduction to Generative AI - May 18, 2023CoriFaklaris1
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence PresentationMiraz Hossain
 
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!taozen
 
Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdfQualcomm Research
 
ChatGPT - AI.pdf
ChatGPT - AI.pdfChatGPT - AI.pdf
ChatGPT - AI.pdfBannoon1
 
Introduction to LLMs
Introduction to LLMsIntroduction to LLMs
Introduction to LLMsLoic Merckel
 
AI and ML Series - Leveraging Generative AI and LLMs Using the UiPath Platfor...
AI and ML Series - Leveraging Generative AI and LLMs Using the UiPath Platfor...AI and ML Series - Leveraging Generative AI and LLMs Using the UiPath Platfor...
AI and ML Series - Leveraging Generative AI and LLMs Using the UiPath Platfor...DianaGray10
 
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...DataScienceConferenc1
 
generative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsgenerative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsAdventureWorld5
 

La actualidad más candente (20)

Generative AI and law.pptx
Generative AI and law.pptxGenerative AI and law.pptx
Generative AI and law.pptx
 
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
 
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Episode 2: The LLM / GPT / AI Prompt / Data Engineer RoadmapEpisode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
 
Generative AI Risks & Concerns
Generative AI Risks & ConcernsGenerative AI Risks & Concerns
Generative AI Risks & Concerns
 
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...How Does Generative AI Actually Work? (a quick semi-technical introduction to...
How Does Generative AI Actually Work? (a quick semi-technical introduction to...
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdf
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
 
Generative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptxGenerative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptx
 
An Introduction to Generative AI - May 18, 2023
An Introduction  to Generative AI - May 18, 2023An Introduction  to Generative AI - May 18, 2023
An Introduction to Generative AI - May 18, 2023
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
 
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
The Rise of the LLMs - How I Learned to Stop Worrying & Love the GPT!
 
Webinar on ChatGPT.pptx
Webinar on ChatGPT.pptxWebinar on ChatGPT.pptx
Webinar on ChatGPT.pptx
 
Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdf
 
ChatGPT - AI.pdf
ChatGPT - AI.pdfChatGPT - AI.pdf
ChatGPT - AI.pdf
 
Semantic AI
Semantic AISemantic AI
Semantic AI
 
Introduction to LLMs
Introduction to LLMsIntroduction to LLMs
Introduction to LLMs
 
AI and ML Series - Leveraging Generative AI and LLMs Using the UiPath Platfor...
AI and ML Series - Leveraging Generative AI and LLMs Using the UiPath Platfor...AI and ML Series - Leveraging Generative AI and LLMs Using the UiPath Platfor...
AI and ML Series - Leveraging Generative AI and LLMs Using the UiPath Platfor...
 
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
 
generative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsgenerative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language models
 
AI, ChatGPT and Content Marketing - Andrew Jenkins, Volterra Consulting
AI, ChatGPT and Content Marketing - Andrew Jenkins, Volterra ConsultingAI, ChatGPT and Content Marketing - Andrew Jenkins, Volterra Consulting
AI, ChatGPT and Content Marketing - Andrew Jenkins, Volterra Consulting
 

Destacado

Multi-modal embeddings: from discriminative to generative models and creative ai
Multi-modal embeddings: from discriminative to generative models and creative aiMulti-modal embeddings: from discriminative to generative models and creative ai
Multi-modal embeddings: from discriminative to generative models and creative aiRoelof Pieters
 
Deep Learning for Information Retrieval
Deep Learning for Information RetrievalDeep Learning for Information Retrieval
Deep Learning for Information RetrievalRoelof Pieters
 
Python for Image Understanding: Deep Learning with Convolutional Neural Nets
Python for Image Understanding: Deep Learning with Convolutional Neural NetsPython for Image Understanding: Deep Learning with Convolutional Neural Nets
Python for Image Understanding: Deep Learning with Convolutional Neural NetsRoelof Pieters
 
Graph, Data-science, and Deep Learning
Graph, Data-science, and Deep LearningGraph, Data-science, and Deep Learning
Graph, Data-science, and Deep LearningRoelof Pieters
 
Deep Learning as a Cat/Dog Detector
Deep Learning as a Cat/Dog DetectorDeep Learning as a Cat/Dog Detector
Deep Learning as a Cat/Dog DetectorRoelof Pieters
 
Explore Data: Data Science + Visualization
Explore Data: Data Science + VisualizationExplore Data: Data Science + Visualization
Explore Data: Data Science + VisualizationRoelof Pieters
 
Multi modal retrieval and generation with deep distributed models
Multi modal retrieval and generation with deep distributed modelsMulti modal retrieval and generation with deep distributed models
Multi modal retrieval and generation with deep distributed modelsRoelof Pieters
 
Deep learning for natural language embeddings
Deep learning for natural language embeddingsDeep learning for natural language embeddings
Deep learning for natural language embeddingsRoelof Pieters
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsRoelof Pieters
 
Deep Learning: a birds eye view
Deep Learning: a birds eye viewDeep Learning: a birds eye view
Deep Learning: a birds eye viewRoelof Pieters
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageRoelof Pieters
 
Deep Neural Networks 
that talk (Back)… with style
Deep Neural Networks 
that talk (Back)… with styleDeep Neural Networks 
that talk (Back)… with style
Deep Neural Networks 
that talk (Back)… with styleRoelof Pieters
 
Learning to understand phrases by embedding the dictionary
Learning to understand phrases by embedding the dictionaryLearning to understand phrases by embedding the dictionary
Learning to understand phrases by embedding the dictionaryRoelof Pieters
 
Zero shot learning through cross-modal transfer
Zero shot learning through cross-modal transferZero shot learning through cross-modal transfer
Zero shot learning through cross-modal transferRoelof Pieters
 
Building a Deep Learning (Dream) Machine
Building a Deep Learning (Dream) MachineBuilding a Deep Learning (Dream) Machine
Building a Deep Learning (Dream) MachineRoelof Pieters
 
Deep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersDeep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersRoelof Pieters
 
Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!Roelof Pieters
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsDeep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsRoelof Pieters
 
Deep neural networks
Deep neural networksDeep neural networks
Deep neural networksSi Haem
 
Deep Learning - Convolutional Neural Networks
Deep Learning - Convolutional Neural NetworksDeep Learning - Convolutional Neural Networks
Deep Learning - Convolutional Neural NetworksChristian Perone
 

Destacado (20)

Multi-modal embeddings: from discriminative to generative models and creative ai
Multi-modal embeddings: from discriminative to generative models and creative aiMulti-modal embeddings: from discriminative to generative models and creative ai
Multi-modal embeddings: from discriminative to generative models and creative ai
 
Deep Learning for Information Retrieval
Deep Learning for Information RetrievalDeep Learning for Information Retrieval
Deep Learning for Information Retrieval
 
Python for Image Understanding: Deep Learning with Convolutional Neural Nets
Python for Image Understanding: Deep Learning with Convolutional Neural NetsPython for Image Understanding: Deep Learning with Convolutional Neural Nets
Python for Image Understanding: Deep Learning with Convolutional Neural Nets
 
Graph, Data-science, and Deep Learning
Graph, Data-science, and Deep LearningGraph, Data-science, and Deep Learning
Graph, Data-science, and Deep Learning
 
Deep Learning as a Cat/Dog Detector
Deep Learning as a Cat/Dog DetectorDeep Learning as a Cat/Dog Detector
Deep Learning as a Cat/Dog Detector
 
Explore Data: Data Science + Visualization
Explore Data: Data Science + VisualizationExplore Data: Data Science + Visualization
Explore Data: Data Science + Visualization
 
Multi modal retrieval and generation with deep distributed models
Multi modal retrieval and generation with deep distributed modelsMulti modal retrieval and generation with deep distributed models
Multi modal retrieval and generation with deep distributed models
 
Deep learning for natural language embeddings
Deep learning for natural language embeddingsDeep learning for natural language embeddings
Deep learning for natural language embeddings
 
Deep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word EmbeddingsDeep Learning for Natural Language Processing: Word Embeddings
Deep Learning for Natural Language Processing: Word Embeddings
 
Deep Learning: a birds eye view
Deep Learning: a birds eye viewDeep Learning: a birds eye view
Deep Learning: a birds eye view
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on LanguageVisual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on Language
 
Deep Neural Networks 
that talk (Back)… with style
Deep Neural Networks 
that talk (Back)… with styleDeep Neural Networks 
that talk (Back)… with style
Deep Neural Networks 
that talk (Back)… with style
 
Learning to understand phrases by embedding the dictionary
Learning to understand phrases by embedding the dictionaryLearning to understand phrases by embedding the dictionary
Learning to understand phrases by embedding the dictionary
 
Zero shot learning through cross-modal transfer
Zero shot learning through cross-modal transferZero shot learning through cross-modal transfer
Zero shot learning through cross-modal transfer
 
Building a Deep Learning (Dream) Machine
Building a Deep Learning (Dream) MachineBuilding a Deep Learning (Dream) Machine
Building a Deep Learning (Dream) Machine
 
Deep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ersDeep Learning, an interactive introduction for NLP-ers
Deep Learning, an interactive introduction for NLP-ers
 
Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsDeep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word Embeddings
 
Deep neural networks
Deep neural networksDeep neural networks
Deep neural networks
 
Deep Learning - Convolutional Neural Networks
Deep Learning - Convolutional Neural NetworksDeep Learning - Convolutional Neural Networks
Deep Learning - Convolutional Neural Networks
 

Similar a Creative AI & multimodality: looking ahead

AI Technology Overview and Career Advice
AI Technology Overview and Career AdviceAI Technology Overview and Career Advice
AI Technology Overview and Career AdviceKunling Geng
 
NumPyCNNAndroid: A Library for Straightforward Implementation of Convolutiona...
NumPyCNNAndroid: A Library for Straightforward Implementation of Convolutiona...NumPyCNNAndroid: A Library for Straightforward Implementation of Convolutiona...
NumPyCNNAndroid: A Library for Straightforward Implementation of Convolutiona...Ahmed Gad
 
Human-Machine Collaboration: Using art-making AI (CrAIyon) as cited work, o...
Human-Machine Collaboration:  Using art-making AI (CrAIyon) as  cited work, o...Human-Machine Collaboration:  Using art-making AI (CrAIyon) as  cited work, o...
Human-Machine Collaboration: Using art-making AI (CrAIyon) as cited work, o...Shalin Hai-Jew
 
Beyond Reality (2027): The Future of Virtual and Augmented Reality
Beyond Reality (2027): The Future of Virtual and Augmented RealityBeyond Reality (2027): The Future of Virtual and Augmented Reality
Beyond Reality (2027): The Future of Virtual and Augmented RealityMark Billinghurst
 
Visually Exploring Patent Collections for Events and Patterns
Visually Exploring Patent Collections for Events and PatternsVisually Exploring Patent Collections for Events and Patterns
Visually Exploring Patent Collections for Events and PatternsXiaoyu Wang
 
Towards Secure and Interpretable AI: Scalable Methods, Interactive Visualizat...
Towards Secure and Interpretable AI: Scalable Methods, Interactive Visualizat...Towards Secure and Interpretable AI: Scalable Methods, Interactive Visualizat...
Towards Secure and Interpretable AI: Scalable Methods, Interactive Visualizat...polochau
 
Art-Making Generative AI and Instructional Design Work: An Early Brainstorm
Art-Making Generative AI and Instructional Design Work:  An Early BrainstormArt-Making Generative AI and Instructional Design Work:  An Early Brainstorm
Art-Making Generative AI and Instructional Design Work: An Early BrainstormShalin Hai-Jew
 
Promises of Deep Learning
Promises of Deep LearningPromises of Deep Learning
Promises of Deep LearningDavid Khosid
 
APIS. Digitale biographische Blütenlese
APIS. Digitale biographische BlütenleseAPIS. Digitale biographische Blütenlese
APIS. Digitale biographische Blütenleseeveline wandl-vogt
 
NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...
NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...
NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...Maryam Farooq
 
CSTA2015 Blocks-based Programming: Toolboxes for Many Occasions
CSTA2015  Blocks-based Programming: Toolboxes for Many OccasionsCSTA2015  Blocks-based Programming: Toolboxes for Many Occasions
CSTA2015 Blocks-based Programming: Toolboxes for Many Occasions Josh Sheldon
 
Vector databases and neural search
Vector databases and neural searchVector databases and neural search
Vector databases and neural searchDmitry Kan
 
AI - Media Art. 인공지능과 미디어아트
AI - Media Art. 인공지능과 미디어아트AI - Media Art. 인공지능과 미디어아트
AI - Media Art. 인공지능과 미디어아트Tae wook kang
 
Virtual Reality for Education
Virtual Reality for EducationVirtual Reality for Education
Virtual Reality for EducationColinKeenan4
 
Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsRakuten Group, Inc.
 
Engagement from scratch
Engagement from scratchEngagement from scratch
Engagement from scratchdrpresident
 
A friendly introduction to GANs
A friendly introduction to GANsA friendly introduction to GANs
A friendly introduction to GANsCsongor Barabasi
 
Moving Forward with AI
Moving Forward with AIMoving Forward with AI
Moving Forward with AIAdrian Hornsby
 

Similar a Creative AI & multimodality: looking ahead (20)

AI Technology Overview and Career Advice
AI Technology Overview and Career AdviceAI Technology Overview and Career Advice
AI Technology Overview and Career Advice
 
NumPyCNNAndroid: A Library for Straightforward Implementation of Convolutiona...
NumPyCNNAndroid: A Library for Straightforward Implementation of Convolutiona...NumPyCNNAndroid: A Library for Straightforward Implementation of Convolutiona...
NumPyCNNAndroid: A Library for Straightforward Implementation of Convolutiona...
 
Human-Machine Collaboration: Using art-making AI (CrAIyon) as cited work, o...
Human-Machine Collaboration:  Using art-making AI (CrAIyon) as  cited work, o...Human-Machine Collaboration:  Using art-making AI (CrAIyon) as  cited work, o...
Human-Machine Collaboration: Using art-making AI (CrAIyon) as cited work, o...
 
Beyond Reality (2027): The Future of Virtual and Augmented Reality
Beyond Reality (2027): The Future of Virtual and Augmented RealityBeyond Reality (2027): The Future of Virtual and Augmented Reality
Beyond Reality (2027): The Future of Virtual and Augmented Reality
 
Visually Exploring Patent Collections for Events and Patterns
Visually Exploring Patent Collections for Events and PatternsVisually Exploring Patent Collections for Events and Patterns
Visually Exploring Patent Collections for Events and Patterns
 
Towards Secure and Interpretable AI: Scalable Methods, Interactive Visualizat...
Towards Secure and Interpretable AI: Scalable Methods, Interactive Visualizat...Towards Secure and Interpretable AI: Scalable Methods, Interactive Visualizat...
Towards Secure and Interpretable AI: Scalable Methods, Interactive Visualizat...
 
Art-Making Generative AI and Instructional Design Work: An Early Brainstorm
Art-Making Generative AI and Instructional Design Work:  An Early BrainstormArt-Making Generative AI and Instructional Design Work:  An Early Brainstorm
Art-Making Generative AI and Instructional Design Work: An Early Brainstorm
 
Promises of Deep Learning
Promises of Deep LearningPromises of Deep Learning
Promises of Deep Learning
 
Intro to deep learning
Intro to deep learningIntro to deep learning
Intro to deep learning
 
APIS. Digitale biographische Blütenlese
APIS. Digitale biographische BlütenleseAPIS. Digitale biographische Blütenlese
APIS. Digitale biographische Blütenlese
 
NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...
NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...
NLP Community Conference - Dr. Catherine Havasi (ConceptNet/MIT Media Lab/Lum...
 
CSTA2015 Blocks-based Programming: Toolboxes for Many Occasions
CSTA2015  Blocks-based Programming: Toolboxes for Many OccasionsCSTA2015  Blocks-based Programming: Toolboxes for Many Occasions
CSTA2015 Blocks-based Programming: Toolboxes for Many Occasions
 
Vector databases and neural search
Vector databases and neural searchVector databases and neural search
Vector databases and neural search
 
AI - Media Art. 인공지능과 미디어아트
AI - Media Art. 인공지능과 미디어아트AI - Media Art. 인공지능과 미디어아트
AI - Media Art. 인공지능과 미디어아트
 
Virtual Reality for Education
Virtual Reality for EducationVirtual Reality for Education
Virtual Reality for Education
 
Using Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIsUsing Algorithmia to leverage AI and Machine Learning APIs
Using Algorithmia to leverage AI and Machine Learning APIs
 
Engagement from scratch
Engagement from scratchEngagement from scratch
Engagement from scratch
 
Griot: Open Source Storytelling Tool
Griot: Open Source Storytelling ToolGriot: Open Source Storytelling Tool
Griot: Open Source Storytelling Tool
 
A friendly introduction to GANs
A friendly introduction to GANsA friendly introduction to GANs
A friendly introduction to GANs
 
Moving Forward with AI
Moving Forward with AIMoving Forward with AI
Moving Forward with AI
 

Último

Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxmohammadalnahdi22
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Kayode Fayemi
 
Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMoumonDas2
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaKayode Fayemi
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Chameera Dedduwage
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Vipesco
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar TrainingKylaCullinane
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Hasting Chen
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AITatiana Gurgel
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxraffaeleoman
 
Air breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsAir breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsaqsarehman5055
 
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardsticksaastr
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...Sheetaleventcompany
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesPooja Nehwal
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubssamaasim06
 

Último (20)

Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
 
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
Governance and Nation-Building in Nigeria: Some Reflections on Options for Po...
 
Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptx
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AI
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
 
Air breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsAir breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animals
 
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
 
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
 
Presentation on Engagement in Book Clubs
Presentation on Engagement in Book ClubsPresentation on Engagement in Book Clubs
Presentation on Engagement in Book Clubs
 

Creative AI & multimodality: looking ahead

  • 1. Creative AI & multimodality: looking ahead Roelof Pieters @graphific Imperial College London, 
 1 Dec 2015 roelof@graph-technologies.comhttp://artificialexperience.com/http://www.csc.kth.se/~roelof/
  • 3. AI I kinda expect the audience to know AI & Machine Learning
 Let’s move on shall we ?
  • 4. AI All references to: - Arxiv or - GitXiv if the “code” or “dataset” is available Collaborative Open Computer Science more info (Medium)
  • 7. “Deep learning is a set of algorithms in machine learning that attempt to learn in multiple levels, corresponding to different levels of abstraction.”
  • 8. AI > today’s focus use of several modes (media) to create a single artifact. Multimodality “Mode” Socially and culturally shaped resource for making meaning. — Gunther Kress
  • 10. Creativity • Many definitions: philosophical, sociological, historical, practical
  • 11. Creativity 1. Making unfamiliar combinations of familiar ideas. 2. Explore a structured conceptual space 3. (Radically) transforming ones structured conceptual space “Exploration” “Remix” “The Creative Mind”
 — Margaret Boden “Transformation”
  • 12. • Skill • Appreciation • Imagination • Learning • Innovation • Accountability, • Subjectivity • Intentionality. Creativity > “Traits” software has to exhibit in order to avoid easy criticism of being “non-creative”. (Simon Colton)
  • 13. • Skill • Appreciation • Imagination • Learning • Innovation • Accountability, • Subjectivity • Intentionality Creativity > software traits
  • 14. • Skill • Appreciation • Imagination • Learning • Innovation • Accountability, • Subjectivity • Intentionality Creativity > software traits
  • 15. • Skill • Appreciation • Imagination • Learning • Innovation • Accountability, • Subjectivity • Intentionality Creativity > software traits
  • 16. • Skill • Appreciation • Imagination • Learning • Innovation • Accountability, • Subjectivity • Intentionality Creativity > software traits
  • 17. • Skill • Appreciation • Imagination • Learning • Innovation • Accountability, • Subjectivity • Intentionality Creativity > software traits
  • 18. • Skill • Appreciation • Imagination • Learning • Innovation • Accountability, • Subjectivity • Intentionality Creativity > software traits
  • 19. • Skill • Appreciation • Imagination • Learning • Innovation • Accountability, • Subjectivity • Intentionality Creativity > software traits
  • 20. • Skill • Appreciation • Imagination • Learning • Innovation • Accountability, • Subjectivity • Intentionality Creativity > software traits
  • 22. Creative AI > Current possibilities • Appropriating “standard” nets for creative use • Reinforcement Learning: Creativity as a Game • RNNs/LSTMs/GRUs • Sequence-to-Sequence: Creativity as a Translation Task • Auto-Encoders • Attention-based Models • Generative Adversarial Nets
  • 23. Creative AI > Current possibilities • Appropriating “standard” nets for creative use • Reinforcement Learning: Creativity as a Game • RNNs/LSTMs/GRUs • Sequence-to-Sequence: Creativity as a Translation Task • Auto-Encoders • Attention-based Models • Generative Adversarial Nets
  • 24. Creative AI > Current possibilities > Appropriating “standard” nets for creative use Deep Dream see also: www.csc.kth.se/~roelof/deepdream/
  • 25. Creative AI > Current possibilities > Appropriating “standard” nets for creative use Deep Dream see also: www.csc.kth.se/~roelof/deepdream/ codeyoutubeRoelof Pieters 2015
  • 26. Creative AI > Current possibilities > Appropriating “standard” nets for creative use Deep Dream see also: www.csc.kth.se/~roelof/deepdream/ C.M.Kosemen & 
 Roelof Pieters (2015) Gizmodo
  • 27. Creative AI > Current possibilities > Appropriating “standard” nets for creative use Leon A. Gatys, Alexander S. Ecker, Matthias Bethge , 2015. 
 A Neural Algorithm of Artistic Style (GitXiv) Style Net
  • 28.
  • 29. Gene Kogan, 2015. Why is a Raven Like a Writing Desk? (vimeo)
  • 30. Creative AI > Current possibilities • Appropriating “standard” nets for creative use • Reinforcement Learning: Creativity as a Game • RNNs/LSTMs/GRUs • Sequence-to-Sequence: Creativity as a Translation Task • Auto-Encoders • Attention-based Models • Generative Adversarial Nets
  • 31.
  • 32. Creative AI > Current possibilities > Reinforcement Learning • AMN: Emilio Parisotto, Jimmy Lei Ba, Ruslan Salakhutdinov 2015, Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning (arxiv) • DQN: Mnih, Volodymyr, Kavukcuoglu, Koray, Silver, David, Rusu, Andrei A., Veness, Joel, Bellemare, Marc G., Graves, Alex, Riedmiller, Martin, Fidjeland, Andreas K., Ostrovski, Georg, Petersen, Stig, Beattie, Charles, Sadik, Amir, Antonoglou, Ioannis, King, Helen, Kumaran, Dharshan, Wierstra, Daan, Legg, Shane, and Hassabis, Demis. Human-level control through deep reinforcement learning. Nature, 518(7540): 529–533, 2015.
  • 33. Creative AI > Current possibilities > Reinforcement Learning Ardi Tampuu, Tambet Matiisen, Dorian Kodelja, Ilya Kuzovkin, Kristjan Korjus, Juhan Aru, Jaan Aru, Raul Vicente, 2015 
 Multiagent Cooperation and Competition with Deep Reinforcement Learning (GitXiv) (YouTube)
  • 34. Reinforcement Learning Ning Xie, Hirotaka Hachiya, Masashi Sugiyama, 2013 , 
 Artist Agent: A Reinforcement Learning Approach to Automatic Stroke Generation in Oriental Ink Painting (Paper, Lecture, YouTube)
  • 36. Ning Xie, Hirotaka Hachiya, Masashi Sugiyama, 2013
 Artist Agent: A Reinforcement Learning Approach to Automatic Stroke Generation in Oriental Ink Painting (Paper, Lecture, YouTube)
  • 37. Creative AI > Current possibilities • Appropriating “standard” nets for creative use • Reinforcement Learning: Creativity as a Game • RNNs/LSTMs/GRUs • Sequence-to-Sequence: Creativity as a Translation Task • Auto-Encoders • Attention-based Models • Generative Adversarial Nets
  • 38. Creative AI > Current possibilities • Appropriating “standard” nets for creative use • Reinforcement Learning: Creativity as a Game • RNNs/LSTMs/GRUs • Sequence-to-Sequence: Creativity as a Translation Task • Auto-Encoders • Attention-based Models • Generative Adversarial Nets
  • 39. Creative AI > Current possibilities • Appropriating “standard” nets for creative use • Reinforcement Learning: Creativity as a Game • RNNs/LSTMs/GRUs • Sequence-to-Sequence: Creativity as a Translation Task • Auto-encoders • Attention-based Models • Generative Adversarial Nets
  • 40. Creative AI > Current possibilities • Standard (“denoising”) Autoencoders • Variational Autoencoder (VAE) / Stochastic Gradient VB • Deep Convolutional Inverse Graphics Network • Variational RNN (VRNN) Vincent et al, 2010. Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion (paper) (code)
  • 41. Creative AI > Current possibilities • Standard “denoising” Autoencoders • Variational Autoencoder (VAE) / Stochastic Gradient VB • Deep Convolutional Inverse Graphics Network • Variational RNN (VRNN) • Diederik P Kingma, Max Welling, 2013. 
 Auto-Encoding Variational Bayes (GitXiv)
  • 42. Creative AI > Current possibilities • Standard “denoising” Autoencoders • Variational Autoencoder (VAE) • Deep Convolutional Inverse Graphics Network (modified VAE) • Variational RNN (VRNN) Tejas D. Kulkarni, Will Whitney, Pushmeet Kohli, Joshua B. Tenenbaum, 2015 Deep Convolutional Inverse Graphics Network (GitXiv)
  • 43. Creative AI > Current possibilities • Standard “denoising” Autoencoders • Variational Autoencoder (VAE) • Deep Convolutional Inverse Graphics Network • Variational RNN (VRNN) (VAE at every time step) Junyoung Chung, Kyle Kastner, Laurent Dinh, Kratarth Goel, Aaron Courville, Yoshua Bengio, 2015
 A Recurrent Latent Variable Model for Sequential Data (GitXiv) VAEVAEVAE
  • 44. Junyoung Chung, Kyle Kastner, Laurent Dinh, Kratarth Goel, Aaron Courville, Yoshua Bengio , 2015. 
 A Recurrent Latent Variable Model for Sequential Data (GitXiv) (Audio Samples)
  • 45. Creative AI > Current possibilities • Appropriating “standard” nets for creative use • Reinforcement Learning: Creativity as a Game • RNNs/LSTMs/GRUs • Sequence-to-Sequence: Creativity as a Translation Task • Auto-Encoders • Attention-based Models • Generative Adversarial Nets
  • 46. Karol Gregor, Ivo Danihelka, Alex Graves, Danilo Jimenez Rezende, Daan Wierstra, 2015
 DRAW: A Recurrent Neural Network For Image Generation (GitXiv) Variational Auto-Encoder Deep Recurrent Attentive Writer (DRAW) Network
  • 48. Creative AI > Current possibilities • Appropriating “standard” nets for creative use • Reinforcement Learning: Creativity as a Game • RNNs/LSTMs/GRUs • Sequence-to-Sequence: Creativity as a Translation Task • Auto-Encoders • Attention-based Models • Generative Adverserial Nets
  • 49. Emily Denton, Soumith Chintala, Arthur Szlam, Rob Fergus, 2015. 
 Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks (GitXiv)
  • 50. Alec Radford, Luke Metz, Soumith Chintala , 2015. 
 Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks (GitXiv)
  • 51. Alec Radford, Luke Metz, Soumith Chintala , 2015. 
 Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks (GitXiv)
  • 52. ”turn” vector created from four averaged samples of faces looking left vs looking right. Alec Radford, Luke Metz, Soumith Chintala , 2015. 
 Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks (GitXiv)
  • 54. top: unmodified samples bottom: same samples dropping out ”window” filters
  • 55. Autonomy Supervision Creativity? - unsupervised training - generator/discrimator - latent/z space - auto encoders - multimodality - query - target/class
  • 57. Creative AI > Needs as I see it Creative AI as a “tool”
 or “brush” to paint with
  • 58. A system which marries the need for a creative process with the need for a creative output • with as less human input as possible (data) • with its own style • with the possibility for human level supervision for rapid experimentation Creative AI > a “brush”
  • 59. A system which marries the need for a creative process with the need for a creative output • with as less human input as possible ( ) • with its own style • with the possibility for human level supervision for rapid experimentation Creative AI > a “brush” data
  • 60. Creative AI > a “brush” > data • reuse nets as much as possible • combining unsupervised & supervised • multiple modalities • plug in external knowledge bases
  • 61. Creative AI > a “brush” > data input • unlabeled & labeled data • external knowledge bases (dbpedia, wikipedia) • one-shot learning • zero-shot learning Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng, 2013
 Zero-Shot Learning Through Cross-Modal Transfer a zero-shot model that can predict both seen and unseen classes
  • 62. Creative AI > a “brush” > data input Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng, 2013
 Zero-Shot Learning Through Cross-Modal Transfer (slides)
  • 63. Creative AI > a “brush” > data input Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng, 2013
 Zero-Shot Learning Through Cross-Modal Transfer (slides)
  • 64. Creative AI > a “brush” > data input Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng, 2013
 Zero-Shot Learning Through Cross-Modal Transfer (slides)
  • 65. A system which marries the need for a creative process with the need for a creative output • with as less human input as possible (data) • with its own style • with the possibility for human level 
 for rapid experimentation Creative AI > a “brush” supervision
  • 66. Creative AI > a “brush” > data • “rich” latent (“z”) space • easy user supervision over output: • priors • constrain network (units, layers, etc) • guided input • mixed input • latent space
  • 67. Creative AI > a “brush” > data • “rich” latent (“z”) space • easy user supervision over output: • priors • constrain network (units, layers, etc) • guided input • mixed input • latent space
  • 68. Creative AI > a “brush” > data Deep Dream Alexander Mordvintsev, Christopher Olah, Mike Tyka, 2015. 
 Inceptionism: Going Deeper into Neural Networks Google Research Blog
  • 69. Creative AI > a “brush” > data Deep Dream Roelof Pieters, 2015 DeepDream - Class visualization Experiment (link)
  • 70. Roelof Pieters, 2015 DeepDream - Class visualization Experiment (link)
  • 71. Creative AI > a “brush” > data • “rich” latent (“z”) space • easy user supervision over output: • priors • constrain network (units, layers, etc) • guided input • mixed input • latent space
  • 72. Creative AI > a “brush” > data Deep Dream Roelof Pieters, 2015 DeepDream - Overview of standard bvlc googlenet (inception) layers (link) Constrain Layers
  • 73. Creative AI > a “brush” > data Deep Dream Roelof Pieters, 2015 Single Unit Activations (early layer) (Flickr Album) Constrain Units
  • 74. Creative AI > a “brush” > data • “rich” latent (“z”) space • easy user supervision over output: • priors • constrain network (units, layers, etc) • guided input • mixed input • latent space
  • 75. Creative AI > a “brush” > data Deep Dream Roelof Pieters, 2015 DeepDream Video (GitHub)
  • 76. Creative AI > a “brush” > data • “rich” latent (“z”) space • easy user supervision over output: • priors • constrain network (units, layers, etc) • guided input • mixed input • latent space
  • 77. Creative AI > a “brush” > data Style Net Roelof Pieters (graphific) (tweet) Roelof Pieters (graphific) (tweet)
  • 78. Creative AI > a “brush” > data • “rich” latent (“z”) space • easy user supervision over output: • priors • constrain network (units, layers, etc) • guided input • mixed input • latent space
  • 79. Image -> Text “A person riding a motorcycle on a dirt road.”???
  • 80. Image -> Text “Two hockey players are fighting over the puck.”???
  • 81. Image -> Text Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, Richard Zemel, Yoshua Bengio, Show, Attend and Tell: Neural Image Caption Generation with Visual Attention (arxiv) (info) (code) Andrej Karpathy Li Fei-Fei , 2015. 
 Deep Visual-Semantic Alignments for Generating Image Descriptions (pdf) (info) (code) Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan , 2015. Show and Tell: A Neural Image Caption Generator (arxiv)
  • 82. Text -> Image “A stop sign is flying in blue skies.” “A herd of elephants flying in the blue skies.” Elman Mansimov, Emilio Parisotto, Jimmy Lei Ba, Ruslan Salakhutdinov, 2015. Generating Images from Captions with Attention (arxiv) (examples)
  • 83. Elman Mansimov, Emilio Parisotto, Jimmy Lei Ba, Ruslan Salakhutdinov, 2015. Generating Images from Captions with Attention (arxiv) (examples) Text -> Image
  • 84. Subhashini Venugopalan, Marcus Rohrbach, Jeff Donahue, Raymond Mooney, Trevor Darrell, Kate Saenko , 2015. Sequence to Sequence -- Video to Text (GitXiv) Video -> Text
  • 85. A system which marries the need for a creative process with the need for a creative output • with as less human input as possible (data) • with its own style • with the possibility for human level supervision for 
 Creative AI > a “brush” rapid experimentation
  • 86. Creative AI > a “brush” > rapid experimentation
  • 87. Widening Deepening Tianqi Chen, Ian Goodfellow, Jonathon Shlens, 2015. Net2Net: Accelerating Learning via Knowledge Transfer (arxiv) / code (torch) Reusing Nets: Bigger Net
  • 88. Teacher and Student net Hint training Adriana Romero, Nicolas Ballas, Samira Ebrahimi Kahou, Antoine Chassang, Carlo Gatta, Yoshua Bengio, 2014. FitNets: Hints for Thin Deep Nets (arxiv) Knowledge distillation SVHN Error MNIST Error Reusing Nets: Smaller Net
  • 89. Hashed Net Wenlin Chen, James T. Wilson, Stephen Tyree, Kilian Q. Weinberger, Yixin Chen, 2015. Compressing Neural Networks with the Hashing Trick (arxiv) Shrinking Nets: Hashing
  • 90. Song Han, Huizi Mao, William J. Dally, 2015. Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding (arxiv) Shrinking Nets: Pruning, Quantization & Huffman coding
  • 91. Creative AI > a “brush” > rapid experimentation • experiments need “tooling”, specialised design software to • try things • explore latent spaces (z-space) • push the AI in the right direction • be surprised by AI
  • 92. Creative AI > a “brush” > rapid experimentation human-machine collaboration
  • 93. Creative AI > a “brush” > rapid experimentation (YouTube, Paper)
  • 94. Creative AI > a “brush” > rapid experimentation (YouTube, Paper)
  • 95. Creative AI > a “brush” > rapid experimentation (Vimeo, Paper)
  • 96. Creative AI > a “brush” > rapid experimentation • Advertising and marketing • Architecture • Crafts • Design: product, graphic and fashion design • Film, TV, video, radio and photography • IT, software and computer services • Publishing • Museums, galleries and libraries • Music, performing and visual arts
  • 97. Questions? love letters? existential dilemma’s? academic questions? gifts? find me at:
 www.csc.kth.se/~roelof/ roelof@kth.se