SlideShare una empresa de Scribd logo
1 de 34
Descargar para leer sin conexión
School of Computer Science and Engineering Register No: 19BCE1367
Deep Neural Network-based Limerick
Generation for an Image
Name: Divyanshi Thapa
Register No: 19BCE1367
Programme and Specialization: B.Tech CSE
CAPSTONE PROJECT
REVIEW 2
Guide Name:
Dr. Praveen Joe I R
School of Computer Science and Engineering Register No: 19BCE1367
01
Introduction
Outline
�
�
02
03 05
06 08
07
Problem
Statement
Research
Challenges
What to be
done next
Guide
Approval
Proposed
System
Research
Paper Status
04
Research
Objectives
09
References
School of Computer Science and Engineering Register No: 19BCE1367
Introduction
01
School of Computer Science and Engineering Register No: 19BCE1367
● Creative writing using artificial intelligence (AI) is
one of the most popular and rapidly growing
research fields. It is highly intriguing but also
challenging as we go more to the side of
generating human-like texts with constraints as
we have in poems.
● Among creative writing tasks, paraphrasing and
writing stories are easier than writing poetry
because poems have many restrictions such as
rhyming structures, number of lines, type of
language, etc.
Introduction
School of Computer Science and Engineering Register No: 19BCE1367
● Several poem frameworks have been developed to assist AI in generating
human-like poems to address the issue.
Introduction
● Poems in literature can be broadly
classified into nine categories depending
on their rhyming structure and the
number of lines. Among all the nine
categories, one of the most challenging
tasks is to generate a limerick using
artificial intelligence and deep learning as
a limerick is a five-lined poem that has a
strict rhyming structure of AABBA
School of Computer Science and Engineering Register No: 19BCE1367
● Image captioning has also helped to
automatically generate well-formed sentences
from a given image which is widely used in
many NLP tasks such as VQA.
● Language models based on neural networks
have improved the state of the art with regard to
predictive language modeling, while topic
models are successful at capturing clear-cut,
semantic dimensions.
● NLP + DL = a system which can understand and
analyze an image and can generate a creative
human like poem based of the theme of the
image.
NLP + Deep learning
School of Computer Science and Engineering Register No: 19BCE1367
Problem Statement
02
School of Computer Science and Engineering Register No: 19BCE1367
● For a poem to be meaningful, both linguistic and literary
aspects need to be taken into account.
● With the advancement in image captioning, the NLP
tasks such as Question Answering has gone to it’s phase 2
that is Visual Question Answering.
● “To create a deep learning model which can create
limericks (a form of poem) for the given input image in
English language. ”
Problem Statement
School of Computer Science and Engineering Register No: 19BCE1367
Current approaches of generating rhyming English poetry with a neural network
involve constraining output to enforce the condition of rhyme.
The generated poem should be:
● According to the context or theme of the given input image
● Error free
● Coherent
● Follows the rhyming structure of the limerick (AABBA)
Problem Statement
School of Computer Science and Engineering Register No: 19BCE1367
Research Challenges
03
School of Computer Science and Engineering Register No: 19BCE1367
1. Mapping the theme of the image with the topic of
poem.
2. Both linguistic and literary aspects need to be taken
into account so that the poem is meaningful.
3. Syntactic well-formedness and topical coherence
throughout the poem.
4. Rhyming constraint (Maintaining rhyming scheme)
5. Certain amount of creativity in literature for making
poem interesting.
Research Challenges
School of Computer Science and Engineering Register No: 19BCE1367
Research Objectives
04
School of Computer Science and Engineering Register No: 19BCE1367
1. An attempt to mimic human creative writing by creating a
simple framework for image to poem generation for English
language.
2. Using a transformer models for better image captioning and
limerick generation .
3. A framework to generate poems (limericks) efficiently so that
it can be deployed as a public application after the post-
processing.
4. Major focus on maintaining the coherency, rhyming structure
of limerick and the efficiency of the framework.
Research Objectives
School of Computer Science and Engineering Register No: 19BCE1367
Proposed System
05
School of Computer Science and Engineering Register No: 19BCE1367
● The goal is also to make a speed-efficient framework and to do so, the
transformer models are the choice for image analysis and limerick generation.
The features of the image are extracted and the description is generated by
the Vision encoder-decoder model which is a combination of a vision
transformer as an encoder for image feature extraction and GPT-2 as a
decoder for generating human-like captions.
● This caption is treated as the first line of the limerick and is fed to another GPT-
2 model for generating a pool of 20 limericks.
● Best limerick is selected as the final output after post-processing.
Proposed System Introduction
School of Computer Science and Engineering Register No: 19BCE1367
Proposed System Diagram
School of Computer Science and Engineering Register No: 19BCE1367
Module 1 (M1): Image Captioning
Module 2 (M2): GPT-2 reverse language modeling
Module 3 (M3): Post-processing
Module 3.1 (M3.1): Grammar and spelling error detection
Module 3.2 (M3.2): BERT based word embeddings
Module 4 (M4): Evaluation
List of Modules
School of Computer Science and Engineering Register No: 19BCE1367
● The vision encoder-decoder model is used via HuggingFace API
which has ViT as its vision encoder model and GPT-2 as the text
decoder model It is trained on the popular Common Objects in
Context (COCO) dataset which contains more than 120
thousand images with their descriptions.
● The PyTorch version is used for generating the captions for the
given input image.
M1: Image Captioning
School of Computer Science and Engineering Register No: 19BCE1367
Problem: GPT2 is a forward language model as it utilizes the standard left-to-right
order of tokens present in a limerick for fine-tuning. This helps in maintaining the
subject’s continuity and coherency but it cannot maintain the rhyming structure
of the poem.
M2: GPT-2 reverse language modeling
School of Computer Science and Engineering Register No: 19BCE1367
● Solution: The GPT-2 model can be fine-tuned with the corpus of reverse order
(right to left) of tokens present in the limerick. This technique helps the GPT-2
model to learn the rhyming structure.
● The caption generated from the image caption model is fed into this fine-
tuned reverse GPT-2 model as a seed sentence to generate limericks and a
pool of 20 limericks is generated
M2: GPT-2 reverse language modeling
School of Computer Science and Engineering Register No: 19BCE1367
M3.1: Grammar and spelling error detection
- The generated limerick should be syntactically
correct and in order to do so, an open-source spelling
and grammar checker is used to assign scores to
each limerick. The limerick with no errors are chosen
for further processing.
M3: Post-processing
School of Computer Science and Engineering Register No: 19BCE1367
● Bidirectional Encoder Representations from Transformers (BERT) model can be
used to generate in-context embeddings.
● The subject continuity is quantified throughout the limerick as the average noun
centroid distance in the embedding space[5].
● If:
○ mean = high, nouns far from the average subject of the limerick.
○ standard deviation = high, many subjects present in the limerick.
● The limericks with lowest mean and standard deviation is selected as final output..
M3: Post-processing
M3.2: BERT based word embeddings
School of Computer Science and Engineering Register No: 19BCE1367
Automatic evaluation methods :
- BLEU (Bilingual Evaluation Understudy ) score
- Cosine Similarity
- Semantic Similarity (using Sentence BERT)
The MultiM-Poem dataset is a collection of 8292 images scraped from the Flikr and
each image is mapped to a related human-written poem. The image will be the
user input image and the related poem will be the ground truth.
M4: Evaluation
School of Computer Science and Engineering Register No: 19BCE1367
What to be done next?
06
School of Computer Science and Engineering Register No: 19BCE1367
1. Compilation of the results.
2. Research paper completion.
What to be done next?
School of Computer Science and Engineering Register No: 19BCE1367
Research Paper Status
07
School of Computer Science and Engineering Register No: 19BCE1367
1. Abstract.
2. Introduction.
3. Related work.
4. Approach.
a. Architecture
b. Image captioning
c. Language model
5. Experiment.
6. Result
7. Conclusion and Future work.
Research Paper Status
School of Computer Science and Engineering Register No: 19BCE1367
Guide Approval
08
School of Computer Science and Engineering Register No: 19BCE1367
Guide Approval mail screenshot
School of Computer Science and Engineering Register No: 19BCE1367
[1] Wang, H., Zhang, Y., & Yu, X. (2020). An overview of image caption generation
methods. Computational intelligence and neuroscience, 2020.
[2] Van de Cruys, T. (2020, July). Automatic poetry generation from prosaic text. In
Proceedings of the 58th annual meeting of the association for computational linguistics
(pp. 2471-2480).
[3] Beheitt, M. E. G., & Hmida, M. B. H. (2022). Automatic Arabic Poem Generation with
GPT-2. In ICAART (2) (pp. 366-374).
[4] Liu, D., Guo, Q., Li, W., & Lv, J. (2018, July). A multi-modal chinese poetry generation
model. In 2018 International Joint Conference on Neural Networks (IJCNN) (pp. 1-8).
IEEE.
[5] Lo, K. L., Ariss, R., & Kurz, P. (2022). GPoeT-2: A GPT-2 Based Poem Generator. arXiv
preprint arXiv:2205.08847.
References
(Reference papers)
School of Computer Science and Engineering Register No: 19BCE1367
[6] Meyer, J. B. (2019). Generating Free Verse Poetry with Transformer Networks (Doctoral
dissertation, Reed College).
[7] Talafha, S., & Rekabdar, B. (2019, January). Arabic poem generation with hierarchical
recurrent attentional network. In 2019 IEEE 13th International Conference on Semantic
Computing (ICSC) (pp. 316-323). IEEE.
[8] Gao, L., Fan, K., Song, J., Liu, X., Xu, X., & Shen, H. T. (2019, July). Deliberate attention
networks for image captioning. In Proceedings of the AAAI conference on artificial
intelligence (Vol. 33, No. 01, pp. 8320-8327).
[9] Jhamtani, H., Mehta, S. V., Carbonell, J., & Berg-Kirkpatrick, T. (2019). Learning rhyming
constraints using structured adversaries. arXiv preprint arXiv:1909.06743.
[10] Lau, J. H., Cohn, T., Baldwin, T., Brooke, J., & Hammond, A. (2018). Deep-speare: A joint
neural model of poetic language, meter and rhyme. arXiv preprint arXiv:1807.03491.
References
(Reference papers)
School of Computer Science and Engineering Register No: 19BCE1367
[11] Talafha, S., & Rekabdar, B. (2021, January). Poetry generation model via deep learning
incorporating extended phonetic and semantic embeddings. In 2021 IEEE 15th
International Conference on Semantic Computing (ICSC) (pp. 48-55). IEEE.
[12] Min, K., Dang, M., & Moon, H. (2021). Deep Learning-Based Short Story Generation for an
Image Using the Encoder-Decoder Structure. IEEE Access, 9, 113550-113557.
[13] Zhang, D., Ni, B., Zhi, Q., Plummer, T., Li, Q., Zheng, H., ... & Wang, D. (2019, August).
Through the eyes of a poet: Classical poetry recommendation with visual input on social
media. In 2019 IEEE/ACM International Conference on Advances in Social Networks
Analysis and Mining (ASONAM) (pp. 333-340). IEEE.
[14] Ghazvininejad, M., Shi, X., Priyadarshi, J., & Knight, K. (2017, July). Hafez: an interactive
poetry generation system. In Proceedings of ACL 2017, System Demonstrations (pp. 43-48).
[15] Liu, Z., Fu, Z., Cao, J., de Melo, G., Tam, Y. C., Niu, C., & Zhou, J. (2019, July). Rhetorically
controlled encoder-decoder for modern chinese poetry generation. In Proceedings of the
57th Annual Meeting of the Association for Computational Linguistics (pp. 1992-2001).
References
(Reference papers)
School of Computer Science and Engineering Register No: 19BCE1367
1. https://scottmduda.medium.com/generating-an-edgar-allen-poe-styled-
poem-using-gpt-2-289801ded82c
2. https://timesofindia.indiatimes.com/readersblog/newtech/artificial-
intelligence-in-education-39512/
3. https://news.climate.columbia.edu/2022/04/22/haiku-ai-generated-poetry/
4. https://towardsdatascience.com/transformers-89034557de14
5. https://github.com/minimaxir/gpt-2-simple
6. https://languagetool.org/
References
(Websites and articles)
School of Computer Science and Engineering Register No: 19BCE1367
THANK YOU

Más contenido relacionado

Similar a 19BCE1367_Capstone_Review 2_Final.pdf

BCS302- Digital Design and computer organization -VTU-2022 scheme-Expectation...
BCS302- Digital Design and computer organization -VTU-2022 scheme-Expectation...BCS302- Digital Design and computer organization -VTU-2022 scheme-Expectation...
BCS302- Digital Design and computer organization -VTU-2022 scheme-Expectation...ciyamala kushbu
 
IRJET - A Review on Chatbot Design and Implementation Techniques
IRJET -  	  A Review on Chatbot Design and Implementation TechniquesIRJET -  	  A Review on Chatbot Design and Implementation Techniques
IRJET - A Review on Chatbot Design and Implementation TechniquesIRJET Journal
 
Intelligent Code Generation for Model Driven Web Development
Intelligent Code Generation for Model Driven Web DevelopmentIntelligent Code Generation for Model Driven Web Development
Intelligent Code Generation for Model Driven Web DevelopmentencompassH2020
 
Automated Context-based Question-Distractor Generation using Extractive Summa...
Automated Context-based Question-Distractor Generation using Extractive Summa...Automated Context-based Question-Distractor Generation using Extractive Summa...
Automated Context-based Question-Distractor Generation using Extractive Summa...IRJET Journal
 
Finding Resource Manipulation Bugs in Linux Code
Finding Resource Manipulation Bugs in Linux CodeFinding Resource Manipulation Bugs in Linux Code
Finding Resource Manipulation Bugs in Linux CodeAndrzej Wasowski
 
IRJET - Visual Question Answering – Implementation using Keras
IRJET -  	  Visual Question Answering – Implementation using KerasIRJET -  	  Visual Question Answering – Implementation using Keras
IRJET - Visual Question Answering – Implementation using KerasIRJET Journal
 
Deepcoder to Self-Code with Machine Learning
Deepcoder to Self-Code with Machine LearningDeepcoder to Self-Code with Machine Learning
Deepcoder to Self-Code with Machine LearningIRJET Journal
 
NL to OCL via SBVR
NL to OCL via SBVRNL to OCL via SBVR
NL to OCL via SBVRImran Bajwa
 
IRJET- Image Caption Generation System using Neural Network with Attention Me...
IRJET- Image Caption Generation System using Neural Network with Attention Me...IRJET- Image Caption Generation System using Neural Network with Attention Me...
IRJET- Image Caption Generation System using Neural Network with Attention Me...IRJET Journal
 
Finding the shortest path in a graph and its visualization using C# and WPF
Finding the shortest path in a graph and its visualization using C# and WPF Finding the shortest path in a graph and its visualization using C# and WPF
Finding the shortest path in a graph and its visualization using C# and WPF IJECEIAES
 
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...IRJET Journal
 
NL based Object Oriented modeling - EJSR 35(1)
NL based Object Oriented modeling - EJSR 35(1)NL based Object Oriented modeling - EJSR 35(1)
NL based Object Oriented modeling - EJSR 35(1)IT Industry
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerFrancesco Osborne
 
IRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for BlindIRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for BlindIRJET Journal
 
Learn Prompt Engineering Basics chatgpt llms guide
Learn Prompt Engineering Basics chatgpt llms guideLearn Prompt Engineering Basics chatgpt llms guide
Learn Prompt Engineering Basics chatgpt llms guideSaweraKhadium
 
CSE202.pptx
CSE202.pptxCSE202.pptx
CSE202.pptxJoyBoy45
 

Similar a 19BCE1367_Capstone_Review 2_Final.pdf (20)

BCS302- Digital Design and computer organization -VTU-2022 scheme-Expectation...
BCS302- Digital Design and computer organization -VTU-2022 scheme-Expectation...BCS302- Digital Design and computer organization -VTU-2022 scheme-Expectation...
BCS302- Digital Design and computer organization -VTU-2022 scheme-Expectation...
 
IRJET - A Review on Chatbot Design and Implementation Techniques
IRJET -  	  A Review on Chatbot Design and Implementation TechniquesIRJET -  	  A Review on Chatbot Design and Implementation Techniques
IRJET - A Review on Chatbot Design and Implementation Techniques
 
Intelligent Code Generation for Model Driven Web Development
Intelligent Code Generation for Model Driven Web DevelopmentIntelligent Code Generation for Model Driven Web Development
Intelligent Code Generation for Model Driven Web Development
 
Computer engineering university
Computer engineering university Computer engineering university
Computer engineering university
 
Automated Context-based Question-Distractor Generation using Extractive Summa...
Automated Context-based Question-Distractor Generation using Extractive Summa...Automated Context-based Question-Distractor Generation using Extractive Summa...
Automated Context-based Question-Distractor Generation using Extractive Summa...
 
Finding Resource Manipulation Bugs in Linux Code
Finding Resource Manipulation Bugs in Linux CodeFinding Resource Manipulation Bugs in Linux Code
Finding Resource Manipulation Bugs in Linux Code
 
IRJET - Visual Question Answering – Implementation using Keras
IRJET -  	  Visual Question Answering – Implementation using KerasIRJET -  	  Visual Question Answering – Implementation using Keras
IRJET - Visual Question Answering – Implementation using Keras
 
Resume
ResumeResume
Resume
 
Deepcoder to Self-Code with Machine Learning
Deepcoder to Self-Code with Machine LearningDeepcoder to Self-Code with Machine Learning
Deepcoder to Self-Code with Machine Learning
 
cv
cvcv
cv
 
Liu11-3.pdf
Liu11-3.pdfLiu11-3.pdf
Liu11-3.pdf
 
NL to OCL via SBVR
NL to OCL via SBVRNL to OCL via SBVR
NL to OCL via SBVR
 
IRJET- Image Caption Generation System using Neural Network with Attention Me...
IRJET- Image Caption Generation System using Neural Network with Attention Me...IRJET- Image Caption Generation System using Neural Network with Attention Me...
IRJET- Image Caption Generation System using Neural Network with Attention Me...
 
Finding the shortest path in a graph and its visualization using C# and WPF
Finding the shortest path in a graph and its visualization using C# and WPF Finding the shortest path in a graph and its visualization using C# and WPF
Finding the shortest path in a graph and its visualization using C# and WPF
 
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...
IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up...
 
NL based Object Oriented modeling - EJSR 35(1)
NL based Object Oriented modeling - EJSR 35(1)NL based Object Oriented modeling - EJSR 35(1)
NL based Object Oriented modeling - EJSR 35(1)
 
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic MinerAutomatic Classification of Springer Nature Proceedings with Smart Topic Miner
Automatic Classification of Springer Nature Proceedings with Smart Topic Miner
 
IRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for BlindIRJET- Wearable AI Device for Blind
IRJET- Wearable AI Device for Blind
 
Learn Prompt Engineering Basics chatgpt llms guide
Learn Prompt Engineering Basics chatgpt llms guideLearn Prompt Engineering Basics chatgpt llms guide
Learn Prompt Engineering Basics chatgpt llms guide
 
CSE202.pptx
CSE202.pptxCSE202.pptx
CSE202.pptx
 

Último

2-tool presenthdbdbdbdbddhdhddation.pptx
2-tool presenthdbdbdbdbddhdhddation.pptx2-tool presenthdbdbdbdbddhdhddation.pptx
2-tool presenthdbdbdbdbddhdhddation.pptxsuhanimunjal27
 
Verified Trusted Call Girls Adugodi💘 9352852248 Good Looking standard Profil...
Verified Trusted Call Girls Adugodi💘 9352852248  Good Looking standard Profil...Verified Trusted Call Girls Adugodi💘 9352852248  Good Looking standard Profil...
Verified Trusted Call Girls Adugodi💘 9352852248 Good Looking standard Profil...kumaririma588
 
call girls in Dakshinpuri (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️
call girls in Dakshinpuri  (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️call girls in Dakshinpuri  (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️
call girls in Dakshinpuri (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
call girls in Kaushambi (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...
call girls in Kaushambi (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...call girls in Kaushambi (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...
call girls in Kaushambi (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...Delhi Call girls
 
Peaches App development presentation deck
Peaches App development presentation deckPeaches App development presentation deck
Peaches App development presentation decktbatkhuu1
 
Pastel Portfolio _ by Slidesgo.pptx. Xxx
Pastel Portfolio _ by Slidesgo.pptx. XxxPastel Portfolio _ by Slidesgo.pptx. Xxx
Pastel Portfolio _ by Slidesgo.pptx. XxxSegundoManuelFaichin1
 
💫✅jodhpur 24×7 BEST GENUINE PERSON LOW PRICE CALL GIRL SERVICE FULL SATISFACT...
💫✅jodhpur 24×7 BEST GENUINE PERSON LOW PRICE CALL GIRL SERVICE FULL SATISFACT...💫✅jodhpur 24×7 BEST GENUINE PERSON LOW PRICE CALL GIRL SERVICE FULL SATISFACT...
💫✅jodhpur 24×7 BEST GENUINE PERSON LOW PRICE CALL GIRL SERVICE FULL SATISFACT...sonalitrivedi431
 
VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...
VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...
VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...Call Girls in Nagpur High Profile
 
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Gi...
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Gi...Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Gi...
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Gi...Pooja Nehwal
 
Case Study of Hotel Taj Vivanta, Pune
Case Study of Hotel Taj Vivanta, PuneCase Study of Hotel Taj Vivanta, Pune
Case Study of Hotel Taj Vivanta, PuneLukeKholes
 
AMBER GRAIN EMBROIDERY | Growing folklore elements | Root-based materials, w...
AMBER GRAIN EMBROIDERY | Growing folklore elements |  Root-based materials, w...AMBER GRAIN EMBROIDERY | Growing folklore elements |  Root-based materials, w...
AMBER GRAIN EMBROIDERY | Growing folklore elements | Root-based materials, w...BarusRa
 
CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service 🧵
CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service  🧵CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service  🧵
CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service 🧵anilsa9823
 
(AISHA) Ambegaon Khurd Call Girls Just Call 7001035870 [ Cash on Delivery ] P...
(AISHA) Ambegaon Khurd Call Girls Just Call 7001035870 [ Cash on Delivery ] P...(AISHA) Ambegaon Khurd Call Girls Just Call 7001035870 [ Cash on Delivery ] P...
(AISHA) Ambegaon Khurd Call Girls Just Call 7001035870 [ Cash on Delivery ] P...ranjana rawat
 
Call Girls Basavanagudi Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
Call Girls Basavanagudi Just Call 👗 7737669865 👗 Top Class Call Girl Service ...Call Girls Basavanagudi Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
Call Girls Basavanagudi Just Call 👗 7737669865 👗 Top Class Call Girl Service ...amitlee9823
 
Kala jadu for love marriage | Real amil baba | Famous amil baba | kala jadu n...
Kala jadu for love marriage | Real amil baba | Famous amil baba | kala jadu n...Kala jadu for love marriage | Real amil baba | Famous amil baba | kala jadu n...
Kala jadu for love marriage | Real amil baba | Famous amil baba | kala jadu n...babafaisel
 
DragonBall PowerPoint Template for demo.pptx
DragonBall PowerPoint Template for demo.pptxDragonBall PowerPoint Template for demo.pptx
DragonBall PowerPoint Template for demo.pptxmirandajeremy200221
 
Top Rated Pune Call Girls Saswad ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated  Pune Call Girls Saswad ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Top Rated  Pune Call Girls Saswad ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated Pune Call Girls Saswad ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Call Girls in Nagpur High Profile
 
Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 

Último (20)

2-tool presenthdbdbdbdbddhdhddation.pptx
2-tool presenthdbdbdbdbddhdhddation.pptx2-tool presenthdbdbdbdbddhdhddation.pptx
2-tool presenthdbdbdbdbddhdhddation.pptx
 
Call Girls Service Mukherjee Nagar @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...
Call Girls Service Mukherjee Nagar @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SER...Call Girls Service Mukherjee Nagar @9999965857 Delhi 🫦 No Advance  VVIP 🍎 SER...
Call Girls Service Mukherjee Nagar @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...
 
Verified Trusted Call Girls Adugodi💘 9352852248 Good Looking standard Profil...
Verified Trusted Call Girls Adugodi💘 9352852248  Good Looking standard Profil...Verified Trusted Call Girls Adugodi💘 9352852248  Good Looking standard Profil...
Verified Trusted Call Girls Adugodi💘 9352852248 Good Looking standard Profil...
 
call girls in Dakshinpuri (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️
call girls in Dakshinpuri  (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️call girls in Dakshinpuri  (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️
call girls in Dakshinpuri (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️
 
call girls in Kaushambi (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...
call girls in Kaushambi (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...call girls in Kaushambi (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...
call girls in Kaushambi (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...
 
Peaches App development presentation deck
Peaches App development presentation deckPeaches App development presentation deck
Peaches App development presentation deck
 
B. Smith. (Architectural Portfolio.).pdf
B. Smith. (Architectural Portfolio.).pdfB. Smith. (Architectural Portfolio.).pdf
B. Smith. (Architectural Portfolio.).pdf
 
Pastel Portfolio _ by Slidesgo.pptx. Xxx
Pastel Portfolio _ by Slidesgo.pptx. XxxPastel Portfolio _ by Slidesgo.pptx. Xxx
Pastel Portfolio _ by Slidesgo.pptx. Xxx
 
💫✅jodhpur 24×7 BEST GENUINE PERSON LOW PRICE CALL GIRL SERVICE FULL SATISFACT...
💫✅jodhpur 24×7 BEST GENUINE PERSON LOW PRICE CALL GIRL SERVICE FULL SATISFACT...💫✅jodhpur 24×7 BEST GENUINE PERSON LOW PRICE CALL GIRL SERVICE FULL SATISFACT...
💫✅jodhpur 24×7 BEST GENUINE PERSON LOW PRICE CALL GIRL SERVICE FULL SATISFACT...
 
VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...
VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...
VVIP Pune Call Girls Hadapsar (7001035870) Pune Escorts Nearby with Complete ...
 
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Gi...
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Gi...Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Gi...
Pooja 9892124323, Call girls Services and Mumbai Escort Service Near Hotel Gi...
 
Case Study of Hotel Taj Vivanta, Pune
Case Study of Hotel Taj Vivanta, PuneCase Study of Hotel Taj Vivanta, Pune
Case Study of Hotel Taj Vivanta, Pune
 
AMBER GRAIN EMBROIDERY | Growing folklore elements | Root-based materials, w...
AMBER GRAIN EMBROIDERY | Growing folklore elements |  Root-based materials, w...AMBER GRAIN EMBROIDERY | Growing folklore elements |  Root-based materials, w...
AMBER GRAIN EMBROIDERY | Growing folklore elements | Root-based materials, w...
 
CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service 🧵
CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service  🧵CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service  🧵
CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service 🧵
 
(AISHA) Ambegaon Khurd Call Girls Just Call 7001035870 [ Cash on Delivery ] P...
(AISHA) Ambegaon Khurd Call Girls Just Call 7001035870 [ Cash on Delivery ] P...(AISHA) Ambegaon Khurd Call Girls Just Call 7001035870 [ Cash on Delivery ] P...
(AISHA) Ambegaon Khurd Call Girls Just Call 7001035870 [ Cash on Delivery ] P...
 
Call Girls Basavanagudi Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
Call Girls Basavanagudi Just Call 👗 7737669865 👗 Top Class Call Girl Service ...Call Girls Basavanagudi Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
Call Girls Basavanagudi Just Call 👗 7737669865 👗 Top Class Call Girl Service ...
 
Kala jadu for love marriage | Real amil baba | Famous amil baba | kala jadu n...
Kala jadu for love marriage | Real amil baba | Famous amil baba | kala jadu n...Kala jadu for love marriage | Real amil baba | Famous amil baba | kala jadu n...
Kala jadu for love marriage | Real amil baba | Famous amil baba | kala jadu n...
 
DragonBall PowerPoint Template for demo.pptx
DragonBall PowerPoint Template for demo.pptxDragonBall PowerPoint Template for demo.pptx
DragonBall PowerPoint Template for demo.pptx
 
Top Rated Pune Call Girls Saswad ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated  Pune Call Girls Saswad ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Top Rated  Pune Call Girls Saswad ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
Top Rated Pune Call Girls Saswad ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...
 
Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Brookefield Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 

19BCE1367_Capstone_Review 2_Final.pdf

  • 1. School of Computer Science and Engineering Register No: 19BCE1367 Deep Neural Network-based Limerick Generation for an Image Name: Divyanshi Thapa Register No: 19BCE1367 Programme and Specialization: B.Tech CSE CAPSTONE PROJECT REVIEW 2 Guide Name: Dr. Praveen Joe I R
  • 2. School of Computer Science and Engineering Register No: 19BCE1367 01 Introduction Outline � � 02 03 05 06 08 07 Problem Statement Research Challenges What to be done next Guide Approval Proposed System Research Paper Status 04 Research Objectives 09 References
  • 3. School of Computer Science and Engineering Register No: 19BCE1367 Introduction 01
  • 4. School of Computer Science and Engineering Register No: 19BCE1367 ● Creative writing using artificial intelligence (AI) is one of the most popular and rapidly growing research fields. It is highly intriguing but also challenging as we go more to the side of generating human-like texts with constraints as we have in poems. ● Among creative writing tasks, paraphrasing and writing stories are easier than writing poetry because poems have many restrictions such as rhyming structures, number of lines, type of language, etc. Introduction
  • 5. School of Computer Science and Engineering Register No: 19BCE1367 ● Several poem frameworks have been developed to assist AI in generating human-like poems to address the issue. Introduction ● Poems in literature can be broadly classified into nine categories depending on their rhyming structure and the number of lines. Among all the nine categories, one of the most challenging tasks is to generate a limerick using artificial intelligence and deep learning as a limerick is a five-lined poem that has a strict rhyming structure of AABBA
  • 6. School of Computer Science and Engineering Register No: 19BCE1367 ● Image captioning has also helped to automatically generate well-formed sentences from a given image which is widely used in many NLP tasks such as VQA. ● Language models based on neural networks have improved the state of the art with regard to predictive language modeling, while topic models are successful at capturing clear-cut, semantic dimensions. ● NLP + DL = a system which can understand and analyze an image and can generate a creative human like poem based of the theme of the image. NLP + Deep learning
  • 7. School of Computer Science and Engineering Register No: 19BCE1367 Problem Statement 02
  • 8. School of Computer Science and Engineering Register No: 19BCE1367 ● For a poem to be meaningful, both linguistic and literary aspects need to be taken into account. ● With the advancement in image captioning, the NLP tasks such as Question Answering has gone to it’s phase 2 that is Visual Question Answering. ● “To create a deep learning model which can create limericks (a form of poem) for the given input image in English language. ” Problem Statement
  • 9. School of Computer Science and Engineering Register No: 19BCE1367 Current approaches of generating rhyming English poetry with a neural network involve constraining output to enforce the condition of rhyme. The generated poem should be: ● According to the context or theme of the given input image ● Error free ● Coherent ● Follows the rhyming structure of the limerick (AABBA) Problem Statement
  • 10. School of Computer Science and Engineering Register No: 19BCE1367 Research Challenges 03
  • 11. School of Computer Science and Engineering Register No: 19BCE1367 1. Mapping the theme of the image with the topic of poem. 2. Both linguistic and literary aspects need to be taken into account so that the poem is meaningful. 3. Syntactic well-formedness and topical coherence throughout the poem. 4. Rhyming constraint (Maintaining rhyming scheme) 5. Certain amount of creativity in literature for making poem interesting. Research Challenges
  • 12. School of Computer Science and Engineering Register No: 19BCE1367 Research Objectives 04
  • 13. School of Computer Science and Engineering Register No: 19BCE1367 1. An attempt to mimic human creative writing by creating a simple framework for image to poem generation for English language. 2. Using a transformer models for better image captioning and limerick generation . 3. A framework to generate poems (limericks) efficiently so that it can be deployed as a public application after the post- processing. 4. Major focus on maintaining the coherency, rhyming structure of limerick and the efficiency of the framework. Research Objectives
  • 14. School of Computer Science and Engineering Register No: 19BCE1367 Proposed System 05
  • 15. School of Computer Science and Engineering Register No: 19BCE1367 ● The goal is also to make a speed-efficient framework and to do so, the transformer models are the choice for image analysis and limerick generation. The features of the image are extracted and the description is generated by the Vision encoder-decoder model which is a combination of a vision transformer as an encoder for image feature extraction and GPT-2 as a decoder for generating human-like captions. ● This caption is treated as the first line of the limerick and is fed to another GPT- 2 model for generating a pool of 20 limericks. ● Best limerick is selected as the final output after post-processing. Proposed System Introduction
  • 16. School of Computer Science and Engineering Register No: 19BCE1367 Proposed System Diagram
  • 17. School of Computer Science and Engineering Register No: 19BCE1367 Module 1 (M1): Image Captioning Module 2 (M2): GPT-2 reverse language modeling Module 3 (M3): Post-processing Module 3.1 (M3.1): Grammar and spelling error detection Module 3.2 (M3.2): BERT based word embeddings Module 4 (M4): Evaluation List of Modules
  • 18. School of Computer Science and Engineering Register No: 19BCE1367 ● The vision encoder-decoder model is used via HuggingFace API which has ViT as its vision encoder model and GPT-2 as the text decoder model It is trained on the popular Common Objects in Context (COCO) dataset which contains more than 120 thousand images with their descriptions. ● The PyTorch version is used for generating the captions for the given input image. M1: Image Captioning
  • 19. School of Computer Science and Engineering Register No: 19BCE1367 Problem: GPT2 is a forward language model as it utilizes the standard left-to-right order of tokens present in a limerick for fine-tuning. This helps in maintaining the subject’s continuity and coherency but it cannot maintain the rhyming structure of the poem. M2: GPT-2 reverse language modeling
  • 20. School of Computer Science and Engineering Register No: 19BCE1367 ● Solution: The GPT-2 model can be fine-tuned with the corpus of reverse order (right to left) of tokens present in the limerick. This technique helps the GPT-2 model to learn the rhyming structure. ● The caption generated from the image caption model is fed into this fine- tuned reverse GPT-2 model as a seed sentence to generate limericks and a pool of 20 limericks is generated M2: GPT-2 reverse language modeling
  • 21. School of Computer Science and Engineering Register No: 19BCE1367 M3.1: Grammar and spelling error detection - The generated limerick should be syntactically correct and in order to do so, an open-source spelling and grammar checker is used to assign scores to each limerick. The limerick with no errors are chosen for further processing. M3: Post-processing
  • 22. School of Computer Science and Engineering Register No: 19BCE1367 ● Bidirectional Encoder Representations from Transformers (BERT) model can be used to generate in-context embeddings. ● The subject continuity is quantified throughout the limerick as the average noun centroid distance in the embedding space[5]. ● If: ○ mean = high, nouns far from the average subject of the limerick. ○ standard deviation = high, many subjects present in the limerick. ● The limericks with lowest mean and standard deviation is selected as final output.. M3: Post-processing M3.2: BERT based word embeddings
  • 23. School of Computer Science and Engineering Register No: 19BCE1367 Automatic evaluation methods : - BLEU (Bilingual Evaluation Understudy ) score - Cosine Similarity - Semantic Similarity (using Sentence BERT) The MultiM-Poem dataset is a collection of 8292 images scraped from the Flikr and each image is mapped to a related human-written poem. The image will be the user input image and the related poem will be the ground truth. M4: Evaluation
  • 24. School of Computer Science and Engineering Register No: 19BCE1367 What to be done next? 06
  • 25. School of Computer Science and Engineering Register No: 19BCE1367 1. Compilation of the results. 2. Research paper completion. What to be done next?
  • 26. School of Computer Science and Engineering Register No: 19BCE1367 Research Paper Status 07
  • 27. School of Computer Science and Engineering Register No: 19BCE1367 1. Abstract. 2. Introduction. 3. Related work. 4. Approach. a. Architecture b. Image captioning c. Language model 5. Experiment. 6. Result 7. Conclusion and Future work. Research Paper Status
  • 28. School of Computer Science and Engineering Register No: 19BCE1367 Guide Approval 08
  • 29. School of Computer Science and Engineering Register No: 19BCE1367 Guide Approval mail screenshot
  • 30. School of Computer Science and Engineering Register No: 19BCE1367 [1] Wang, H., Zhang, Y., & Yu, X. (2020). An overview of image caption generation methods. Computational intelligence and neuroscience, 2020. [2] Van de Cruys, T. (2020, July). Automatic poetry generation from prosaic text. In Proceedings of the 58th annual meeting of the association for computational linguistics (pp. 2471-2480). [3] Beheitt, M. E. G., & Hmida, M. B. H. (2022). Automatic Arabic Poem Generation with GPT-2. In ICAART (2) (pp. 366-374). [4] Liu, D., Guo, Q., Li, W., & Lv, J. (2018, July). A multi-modal chinese poetry generation model. In 2018 International Joint Conference on Neural Networks (IJCNN) (pp. 1-8). IEEE. [5] Lo, K. L., Ariss, R., & Kurz, P. (2022). GPoeT-2: A GPT-2 Based Poem Generator. arXiv preprint arXiv:2205.08847. References (Reference papers)
  • 31. School of Computer Science and Engineering Register No: 19BCE1367 [6] Meyer, J. B. (2019). Generating Free Verse Poetry with Transformer Networks (Doctoral dissertation, Reed College). [7] Talafha, S., & Rekabdar, B. (2019, January). Arabic poem generation with hierarchical recurrent attentional network. In 2019 IEEE 13th International Conference on Semantic Computing (ICSC) (pp. 316-323). IEEE. [8] Gao, L., Fan, K., Song, J., Liu, X., Xu, X., & Shen, H. T. (2019, July). Deliberate attention networks for image captioning. In Proceedings of the AAAI conference on artificial intelligence (Vol. 33, No. 01, pp. 8320-8327). [9] Jhamtani, H., Mehta, S. V., Carbonell, J., & Berg-Kirkpatrick, T. (2019). Learning rhyming constraints using structured adversaries. arXiv preprint arXiv:1909.06743. [10] Lau, J. H., Cohn, T., Baldwin, T., Brooke, J., & Hammond, A. (2018). Deep-speare: A joint neural model of poetic language, meter and rhyme. arXiv preprint arXiv:1807.03491. References (Reference papers)
  • 32. School of Computer Science and Engineering Register No: 19BCE1367 [11] Talafha, S., & Rekabdar, B. (2021, January). Poetry generation model via deep learning incorporating extended phonetic and semantic embeddings. In 2021 IEEE 15th International Conference on Semantic Computing (ICSC) (pp. 48-55). IEEE. [12] Min, K., Dang, M., & Moon, H. (2021). Deep Learning-Based Short Story Generation for an Image Using the Encoder-Decoder Structure. IEEE Access, 9, 113550-113557. [13] Zhang, D., Ni, B., Zhi, Q., Plummer, T., Li, Q., Zheng, H., ... & Wang, D. (2019, August). Through the eyes of a poet: Classical poetry recommendation with visual input on social media. In 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) (pp. 333-340). IEEE. [14] Ghazvininejad, M., Shi, X., Priyadarshi, J., & Knight, K. (2017, July). Hafez: an interactive poetry generation system. In Proceedings of ACL 2017, System Demonstrations (pp. 43-48). [15] Liu, Z., Fu, Z., Cao, J., de Melo, G., Tam, Y. C., Niu, C., & Zhou, J. (2019, July). Rhetorically controlled encoder-decoder for modern chinese poetry generation. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (pp. 1992-2001). References (Reference papers)
  • 33. School of Computer Science and Engineering Register No: 19BCE1367 1. https://scottmduda.medium.com/generating-an-edgar-allen-poe-styled- poem-using-gpt-2-289801ded82c 2. https://timesofindia.indiatimes.com/readersblog/newtech/artificial- intelligence-in-education-39512/ 3. https://news.climate.columbia.edu/2022/04/22/haiku-ai-generated-poetry/ 4. https://towardsdatascience.com/transformers-89034557de14 5. https://github.com/minimaxir/gpt-2-simple 6. https://languagetool.org/ References (Websites and articles)
  • 34. School of Computer Science and Engineering Register No: 19BCE1367 THANK YOU