SlideShare una empresa de Scribd logo
1 de 14
Plangsarn: Thai
Romanization Library
                       Application

  Srichan Chancheewa, THAMMASAT UNIVERSITYDr.
Choochart Haruechaiyasak, NECTEC, THAILAND
Why do we need this application?
before                   after
Background and Motivation
• Romanization is the representation of a written word or
  spoken speech with the Roman (Latin) script.
• Two main methods of romanization are
   (1) Transliteration: for representing written text,
   (2) Transcription: for representing the spoken word,
       (2.1) phonemic transcription: records the
       phonemes or units of semantic meaning in speech
       (2.2) phonetic transcription: records speech sounds
       with precision.
Background and Motivation
• Phonetic transcription attempts to depict all phones in the
  source language.
• We adopt the International Phonetic Alphabet (IPA).
• IPA is an alphabetic system of phonetic notation based
  primarily on the Latin alphabet.
• IPA was devised by the International Phonetic Association as
  a standardized representation of the sounds of oral
  language.
• IPA symbols are composed of one or more elements of two
  basic types, letters and diacritics.
Background and Motivation
• For library cataloging, there is a guideline for preparing
  romanization:
   –ALA-LC Romanization Tables: Transliteration
    Schemes for Non-Roman Scripts, which is approved
    by the Library of Congress and the American Library
    Association.
   –ALA-LC is a set of standards for romanization, or
    the representation of text in other writing systems
    using the Latin alphabet.
Objective
• Design and implement a system for automatically
  romanizing Thai texts with the focus on bibliographic
  data.
• The system is intended to support librarians who
  perform the cataloguing task.
• The system is called Plangsarn (          ).
   –plang (      ) = transform
   –sarn (     ) = text, message
Plangsarn’s System Architecture




• Lexicon contains Thai word list with romanization.
• G2P (Grapheme-to-Phoneme) converts grapheme unit into phoneme unit.
• Tokenizer performs the word segmentation on Thai written texts.
Plangsarn’s process flow
1. Get Thai text input from the user via browser
   interface.
2. Perform word segmentation on the input text.
3. For each word token, look up in the lexicon for
   existing romanized representation.
4. If the romanized representation already exists, then
   use it. Otherwise send the word to G2P module.
5. Concatenate all romanized representations and
   send the final result to the user via the browser
   interface.
Example of Plangsarn process
Input Thai text:



Tokenized text:


Romanized text:     kāndoēnthāng pai mahāwitthayālai
Plangsarn’s Web interface:
http://164.115.23.167/plangsarn/
User Evaluation
• Plangsarn is under the process of user testing and evaluation.
• Two main reported problems so far are
   – Incorrect word segmented unit.
       •         (Kitchen) => hǭng khrūa (should be hǭngkhrūa)

   – Unable to recognize and capitalize named entities such as the names
     of person, organization, place
      •         (Bangkok) => krung thēp (should be Krungthēp)

• To solve the problem, the system allows authorized users to manually
  include words with its correct romanization into the lexicon.
Plangsarn’s Summarized Features
• Automatic romanization of Thai text using the G2P
  (Grapheme-to-Phoneme) technique
• Use the lexicon-based Thai word segmentation to tokenize
  the text
• Allow authorized users to manually add and edit the list of
  romanized lexicon
• Support both IPA and OCLC Connexion
• Automatic capitalization for named entities (e.g., names of
  person, organization, place)
• Provide the machine translation of Thai to English
PLANGSARN IS NOW AVAILABLE WITHIN
 THAI GOVERNMENT CLOUD SERVICE
 HTTP://164.115.23.167/PLANGSARN/
SPECIAL THANKS TO
                 Dr. Choochart Haruechaiyasak
                       Senior Researcher
        Speech and Audio Technology Laboratory (SPT)
National Electronics and Computer Technology Center (NECTEC)
          112 Phahonyothin Rd., Klong 1, Klong Luang,
                       Pathumthani 12120
                           THAILAND
email: choochart.haruechaiyasak@nectec.or.th
               Tel. (+66) 2 564-6900 Ext. 2240
                     Fax. (+66) 2 564-6873

Más contenido relacionado

Similar a Plangsarn: Thai Romanization Library Application - Srichan Chincheewa

Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
Abdullah al Mamun
 
Error Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation OutputsError Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation Outputs
Parisa Niksefat
 
Machine transliteration survey
Machine transliteration surveyMachine transliteration survey
Machine transliteration survey
unyil96
 
Onward presentation.en
Onward presentation.enOnward presentation.en
Onward presentation.en
ClarkTony
 

Similar a Plangsarn: Thai Romanization Library Application - Srichan Chincheewa (20)

Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
Speech-Recognition.pptx
Speech-Recognition.pptxSpeech-Recognition.pptx
Speech-Recognition.pptx
 
ReseachPaper
ReseachPaperReseachPaper
ReseachPaper
 
Acceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation SystemAcceptance Testing Of A Spoken Language Translation System
Acceptance Testing Of A Spoken Language Translation System
 
Ch 2 Names scopes and bindings.pptx
Ch 2 Names scopes and bindings.pptxCh 2 Names scopes and bindings.pptx
Ch 2 Names scopes and bindings.pptx
 
Multi lingual corpus for machine aided translation
Multi lingual corpus for machine aided translationMulti lingual corpus for machine aided translation
Multi lingual corpus for machine aided translation
 
Error Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation OutputsError Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation Outputs
 
**JUNK** (no subject)
**JUNK** (no subject)**JUNK** (no subject)
**JUNK** (no subject)
 
International journal on natural language computing(ijnlc)
International journal on natural language computing(ijnlc)International journal on natural language computing(ijnlc)
International journal on natural language computing(ijnlc)
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Machine transliteration survey
Machine transliteration surveyMachine transliteration survey
Machine transliteration survey
 
An expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabicAn expert system for automatic reading of a text written in standard arabic
An expert system for automatic reading of a text written in standard arabic
 
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
 
Implementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large DictionaryImplementation of Marathi Language Speech Databases for Large Dictionary
Implementation of Marathi Language Speech Databases for Large Dictionary
 
Ijetcas14 444
Ijetcas14 444Ijetcas14 444
Ijetcas14 444
 
AMHARIC TEXT TO SPEECH SYNTHESIS FOR SYSTEM DEVELOPMENT
AMHARIC TEXT TO SPEECH SYNTHESIS FOR SYSTEM DEVELOPMENTAMHARIC TEXT TO SPEECH SYNTHESIS FOR SYSTEM DEVELOPMENT
AMHARIC TEXT TO SPEECH SYNTHESIS FOR SYSTEM DEVELOPMENT
 
Onward presentation.en
Onward presentation.enOnward presentation.en
Onward presentation.en
 
4. processor
4. processor4. processor
4. processor
 
What is machine translation
What is machine translationWhat is machine translation
What is machine translation
 
I1 geetha3 revathi
I1 geetha3 revathiI1 geetha3 revathi
I1 geetha3 revathi
 

Más de tulipbiru64

Más de tulipbiru64 (20)

Towards a Structured Information Security Awareness Programme
Towards a Structured Information Security Awareness ProgrammeTowards a Structured Information Security Awareness Programme
Towards a Structured Information Security Awareness Programme
 
Towards The Curated web
Towards The Curated webTowards The Curated web
Towards The Curated web
 
Kajian Kepuasan Pengguna Terhadap Kualiti Perkhidmatan Ruang Pembelajaran di ...
Kajian Kepuasan Pengguna Terhadap Kualiti Perkhidmatan Ruang Pembelajaran di ...Kajian Kepuasan Pengguna Terhadap Kualiti Perkhidmatan Ruang Pembelajaran di ...
Kajian Kepuasan Pengguna Terhadap Kualiti Perkhidmatan Ruang Pembelajaran di ...
 
Multi-factor Information Security Risk in Information System
Multi-factor Information Security Risk in Information SystemMulti-factor Information Security Risk in Information System
Multi-factor Information Security Risk in Information System
 
Informative Centers' Intelligent Agent Based Model - a preliminary study
Informative Centers' Intelligent Agent Based Model - a preliminary studyInformative Centers' Intelligent Agent Based Model - a preliminary study
Informative Centers' Intelligent Agent Based Model - a preliminary study
 
Research Data Management: Our Role
Research Data Management: Our RoleResearch Data Management: Our Role
Research Data Management: Our Role
 
Transforming The Academic Library Services For Generation Y Using Knowledge M...
Transforming The Academic Library Services For Generation Y Using Knowledge M...Transforming The Academic Library Services For Generation Y Using Knowledge M...
Transforming The Academic Library Services For Generation Y Using Knowledge M...
 
Repositori Institusi Isu Dan Cabaran: Kajian Kes Perpustakaan Universiti Tekn...
Repositori Institusi Isu Dan Cabaran: Kajian Kes Perpustakaan Universiti Tekn...Repositori Institusi Isu Dan Cabaran: Kajian Kes Perpustakaan Universiti Tekn...
Repositori Institusi Isu Dan Cabaran: Kajian Kes Perpustakaan Universiti Tekn...
 
Social Tagging/Bookmarking Application: The Usage In Academic Libraries
Social Tagging/Bookmarking Application: The Usage In Academic LibrariesSocial Tagging/Bookmarking Application: The Usage In Academic Libraries
Social Tagging/Bookmarking Application: The Usage In Academic Libraries
 
e-Books: Putting Librarians And Researchers 'In The Know'
e-Books: Putting Librarians And Researchers 'In The Know'e-Books: Putting Librarians And Researchers 'In The Know'
e-Books: Putting Librarians And Researchers 'In The Know'
 
Buku Masa Depan
Buku Masa DepanBuku Masa Depan
Buku Masa Depan
 
Measurement Of Values And Performance For The Institutions Of Higher Educatio...
Measurement Of Values And Performance For The Institutions Of Higher Educatio...Measurement Of Values And Performance For The Institutions Of Higher Educatio...
Measurement Of Values And Performance For The Institutions Of Higher Educatio...
 
Keynote Speech by YBhg. Profesor Datuk Dr. Shahrin Sahib, Vice Chancellor UTeM
Keynote Speech by YBhg. Profesor Datuk Dr. Shahrin Sahib, Vice Chancellor UTeMKeynote Speech by YBhg. Profesor Datuk Dr. Shahrin Sahib, Vice Chancellor UTeM
Keynote Speech by YBhg. Profesor Datuk Dr. Shahrin Sahib, Vice Chancellor UTeM
 
Repository : A Brief Comparative Study Between The National University Of Mal...
Repository : A Brief Comparative Study Between The National University Of Mal...Repository : A Brief Comparative Study Between The National University Of Mal...
Repository : A Brief Comparative Study Between The National University Of Mal...
 
Mobile OPAC Prototype Based On Koha Open Source Integrated Library System
Mobile OPAC Prototype Based On Koha Open Source Integrated Library SystemMobile OPAC Prototype Based On Koha Open Source Integrated Library System
Mobile OPAC Prototype Based On Koha Open Source Integrated Library System
 
Kajian Tinjauan Tanggapan Pengguna Terhadap Profesion Pustakawan Dalam Kalang...
Kajian Tinjauan Tanggapan Pengguna Terhadap Profesion Pustakawan Dalam Kalang...Kajian Tinjauan Tanggapan Pengguna Terhadap Profesion Pustakawan Dalam Kalang...
Kajian Tinjauan Tanggapan Pengguna Terhadap Profesion Pustakawan Dalam Kalang...
 
Corporate Social Responsibility (CSR) And Library Collaborative Partnership
Corporate Social Responsibility (CSR) And Library Collaborative PartnershipCorporate Social Responsibility (CSR) And Library Collaborative Partnership
Corporate Social Responsibility (CSR) And Library Collaborative Partnership
 
The Effectiveness Of Searching Arabic Resources Through OPAC : A Case Study I...
The Effectiveness Of Searching Arabic Resources Through OPAC : A Case Study I...The Effectiveness Of Searching Arabic Resources Through OPAC : A Case Study I...
The Effectiveness Of Searching Arabic Resources Through OPAC : A Case Study I...
 
Library Usage Among Medical Students In The Faculty Of Medicine And Health Sc...
Library Usage Among Medical Students In The Faculty Of Medicine And Health Sc...Library Usage Among Medical Students In The Faculty Of Medicine And Health Sc...
Library Usage Among Medical Students In The Faculty Of Medicine And Health Sc...
 
Kajian Kepuasan Pelanggan Di Perpustakaan UTHM Dalam Meningkatkan Kualiti Per...
Kajian Kepuasan Pelanggan Di Perpustakaan UTHM Dalam Meningkatkan Kualiti Per...Kajian Kepuasan Pelanggan Di Perpustakaan UTHM Dalam Meningkatkan Kualiti Per...
Kajian Kepuasan Pelanggan Di Perpustakaan UTHM Dalam Meningkatkan Kualiti Per...
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 

Plangsarn: Thai Romanization Library Application - Srichan Chincheewa

  • 1. Plangsarn: Thai Romanization Library Application Srichan Chancheewa, THAMMASAT UNIVERSITYDr. Choochart Haruechaiyasak, NECTEC, THAILAND
  • 2. Why do we need this application? before after
  • 3. Background and Motivation • Romanization is the representation of a written word or spoken speech with the Roman (Latin) script. • Two main methods of romanization are (1) Transliteration: for representing written text, (2) Transcription: for representing the spoken word, (2.1) phonemic transcription: records the phonemes or units of semantic meaning in speech (2.2) phonetic transcription: records speech sounds with precision.
  • 4. Background and Motivation • Phonetic transcription attempts to depict all phones in the source language. • We adopt the International Phonetic Alphabet (IPA). • IPA is an alphabetic system of phonetic notation based primarily on the Latin alphabet. • IPA was devised by the International Phonetic Association as a standardized representation of the sounds of oral language. • IPA symbols are composed of one or more elements of two basic types, letters and diacritics.
  • 5. Background and Motivation • For library cataloging, there is a guideline for preparing romanization: –ALA-LC Romanization Tables: Transliteration Schemes for Non-Roman Scripts, which is approved by the Library of Congress and the American Library Association. –ALA-LC is a set of standards for romanization, or the representation of text in other writing systems using the Latin alphabet.
  • 6. Objective • Design and implement a system for automatically romanizing Thai texts with the focus on bibliographic data. • The system is intended to support librarians who perform the cataloguing task. • The system is called Plangsarn ( ). –plang ( ) = transform –sarn ( ) = text, message
  • 7. Plangsarn’s System Architecture • Lexicon contains Thai word list with romanization. • G2P (Grapheme-to-Phoneme) converts grapheme unit into phoneme unit. • Tokenizer performs the word segmentation on Thai written texts.
  • 8. Plangsarn’s process flow 1. Get Thai text input from the user via browser interface. 2. Perform word segmentation on the input text. 3. For each word token, look up in the lexicon for existing romanized representation. 4. If the romanized representation already exists, then use it. Otherwise send the word to G2P module. 5. Concatenate all romanized representations and send the final result to the user via the browser interface.
  • 9. Example of Plangsarn process Input Thai text: Tokenized text: Romanized text: kāndoēnthāng pai mahāwitthayālai
  • 11. User Evaluation • Plangsarn is under the process of user testing and evaluation. • Two main reported problems so far are – Incorrect word segmented unit. • (Kitchen) => hǭng khrūa (should be hǭngkhrūa) – Unable to recognize and capitalize named entities such as the names of person, organization, place • (Bangkok) => krung thēp (should be Krungthēp) • To solve the problem, the system allows authorized users to manually include words with its correct romanization into the lexicon.
  • 12. Plangsarn’s Summarized Features • Automatic romanization of Thai text using the G2P (Grapheme-to-Phoneme) technique • Use the lexicon-based Thai word segmentation to tokenize the text • Allow authorized users to manually add and edit the list of romanized lexicon • Support both IPA and OCLC Connexion • Automatic capitalization for named entities (e.g., names of person, organization, place) • Provide the machine translation of Thai to English
  • 13. PLANGSARN IS NOW AVAILABLE WITHIN THAI GOVERNMENT CLOUD SERVICE HTTP://164.115.23.167/PLANGSARN/
  • 14. SPECIAL THANKS TO Dr. Choochart Haruechaiyasak Senior Researcher Speech and Audio Technology Laboratory (SPT) National Electronics and Computer Technology Center (NECTEC) 112 Phahonyothin Rd., Klong 1, Klong Luang, Pathumthani 12120 THAILAND email: choochart.haruechaiyasak@nectec.or.th Tel. (+66) 2 564-6900 Ext. 2240 Fax. (+66) 2 564-6873

Notas del editor

  1. In this paper, we adopt the approach of phonetic transcription.
  2. Plangsarn is designed based on the phonetic transcription approach by using the IPA and ALA-LC guideline.
  3. การเดินทางไปมหาวิทยาลัย = travelling to the university
  4. การเดินทางไปมหาวิทยาลัย = a trip to a university