SlideShare una empresa de Scribd logo
1 de 10
FARAH DIYANA BINTI AHMAD JEFIRUDDIN
Study of language as expressed
in samples (corpora) or "real
world" text.
DEFINITION
KUCERA AND W. NELSON FRANCIS
-publish Computational Analysis of Present-Day American
English (1967)
-contains a variety of computational analyses, combining
elements of linguistics, language
teaching, psychology, statistics, and sociology
RANDOLPH QUIRK
-publish Towards a description of English Usage' (1960) in
which he introduced The Survey of English Usage.
HISTORY
HOUGHTON-MIFFLIN
- publish American Heritage Dictionary (first
dictionary to be compiled using corpus linguistics)
-supply a million word, three-line citation base for the
dictionary
- AHD combines prescriptive elements with
descriptive information.
COLLINS
- publish COBUILD monolingual learner's dictionary
- designed for users learning English as a foreign
language, (compiled using the Bank of English)
-The Survey of English Usage Corpus was used in the
development of the Comprehensive Grammar of
English
MONTREAL FRENCH PROJECT
- The first computerized corpus of transcribed spoken
language
- contains one million words
ANDERSEN-FORBES
- is a computerized corpora
- database of the Hebrew Bible
- every clause is parsed using graphs representing
seven levels of syntax, and each segment are tagged
with seven fields of information
THE QURANIC ARABIC CORPUS
- an annotated corpus for the Classical Arabic
language of the Quran
- recent project with multiple layers of annotation
including morphological segmentation, part-of-
speech tagging, and syntactic analysis using
dependency grammar
METHODS 1) Annotation
2) Abstraction
3) Analysis
METHODS
Annotation consists of the application of a scheme to
texts.
Annotations may include structural mark-up, part-of-
speech tagging, parsing, and numerous other
representations.
1) ANNOTATION
Abstraction consists of the translation (mapping) of
terms in the scheme to terms in a theoretically
motivated model or dataset.
It typically includes linguist-directed search but may
include e.g., rule-learning for parsers.
2) ABSTRACTION
Analysis consists of statistically probing, manipulating
and generalising from the dataset.
Might include statistical evaluations, optimisation of
rule-bases or knowledge discovery methods.
3) ANALYSIS
Corpus linguistic

Más contenido relacionado

Destacado

царевская наталья. революция Edtech 2.0 и ее монетизация
царевская наталья. революция Edtech  2.0  и ее монетизацияцаревская наталья. революция Edtech  2.0  и ее монетизация
царевская наталья. революция Edtech 2.0 и ее монетизацияelenae00
 
лукацкий алексей. обзор последних законодательных инициатив в области информа...
лукацкий алексей. обзор последних законодательных инициатив в области информа...лукацкий алексей. обзор последних законодательных инициатив в области информа...
лукацкий алексей. обзор последних законодательных инициатив в области информа...elenae00
 
левский николай. оценка рисков мобильного пользователя и рекомендации по их ...
левский николай. оценка рисков мобильного пользователя  и рекомендации по их ...левский николай. оценка рисков мобильного пользователя  и рекомендации по их ...
левский николай. оценка рисков мобильного пользователя и рекомендации по их ...elenae00
 
EQB Minnesota and Climate Change
EQB Minnesota and Climate ChangeEQB Minnesota and Climate Change
EQB Minnesota and Climate ChangeAnna Henderson
 
100 Trường đại học hàng đầu thế giới 2013-2014
100 Trường đại học hàng đầu thế giới 2013-2014100 Trường đại học hàng đầu thế giới 2013-2014
100 Trường đại học hàng đầu thế giới 2013-2014Duhoc_Vietsail
 
Week 8 Exercise
Week 8 ExerciseWeek 8 Exercise
Week 8 ExerciseCOMM12033
 
Membuat desain sistem keamananjaringan
Membuat desain sistem  keamananjaringanMembuat desain sistem  keamananjaringan
Membuat desain sistem keamananjaringanAnwarMuhammad1
 
Should teachers experiment with poetry in the classroom?
Should teachers experiment with poetry in the classroom?Should teachers experiment with poetry in the classroom?
Should teachers experiment with poetry in the classroom?Lauris Jagger
 
Chetan QA & MR resume
Chetan QA & MR resumeChetan QA & MR resume
Chetan QA & MR resumechetan naidu
 
бешков андрей. сравнение безопасности мобильных платформ
бешков андрей. сравнение безопасности мобильных платформбешков андрей. сравнение безопасности мобильных платформ
бешков андрей. сравнение безопасности мобильных платформelenae00
 
Safari App extensions cleared up
Safari App extensions cleared upSafari App extensions cleared up
Safari App extensions cleared upSanaa Squalli
 
Mr kvantitativni aspekt fundamentalne analize na nivou kompanije i industri...
Mr   kvantitativni aspekt fundamentalne analize na nivou kompanije i industri...Mr   kvantitativni aspekt fundamentalne analize na nivou kompanije i industri...
Mr kvantitativni aspekt fundamentalne analize na nivou kompanije i industri...Srđan Stefanovic
 
Cara membuktikan keaslian website
Cara membuktikan keaslian websiteCara membuktikan keaslian website
Cara membuktikan keaslian websiteAnwarMuhammad1
 
90% Of People Can't Pronounce This Whole Poem. You Have To Try It.
90% Of People Can't Pronounce This Whole Poem. You Have To Try It.90% Of People Can't Pronounce This Whole Poem. You Have To Try It.
90% Of People Can't Pronounce This Whole Poem. You Have To Try It.Duhoc_Vietsail
 

Destacado (17)

царевская наталья. революция Edtech 2.0 и ее монетизация
царевская наталья. революция Edtech  2.0  и ее монетизацияцаревская наталья. революция Edtech  2.0  и ее монетизация
царевская наталья. революция Edtech 2.0 и ее монетизация
 
лукацкий алексей. обзор последних законодательных инициатив в области информа...
лукацкий алексей. обзор последних законодательных инициатив в области информа...лукацкий алексей. обзор последних законодательных инициатив в области информа...
лукацкий алексей. обзор последних законодательных инициатив в области информа...
 
левский николай. оценка рисков мобильного пользователя и рекомендации по их ...
левский николай. оценка рисков мобильного пользователя  и рекомендации по их ...левский николай. оценка рисков мобильного пользователя  и рекомендации по их ...
левский николай. оценка рисков мобильного пользователя и рекомендации по их ...
 
EQB Minnesota and Climate Change
EQB Minnesota and Climate ChangeEQB Minnesota and Climate Change
EQB Minnesota and Climate Change
 
100 Trường đại học hàng đầu thế giới 2013-2014
100 Trường đại học hàng đầu thế giới 2013-2014100 Trường đại học hàng đầu thế giới 2013-2014
100 Trường đại học hàng đầu thế giới 2013-2014
 
Week 8 Exercise
Week 8 ExerciseWeek 8 Exercise
Week 8 Exercise
 
Membuat desain sistem keamananjaringan
Membuat desain sistem  keamananjaringanMembuat desain sistem  keamananjaringan
Membuat desain sistem keamananjaringan
 
Should teachers experiment with poetry in the classroom?
Should teachers experiment with poetry in the classroom?Should teachers experiment with poetry in the classroom?
Should teachers experiment with poetry in the classroom?
 
Cartilha Reserva Legal
Cartilha Reserva LegalCartilha Reserva Legal
Cartilha Reserva Legal
 
Chetan QA & MR resume
Chetan QA & MR resumeChetan QA & MR resume
Chetan QA & MR resume
 
бешков андрей. сравнение безопасности мобильных платформ
бешков андрей. сравнение безопасности мобильных платформбешков андрей. сравнение безопасности мобильных платформ
бешков андрей. сравнение безопасности мобильных платформ
 
Safari App extensions cleared up
Safari App extensions cleared upSafari App extensions cleared up
Safari App extensions cleared up
 
Mr kvantitativni aspekt fundamentalne analize na nivou kompanije i industri...
Mr   kvantitativni aspekt fundamentalne analize na nivou kompanije i industri...Mr   kvantitativni aspekt fundamentalne analize na nivou kompanije i industri...
Mr kvantitativni aspekt fundamentalne analize na nivou kompanije i industri...
 
Galeria
GaleriaGaleria
Galeria
 
Cara membuktikan keaslian website
Cara membuktikan keaslian websiteCara membuktikan keaslian website
Cara membuktikan keaslian website
 
Arabe
ArabeArabe
Arabe
 
90% Of People Can't Pronounce This Whole Poem. You Have To Try It.
90% Of People Can't Pronounce This Whole Poem. You Have To Try It.90% Of People Can't Pronounce This Whole Poem. You Have To Try It.
90% Of People Can't Pronounce This Whole Poem. You Have To Try It.
 

Similar a Corpus linguistic

From Universal to Programming Languages
From Universal to Programming LanguagesFrom Universal to Programming Languages
From Universal to Programming LanguagesFederico Gobbo
 
Sujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdf
Sujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdfSujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdf
Sujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdfSujay Rao Mandavilli
 
A history of english language teaching - Section 1 (3,4,5)
A history of english language teaching - Section 1 (3,4,5)A history of english language teaching - Section 1 (3,4,5)
A history of english language teaching - Section 1 (3,4,5)Seray Tanyer
 
Sujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdf
Sujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdfSujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdf
Sujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdfSujay Rao Mandavilli
 
Secondary and tertiary sources
Secondary and tertiary sourcesSecondary and tertiary sources
Secondary and tertiary sourcesDisha Mishra
 
The Use of Corpus Linguistics in Lexicography
The Use of Corpus Linguistics in LexicographyThe Use of Corpus Linguistics in Lexicography
The Use of Corpus Linguistics in LexicographyIhsan Ibadurrahman
 
History of linguistics presentation
History of linguistics presentationHistory of linguistics presentation
History of linguistics presentationFariha asghar
 
A Brief History of Archiving in Language Documentation, With an Annotated Bib...
A Brief History of Archiving in Language Documentation, With an Annotated Bib...A Brief History of Archiving in Language Documentation, With an Annotated Bib...
A Brief History of Archiving in Language Documentation, With an Annotated Bib...Tiffany Daniels
 
The History of Language Teaching Methodology
The History of Language Teaching MethodologyThe History of Language Teaching Methodology
The History of Language Teaching MethodologyGeovanny Peña
 
A timeline of the history of linguists - BAUTISTA - BELGERA.pdf
A timeline of the history of linguists - BAUTISTA - BELGERA.pdfA timeline of the history of linguists - BAUTISTA - BELGERA.pdf
A timeline of the history of linguists - BAUTISTA - BELGERA.pdfFordBryantSadio
 
a timeline of the history of linguistics- BAUTISTA- BELGERA.pdf
a timeline of the history of linguistics- BAUTISTA- BELGERA.pdfa timeline of the history of linguistics- BAUTISTA- BELGERA.pdf
a timeline of the history of linguistics- BAUTISTA- BELGERA.pdfFordBryantSadio
 
corpus linguistics and lexicography
corpus linguistics and lexicographycorpus linguistics and lexicography
corpus linguistics and lexicographyayfa
 
History Of Language Teaching
History Of Language TeachingHistory Of Language Teaching
History Of Language TeachingIsabel
 

Similar a Corpus linguistic (20)

Schools of thought
Schools of thoughtSchools of thought
Schools of thought
 
Skpb 1023 corpus linguitics
Skpb 1023 corpus linguiticsSkpb 1023 corpus linguitics
Skpb 1023 corpus linguitics
 
Phonetics report
Phonetics reportPhonetics report
Phonetics report
 
From Universal to Programming Languages
From Universal to Programming LanguagesFrom Universal to Programming Languages
From Universal to Programming Languages
 
Sujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdf
Sujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdfSujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdf
Sujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdf
 
A history of english language teaching - Section 1 (3,4,5)
A history of english language teaching - Section 1 (3,4,5)A history of english language teaching - Section 1 (3,4,5)
A history of english language teaching - Section 1 (3,4,5)
 
Sujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdf
Sujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdfSujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdf
Sujay Laws of Language Dynamics FINAL FINAL FINAL FINAL FINAL.pdf
 
Secondary and tertiary sources
Secondary and tertiary sourcesSecondary and tertiary sources
Secondary and tertiary sources
 
Dictionaries
DictionariesDictionaries
Dictionaries
 
Dictionaries
DictionariesDictionaries
Dictionaries
 
6. lecture no. intro to lang. dictionary, v+adv
6. lecture no. intro to lang. dictionary, v+adv6. lecture no. intro to lang. dictionary, v+adv
6. lecture no. intro to lang. dictionary, v+adv
 
The Use of Corpus Linguistics in Lexicography
The Use of Corpus Linguistics in LexicographyThe Use of Corpus Linguistics in Lexicography
The Use of Corpus Linguistics in Lexicography
 
History of linguistics presentation
History of linguistics presentationHistory of linguistics presentation
History of linguistics presentation
 
History of Linguistic
History of LinguisticHistory of Linguistic
History of Linguistic
 
A Brief History of Archiving in Language Documentation, With an Annotated Bib...
A Brief History of Archiving in Language Documentation, With an Annotated Bib...A Brief History of Archiving in Language Documentation, With an Annotated Bib...
A Brief History of Archiving in Language Documentation, With an Annotated Bib...
 
The History of Language Teaching Methodology
The History of Language Teaching MethodologyThe History of Language Teaching Methodology
The History of Language Teaching Methodology
 
A timeline of the history of linguists - BAUTISTA - BELGERA.pdf
A timeline of the history of linguists - BAUTISTA - BELGERA.pdfA timeline of the history of linguists - BAUTISTA - BELGERA.pdf
A timeline of the history of linguists - BAUTISTA - BELGERA.pdf
 
a timeline of the history of linguistics- BAUTISTA- BELGERA.pdf
a timeline of the history of linguistics- BAUTISTA- BELGERA.pdfa timeline of the history of linguistics- BAUTISTA- BELGERA.pdf
a timeline of the history of linguistics- BAUTISTA- BELGERA.pdf
 
corpus linguistics and lexicography
corpus linguistics and lexicographycorpus linguistics and lexicography
corpus linguistics and lexicography
 
History Of Language Teaching
History Of Language TeachingHistory Of Language Teaching
History Of Language Teaching
 

Último

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 

Último (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 

Corpus linguistic

  • 1. FARAH DIYANA BINTI AHMAD JEFIRUDDIN
  • 2. Study of language as expressed in samples (corpora) or "real world" text. DEFINITION
  • 3. KUCERA AND W. NELSON FRANCIS -publish Computational Analysis of Present-Day American English (1967) -contains a variety of computational analyses, combining elements of linguistics, language teaching, psychology, statistics, and sociology RANDOLPH QUIRK -publish Towards a description of English Usage' (1960) in which he introduced The Survey of English Usage. HISTORY
  • 4. HOUGHTON-MIFFLIN - publish American Heritage Dictionary (first dictionary to be compiled using corpus linguistics) -supply a million word, three-line citation base for the dictionary - AHD combines prescriptive elements with descriptive information. COLLINS - publish COBUILD monolingual learner's dictionary - designed for users learning English as a foreign language, (compiled using the Bank of English) -The Survey of English Usage Corpus was used in the development of the Comprehensive Grammar of English
  • 5. MONTREAL FRENCH PROJECT - The first computerized corpus of transcribed spoken language - contains one million words ANDERSEN-FORBES - is a computerized corpora - database of the Hebrew Bible - every clause is parsed using graphs representing seven levels of syntax, and each segment are tagged with seven fields of information THE QURANIC ARABIC CORPUS - an annotated corpus for the Classical Arabic language of the Quran - recent project with multiple layers of annotation including morphological segmentation, part-of- speech tagging, and syntactic analysis using dependency grammar
  • 6. METHODS 1) Annotation 2) Abstraction 3) Analysis METHODS
  • 7. Annotation consists of the application of a scheme to texts. Annotations may include structural mark-up, part-of- speech tagging, parsing, and numerous other representations. 1) ANNOTATION
  • 8. Abstraction consists of the translation (mapping) of terms in the scheme to terms in a theoretically motivated model or dataset. It typically includes linguist-directed search but may include e.g., rule-learning for parsers. 2) ABSTRACTION
  • 9. Analysis consists of statistically probing, manipulating and generalising from the dataset. Might include statistical evaluations, optimisation of rule-bases or knowledge discovery methods. 3) ANALYSIS