SlideShare a Scribd company logo
1 of 37
1




   Writing with Open
   Tools
             (Part One)




09/11/2011      http://www.flickr.com/photos/mikekline/265954619/   Alannah Fitzgerald
2   Overview (part one)
    Introducing Corpus Linguistics
    Lexical knowledge: collocations, derivatives,
    register
    The Flexible Language Acquisition Project
    (FLAX)
    The British National Corpus (BNC)
    The Lextutor
    The Academic Wordlist (AWL)
    EAP practice resources
Intro to corpus linguistics
Let‟s start with three questions about English:

1.    What is the meaning of goalless?
2.    How is the word shall used in present-day British
      English? Think of some examples.
3.    Which is more commonly expressed in everyday
      English?
     a.   “I was a little disappointed…”
     b.   “I was very disappointed…”

     Adapted from Hoffmann et al., 2008
British National Corpus

http://www.natcorp.ox.ac.uk/
Focus on representation
The British National Corpus (BNC)
100 million-word static corpus 1978-1992
  Spoken (10%); Written (90%); Domain representation
BNCweb concordancer – free download

        http://bncweb.info/
BNC header information
http://flax.nzdl.org/greenstone3/flax
?a=fp&sa=home
Focus on automation
The Flexible Language Acquisition Project
(FLAX)
Web n-gram corpora generated and supplied by 2006
Google web dump
  500,000 words and 380 million five-grams
  GALL - Google Assisted Language Learning
    (Chinnery, 2008; Shei, 2008)
„Goalless‟ keyword search in FLAX
     http://flax2.nzdl.org/greenstone3/flax?
Distribution of shall I/we in the spoken component
                     of the BNC
Distribution of I/we shall in the spoken component
                     of the BNC
FLAX - Samples retrieved for I was a little
disappointed
BNC - Samples retrieved for I was a little
disappointed
BNC – Samples retrieved for I was very
disappointed
 FLAX Web Collocations Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
FLAX vs BNC?

•   Limitations with representativeness
     Identifyingregister on the Web is difficult
     Successful corpora are based on
      domains, genres, collections of document types
     The web is a “dirty corpus” Kilgariff & Grefenstette
      (2003, p. 342)

   FLAX cleaned by 30% using BNC wordlist
     Linked   externally to BNC, Yahoo
       Complementary   sources, both with limitations
Google‟s terms of services
“You agree not to access (or attempt to access)
any of the Services by any means other than
through the interface that is provided by
Google, unless you have been specifically
allowed to do so in a separate agreement with
Google.”

http:www.google.com/accounts/TOS Clause 5.3
Typical lexical errors
18



                                        telling
     a. He‟s very humorous. He‟s always doing
        jokes.                                collocation

           conversed

     b. We conversated for almost word families / derivatives
                                  one hour.
                                                  without delay

     c. …and compromise, the issue was resolved in
                                               register
     a jiffy.
http://flax.nzdl.org/greenstone3/flax
?a=fp&sa=home
OSS Mozilla




          http://www.flickr.com/photos/hindrik/2586245939/
21   FLAX Web Pronoun Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
Noticing Text Types – Issues of Register and
                  Genre




     FLAX Web Pronoun Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
22
FLAX Web Pronoun Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
23
Web Pronouns Phrases OER
24




     http://www.youtube.com/watch?v=Ns4nXsZ
Kibbitzers (Tim John‟s EAP
25
     pages)




http://www.lexically.net/TimJohns/Kibbitzer/timeap3.htm
Web Collocations (fact vs idea)
26




http://flax2.nzdl.org/greenstone3/flax?a=g&rt=r&sa=CollocationSearch&s=CollocationTypes&s1.wordClass=n&c=c
                                         ollodb&s1.query=&s1.multiple=on
Web Collocations (fact vs idea)
27
Compleat Lexical Tutor (Tom
Cobb)




       http://www.lextutor.ca/
Web Collocations OER
     http://www.lextutor.ca/vp/
29




     http://www.youtube.com/watch?v=iyZgZhHM
AWL Exercises (Nottingham)
30




http://www.nottingham.ac.uk/~alzsh3/acvocab/index.htm
UEFAP (Andy Gillett)
31




        http://www.uefap.com/index.htm
Specific EAP vocab (UEFAP)
32




                     http://www.uefap.com/vocab/vocfram.htm
FLAX User guides & demos




  FLAX Web Collocations & Phrases Excercises (by Shaoqun Wu http://www.cs.waikato.ac.nz/~shaoqun/tmp/instruction.html)
Speaking & Listening OER for
EAP




       http://openspires.oucs.ox.ac.uk/crunch/
Web Phrases OER
35




         http://www.youtube.com/watch?v=n67FBqBFm6I
36   FLAX Web Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
37   Preparation (part two)
     •   Samples of your own writing – soft copy
     •   Build your own corpus – collect ten
         academic articles in your discipline
     •   Writing analysis tools
     •   Specific academic word lists

More Related Content

Similar to Intro to corpus linguistics tools for EAP

ICT Tools for Teaching Vocabulary
ICT Tools for Teaching VocabularyICT Tools for Teaching Vocabulary
ICT Tools for Teaching VocabularyNatalia Katasonova
 
Flexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual WorldFlexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual WorldAlannah Fitzgerald
 
Free Software Presentation Dkg
Free Software Presentation DkgFree Software Presentation Dkg
Free Software Presentation Dkglightybug
 
Improving Flickr discovery through Wikipedias
Improving Flickr discovery through WikipediasImproving Flickr discovery through Wikipedias
Improving Flickr discovery through WikipediasFederico Gobbo
 
Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1Tobias Wunner
 
Wreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognitionWreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognitionStephen Marquard
 
Arabic morphology and POS-tagging
Arabic morphology and POS-taggingArabic morphology and POS-tagging
Arabic morphology and POS-taggingbutest
 
CS4200 2019 Lecture 1: Introduction
CS4200 2019 Lecture 1: IntroductionCS4200 2019 Lecture 1: Introduction
CS4200 2019 Lecture 1: IntroductionEelco Visser
 
Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...Alannah Fitzgerald
 
Blending in the Open
Blending in the OpenBlending in the Open
Blending in the Openbbridges51
 
Corpora in language teaching
Corpora in language teachingCorpora in language teaching
Corpora in language teachingJonathan Smart
 
Technology for Open Education - Training with Open E-resources (TOETOE) in La...
Technology for Open Education - Training with Open E-resources (TOETOE) in La...Technology for Open Education - Training with Open E-resources (TOETOE) in La...
Technology for Open Education - Training with Open E-resources (TOETOE) in La...Alannah Fitzgerald
 
What you Can Make Out of Linked Data
What you Can Make Out of Linked DataWhat you Can Make Out of Linked Data
What you Can Make Out of Linked DataMarco Fossati
 
The Great Beyond with Open English Language Resources
The Great Beyond with Open English Language ResourcesThe Great Beyond with Open English Language Resources
The Great Beyond with Open English Language ResourcesAlannah Fitzgerald
 
GSoC: How to get prepared and write a good proposal (or how to start contribu...
GSoC: How to get prepared and write a good proposal (or how to start contribu...GSoC: How to get prepared and write a good proposal (or how to start contribu...
GSoC: How to get prepared and write a good proposal (or how to start contribu...João Paulo Rechi Vita
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...Tobias Kuhn
 
The TOETOE project - SCORE final presentation
The TOETOE project - SCORE final presentationThe TOETOE project - SCORE final presentation
The TOETOE project - SCORE final presentationAlannah Fitzgerald
 

Similar to Intro to corpus linguistics tools for EAP (20)

ICT Tools for Teaching Vocabulary
ICT Tools for Teaching VocabularyICT Tools for Teaching Vocabulary
ICT Tools for Teaching Vocabulary
 
E tools
E toolsE tools
E tools
 
Flexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual WorldFlexible Open Language Education for a MultiLingual World
Flexible Open Language Education for a MultiLingual World
 
Free Software Presentation Dkg
Free Software Presentation DkgFree Software Presentation Dkg
Free Software Presentation Dkg
 
Improving Flickr discovery through Wikipedias
Improving Flickr discovery through WikipediasImproving Flickr discovery through Wikipedias
Improving Flickr discovery through Wikipedias
 
Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1Enriching the semantic web tutorial session 1
Enriching the semantic web tutorial session 1
 
Wreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognitionWreck a nice beach: adventures in speech recognition
Wreck a nice beach: adventures in speech recognition
 
Arabic morphology and POS-tagging
Arabic morphology and POS-taggingArabic morphology and POS-tagging
Arabic morphology and POS-tagging
 
CS4200 2019 Lecture 1: Introduction
CS4200 2019 Lecture 1: IntroductionCS4200 2019 Lecture 1: Introduction
CS4200 2019 Lecture 1: Introduction
 
Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...Open English Language Resources and Practices for Professional and Academic S...
Open English Language Resources and Practices for Professional and Academic S...
 
Blending in the Open
Blending in the OpenBlending in the Open
Blending in the Open
 
Corpora in language teaching
Corpora in language teachingCorpora in language teaching
Corpora in language teaching
 
Technology for Open Education - Training with Open E-resources (TOETOE) in La...
Technology for Open Education - Training with Open E-resources (TOETOE) in La...Technology for Open Education - Training with Open E-resources (TOETOE) in La...
Technology for Open Education - Training with Open E-resources (TOETOE) in La...
 
What you Can Make Out of Linked Data
What you Can Make Out of Linked DataWhat you Can Make Out of Linked Data
What you Can Make Out of Linked Data
 
The Great Beyond with Open English Language Resources
The Great Beyond with Open English Language ResourcesThe Great Beyond with Open English Language Resources
The Great Beyond with Open English Language Resources
 
GSoC: How to get prepared and write a good proposal (or how to start contribu...
GSoC: How to get prepared and write a good proposal (or how to start contribu...GSoC: How to get prepared and write a good proposal (or how to start contribu...
GSoC: How to get prepared and write a good proposal (or how to start contribu...
 
Procedural programming
Procedural programmingProcedural programming
Procedural programming
 
SECCLL 2010
SECCLL 2010SECCLL 2010
SECCLL 2010
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
 
The TOETOE project - SCORE final presentation
The TOETOE project - SCORE final presentationThe TOETOE project - SCORE final presentation
The TOETOE project - SCORE final presentation
 

More from Alannah Fitzgerald

F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
F-Lingo: Integrating lexical feature identification into MOOC platforms for l...F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
F-Lingo: Integrating lexical feature identification into MOOC platforms for l...Alannah Fitzgerald
 
F-Lingo & FLAX: Automated open data-driven language learning in MOOCs
F-Lingo & FLAX: Automated open data-driven language learning in MOOCsF-Lingo & FLAX: Automated open data-driven language learning in MOOCs
F-Lingo & FLAX: Automated open data-driven language learning in MOOCsAlannah Fitzgerald
 
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...Alannah Fitzgerald
 
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...Alannah Fitzgerald
 
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)Alannah Fitzgerald
 
From clarion calls to auto-complete errors: a nascent discourse on openness ...
From clarion calls to auto-complete errors: a nascent discourse on openness ...From clarion calls to auto-complete errors: a nascent discourse on openness ...
From clarion calls to auto-complete errors: a nascent discourse on openness ...Alannah Fitzgerald
 
Converging cultures of open in language resources development
Converging cultures of open in language resources developmentConverging cultures of open in language resources development
Converging cultures of open in language resources developmentAlannah Fitzgerald
 
Developing Open Access Content into Academic English Resources for Data-Drive...
Developing Open Access Content into Academic English Resources for Data-Drive...Developing Open Access Content into Academic English Resources for Data-Drive...
Developing Open Access Content into Academic English Resources for Data-Drive...Alannah Fitzgerald
 
When a MOOC became a GROOC we all became co-creators
When a MOOC became a GROOC we all became co-creatorsWhen a MOOC became a GROOC we all became co-creators
When a MOOC became a GROOC we all became co-creatorsAlannah Fitzgerald
 
Serendipitous Innovation with Academic English Resources
Serendipitous Innovation with Academic English ResourcesSerendipitous Innovation with Academic English Resources
Serendipitous Innovation with Academic English ResourcesAlannah Fitzgerald
 
Bridging Formal and Informal Learning for Second Language Writing in FLAX
Bridging Formal and Informal Learning for Second Language Writing in FLAX Bridging Formal and Informal Learning for Second Language Writing in FLAX
Bridging Formal and Informal Learning for Second Language Writing in FLAX Alannah Fitzgerald
 
Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...Alannah Fitzgerald
 
The Open-Source FLAX Language System
The Open-Source FLAX Language System The Open-Source FLAX Language System
The Open-Source FLAX Language System Alannah Fitzgerald
 
FLAX: Flexible Language Acquisition with Open Data-Driven Learning
FLAX: Flexible Language Acquisition with Open Data-Driven LearningFLAX: Flexible Language Acquisition with Open Data-Driven Learning
FLAX: Flexible Language Acquisition with Open Data-Driven LearningAlannah Fitzgerald
 
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...Alannah Fitzgerald
 
Sharing an Open Methodology for Building Domain-specific Corpora for EAP
Sharing an Open Methodology for Building Domain-specific Corpora for EAP Sharing an Open Methodology for Building Domain-specific Corpora for EAP
Sharing an Open Methodology for Building Domain-specific Corpora for EAP Alannah Fitzgerald
 
Resources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic EnglishResources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic EnglishAlannah Fitzgerald
 
Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...Alannah Fitzgerald
 
Designing Open Linguistic Support
Designing Open Linguistic SupportDesigning Open Linguistic Support
Designing Open Linguistic SupportAlannah Fitzgerald
 

More from Alannah Fitzgerald (20)

F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
F-Lingo: Integrating lexical feature identification into MOOC platforms for l...F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
F-Lingo: Integrating lexical feature identification into MOOC platforms for l...
 
F-Lingo & FLAX: Automated open data-driven language learning in MOOCs
F-Lingo & FLAX: Automated open data-driven language learning in MOOCsF-Lingo & FLAX: Automated open data-driven language learning in MOOCs
F-Lingo & FLAX: Automated open data-driven language learning in MOOCs
 
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
The PhD Abstracts Collections in FLAX: Academic English with the Open Access ...
 
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
EThOS for EAP: The PhD Abstracts Collections in FLAX with the British Library...
 
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
Flexible, Free and Open Data-Driven Learning for the Masses (MOOCs)
 
EThOS for Academic English
EThOS for Academic EnglishEThOS for Academic English
EThOS for Academic English
 
From clarion calls to auto-complete errors: a nascent discourse on openness ...
From clarion calls to auto-complete errors: a nascent discourse on openness ...From clarion calls to auto-complete errors: a nascent discourse on openness ...
From clarion calls to auto-complete errors: a nascent discourse on openness ...
 
Converging cultures of open in language resources development
Converging cultures of open in language resources developmentConverging cultures of open in language resources development
Converging cultures of open in language resources development
 
Developing Open Access Content into Academic English Resources for Data-Drive...
Developing Open Access Content into Academic English Resources for Data-Drive...Developing Open Access Content into Academic English Resources for Data-Drive...
Developing Open Access Content into Academic English Resources for Data-Drive...
 
When a MOOC became a GROOC we all became co-creators
When a MOOC became a GROOC we all became co-creatorsWhen a MOOC became a GROOC we all became co-creators
When a MOOC became a GROOC we all became co-creators
 
Serendipitous Innovation with Academic English Resources
Serendipitous Innovation with Academic English ResourcesSerendipitous Innovation with Academic English Resources
Serendipitous Innovation with Academic English Resources
 
Bridging Formal and Informal Learning for Second Language Writing in FLAX
Bridging Formal and Informal Learning for Second Language Writing in FLAX Bridging Formal and Informal Learning for Second Language Writing in FLAX
Bridging Formal and Informal Learning for Second Language Writing in FLAX
 
Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...Setting a Precedent with Open Resources Development in English for Specific A...
Setting a Precedent with Open Resources Development in English for Specific A...
 
The Open-Source FLAX Language System
The Open-Source FLAX Language System The Open-Source FLAX Language System
The Open-Source FLAX Language System
 
FLAX: Flexible Language Acquisition with Open Data-Driven Learning
FLAX: Flexible Language Acquisition with Open Data-Driven LearningFLAX: Flexible Language Acquisition with Open Data-Driven Learning
FLAX: Flexible Language Acquisition with Open Data-Driven Learning
 
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
Bridging Informal MOOCs & Formal English for Academic Purposes Programmes wit...
 
Sharing an Open Methodology for Building Domain-specific Corpora for EAP
Sharing an Open Methodology for Building Domain-specific Corpora for EAP Sharing an Open Methodology for Building Domain-specific Corpora for EAP
Sharing an Open Methodology for Building Domain-specific Corpora for EAP
 
Resources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic EnglishResources at the Interface of Openness for Academic English
Resources at the Interface of Openness for Academic English
 
Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...Downstream with Open Educational Resources and Practices: rEAPing the rewards...
Downstream with Open Educational Resources and Practices: rEAPing the rewards...
 
Designing Open Linguistic Support
Designing Open Linguistic SupportDesigning Open Linguistic Support
Designing Open Linguistic Support
 

Recently uploaded

Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 

Recently uploaded (20)

INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 

Intro to corpus linguistics tools for EAP

  • 1. 1 Writing with Open Tools (Part One) 09/11/2011 http://www.flickr.com/photos/mikekline/265954619/ Alannah Fitzgerald
  • 2. 2 Overview (part one) Introducing Corpus Linguistics Lexical knowledge: collocations, derivatives, register The Flexible Language Acquisition Project (FLAX) The British National Corpus (BNC) The Lextutor The Academic Wordlist (AWL) EAP practice resources
  • 3. Intro to corpus linguistics Let‟s start with three questions about English: 1. What is the meaning of goalless? 2. How is the word shall used in present-day British English? Think of some examples. 3. Which is more commonly expressed in everyday English? a. “I was a little disappointed…” b. “I was very disappointed…” Adapted from Hoffmann et al., 2008
  • 5. Focus on representation The British National Corpus (BNC) 100 million-word static corpus 1978-1992 Spoken (10%); Written (90%); Domain representation
  • 6. BNCweb concordancer – free download http://bncweb.info/
  • 9. Focus on automation The Flexible Language Acquisition Project (FLAX) Web n-gram corpora generated and supplied by 2006 Google web dump 500,000 words and 380 million five-grams GALL - Google Assisted Language Learning (Chinnery, 2008; Shei, 2008)
  • 10. „Goalless‟ keyword search in FLAX http://flax2.nzdl.org/greenstone3/flax?
  • 11. Distribution of shall I/we in the spoken component of the BNC
  • 12. Distribution of I/we shall in the spoken component of the BNC
  • 13. FLAX - Samples retrieved for I was a little disappointed
  • 14. BNC - Samples retrieved for I was a little disappointed
  • 15. BNC – Samples retrieved for I was very disappointed FLAX Web Collocations Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
  • 16. FLAX vs BNC? • Limitations with representativeness  Identifyingregister on the Web is difficult  Successful corpora are based on domains, genres, collections of document types  The web is a “dirty corpus” Kilgariff & Grefenstette (2003, p. 342)  FLAX cleaned by 30% using BNC wordlist  Linked externally to BNC, Yahoo  Complementary sources, both with limitations
  • 17. Google‟s terms of services “You agree not to access (or attempt to access) any of the Services by any means other than through the interface that is provided by Google, unless you have been specifically allowed to do so in a separate agreement with Google.” http:www.google.com/accounts/TOS Clause 5.3
  • 18. Typical lexical errors 18 telling a. He‟s very humorous. He‟s always doing jokes. collocation conversed b. We conversated for almost word families / derivatives one hour. without delay c. …and compromise, the issue was resolved in register a jiffy.
  • 20. OSS Mozilla http://www.flickr.com/photos/hindrik/2586245939/
  • 21. 21 FLAX Web Pronoun Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
  • 22. Noticing Text Types – Issues of Register and Genre FLAX Web Pronoun Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=) 22
  • 23. FLAX Web Pronoun Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=) 23
  • 24. Web Pronouns Phrases OER 24 http://www.youtube.com/watch?v=Ns4nXsZ
  • 25. Kibbitzers (Tim John‟s EAP 25 pages) http://www.lexically.net/TimJohns/Kibbitzer/timeap3.htm
  • 26. Web Collocations (fact vs idea) 26 http://flax2.nzdl.org/greenstone3/flax?a=g&rt=r&sa=CollocationSearch&s=CollocationTypes&s1.wordClass=n&c=c ollodb&s1.query=&s1.multiple=on
  • 27. Web Collocations (fact vs idea) 27
  • 28. Compleat Lexical Tutor (Tom Cobb) http://www.lextutor.ca/
  • 29. Web Collocations OER http://www.lextutor.ca/vp/ 29 http://www.youtube.com/watch?v=iyZgZhHM
  • 31. UEFAP (Andy Gillett) 31 http://www.uefap.com/index.htm
  • 32. Specific EAP vocab (UEFAP) 32 http://www.uefap.com/vocab/vocfram.htm
  • 33. FLAX User guides & demos FLAX Web Collocations & Phrases Excercises (by Shaoqun Wu http://www.cs.waikato.ac.nz/~shaoqun/tmp/instruction.html)
  • 34. Speaking & Listening OER for EAP http://openspires.oucs.ox.ac.uk/crunch/
  • 35. Web Phrases OER 35 http://www.youtube.com/watch?v=n67FBqBFm6I
  • 36. 36 FLAX Web Phrases Collection Search (http://flax2.nzdl.org/greenstone3/flax?a=p&sa=home&module=)
  • 37. 37 Preparation (part two) • Samples of your own writing – soft copy • Build your own corpus – collect ten academic articles in your discipline • Writing analysis tools • Specific academic word lists