SlideShare una empresa de Scribd logo
1 de 31
Descargar para leer sin conexión
Translation Quality Assessment:
Five Easy Steps
Using Multidimensional Quality Metrics to
improve quality assessment and management
Prepared by the QTLaunchPad project (info@qt21.eu)
version 1.0 (26.April 2013)
Who does this apply to?
 Requesters of translation services looking for relevant
quality metrics
 Language Service Providers (LSPs) delivering translation
services to their clients
 The following materials will apply to negotiation
between requesters and providers
 This description does not apply to individual translators
(although they may want to be aware of the contents)
Step 1: Specifications
Basic questions about your project
E.g.,
 What languages are you working in?
 What is your subject field?
 What sort of project is it (e.g., user
interface, documentation, advertising)?
 What technology are you using (MT, CAT, etc.)?
 What register and style are you using?
Step 2. Select Metrics
Based on your specifications…
 MQM recommendation tool will:
 suggest a pre-defined metric used for similar projects, or
 recommend a custom metric that applies to your project
 You are free to modify the metric as needed
 Create a metrics specification file that
 defines the issues to be examined
 provides weights (descriptions of how important the issues
are)
 Metrics specification file can be used by an MQM-
compliant tool
Step 3: Evaluation Method
Three options:
1. Sampling: Examine a portion of the text to determine
whether to pass or fail the entire text. Sampling can
utilize quality estimation for better results
2. Full error analysis: Review the entire text (needed for
critical legal or safety texts)
3. Rubric: Rate the text on a numerical scale (suitable
for quick assessment of suitability)
Automated Metrics
 If sampling is used, MQM’s quality estimation tools will
help focus sampling on those parts of the text that need
attention
 Automatic metrics can be used in some cases where
human evaluation is too expensive or time-consuming
Step 4: Evaluation
Evaluation…
 Can be conducted by the requester or LSP in accordance
with the agreement between the parties
 Follows the method chosen in Step 3 (evaluation
method)
 Issues must match the metric chosen in Step 2: issues
not found in the metric should not be considered errors
MQM provides capabilities
 For human evaluation
 Inline markup provides an audit trail:
 Allows independent verification of errors
 Helps ensure that issues are corrected
 Full reporting functions:
 See what types of errors are reported
 Understand where errors come from
 For automatic evaluation
 Integrated use of existing quality metrics to help provide
evaluation
translate5
 These capabilities are being integrated into an open-
source editing tool, translate5
(http://www.translate5.net)
 All results are free to implement in additional tools
(both open source and proprietary)
 Parties interested in development should contact
info@qt21.eu
The source matters
 Full MQM evaluation includes the source
 Source quality evaluation can help identify reasons for
problems and resolve them
 Translators can be rewarded for addressing source
deficiencies (scores over 100% are possible!)
Step 5: Scoring
Scoring Formula
 (Q = whatever set of issues being counted within the bigger
formula)
 Provides consistency with LISA QA Model scoring method
 Can be customized to support other legacy systems
 Can be applied to individual parts of the overall formula:
i.e., fluency, accuracy, grammar, etc. subscores can be
derived
 Weights (not shown) can be used to adjust importance of
various issue types
Scores help guide decisions
 Scores are given on a 100% basis
 Scores can be broken down into more fine-grained
reports.
 E.g., a score of 96% could have 100% accuracy but 92%
fluency.
 Helps target actions for quality control.
Example
1. Specifications
Parameter Value
Language/Locale Source: English; Target: Japanese
Subject field/domain Medical
Text type Narrative
Audience Educated readers with an interest in medicine
Purpose Education about a new procedure for managing
diabetes
Register Moderately formal
Style no specified style – match source if possible
Content correspondence Literal translation
Output modality subtitles (speech to text)
File format Time-coded XML for dotSub
Production technology human translation
2. Recommended Metric
Issue type Weight (high,
medium, low)
Notes
Fluency
Orthography High
Grammar High
Accuracy
Mistranslation High
Omission Low Due to nature as captions,
some information loss is
expected. Captions should be
60% of spoken dialogue
Untranslated High
Legal
requirements
High Must make sure that legal
claims are admissible under
Japanese law
Chosen from…
Issue types are a subset of the full catalog of types
Chosen from…
Quality Formula (1)
TQ = (Atr + At - As) + (Ft – Fs)
with respect to specifications
 TQ = translation quality
 Atr = accuracy (transfer)
 At = accuracy for the target text
 As = accuracy for the source text
 Ft = fluency score for target text
 Fs = fluency score for source text
Quality Formula (2)
TQ = (Atr + At - As) + (Ft – Fs)
with respect to specifications
Definition: A quality translation demonstrates required
accuracy and fluency for the audience and purpose and
complies with all other negotiated specifications, taking
into account end-user needs.
The gold portion = dimensions (specifications)
3. Evaluation method
 In this example, portions of the text are marketing:
sampling is an acceptable evaluation method for these
parts
 Other portions contain legal and regulatory claims: full
error analysis is required for those portions
 Inline markup can be used via MQM namespace (because
text is in XML) to ensure corrections are made.
4. Evaluation
• Evaluation includes
subsegment markup
with issues in metric
• Issues stored in MQM
namespace to allow
audit and revision
• Users can select three severity levels:
• critical: the issue renders the text unusable
• major: the issue leaves the text usable, but is an obstacle to
understanding
• minor: the issue does not impact usability of the text
screenshot: translate5.net showing MQM markup tool
5. Scoring
Issue type Weight Minor Major Critical Penalty Adjusted Total
Fluency
Orthography 1.0 8 2 1 28 28 97.2%
Grammar 1.0 6 2 0 16 16 98.4%
Subtotal 44 95.6%
Accuracy
Mistranslation 1.0 4 0 0 4 4 99.6%
Omission 0.2 12 4 1 42 8.4 99.2%
Untranslated 1.0 1 0 0 1 1 99.9%
Legal
requirements
1.0 0 0 1 10 10 99.0%
Subtotal 23.4 97.7%
Total 67.4 93.3%
Assumes 1000-word sample
Because Omission is considered a low
priority in this case, it is given a low
weight
5. Scoring
 Without weighting of Omission, the score would be
89.9%
 We can see that the translator has more problems with
fluency than with accuracy
5. Full scoring (including source)
Issue type Source Target Adjusted
Fluency
Orthography 96.1% 97.2% 101.1%
Grammar 99.0% 98.4% 99.6%
Subtotal 95.1% 95.6% ☞ 100.5%
Accuracy
Mistranslation (100%) 99.6% 99.6%
Omission (100%) 99.2% 99.2%
Untranslated (100%) 99.9% 99.9%
Legal requirements (100%) 99.0% 99.0%
Subtotal 100% 97.7% 97.7%
Total 95.1% 89.9% 98.2%
Assumes 1000-word sample. Source accuracy set to 100% for computational purposes.
5. Scoring (including source)
 In many cases, some problems in a translation are not
caused by the translator.
 In this case, the translator fixed problems in the
source, resulting in better quality for fluency in the
target. The translator should be recognized for this
work.
For more information
 Please visit http://www.qt21.eu/launchpad/
 Write to info@qt21.eu

Más contenido relacionado

La actualidad más candente

Top Trans Survey Translation Issues
Top Trans Survey Translation IssuesTop Trans Survey Translation Issues
Top Trans Survey Translation IssuesRaya Wasser
 
Serge astm-presentation-chicago-2014-final
Serge astm-presentation-chicago-2014-finalSerge astm-presentation-chicago-2014-final
Serge astm-presentation-chicago-2014-finalSerge Gladkoff
 
Tech capabilities with_sa
Tech capabilities with_saTech capabilities with_sa
Tech capabilities with_saRobert Martin
 
18. Alessandro Cattelan (Translated) Terminology
18. Alessandro Cattelan (Translated) Terminology18. Alessandro Cattelan (Translated) Terminology
18. Alessandro Cattelan (Translated) TerminologyRIILP
 
A Survey Of Agile Development Methodologies
A Survey Of Agile Development MethodologiesA Survey Of Agile Development Methodologies
A Survey Of Agile Development MethodologiesAbdul Basit
 
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize
 
High Volume, Rapid Turn Around Localization: Lessons Learned
High Volume, Rapid Turn Around Localization: Lessons LearnedHigh Volume, Rapid Turn Around Localization: Lessons Learned
High Volume, Rapid Turn Around Localization: Lessons LearnedSDL
 
The Latest Advances in Patent Machine Translation
The Latest Advances in Patent Machine TranslationThe Latest Advances in Patent Machine Translation
The Latest Advances in Patent Machine TranslationIconic Translation Machines
 
Tools-Driven Content Curation & Engine Training ATMA 2014
Tools-Driven Content Curation & Engine Training ATMA 2014Tools-Driven Content Curation & Engine Training ATMA 2014
Tools-Driven Content Curation & Engine Training ATMA 2014Welocalize
 
Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019
Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019
Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019Jose Luis Bonilla Sánchez
 

La actualidad más candente (17)

Top Trans Survey Translation Issues
Top Trans Survey Translation IssuesTop Trans Survey Translation Issues
Top Trans Survey Translation Issues
 
Serge astm-presentation-chicago-2014-final
Serge astm-presentation-chicago-2014-finalSerge astm-presentation-chicago-2014-final
Serge astm-presentation-chicago-2014-final
 
Lucia Specia - Estimativa de qualidade em TA
Lucia Specia - Estimativa de qualidade em TALucia Specia - Estimativa de qualidade em TA
Lucia Specia - Estimativa de qualidade em TA
 
TAUS Evaluating Post-Editor Performance Guidelines
TAUS Evaluating Post-Editor Performance GuidelinesTAUS Evaluating Post-Editor Performance Guidelines
TAUS Evaluating Post-Editor Performance Guidelines
 
Tech capabilities with_sa
Tech capabilities with_saTech capabilities with_sa
Tech capabilities with_sa
 
18. Alessandro Cattelan (Translated) Terminology
18. Alessandro Cattelan (Translated) Terminology18. Alessandro Cattelan (Translated) Terminology
18. Alessandro Cattelan (Translated) Terminology
 
A Survey Of Agile Development Methodologies
A Survey Of Agile Development MethodologiesA Survey Of Agile Development Methodologies
A Survey Of Agile Development Methodologies
 
ATA grading rubric
ATA grading rubricATA grading rubric
ATA grading rubric
 
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
 
High Volume, Rapid Turn Around Localization: Lessons Learned
High Volume, Rapid Turn Around Localization: Lessons LearnedHigh Volume, Rapid Turn Around Localization: Lessons Learned
High Volume, Rapid Turn Around Localization: Lessons Learned
 
The Latest Advances in Patent Machine Translation
The Latest Advances in Patent Machine TranslationThe Latest Advances in Patent Machine Translation
The Latest Advances in Patent Machine Translation
 
Auditing_COAs_TranslationSilviaZaragoza
Auditing_COAs_TranslationSilviaZaragozaAuditing_COAs_TranslationSilviaZaragoza
Auditing_COAs_TranslationSilviaZaragoza
 
Tools-Driven Content Curation & Engine Training ATMA 2014
Tools-Driven Content Curation & Engine Training ATMA 2014Tools-Driven Content Curation & Engine Training ATMA 2014
Tools-Driven Content Curation & Engine Training ATMA 2014
 
Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019
Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019
Building and Implementing MT systems @ eBay – TAUS Global Content Summit 2019
 
Goods procurement­
Goods procurement­Goods procurement­
Goods procurement­
 
Unit1
Unit1Unit1
Unit1
 
Documentation Checklist
Documentation ChecklistDocumentation Checklist
Documentation Checklist
 

Destacado

Wind Power in Maine
Wind Power in MaineWind Power in Maine
Wind Power in MaineHall-Dale
 
Quality metrics in project management
Quality metrics in project managementQuality metrics in project management
Quality metrics in project managementselinasimpson1601
 
Project quality (and test process) metrics
Project quality (and test process) metricsProject quality (and test process) metrics
Project quality (and test process) metricsZbyszek Mockun
 
Quality metrics project management
Quality metrics project managementQuality metrics project management
Quality metrics project managementselinasimpson1501
 
Software Quality Metrics for Testers - StarWest 2013
Software Quality Metrics for Testers - StarWest 2013Software Quality Metrics for Testers - StarWest 2013
Software Quality Metrics for Testers - StarWest 2013XBOSoft
 
Quality Metrics
Quality Metrics Quality Metrics
Quality Metrics Haroon Abbu
 
Construction quality management plan (Construction Productivity Analysis)
Construction quality management plan (Construction Productivity Analysis)Construction quality management plan (Construction Productivity Analysis)
Construction quality management plan (Construction Productivity Analysis)Jayson Narito
 
Quality management slides
Quality management slidesQuality management slides
Quality management slidesSaba Afaq
 
Project quality management - PMI PMBOK Knowledge Area
Project quality management - PMI PMBOK Knowledge AreaProject quality management - PMI PMBOK Knowledge Area
Project quality management - PMI PMBOK Knowledge AreaImran Jamil
 

Destacado (13)

Mekheimer
MekheimerMekheimer
Mekheimer
 
Wind Power in Maine
Wind Power in MaineWind Power in Maine
Wind Power in Maine
 
Quality metrics in project management
Quality metrics in project managementQuality metrics in project management
Quality metrics in project management
 
Project quality (and test process) metrics
Project quality (and test process) metricsProject quality (and test process) metrics
Project quality (and test process) metrics
 
Quality metrics project management
Quality metrics project managementQuality metrics project management
Quality metrics project management
 
5 of 5 Quality Metrics & Science by Obaid Ali
5 of 5  Quality Metrics & Science by Obaid Ali5 of 5  Quality Metrics & Science by Obaid Ali
5 of 5 Quality Metrics & Science by Obaid Ali
 
Software Quality Metrics for Testers - StarWest 2013
Software Quality Metrics for Testers - StarWest 2013Software Quality Metrics for Testers - StarWest 2013
Software Quality Metrics for Testers - StarWest 2013
 
Quality Metrics
Quality Metrics Quality Metrics
Quality Metrics
 
Quality Metrics
Quality MetricsQuality Metrics
Quality Metrics
 
Construction quality management plan (Construction Productivity Analysis)
Construction quality management plan (Construction Productivity Analysis)Construction quality management plan (Construction Productivity Analysis)
Construction quality management plan (Construction Productivity Analysis)
 
Quality management slides
Quality management slidesQuality management slides
Quality management slides
 
Project quality management - PMI PMBOK Knowledge Area
Project quality management - PMI PMBOK Knowledge AreaProject quality management - PMI PMBOK Knowledge Area
Project quality management - PMI PMBOK Knowledge Area
 
Project Metrics & Measures
Project Metrics & MeasuresProject Metrics & Measures
Project Metrics & Measures
 

Similar a Overview of Multidimensional Quality Metrics (QTLaunchPad)

Algorithm ExampleFor the following taskUse the random module .docx
Algorithm ExampleFor the following taskUse the random module .docxAlgorithm ExampleFor the following taskUse the random module .docx
Algorithm ExampleFor the following taskUse the random module .docxdaniahendric
 
Improving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case StudyImproving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case StudyIconic Translation Machines
 
Workshop on the tauyou machine translation platform
Workshop on the tauyou machine translation platformWorkshop on the tauyou machine translation platform
Workshop on the tauyou machine translation platformtauyou
 
Quality is in the Eye of the Beholder - Part 2
Quality is in the Eye of the Beholder - Part 2Quality is in the Eye of the Beholder - Part 2
Quality is in the Eye of the Beholder - Part 2Think Latin America
 
ICALT 2010: Supporting Exception Handling in Scripted Collaborative Courses
ICALT 2010: Supporting Exception Handling in Scripted Collaborative CoursesICALT 2010: Supporting Exception Handling in Scripted Collaborative Courses
ICALT 2010: Supporting Exception Handling in Scripted Collaborative CoursesRoberto Perez-Rodriguez
 
What machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happyWhat machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happyIconic Translation Machines
 
Software testing
Software testingSoftware testing
Software testingthaneofife
 
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...dclsocialmedia
 
Tm challenges
Tm challengesTm challenges
Tm challengesITIRussia
 
Language Quality Management: Models, Measures, Methodologies
Language Quality Management: Models, Measures, Methodologies Language Quality Management: Models, Measures, Methodologies
Language Quality Management: Models, Measures, Methodologies Sajan
 
Error Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation OutputsError Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation OutputsParisa Niksefat
 
Top Natural Language Processing |aitech.studio
Top Natural Language Processing |aitech.studioTop Natural Language Processing |aitech.studio
Top Natural Language Processing |aitech.studioAITechStudio
 
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...TAUS - The Language Data Network
 
Machine Translation Master Class at the EUATC Conference by Diego Bartolome
Machine Translation Master Class at the EUATC Conference by Diego BartolomeMachine Translation Master Class at the EUATC Conference by Diego Bartolome
Machine Translation Master Class at the EUATC Conference by Diego Bartolometauyou
 

Similar a Overview of Multidimensional Quality Metrics (QTLaunchPad) (20)

Algorithm ExampleFor the following taskUse the random module .docx
Algorithm ExampleFor the following taskUse the random module .docxAlgorithm ExampleFor the following taskUse the random module .docx
Algorithm ExampleFor the following taskUse the random module .docx
 
Improving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case StudyImproving Translator Productivity with MT: A Patent Translation Case Study
Improving Translator Productivity with MT: A Patent Translation Case Study
 
Workshop on the tauyou machine translation platform
Workshop on the tauyou machine translation platformWorkshop on the tauyou machine translation platform
Workshop on the tauyou machine translation platform
 
Quality is in the Eye of the Beholder, by Eva Klaudinyova
Quality is in the Eye of the Beholder, by Eva KlaudinyovaQuality is in the Eye of the Beholder, by Eva Klaudinyova
Quality is in the Eye of the Beholder, by Eva Klaudinyova
 
Quality is in the Eye of the Beholder - Part 2
Quality is in the Eye of the Beholder - Part 2Quality is in the Eye of the Beholder - Part 2
Quality is in the Eye of the Beholder - Part 2
 
ICALT 2010: Supporting Exception Handling in Scripted Collaborative Courses
ICALT 2010: Supporting Exception Handling in Scripted Collaborative CoursesICALT 2010: Supporting Exception Handling in Scripted Collaborative Courses
ICALT 2010: Supporting Exception Handling in Scripted Collaborative Courses
 
What machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happyWhat machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happy
 
Software testing
Software testingSoftware testing
Software testing
 
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
Is Your Enterprise “fire-fighting” translation issues? Optimize the process w...
 
Tm challenges
Tm challengesTm challenges
Tm challenges
 
TAUS Best Practices Error Typology Guidelines
TAUS Best Practices Error Typology GuidelinesTAUS Best Practices Error Typology Guidelines
TAUS Best Practices Error Typology Guidelines
 
Language Quality Management: Models, Measures, Methodologies
Language Quality Management: Models, Measures, Methodologies Language Quality Management: Models, Measures, Methodologies
Language Quality Management: Models, Measures, Methodologies
 
Error Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation OutputsError Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation Outputs
 
Sda 4
Sda   4Sda   4
Sda 4
 
Check upload1
Check upload1Check upload1
Check upload1
 
Check upload1
Check upload1Check upload1
Check upload1
 
Prvt file test
Prvt file testPrvt file test
Prvt file test
 
Top Natural Language Processing |aitech.studio
Top Natural Language Processing |aitech.studioTop Natural Language Processing |aitech.studio
Top Natural Language Processing |aitech.studio
 
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
Teaminology - A New Crowdsourcing Application for Term & Translation Governan...
 
Machine Translation Master Class at the EUATC Conference by Diego Bartolome
Machine Translation Master Class at the EUATC Conference by Diego BartolomeMachine Translation Master Class at the EUATC Conference by Diego Bartolome
Machine Translation Master Class at the EUATC Conference by Diego Bartolome
 

Último

QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesBernd Ruecker
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfAarwolf Industries LLC
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
A Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxA Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxAna-Maria Mihalceanu
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Mark Simos
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 

Último (20)

QCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architecturesQCon London: Mastering long-running processes in modern architectures
QCon London: Mastering long-running processes in modern architectures
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Landscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdfLandscape Catalogue 2024 Australia-1.pdf
Landscape Catalogue 2024 Australia-1.pdf
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
A Glance At The Java Performance Toolbox
A Glance At The Java Performance ToolboxA Glance At The Java Performance Toolbox
A Glance At The Java Performance Toolbox
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 

Overview of Multidimensional Quality Metrics (QTLaunchPad)

  • 1. Translation Quality Assessment: Five Easy Steps Using Multidimensional Quality Metrics to improve quality assessment and management Prepared by the QTLaunchPad project (info@qt21.eu) version 1.0 (26.April 2013)
  • 2. Who does this apply to?  Requesters of translation services looking for relevant quality metrics  Language Service Providers (LSPs) delivering translation services to their clients  The following materials will apply to negotiation between requesters and providers  This description does not apply to individual translators (although they may want to be aware of the contents)
  • 4. Basic questions about your project E.g.,  What languages are you working in?  What is your subject field?  What sort of project is it (e.g., user interface, documentation, advertising)?  What technology are you using (MT, CAT, etc.)?  What register and style are you using?
  • 5. Step 2. Select Metrics
  • 6. Based on your specifications…  MQM recommendation tool will:  suggest a pre-defined metric used for similar projects, or  recommend a custom metric that applies to your project  You are free to modify the metric as needed  Create a metrics specification file that  defines the issues to be examined  provides weights (descriptions of how important the issues are)  Metrics specification file can be used by an MQM- compliant tool
  • 8. Three options: 1. Sampling: Examine a portion of the text to determine whether to pass or fail the entire text. Sampling can utilize quality estimation for better results 2. Full error analysis: Review the entire text (needed for critical legal or safety texts) 3. Rubric: Rate the text on a numerical scale (suitable for quick assessment of suitability)
  • 9. Automated Metrics  If sampling is used, MQM’s quality estimation tools will help focus sampling on those parts of the text that need attention  Automatic metrics can be used in some cases where human evaluation is too expensive or time-consuming
  • 11. Evaluation…  Can be conducted by the requester or LSP in accordance with the agreement between the parties  Follows the method chosen in Step 3 (evaluation method)  Issues must match the metric chosen in Step 2: issues not found in the metric should not be considered errors
  • 12. MQM provides capabilities  For human evaluation  Inline markup provides an audit trail:  Allows independent verification of errors  Helps ensure that issues are corrected  Full reporting functions:  See what types of errors are reported  Understand where errors come from  For automatic evaluation  Integrated use of existing quality metrics to help provide evaluation
  • 13. translate5  These capabilities are being integrated into an open- source editing tool, translate5 (http://www.translate5.net)  All results are free to implement in additional tools (both open source and proprietary)  Parties interested in development should contact info@qt21.eu
  • 14. The source matters  Full MQM evaluation includes the source  Source quality evaluation can help identify reasons for problems and resolve them  Translators can be rewarded for addressing source deficiencies (scores over 100% are possible!)
  • 16. Scoring Formula  (Q = whatever set of issues being counted within the bigger formula)  Provides consistency with LISA QA Model scoring method  Can be customized to support other legacy systems  Can be applied to individual parts of the overall formula: i.e., fluency, accuracy, grammar, etc. subscores can be derived  Weights (not shown) can be used to adjust importance of various issue types
  • 17. Scores help guide decisions  Scores are given on a 100% basis  Scores can be broken down into more fine-grained reports.  E.g., a score of 96% could have 100% accuracy but 92% fluency.  Helps target actions for quality control.
  • 19. 1. Specifications Parameter Value Language/Locale Source: English; Target: Japanese Subject field/domain Medical Text type Narrative Audience Educated readers with an interest in medicine Purpose Education about a new procedure for managing diabetes Register Moderately formal Style no specified style – match source if possible Content correspondence Literal translation Output modality subtitles (speech to text) File format Time-coded XML for dotSub Production technology human translation
  • 20. 2. Recommended Metric Issue type Weight (high, medium, low) Notes Fluency Orthography High Grammar High Accuracy Mistranslation High Omission Low Due to nature as captions, some information loss is expected. Captions should be 60% of spoken dialogue Untranslated High Legal requirements High Must make sure that legal claims are admissible under Japanese law
  • 21. Chosen from… Issue types are a subset of the full catalog of types
  • 23. Quality Formula (1) TQ = (Atr + At - As) + (Ft – Fs) with respect to specifications  TQ = translation quality  Atr = accuracy (transfer)  At = accuracy for the target text  As = accuracy for the source text  Ft = fluency score for target text  Fs = fluency score for source text
  • 24. Quality Formula (2) TQ = (Atr + At - As) + (Ft – Fs) with respect to specifications Definition: A quality translation demonstrates required accuracy and fluency for the audience and purpose and complies with all other negotiated specifications, taking into account end-user needs. The gold portion = dimensions (specifications)
  • 25. 3. Evaluation method  In this example, portions of the text are marketing: sampling is an acceptable evaluation method for these parts  Other portions contain legal and regulatory claims: full error analysis is required for those portions  Inline markup can be used via MQM namespace (because text is in XML) to ensure corrections are made.
  • 26. 4. Evaluation • Evaluation includes subsegment markup with issues in metric • Issues stored in MQM namespace to allow audit and revision • Users can select three severity levels: • critical: the issue renders the text unusable • major: the issue leaves the text usable, but is an obstacle to understanding • minor: the issue does not impact usability of the text screenshot: translate5.net showing MQM markup tool
  • 27. 5. Scoring Issue type Weight Minor Major Critical Penalty Adjusted Total Fluency Orthography 1.0 8 2 1 28 28 97.2% Grammar 1.0 6 2 0 16 16 98.4% Subtotal 44 95.6% Accuracy Mistranslation 1.0 4 0 0 4 4 99.6% Omission 0.2 12 4 1 42 8.4 99.2% Untranslated 1.0 1 0 0 1 1 99.9% Legal requirements 1.0 0 0 1 10 10 99.0% Subtotal 23.4 97.7% Total 67.4 93.3% Assumes 1000-word sample Because Omission is considered a low priority in this case, it is given a low weight
  • 28. 5. Scoring  Without weighting of Omission, the score would be 89.9%  We can see that the translator has more problems with fluency than with accuracy
  • 29. 5. Full scoring (including source) Issue type Source Target Adjusted Fluency Orthography 96.1% 97.2% 101.1% Grammar 99.0% 98.4% 99.6% Subtotal 95.1% 95.6% ☞ 100.5% Accuracy Mistranslation (100%) 99.6% 99.6% Omission (100%) 99.2% 99.2% Untranslated (100%) 99.9% 99.9% Legal requirements (100%) 99.0% 99.0% Subtotal 100% 97.7% 97.7% Total 95.1% 89.9% 98.2% Assumes 1000-word sample. Source accuracy set to 100% for computational purposes.
  • 30. 5. Scoring (including source)  In many cases, some problems in a translation are not caused by the translator.  In this case, the translator fixed problems in the source, resulting in better quality for fluency in the target. The translator should be recognized for this work.
  • 31. For more information  Please visit http://www.qt21.eu/launchpad/  Write to info@qt21.eu