SlideShare una empresa de Scribd logo
1 de 25
Automation of Data Spike Flagging on
TAO Refresh
Sathishkumar Samiappan
PhD Candidate in ECE
Mississippi State University
Outline
• Introduction - TAO
• TAO Refresh
• TAO Refresh Data Flow
• Quality Control
• Data Spikes
• Types of Data Spikes
• Proposed Auto QC & Auto QC Algorithms
Introduction
• TAO – Tropical Atmosphere Ocean project
• ≈ 55 moorings in the tropical pacific ocean
• To study and predict El Niño Southern
Oscillation (ENSO)
• Sensors – sea surface & subsurface
temperatures, salinity, water pressure,
ocean currents, radiation, air temperature,
wind direction, wind speed, rainfall, relative
humidity and down welling radiation
TAO Refresh
137E 147E 156E 165E 180W 170W 155W 140W 125W 110W 95W
9N DM037A 9N
8N DM038A DM047A 8N
5N DM027A DM046A DM036A DM048A 5N
2N DM041A DM026A DM033A DM049A DM050A 2N
0 DM031B DM039A DM045A DM051A 0
2S DM028B DM030A DM040A 2S
5S DM044A DM035A DM023B 5S
8S DM029A DM042A DM043A DM025A DM034A DM032A 8S
137E 147E 156E 165E 180W 170W 155W 140W 125W 110W 95W
TAO Refresh Latitudes and Longitudes
TAO Refresh Data Flow
Iridium Data
Gateway
TAO
Database
Quality
Control (QC)
GTS
NDBC
Database
FTP WWW Access
Submissions
to
NODC/NCDC
Quality Control
• Currently, an automatic gross range check is preformed
on all incoming data before sending it to GTS.
• After few days, a manually QC data is updated on NDBC
database
• Data sent to GTS is not modified
• Data on NDBC database is open and can be accessed
through web, FTP and submitted to NODC/NCDC
Data Spikes
• A spike is an abnormality in the data values either due to
sensor failure /error or transmission error.
• The data stored in the buoy on-board sensor memory is
available when the buoy is recovered and this data can be
compared with the data transmitted to understand the
different transmission errors or spikes occurred and
sensor failures.
• The data stored on-board sensor memory is commonly
referred as sensor data or tube data and the data
transmitted from buoy is referred as real-time data.
Data Spike Examples
Data Spike Examples
Data Spike Examples
Data Spike Examples
Data Spike Examples
Types of Data Spikes and Flags
Type Name
1 Missing Data
2 Deviation from actual value or outliers
3 Repeated data, truncated data and values shifted in time
Flag Name Condition
Good Data When Confidence = 100%
Probably Good Data When Confidence <100%
Questionable Data Type -2 near thresholds
Bad Data Type -2 or Type 3 or Way off threshold
Missing Data Type - 1
Proposed Automatic QC
Iridium Data
Gateway
TAO
Database
Proposed
QC
GTS
NDBC
Database
FTP WWW Access
Submissions
to
NODC/NCDC
TAO
Database
MySQL –
MATLAB
Connector
MATLAB Auto
QC Algorithms
Proposed QC System
Climatology Analysis Performed by Yee H. Lau and Patrick J. Fitzpatrick - GRI
TAO Database
Thresholding Pass-3
Automatic QC for Extreme Values
Compute 90 Day
Seasonal Range
(Performed once a day)
Compute Range from
monthly Climatology
Data (1999 – 2011)
Compute Thermo
cline for outlier
detection
Thresholding Pass-1
?
Thresholding Pass-2
Reject
below 3σ
Reject
outside
1.5*IQR
Below Above
Automatic QC for Repeating values
Automatic QC for Repeating values
• Repeating values are unmodified if it falls
within the above range
• A and B are two different numbers & R is
the repeating value
• R is considered okay if it satisfies the
equation
A
R
R
B
Work Progress
Work Completed
• Implemented all the algorithms on matlab and
tested individually on 8 datasets that spans 2
years.
Yet to be completed
• Combine all the algorithms along with
database connection
• To develop a GUI console that enables
modification of parameters
References
• TAO project overview web page at
“tao.ndbc.gov/proj_overview/proj_overview_ndbc.shtml”
• Unidata – NetCDF information at
“http://www.unidata.ucar.edu/software/netcdf/”
• OceanSITES ftp server “ftp://data.ndbc.noaa.gov/data/oceansites/DATA/”
• NDBC threads server
“http://dods.ndbc.noaa.gov/thredds/catalog/data/oceansites/DATA/catalog.
html”
• NDBC threads server(TAO Legacy)
“http://dods.ndbc.noaa.gov/thredds/catalog/data/tao/catalog.html”
Acknowledgements
• Karen Grissom*, Dawn Petriatis* and Raymond Beets*
– For their overall ideas and support
• Jing Zhou* – For his support with MySQL database
connectivity
• Yee H. Lau† and Patrick J. Fitzpatrick† – For their
support in computing thresholds using climatology data
* - National Data Buoy Center
† - Geosystems Research Institute – Mississippi State University
Thank You
TAO Buoys on Map

Más contenido relacionado

La actualidad más candente

Statistical Learning Based Anomaly Detection @ Twitter
Statistical Learning Based Anomaly Detection @ TwitterStatistical Learning Based Anomaly Detection @ Twitter
Statistical Learning Based Anomaly Detection @ TwitterArun Kejariwal
 
Forecasting Fine Grained Air Quality Based on Big Data
Forecasting Fine Grained Air Quality Based on Big DataForecasting Fine Grained Air Quality Based on Big Data
Forecasting Fine Grained Air Quality Based on Big DataWei-Yuan Chang
 
Anomaly Detection @Twitter
Anomaly Detection @TwitterAnomaly Detection @Twitter
Anomaly Detection @TwitterZhan Zhang
 
Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / APIPv6TF]
Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / APIPv6TF]Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / APIPv6TF]
Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / APIPv6TF]APNIC
 
Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / IPv6 Readiness Measurem...
Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / IPv6 Readiness Measurem...Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / IPv6 Readiness Measurem...
Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / IPv6 Readiness Measurem...APNIC
 
DSD-INT 2020 Towards more accurate riverine flood forecasting over the Lower ...
DSD-INT 2020 Towards more accurate riverine flood forecasting over the Lower ...DSD-INT 2020 Towards more accurate riverine flood forecasting over the Lower ...
DSD-INT 2020 Towards more accurate riverine flood forecasting over the Lower ...Deltares
 
Air Quality Data Acquisition and Management Systems for Tribes
Air Quality Data Acquisition and Management Systems for TribesAir Quality Data Acquisition and Management Systems for Tribes
Air Quality Data Acquisition and Management Systems for TribesAgilaire LLC
 
4 Step Translation Process for Language Professionals
4 Step Translation Process for Language Professionals4 Step Translation Process for Language Professionals
4 Step Translation Process for Language ProfessionalsHSS Translation
 

La actualidad más candente (9)

Statistical Learning Based Anomaly Detection @ Twitter
Statistical Learning Based Anomaly Detection @ TwitterStatistical Learning Based Anomaly Detection @ Twitter
Statistical Learning Based Anomaly Detection @ Twitter
 
Short-Haul Flights
Short-Haul FlightsShort-Haul Flights
Short-Haul Flights
 
Forecasting Fine Grained Air Quality Based on Big Data
Forecasting Fine Grained Air Quality Based on Big DataForecasting Fine Grained Air Quality Based on Big Data
Forecasting Fine Grained Air Quality Based on Big Data
 
Anomaly Detection @Twitter
Anomaly Detection @TwitterAnomaly Detection @Twitter
Anomaly Detection @Twitter
 
Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / APIPv6TF]
Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / APIPv6TF]Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / APIPv6TF]
Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / APIPv6TF]
 
Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / IPv6 Readiness Measurem...
Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / IPv6 Readiness Measurem...Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / IPv6 Readiness Measurem...
Taiwan IPv6 Measurement, by Ching-Heng Ku [APNIC 38 / IPv6 Readiness Measurem...
 
DSD-INT 2020 Towards more accurate riverine flood forecasting over the Lower ...
DSD-INT 2020 Towards more accurate riverine flood forecasting over the Lower ...DSD-INT 2020 Towards more accurate riverine flood forecasting over the Lower ...
DSD-INT 2020 Towards more accurate riverine flood forecasting over the Lower ...
 
Air Quality Data Acquisition and Management Systems for Tribes
Air Quality Data Acquisition and Management Systems for TribesAir Quality Data Acquisition and Management Systems for Tribes
Air Quality Data Acquisition and Management Systems for Tribes
 
4 Step Translation Process for Language Professionals
4 Step Translation Process for Language Professionals4 Step Translation Process for Language Professionals
4 Step Translation Process for Language Professionals
 

Destacado

Continuous Delivery and Feature Flagging
Continuous Delivery and Feature FlaggingContinuous Delivery and Feature Flagging
Continuous Delivery and Feature FlaggingLaunchDarkly
 
Feature flags to speed up & de risk development
Feature flags to speed up & de risk developmentFeature flags to speed up & de risk development
Feature flags to speed up & de risk developmentLaunchDarkly
 
Feature Flags Are Flawed: Let's Make Them Better
Feature Flags Are Flawed: Let's Make Them BetterFeature Flags Are Flawed: Let's Make Them Better
Feature Flags Are Flawed: Let's Make Them BetterStephen Young
 
Feature Flagging to Reduce Risk in Database Migrations
Feature Flagging to Reduce Risk in Database Migrations Feature Flagging to Reduce Risk in Database Migrations
Feature Flagging to Reduce Risk in Database Migrations LaunchDarkly
 
The Staging Server is Dead! Long Live the Staging Server!
The Staging Server is Dead! Long Live the Staging Server!The Staging Server is Dead! Long Live the Staging Server!
The Staging Server is Dead! Long Live the Staging Server!LaunchDarkly
 
Faster & Less Risky Releases with Feature Flags
Faster & Less Risky Releases with Feature FlagsFaster & Less Risky Releases with Feature Flags
Faster & Less Risky Releases with Feature FlagsLaunchDarkly
 
Feature Toggle XP Conference 2016 Kalpana Gulati
Feature Toggle  XP Conference 2016 Kalpana GulatiFeature Toggle  XP Conference 2016 Kalpana Gulati
Feature Toggle XP Conference 2016 Kalpana GulatiXP Conference India
 
Waffle: A feature switch/flag/toggle application for Django
Waffle: A feature switch/flag/toggle application for DjangoWaffle: A feature switch/flag/toggle application for Django
Waffle: A feature switch/flag/toggle application for Djangotobych
 
The Cultural Changes of Feature Flagging
The Cultural Changes of Feature FlaggingThe Cultural Changes of Feature Flagging
The Cultural Changes of Feature FlaggingLaunchDarkly
 
Feature Flags. Reducing risks during shipping changes/
Feature Flags. Reducing risks during shipping changes/Feature Flags. Reducing risks during shipping changes/
Feature Flags. Reducing risks during shipping changes/Aleksandr Makhomet
 
Becoming a Bitter Programmer
Becoming a Bitter ProgrammerBecoming a Bitter Programmer
Becoming a Bitter ProgrammerChris Oldwood
 
Overcoming the fear of deployments
Overcoming the fear of deploymentsOvercoming the fear of deployments
Overcoming the fear of deploymentsAndrei Tognolo
 
Continuous Visual Integration - RailsConf 2016 - Mike Fotinakis - Percy.io
Continuous Visual Integration - RailsConf 2016 - Mike Fotinakis - Percy.ioContinuous Visual Integration - RailsConf 2016 - Mike Fotinakis - Percy.io
Continuous Visual Integration - RailsConf 2016 - Mike Fotinakis - Percy.ioMike Fotinakis
 
Entregas Contínuas com feature toggles
Entregas Contínuas com feature togglesEntregas Contínuas com feature toggles
Entregas Contínuas com feature togglessolon_aguiar
 
Pull requests and testers can be friends
Pull requests and testers can be friendsPull requests and testers can be friends
Pull requests and testers can be friendsAlan Parkinson
 
Who will test your tests?
Who will test your tests?Who will test your tests?
Who will test your tests?Yahya Poonawala
 
Continuous Integration, Delivery and Deployment
Continuous Integration, Delivery and DeploymentContinuous Integration, Delivery and Deployment
Continuous Integration, Delivery and DeploymentEero Laukkanen
 

Destacado (20)

Continuous Delivery and Feature Flagging
Continuous Delivery and Feature FlaggingContinuous Delivery and Feature Flagging
Continuous Delivery and Feature Flagging
 
Feature flags to speed up & de risk development
Feature flags to speed up & de risk developmentFeature flags to speed up & de risk development
Feature flags to speed up & de risk development
 
Feature Flags Are Flawed: Let's Make Them Better
Feature Flags Are Flawed: Let's Make Them BetterFeature Flags Are Flawed: Let's Make Them Better
Feature Flags Are Flawed: Let's Make Them Better
 
Feature Flagging to Reduce Risk in Database Migrations
Feature Flagging to Reduce Risk in Database Migrations Feature Flagging to Reduce Risk in Database Migrations
Feature Flagging to Reduce Risk in Database Migrations
 
The Staging Server is Dead! Long Live the Staging Server!
The Staging Server is Dead! Long Live the Staging Server!The Staging Server is Dead! Long Live the Staging Server!
The Staging Server is Dead! Long Live the Staging Server!
 
Faster & Less Risky Releases with Feature Flags
Faster & Less Risky Releases with Feature FlagsFaster & Less Risky Releases with Feature Flags
Faster & Less Risky Releases with Feature Flags
 
Feature Toggle XP Conference 2016 Kalpana Gulati
Feature Toggle  XP Conference 2016 Kalpana GulatiFeature Toggle  XP Conference 2016 Kalpana Gulati
Feature Toggle XP Conference 2016 Kalpana Gulati
 
Waffle: A feature switch/flag/toggle application for Django
Waffle: A feature switch/flag/toggle application for DjangoWaffle: A feature switch/flag/toggle application for Django
Waffle: A feature switch/flag/toggle application for Django
 
The Cultural Changes of Feature Flagging
The Cultural Changes of Feature FlaggingThe Cultural Changes of Feature Flagging
The Cultural Changes of Feature Flagging
 
Feature toggling
Feature togglingFeature toggling
Feature toggling
 
Feature Flags. Reducing risks during shipping changes/
Feature Flags. Reducing risks during shipping changes/Feature Flags. Reducing risks during shipping changes/
Feature Flags. Reducing risks during shipping changes/
 
El Tao De La FíSica 2
El Tao De La FíSica 2El Tao De La FíSica 2
El Tao De La FíSica 2
 
Test-Driven SQL
Test-Driven SQLTest-Driven SQL
Test-Driven SQL
 
Becoming a Bitter Programmer
Becoming a Bitter ProgrammerBecoming a Bitter Programmer
Becoming a Bitter Programmer
 
Overcoming the fear of deployments
Overcoming the fear of deploymentsOvercoming the fear of deployments
Overcoming the fear of deployments
 
Continuous Visual Integration - RailsConf 2016 - Mike Fotinakis - Percy.io
Continuous Visual Integration - RailsConf 2016 - Mike Fotinakis - Percy.ioContinuous Visual Integration - RailsConf 2016 - Mike Fotinakis - Percy.io
Continuous Visual Integration - RailsConf 2016 - Mike Fotinakis - Percy.io
 
Entregas Contínuas com feature toggles
Entregas Contínuas com feature togglesEntregas Contínuas com feature toggles
Entregas Contínuas com feature toggles
 
Pull requests and testers can be friends
Pull requests and testers can be friendsPull requests and testers can be friends
Pull requests and testers can be friends
 
Who will test your tests?
Who will test your tests?Who will test your tests?
Who will test your tests?
 
Continuous Integration, Delivery and Deployment
Continuous Integration, Delivery and DeploymentContinuous Integration, Delivery and Deployment
Continuous Integration, Delivery and Deployment
 

Similar a TAO Refresh - Automation of Data Spike Flagging Quality

Wqtc2013 invest ofperformanceprobswitheds-20130910
Wqtc2013 invest ofperformanceprobswitheds-20130910Wqtc2013 invest ofperformanceprobswitheds-20130910
Wqtc2013 invest ofperformanceprobswitheds-20130910John B. Cook, PE, CEO
 
Datalogger error introductory course
Datalogger error introductory courseDatalogger error introductory course
Datalogger error introductory courseHydroG Resources
 
Peng etal UPQ_AMS2014_P332
Peng etal UPQ_AMS2014_P332Peng etal UPQ_AMS2014_P332
Peng etal UPQ_AMS2014_P332Ge Peng
 
Building highly reliable data pipeline @datadog par Quentin François
Building highly reliable data pipeline @datadog par Quentin FrançoisBuilding highly reliable data pipeline @datadog par Quentin François
Building highly reliable data pipeline @datadog par Quentin FrançoisParis Data Engineers !
 
Improving continuous process operation using data analytics delta v applicati...
Improving continuous process operation using data analytics delta v applicati...Improving continuous process operation using data analytics delta v applicati...
Improving continuous process operation using data analytics delta v applicati...Emerson Exchange
 
03_Real-Time_System_Operations_Using_Synchrophasors.pdf
03_Real-Time_System_Operations_Using_Synchrophasors.pdf03_Real-Time_System_Operations_Using_Synchrophasors.pdf
03_Real-Time_System_Operations_Using_Synchrophasors.pdfGustavoAguayo3
 
Site Verification - Protocols & Tools for Complex Metering Installations
Site Verification - Protocols & Tools for Complex Metering InstallationsSite Verification - Protocols & Tools for Complex Metering Installations
Site Verification - Protocols & Tools for Complex Metering InstallationsTESCO - The Eastern Specialty Company
 
PE230516_Flowserve_WC_presentation_FINAL.pdf
PE230516_Flowserve_WC_presentation_FINAL.pdfPE230516_Flowserve_WC_presentation_FINAL.pdf
PE230516_Flowserve_WC_presentation_FINAL.pdfClaudiomarConceioCos
 
Roberti lter asm_september2015_jr_ds_jt_jc
Roberti lter asm_september2015_jr_ds_jt_jcRoberti lter asm_september2015_jr_ds_jt_jc
Roberti lter asm_september2015_jr_ds_jt_jcquestRCN
 
Data Stream Management
Data Stream ManagementData Stream Management
Data Stream Managementk_tauhid
 
T1-4_Maslennikov_et_al.pdf
T1-4_Maslennikov_et_al.pdfT1-4_Maslennikov_et_al.pdf
T1-4_Maslennikov_et_al.pdfMareLunare
 
DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...
DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...
DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...Deltares
 
Clinical data management and software packages final edc and rdc
Clinical data management and software packages final edc and rdcClinical data management and software packages final edc and rdc
Clinical data management and software packages final edc and rdcPristyn Research Solutions
 
Electronic Data Capture & Remote Data Capture
Electronic Data Capture & Remote  Data CaptureElectronic Data Capture & Remote  Data Capture
Electronic Data Capture & Remote Data CaptureCRB Tech
 

Similar a TAO Refresh - Automation of Data Spike Flagging Quality (20)

Wqtc2013 invest ofperformanceprobswitheds-20130910
Wqtc2013 invest ofperformanceprobswitheds-20130910Wqtc2013 invest ofperformanceprobswitheds-20130910
Wqtc2013 invest ofperformanceprobswitheds-20130910
 
Datalogger error introductory course
Datalogger error introductory courseDatalogger error introductory course
Datalogger error introductory course
 
Peng etal UPQ_AMS2014_P332
Peng etal UPQ_AMS2014_P332Peng etal UPQ_AMS2014_P332
Peng etal UPQ_AMS2014_P332
 
Final presentation
Final presentationFinal presentation
Final presentation
 
Building highly reliable data pipeline @datadog par Quentin François
Building highly reliable data pipeline @datadog par Quentin FrançoisBuilding highly reliable data pipeline @datadog par Quentin François
Building highly reliable data pipeline @datadog par Quentin François
 
Final presentation
Final presentationFinal presentation
Final presentation
 
Spc
SpcSpc
Spc
 
MAOP Process Validation
MAOP Process ValidationMAOP Process Validation
MAOP Process Validation
 
Improving continuous process operation using data analytics delta v applicati...
Improving continuous process operation using data analytics delta v applicati...Improving continuous process operation using data analytics delta v applicati...
Improving continuous process operation using data analytics delta v applicati...
 
03_Real-Time_System_Operations_Using_Synchrophasors.pdf
03_Real-Time_System_Operations_Using_Synchrophasors.pdf03_Real-Time_System_Operations_Using_Synchrophasors.pdf
03_Real-Time_System_Operations_Using_Synchrophasors.pdf
 
Site Verification - Protocols & Tools for Complex Metering Installations
Site Verification - Protocols & Tools for Complex Metering InstallationsSite Verification - Protocols & Tools for Complex Metering Installations
Site Verification - Protocols & Tools for Complex Metering Installations
 
PE230516_Flowserve_WC_presentation_FINAL.pdf
PE230516_Flowserve_WC_presentation_FINAL.pdfPE230516_Flowserve_WC_presentation_FINAL.pdf
PE230516_Flowserve_WC_presentation_FINAL.pdf
 
Roberti lter asm_september2015_jr_ds_jt_jc
Roberti lter asm_september2015_jr_ds_jt_jcRoberti lter asm_september2015_jr_ds_jt_jc
Roberti lter asm_september2015_jr_ds_jt_jc
 
Data Stream Management
Data Stream ManagementData Stream Management
Data Stream Management
 
T1-4_Maslennikov_et_al.pdf
T1-4_Maslennikov_et_al.pdfT1-4_Maslennikov_et_al.pdf
T1-4_Maslennikov_et_al.pdf
 
DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...
DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...
DSD-INT 2020 Real Time Hydrologic, Hydraulic and Water Quality Forecasting in...
 
Clinical data management and software packages final edc and rdc
Clinical data management and software packages final edc and rdcClinical data management and software packages final edc and rdc
Clinical data management and software packages final edc and rdc
 
Midterm presentation (1)
Midterm presentation (1)Midterm presentation (1)
Midterm presentation (1)
 
Electronic Data Capture & Remote Data Capture
Electronic Data Capture & Remote  Data CaptureElectronic Data Capture & Remote  Data Capture
Electronic Data Capture & Remote Data Capture
 
Reliable and Scalable Data Ingestion at Airbnb
Reliable and Scalable Data Ingestion at AirbnbReliable and Scalable Data Ingestion at Airbnb
Reliable and Scalable Data Ingestion at Airbnb
 

Último

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 

Último (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

TAO Refresh - Automation of Data Spike Flagging Quality

  • 1. Automation of Data Spike Flagging on TAO Refresh Sathishkumar Samiappan PhD Candidate in ECE Mississippi State University
  • 2. Outline • Introduction - TAO • TAO Refresh • TAO Refresh Data Flow • Quality Control • Data Spikes • Types of Data Spikes • Proposed Auto QC & Auto QC Algorithms
  • 3. Introduction • TAO – Tropical Atmosphere Ocean project • ≈ 55 moorings in the tropical pacific ocean • To study and predict El Niño Southern Oscillation (ENSO) • Sensors – sea surface & subsurface temperatures, salinity, water pressure, ocean currents, radiation, air temperature, wind direction, wind speed, rainfall, relative humidity and down welling radiation
  • 5. 137E 147E 156E 165E 180W 170W 155W 140W 125W 110W 95W 9N DM037A 9N 8N DM038A DM047A 8N 5N DM027A DM046A DM036A DM048A 5N 2N DM041A DM026A DM033A DM049A DM050A 2N 0 DM031B DM039A DM045A DM051A 0 2S DM028B DM030A DM040A 2S 5S DM044A DM035A DM023B 5S 8S DM029A DM042A DM043A DM025A DM034A DM032A 8S 137E 147E 156E 165E 180W 170W 155W 140W 125W 110W 95W TAO Refresh Latitudes and Longitudes
  • 6. TAO Refresh Data Flow Iridium Data Gateway TAO Database Quality Control (QC) GTS NDBC Database FTP WWW Access Submissions to NODC/NCDC
  • 7. Quality Control • Currently, an automatic gross range check is preformed on all incoming data before sending it to GTS. • After few days, a manually QC data is updated on NDBC database • Data sent to GTS is not modified • Data on NDBC database is open and can be accessed through web, FTP and submitted to NODC/NCDC
  • 8. Data Spikes • A spike is an abnormality in the data values either due to sensor failure /error or transmission error. • The data stored in the buoy on-board sensor memory is available when the buoy is recovered and this data can be compared with the data transmitted to understand the different transmission errors or spikes occurred and sensor failures. • The data stored on-board sensor memory is commonly referred as sensor data or tube data and the data transmitted from buoy is referred as real-time data.
  • 14. Types of Data Spikes and Flags Type Name 1 Missing Data 2 Deviation from actual value or outliers 3 Repeated data, truncated data and values shifted in time Flag Name Condition Good Data When Confidence = 100% Probably Good Data When Confidence <100% Questionable Data Type -2 near thresholds Bad Data Type -2 or Type 3 or Way off threshold Missing Data Type - 1
  • 15. Proposed Automatic QC Iridium Data Gateway TAO Database Proposed QC GTS NDBC Database FTP WWW Access Submissions to NODC/NCDC TAO Database MySQL – MATLAB Connector MATLAB Auto QC Algorithms Proposed QC System
  • 16. Climatology Analysis Performed by Yee H. Lau and Patrick J. Fitzpatrick - GRI
  • 17. TAO Database Thresholding Pass-3 Automatic QC for Extreme Values Compute 90 Day Seasonal Range (Performed once a day) Compute Range from monthly Climatology Data (1999 – 2011) Compute Thermo cline for outlier detection Thresholding Pass-1 ? Thresholding Pass-2 Reject below 3σ Reject outside 1.5*IQR Below Above
  • 18. Automatic QC for Repeating values
  • 19. Automatic QC for Repeating values • Repeating values are unmodified if it falls within the above range • A and B are two different numbers & R is the repeating value • R is considered okay if it satisfies the equation A R R B
  • 20. Work Progress Work Completed • Implemented all the algorithms on matlab and tested individually on 8 datasets that spans 2 years. Yet to be completed • Combine all the algorithms along with database connection • To develop a GUI console that enables modification of parameters
  • 21.
  • 22. References • TAO project overview web page at “tao.ndbc.gov/proj_overview/proj_overview_ndbc.shtml” • Unidata – NetCDF information at “http://www.unidata.ucar.edu/software/netcdf/” • OceanSITES ftp server “ftp://data.ndbc.noaa.gov/data/oceansites/DATA/” • NDBC threads server “http://dods.ndbc.noaa.gov/thredds/catalog/data/oceansites/DATA/catalog. html” • NDBC threads server(TAO Legacy) “http://dods.ndbc.noaa.gov/thredds/catalog/data/tao/catalog.html”
  • 23. Acknowledgements • Karen Grissom*, Dawn Petriatis* and Raymond Beets* – For their overall ideas and support • Jing Zhou* – For his support with MySQL database connectivity • Yee H. Lau† and Patrick J. Fitzpatrick† – For their support in computing thresholds using climatology data * - National Data Buoy Center † - Geosystems Research Institute – Mississippi State University