SlideShare una empresa de Scribd logo
1 de 24
Descargar para leer sin conexión
Accessibility
Considerations forVery
Large Datasets
Puneet Kishor
University of Wisconsin-Madison and Creative Commons
Monday, October 25, 2010
Acknowledgments to
CODATA for inviting me, Creative
Commons for funding my trip,
University of Wisconsin-Madison for
paying my salary, and most
importantly, the US Federal
Goverment for making all the data
available to anyone, anywhere
without any pre-conditions
Monday, October 25, 2010
Research context:
ecosystem process
modeling of very large
terrestrial ecosystems
Monday, October 25, 2010
Information by numbers
Monday, October 25, 2010
7
daily variablestm
axtm
in
taveprcpsrad
vpd
dayl
Monday, October 25, 2010
1
km2 cell
1 km
1 km
tm
axtm
in
taveprcpsrad
vpd
dayl
Monday, October 25, 2010
13million cells
4587
2889
.25
Monday, October 25, 2010
8401days
8400
Monday, October 25, 2010
111billion septets
.32
tm
axtm
in
taveprcpsrad
vpd
dayl
111.32b
Monday, October 25, 2010
725raw gigabytes
.78
Monday, October 25, 2010
10times as much in
a database
Monday, October 25, 2010
84GB of NetCDF format
in tar gzipped archives
Monday, October 25, 2010
2 3 4 5
8 9 10 11
6
12
7
13 14
1
4˚square chunks
Monday, October 25, 2010
“½”incomplete
documentation
Monday, October 25, 2010
0ways to query
the data
?X
Monday, October 25, 2010
1. Acquire NetCDF file of lat/lon values for each
cell from the weather data 1 km2 estimates
2. Dump lat/lon values to CSV with Panoply
3. Import into ArcMap as XY data
4. Export as shapefile
5. Assign WGS84 datum to shapefile in ArcCatalog
6. Reproject to Lambert Spherical (“US National
Atlas Equal Area”)
7. Separate by 2x2 degree tile using "tile_num"
attribute (so grid will match the netCDF met
files) using defination query in ArcMap and
exporting to individual shapefiles (256 tiles) as
"mask".
8. Open lambert points in qGIS and make 1km grid
(shapefile) for each 2x2 tile
9. Assign projection to output (EPSG:2163)
10. Add each new grid shapefile (one at a time) to
ArcMap with 2x2 Grid as separate layer
11. Select by location (select from grid x that
intersect mask x)
12. Export selected features of grid x (now will be
numbered sequentially by record in a way that
matches the met NetCDF “ncells”)
13. Clean up: delete extra fields from qGIS
(ID,MAXX,MINX,MAXY,MINY) add ncell_id (FID
+1) block_id, block_name
“10”times the work to
unpack the data
Monday, October 25, 2010
Many kinds of queries
f<variable> <location> <point in time>
avg(srad) at x,y on Dec 2, 2001
tmin for area on May 19, 1992
tmax at x,y on May 19, 1992
f<variable> <point location> <duration of time>
tave at x,y during the first quarter of 1983
sum(vpd) at x,y during the last week of Mar, 2003
Monday, October 25, 2010
accessible¦aksesəbəl¦
adjective
1 (of a place) able to be reached or entered : the town is
accessible by bus | the building has been made accessible to
disabled people.
• (of an object, service, or facility) able to be easily obtained or
used : making learning opportunities more accessible to adults.
• easily understood : his Latin grammar is lucid and accessible.
• able to be reached or entered by people in wheelchairs : it
provides specialized features such as nonslip floors and accessible
entrances.
2 (of a person, typically one in a position of authority or
importance) friendly and easy to talk to; approachable : he is more
accessible than most tycoons.
Monday, October 25, 2010
Accessible information
is easy to: find,
determine what one
can do with it, acquire,
and use
Monday, October 25, 2010
Factors that affect
accessibility: law;
technology; culture;
semantics; and
economics
Monday, October 25, 2010
Law makes sharing
permissible; technology
makes it possible; culture
makes it acceptable;
semantics make it
understandable; and
economics affordable
Monday, October 25, 2010
It is permissible,
acceptable, and
affordable to access
public sector
information, but not
necessarily possible or
understandableMonday, October 25, 2010
Goals of the new
storage: make the
information
technologically and
semantically accessible
Monday, October 25, 2010
Allow access by
providing user-
interface, application
programming interface
and documentation
Monday, October 25, 2010

Más contenido relacionado

Destacado

2 sharif
2 sharif2 sharif
G T C N Exec Summ J M 1
G T C N  Exec  Summ  J M 1G T C N  Exec  Summ  J M 1
G T C N Exec Summ J M 1
Jeffery Massey
 

Destacado (7)

2 sharif
2 sharif2 sharif
2 sharif
 
10
1010
10
 
G T C N Exec Summ J M 1
G T C N  Exec  Summ  J M 1G T C N  Exec  Summ  J M 1
G T C N Exec Summ J M 1
 
How Many Errors Can Be In My Paper?
How Many Errors Can Be In My Paper?How Many Errors Can Be In My Paper?
How Many Errors Can Be In My Paper?
 
Your True Reality Short Version
Your True Reality  Short VersionYour True Reality  Short Version
Your True Reality Short Version
 
Funky Buddha Fashion Collection SS 14
Funky Buddha Fashion Collection SS 14Funky Buddha Fashion Collection SS 14
Funky Buddha Fashion Collection SS 14
 
Auto Enrolment: Are You Ready?
Auto Enrolment: Are You Ready?Auto Enrolment: Are You Ready?
Auto Enrolment: Are You Ready?
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 

3 kishor

  • 1. Accessibility Considerations forVery Large Datasets Puneet Kishor University of Wisconsin-Madison and Creative Commons Monday, October 25, 2010
  • 2. Acknowledgments to CODATA for inviting me, Creative Commons for funding my trip, University of Wisconsin-Madison for paying my salary, and most importantly, the US Federal Goverment for making all the data available to anyone, anywhere without any pre-conditions Monday, October 25, 2010
  • 3. Research context: ecosystem process modeling of very large terrestrial ecosystems Monday, October 25, 2010
  • 6. 1 km2 cell 1 km 1 km tm axtm in taveprcpsrad vpd dayl Monday, October 25, 2010
  • 11. 10times as much in a database Monday, October 25, 2010
  • 12. 84GB of NetCDF format in tar gzipped archives Monday, October 25, 2010
  • 13. 2 3 4 5 8 9 10 11 6 12 7 13 14 1 4˚square chunks Monday, October 25, 2010
  • 15. 0ways to query the data ?X Monday, October 25, 2010
  • 16. 1. Acquire NetCDF file of lat/lon values for each cell from the weather data 1 km2 estimates 2. Dump lat/lon values to CSV with Panoply 3. Import into ArcMap as XY data 4. Export as shapefile 5. Assign WGS84 datum to shapefile in ArcCatalog 6. Reproject to Lambert Spherical (“US National Atlas Equal Area”) 7. Separate by 2x2 degree tile using "tile_num" attribute (so grid will match the netCDF met files) using defination query in ArcMap and exporting to individual shapefiles (256 tiles) as "mask". 8. Open lambert points in qGIS and make 1km grid (shapefile) for each 2x2 tile 9. Assign projection to output (EPSG:2163) 10. Add each new grid shapefile (one at a time) to ArcMap with 2x2 Grid as separate layer 11. Select by location (select from grid x that intersect mask x) 12. Export selected features of grid x (now will be numbered sequentially by record in a way that matches the met NetCDF “ncells”) 13. Clean up: delete extra fields from qGIS (ID,MAXX,MINX,MAXY,MINY) add ncell_id (FID +1) block_id, block_name “10”times the work to unpack the data Monday, October 25, 2010
  • 17. Many kinds of queries f<variable> <location> <point in time> avg(srad) at x,y on Dec 2, 2001 tmin for area on May 19, 1992 tmax at x,y on May 19, 1992 f<variable> <point location> <duration of time> tave at x,y during the first quarter of 1983 sum(vpd) at x,y during the last week of Mar, 2003 Monday, October 25, 2010
  • 18. accessible¦aksesəbəl¦ adjective 1 (of a place) able to be reached or entered : the town is accessible by bus | the building has been made accessible to disabled people. • (of an object, service, or facility) able to be easily obtained or used : making learning opportunities more accessible to adults. • easily understood : his Latin grammar is lucid and accessible. • able to be reached or entered by people in wheelchairs : it provides specialized features such as nonslip floors and accessible entrances. 2 (of a person, typically one in a position of authority or importance) friendly and easy to talk to; approachable : he is more accessible than most tycoons. Monday, October 25, 2010
  • 19. Accessible information is easy to: find, determine what one can do with it, acquire, and use Monday, October 25, 2010
  • 20. Factors that affect accessibility: law; technology; culture; semantics; and economics Monday, October 25, 2010
  • 21. Law makes sharing permissible; technology makes it possible; culture makes it acceptable; semantics make it understandable; and economics affordable Monday, October 25, 2010
  • 22. It is permissible, acceptable, and affordable to access public sector information, but not necessarily possible or understandableMonday, October 25, 2010
  • 23. Goals of the new storage: make the information technologically and semantically accessible Monday, October 25, 2010
  • 24. Allow access by providing user- interface, application programming interface and documentation Monday, October 25, 2010