2. Why do we sometimes need synthetic daily
weather?
• Synthetic: weather records that are statistically the same as (or
similar to) historical records
• Needed to fill data gaps, for interpolation, for long-term
simulations, for short-term simulations, for help in simulating future
scenarios
• To drive models of agricultural processes: growth and
development of plants, of animals, pests & diseases, …
3. Weather generation
Which variables, and how often?
• Plant growth models
Daily: rainfall, max and min temperatures, solar
radiation (or sunshine hours)
• Livestock and ecosystem models
Weekly, monthly
• Special purpose models
< 1 day time step: wind speed, leaf wetness, etc
4. Rainfall generation
• Rainfall is a major determinant of agriculture globally
• Difficult to model well; most rainfall generators severely under-
estimate monthly and annual variance
• In risk studies, tails of the distributions (especially the lower) are
of particular importance: impacts on household incomes, food
security, …
Need for representative rainfall generation
5. How do weather generators work?
• Very many different weather generators have been built and
applied, often using similar principles
• They use sequences of pseudo-random numbers to operate; a
sequence that is “seeded” will eventually repeat itself
Same seed same random number sequence same weather
sequence
6. A common method of generating rainfall
The chance of rain depends on whether it rained yesterday (a “first-
order Markov chain”):
Pi (w | w) = 1 – Pi (d | w)
Pi (w | d) = 1 – Pi (d | d)
Y esterday T oday S tate
wet 1 wet 1 1
001101000101110 … dry 0 2
dry 0 wet 1 3
4 2 3 1 dry 0 4
7. Is today wet?
Yesterday was dry: Pi (w | d) = 0.3
Generate a random number R, distributed
uniformly between 0 and 1
Y esterday T oday S tate
If R < 0.3, today is wet wet 1 wet 1 1
dry 0 2
dry 0 wet 1 3
dry 0 4
8. If today is wet, how much rain falls?
Use a two-parameter gamma
probability distribution, defined by
• a shape parameter p
• a scale (or location) parameter av
9. What about temperatures and solar
radiation?
For example, WGEN (a common generator) takes into account:
• serial correlation
• cross correlations on the same day,
and on one day and the previous day
Mean SR
Generated from monthly means
and monthly standard deviations dry
(northern hemisphere)
wet
1 365
Day of year
10. Weather generator parameters
Depending on the weather generator, a set of parameters for any
site might include long-term means for
• Monthly rainfall amounts
• Number of rain days per month
• Monthly max and min temperatures, averages and
standard deviations
• Monthly solar radiation, averages and standard deviations
Where might these come from, for a site?
• Historical daily data for several years for these variables
• Climate surfaces based on station data, with interpolation
techniques used to fill in the “grid”
11. What is the “minimum” set of climate
variables for generating daily data?
Depends on the methods used, but could include:
• Monthly rainfall amounts – from climate grids
• Number of rain days per month
• Monthly average max and min temperatures – from climate
grids, standard deviations from regressions
• Monthly solar radiation – can be estimated from
• sunshine hours, day length, and 2 empirical parameters, or
• daily maximum and minimum air temperatures, latitude,
longitude, and various empirical parameters
12. MarkSim is a tool that combines a daily
weather generator with climate surfaces of
the common variables (rainfall, tmax and
tmin), and uses climate typing to estimate
the full parameter set of the model
13. Climate surfaces in MarkSim v1
About 10,000 stations for Latin America 7,000 stations for Africa
4,500 stations for Asia Different resolutions
10’ pixel
2.5’ pixel
2.5’ pixel
10’ pixel
10’ pixel 0
Climate
270 Phase 90
Angle
180
15. 140 Northvil e
120
100
80
Rainfall mm
60
40
20
0
Jan Mar May Jul Sep Nov
140 Southvil e
120
100
80
Rainfall mm
60
40
20
0
Jan Mar May Jul Sep Nov
16. Northville Southville
150 JAN 150 JAN
DEC FEB DEC FEB
100 100
NOV MAR NOV MAR
50 50
OCT APR OCT APR
SEP MAY SEP MAY
AUG JUN AUG JUN
JUL JUL
Rotation of climate record based on 12-point Fourier transform
Convert 12 monthly values to a series of sine & cosine functions and subtract the first phase angle
17. Northville Southville
150 JAN 150 AUG
DEC FEB JUL SEP
100 100
NOV MAR JUN OCT
50 50
OCT APR MAY NOV
SEP MAY APR DEC
AUG JUN MAR JAN
JUL FEB
19. MarkSim the rainfall generator
• Based on a third-order Markov process: probability of
rain on any day is dependent on occurrence of rain
on the three previous days
• To model observed rainfall variances in parts of the tropics
and subtropics, some of the parameters of the rainfall
model are randomly sampled
• The model has been fitted to more than 9000 data sets
world-wide and has been extensively tested
• May model parameters offer some insight into the nature of
long-term climate change?
20. Probability of a Wet Day
The probability of a day being wet is
-1
P(W/D1D2D3) = Φ (bi + ai-1d1 + ai-2d2 + ai-3d3)
Where:
-1
Φ is the inverse of the probit function
bi is the monthly probit of a wet day following 3 consecutive dry days
am are binary coefficients for rain (1) or no rain (0) on day m
dm are lag constants.
-1
The probability of a wet day following three dry days is Φ (bi)
-1
The probability of a wet day following three wet days is Φ (bi + d1 + d2 + d3)
21. 1.0
0.8
0.6
Cumulative Probability
0.4
0.2
0.0
-3 -2 -1 0 1 2 3
Normal Probability (Probit)
22. Sampling Model Parameters
For each year of generated rainfall required, the baseline probits are randomly
sampled from a 12-dimensional normal distribution:
*
b i = si RNi + bi, i=1,12
Where:
*
bi is the sampled value of bi, the baseline probability of rain.
si is the standard deviation of bI (from the fitting of the model).
RNi is a random normal number.
The sampled baseline monthly probabilities are correlated using the correlation
matrix of raindays per month.
27. (1)
200
GUATEMALA
TOTAL 1175 mm
(n=38)
100
RAINFALL (mm)
0
PALMIRA
(2) TOTAL 993 mm
200 (n=52)
100
RAINFALL (mm)
0
TILLABERY
(3) TOTAL 483 mm
200 (n=35)
100
RAINFALL (mm)
0
J F M A M J J A S O N D
28. ANNUAL VARIANCE OF RAINFALL FOR THREE SITES
50
HISTORICAL
WGEN
40
MarkSim
TOMM
30
ANNUAL RAINFALL
VARIANCE X 1000
20
10
0
GUATEMALA PALMIRA, TILLABERY,
CITY COLOMBIA NIGER
29. Combining climate surfaces and the weather generator to provide
daily weather data that are characteristic of any location
• MarkSim v1 on CD-ROM, data for Latin America, Africa, Asia (2002)
• The original MarkSim calibration data station map
30. Weather typing
~10,000 weather stations with > 12 years daily rainfall data
Lat, long, elevation
Long-term monthly temp Estimate rainfall model parameters
Monthly diurnal temp range
Monthly rainfall
Cluster analysis in 36-D space, giving ~700 climate clusters
Calculate regression coefficients for rainfall
model
parameters for each cluster
Store in cluster
files
31. Generating daily weather
Pick a location: lat, long (elevation)
Read the climate surface to find long-term means
Read cluster coverage to find cluster number
Read cluster files for regression coefficients
Reconstitute rainfall model parameters
Generate daily rainfall; generate daily temps from long-term means
Generate daily solar radiation from temps
Application
32. Where do the MarkSim v1 model
parameters come from?
From climate grids, or from the user directly:
• Monthly rainfall amounts
• Monthly average max and min temperatures
From the climate typing clusters:
• Number of rain days per month
• Monthly correlation matrix of raindays per month
• Baseline normal probabilities (probits) of a wet day following
three dry days and the “lag parameters”
Derived parameters:
• Monthly solar radiation
33. Applications
Weather data to drive crop, livestock, household, ecosystem
models
providing information on
• system performance with respect to technological, climate,
policy changes
• constraint identification and characterisation
• identification of possible intervention points
• answer “what-if” questions relating to risk, sustainability,
trade-offs
34. Simulated Cereal Yields at Three Sites
1.0
Til abery
(mil et) Guatemala City
0.8
(maize)
0.6
Palmira
CUMULATIVE PROBABILITY
0.4 (maize)
0.2
Historical weather
"Interpolated" weather
0.0
0 1 2 3 4 5 6 7
GRAIN YIELD (t/ha)
35. Versions of MarkSim
Version 1.0 GIS-based CD edition (2002)
• Getting old
• Some known problems with the code and the climate grids
• Support?
Version 1.5 MarkSimGCM (2011)
• Google-Earth based web application that generates data in
relation to future climates from GCMs as well as data for
“current conditions”
• Many v1 bugs fixed, data produced in DSSAT weather file
format
36. Versions of MarkSim
Version 1.7 MarkSimGCM batch mode (late 2011)
• Run MarkSimGCM as a DLL and an EXE (depending on OS)
• Can then be run as part of the DSSAT v4.5 system or of any
other model system
Version 2 MarkSim2012
• Use an expanded gauge dataset (~54,000 stations) to refit
MarkSim globally
• Will incorporate the features of MarkSimGCM and AR5
climate data
Version 3 onwards
• Current research ideas: improve temperature simulation;
data filling; utilising satellite data in places where gauge data
are sparse; MarkSim spatial