SlideShare una empresa de Scribd logo
1 de 78
Descargar para leer sin conexión
THE GENETIC 
ARCHITECTURES OF 
PSYCHOLOGICAL TRAITS 
James J. Lee 
University of Minnesota Twin Cities
THREE LAWS OF BEHAVIOR 
GENETICS 
• First Law. All behavioral traits 
are heritable. 
• Second Law. The effect of being 
raised in the same family is 
smaller than the effect of 
genes. 
• Third Law. A substantial portion 
of the variance in behavioral 
traits is not accounted for by 
genes or families. 
Eric Turkheimer, the coiner of the Three 
Laws of Behavior Genetics.
EVIDENCE FROM CLASSICAL 
QUANTITATIVE GENETICS 
90 100 110 120 130 
The Minnesota Adolescent Adoption Study (Scarr & Weinberg, 1978; Scarr, 1997) 
80 90 100 110 120 130 
BIOLOGICAL FAMILIES 
MIDPARENT IQ 
OFFSPRING IQ 
β = 0.61± 0.07 
90 100 110 120 130 
80 90 100 110 120 130 
ADOPTIVE FAMILIES 
MIDPARENT IQ 
OFFSPRING IQ 
β = 0.13 ± 0.08
EVIDENCE FROM CLASSICAL 
QUANTITATIVE GENETICS 
BIOLOGICAL FAMILIES 
ADOPTIVE FAMILIES 
The Sibling Interaction and Behavior Study (McGue et al., 2007)
THE SEARCH FOR CAUSAL 
VARIANTS AT THE DNA LEVEL 
• If studies of twins and other 
kinships support the Three 
Laws, it seems justified to 
search for the causal loci at 
the DNA level. 
• This is the aim of genome-wide 
association studies 
(GWAS). 
A research subject provides DNA by 
spitting into a tube with a preservative.
BACKLASH AGAINST GWAS 
OF PSYCHOLOGICAL TRAITS 
• Correlations between 
common variants and 
phenotypes such as general 
cognitive ability (g) and 
schizophrenia have turned out 
to be very small. 
• We have just reported three 
common SNPs that each 
account for ~0.02% of IQ 
variance (Rietveld et al., 2014).
BACKLASH AGAINST GWAS 
OF PSYCHOLOGICAL TRAITS 
• In response, a fellow at the 
Center for Genetics and 
Society wrote a blog post 
called “The Stupidity of Smart 
Genes.” 
• Some academics are scarcely 
more charitable. Kevin Mitchell 
of Trinity College Dublin: “The 
idea that this trait is 
determined by common 
variants … is really unproven.”
MY RESPONSE TO THE 
BACKLASH 
• We seem to have a paradox: 
if traits are as heritable as 
implied by classical studies, 
then where are the genes? 
• I argue that the heritability is 
hiding in plain sight: there are 
thousands of causal variants, 
each of which exerts a small 
effect—which means that it is 
difficult to find any single one.
MY RESPONSE TO THE 
BACKLASH 
• I provide an estimate of the 
total GWAS sample size 
required to capture the entire 
heritability (due to common 
variants) of a phenotype like g. 
• Most importantly, I argue that 
chasing down thousands of 
DNA variants with small 
effects is a worthy scientific 
enterprise.
HERITABILITY ESTIMATED 
DIRECTLY FROM DNA DATA 
• A critic might claim that 
GWAS of psychological 
traits cannot be guaranteed 
to produce more “hits” as 
sample size grows. 
• Perhaps “indirect” heritability 
estimates from studies of 
twins, adoptees, etc. are 
flawed and there are not 
that many causal loci after all. Richard Nixon and Forrest Gump may be 
slightly less similar at the DNA level than 
most other random pairs of people.
HERITABILITY ESTIMATED 
DIRECTLY FROM DNA DATA 
• In recent years a new 
method, often called GCTA 
(after the software package 
Genome-wide Complex 
Trait Analysis), obtains 
“direct” estimates of 
heritability from DNA data. 
• Think of two people who 
are not related to you. Richard Nixon and Forrest Gump may be 
slightly less similar at the DNA level than 
most other random pairs of people.
HERITABILITY ESTIMATED 
DIRECTLY FROM DNA DATA 
• If we genotype/sequence all 
three individuals, you will 
turn out by chance to be 
slightly more similar at the 
genetic level to person A 
than to person B. 
• Are you also 
phenotypically more 
similar to A than to B? Richard Nixon and Forrest Gump may be 
slightly less similar at the DNA level than 
most other random pairs of people.
HERITABILITY ESTIMATED 
DIRECTLY FROM DNA DATA 
• In a large sample of 
unrelated people, we look 
at all pairs of people and 
calculate their genetic and 
phenotypic similarities. 
• Higher heritability means 
that genetically similar 
people will tend to be 
phenotypically similar. Richard Nixon and Forrest Gump may be 
slightly less similar at the DNA level than 
most other random pairs of people.
HERITABILITY ESTIMATED 
DIRECTLY FROM DNA DATA 
E (yy0) = A2A 
+ I2E 
, 
where Aij = 
1 
p 
Xp 
k=1 
zikzjk 
! 
• y: the vector of phenotypic values 
• σA2: additive genetic variance 
• σE2: residual variance 
• A: the matrix of “relatedness” coefficients 
• I: the identity matrix 
• zik: the standardized gene count of person i at locus k
HERITABILITY ESTIMATED 
DIRECTLY FROM DNA DATA 
• According to GCTA, the 
heritability of g is roughly 
0.45 (Davies et al., 2011; 
Chabris et al., 2012). 
• This is actually a lower 
bound on h2 because many 
causal variants (particularly 
those where one allele is 
rare) are probably not 
captured by SNP chips. 
Peter Visscher, quantitative geneticist and a 
developer of GCTA.
HERITABILITY ESTIMATED 
DIRECTLY FROM DNA DATA 
• We have studied the 
conditions under which GCTA 
provides a valid estimate of 
SNP-based heritability (Lee  
Chow, 2014). 
• If the causal variants tend to 
be less well tagged (a realistic 
case), then GCTA will be 
biased downward. 
• Thus, h2GCTA  h2SNP  h2. 
My postdoctoral supervisor, Carson Chow, 
goes to the supermarket.
HERITABILITY ESTIMATED 
DIRECTLY FROM DNA DATA 
0.0 0.2 0.4 0.6 0.8 1.0 
GREML HERITABILIY ESTIMATE 
VERY WEAKLY 
TAGGED 
WEAKLY 
TAGGED 
MODERATELY 
TAGGED 
STRONGLY 
TAGGED 
VERY STRONGLY 
TAGGED 
The purple horizontal line corresponds to the true h2SNP in our simulations.
HERITABILITY ESTIMATED 
DIRECTLY FROM DNA DATA 
• “Direct” estimates based on 
DNA data and “indirect” 
estimates based on the 
correlations between relatives 
are thus fully consistent. 
• “Our results unequivocally 
confirm that a substantial 
proportion of individual 
differences in human 
intelligence is due to genetic 
variation” (Davies et al., 2011). 
Peter Visscher, quantitative geneticist and a 
developer of GCTA.
HERITABILITY ESTIMATED 
DIRECTLY FROM DNA DATA 
• Some trait-associated SNPs 
might only be correlated (in 
linkage disequilibrium) with 
untyped causal variants. 
• How can we be sure that 
GCTA-estimated heritability 
reflects common variants? 
• Basic principle of psychometrics. 
Two dichotomously scored items 
can show a strong correlation 
only if their pass rates are similar. 
0 0.5 1 
b 
0.2 
0 
0.2 
0 
frequency 
SNP index i 
SNP index j 
C 
2000 4000 6000 8000 
2000 
4000 
6000 
8000 
1 
0.8 
0.6 
0.4 
0.2 
0 
A color-coded correlation matrix of SNPs 
on chromosome 22.
HERITABILITY ESTIMATED 
0.2 
0 
DIRECTLY FROM 0 DNA 0.5 1 
b 
DATA 
• This same principle also 
applies to genetics! 
• Two SNPs can show strong 
linkage disequilibrium (LD) 
only if their allele 
frequencies are similar. 
• Therefore, a substantial 
h2GCTA implies that common 
variants play a large role. 
0.2 
0 
frequency 
SNP index i 
SNP index j 
C 
2000 4000 6000 8000 
2000 
4000 
6000 
8000 
1 
0.8 
0.6 
0.4 
0.2 
0 
A color-coded correlation matrix of SNPs 
on chromosome 22.
… T A … 
… T A … 
… T A … 
… C G … 
… C … 
G 
… C … 
G 
… C G 
… 
Locus 1 
MAF = 3/7 
Locus 2 
MAF = 3/7
… T A … 
… T A … 
… T A … 
… T G … 
… T … 
G 
… T … 
G 
… C G 
… 
Locus 1 
MAF = 1/7 
Locus 2 
MAF = 3/7
THE NUMBER OF CAUSAL VARIANTS: 
THE “POLY” IN POLYGENIC 
• The simulations and 
mathematical arguments by 
Lee and Chow (2014) show 
that GCTA can be valid even 
if there is just one trait-associated 
SNP. 
• Can we find other evidence 
supporting the notion that 
missing heritability is 
distributed among many 
variants of very small effect? 
Peter Visscher, quantitative geneticist and a 
developer of GCTA.
THE NUMBER OF CAUSAL VARIANTS: 
THE “POLY” IN POLYGENIC 
• GCTA has an advantage 
over classical pedigree-based 
methods. It can 
partition h2 among 
different parts of the 
genome. 
• E.g., we can determine how 
much heritability is 
contributed by each 
chromosome. 
Peter Visscher, quantitative geneticist and a 
developer of GCTA.
THE NUMBER OF CAUSAL VARIANTS: 
THE “POLY” IN POLYGENIC 
• Basic idea. Calculate 
separate realized genetic 
similarities for different parts 
of the genome. 
• Suppose that there are many 
causal loci on chr1, but none 
on chr2. Then chr1 genetic 
similarity will predict 
phenotypic similarity, whereas 
chr2 genetic similarity will not. 
Peter Visscher, quantitative geneticist and a 
developer of GCTA.
PARTITIONING SCHIZOPHRENIA 
HERITABILITY AMONG CHROMOSOMES 
Lee et al. (2012)
PARTITIONING SCHIZOPHRENIA 
HERITABILITY AMONG CHROMOSOMES 
• The remarkable correlation 
between chromosome length 
and heritability contribution 
suggests that many loci 
contribute to SCZ liability 
(Gottesman  Shields, 1967). 
• E.g., if there were only ten 
loci, each on a different 
chromosome, we would not 
see such a relationship. 
Prof. Emeritus Irving Gottesman, a pioneer 
in the genetic study of mental illness.
THE NUMBER OF CAUSAL VARIANTS: 
THE “POLY” IN POLYGENIC 
• We know that there are 
many causal variants. But 
can we get more precise? 
• Even if a GWAS dataset has 
too little power to yield 
many “hits,” it still contains 
substantial information 
about the trait’s genetic 
architecture. Naomi Wray and Peter Visscher 
introduced a method to estimate 
parameters of genetic architectures in 
their 2009 study of schizophrenia.
THE NUMBER OF CAUSAL VARIANTS: 
THE “POLY” IN POLYGENIC 
• We have seen how GCTA 
exploits this information in the 
estimation of heritability. 
• It is possible to get out more 
than just h2. 
• Approximate Bayesian 
polygenic analysis (ABPA) 
estimates the total number of 
genotyped SNPs that are 
associated with the trait (Stahl 
et al., 2012). Naomi Wray and Peter Visscher 
introduced a method to estimate 
parameters of genetic architectures in 
their 2009 study of schizophrenia.
THE NUMBER OF CAUSAL VARIANTS: 
THE “POLY” IN POLYGENIC 
• Suppose that we estimate 
SNP regression coefficients 
in a GWAS and use them 
to predict the phenotypes 
of individuals in a new 
sample. 
• The cross-validation R2 is 
the predictive power of the 
estimated coefficients in the 
new sample. 
Eli Stahl introduced ABPA in 2012, extending a 
method devised by Visscher and colleagues.
THE NUMBER OF CAUSAL VARIANTS: 
THE “POLY” IN POLYGENIC 
• Suppose that we bin the SNP 
effects estimated in the 
GWAS (“training sample”) by 
p-value. 
• If the GWAS results in every 
p-value bin—even in the bins 
corresponding to large p-values— 
show at least a small 
cross-validation R2, then the 
trait must be highly polygenic. 
Eli Stahl introduced ABPA in 2012, extending a 
method devised by Visscher and colleagues.
THE NUMBER OF CAUSAL VARIANTS: 
THE “POLY” IN POLYGENIC 
• What if the heritability were 
due to just a few variants of 
large effect? These variants 
would be in a bin with low 
p-values, and all other bins 
would show no cross-validation. 
• A failure to observe this 
pattern implies polygenicity. 
Eli Stahl introduced ABPA in 2012, extending a 
method devised by Visscher and colleagues.
THE NUMBER OF CAUSAL VARIANTS: 
THE “POLY” IN POLYGENIC 
• This logic extends to larger 
sample sizes. 
• What if the bins corresponding 
to p ≥ .05 no longer cross-validate? 
Then all trait-associated 
SNPs must have p  .05! 
• The number of SNPs meeting 
the cutoff p  .05 is then an 
upper bound on the total 
number of SNPs with nonzero 
regression coefficients. 
Eli Stahl introduced ABPA in 2012, extending a 
method devised by Visscher and colleagues.
THE NUMBER OF CAUSAL VARIANTS: 
THE “POLY” IN POLYGENIC 
• Simulations can be used to 
determine what values of 
summary statistics (e.g., 
cross-validation R2 values of 
different p-value bins) are likely 
given the parameters (e.g., 
number of trait-associated 
SNPs). 
• Working backward from the 
simulation results leads to 
Bayesian posterior distributions. 
Eli Stahl introduced ABPA in 2012, extending a 
method devised by Visscher and colleagues.
THE POLYGENIC ARCHITECTURE OF 
SCHIZOPHRENIA 
Application of ABPA to schizophrenia GWAS data has yielded 
an estimate of 8,300 common variants (Ripke et al., 2013).
A FOURTH LAW OF 
BEHAVIOR GENETICS 
• Results from GWAS of mental 
illness, education, and 
intelligence justify an additional 
“law.” 
• Fourth Law. Genetic variation is 
caused by thousands of sites 
across the genome, all of which 
are individually responsible for 
a minuscule fraction of the 
variance (Chabris, Lee, Cesarini, 
Benjamin,  Laibson, in press). 
My colleague Christopher Chabris, the coiner 
of the Fourth Law.
A FOURTH LAW OF 
BEHAVIOR GENETICS 
• The coiner of the original 
Three Laws has already 
commented on some of the 
evidence supporting our 
proposed Fourth Law 
(Turkheimer, 2012). 
• Turkheimer suggests that 
this evidence points toward 
deemphasizing GWAS. 
Eric Turkheimer, the coiner of the Three 
Laws of Behavior Genetics.
A FOURTH LAW OF 
BEHAVIOR GENETICS 
• Turkheimer’s arguments are 
important. They are related 
to recently expressed 
concerns regarding the 
trustworthiness of the 
scientific enterprise (Pashler 
 Wagenmakers, 2012). 
• Close scrutiny, however, 
shows that these arguments 
do not apply to GWAS. 
Eric Turkheimer, the coiner of the Three 
Laws of Behavior Genetics.
ISSUE #1: REPLICABILITY OF 
GWAS FINDINGS 
• Some have argued that 
GWAS findings show a poor 
track record of replication. 
• Kernel of truth. The small 
effects described by the 
Fourth Law are difficult to 
distinguish from noise in 
poorly powered studies and 
require large samples to be 
replicated.
ISSUE #1: REPLICABILITY 
Given adequate sample sizes, however, the degree of 
quantitative replication in GWAS is nothing short of astounding.
ISSUE #1: REPLICABILITY 
The best-fitting straight line is close to the line of zero 
intercept and unit slope (Marigorta  Navarro, 2013).
WILL REPLICABILITY EXTEND 
TO PSYCHOLOGICAL TRAITS? 
• There have been few 
GWAS of behavioral traits 
in distinct populations. 
• It is possible, however, to 
use GCTA to estimate the 
genetic correlation 
between populations with 
respect to a certain 
phenotype.
WILL REPLICABILITY EXTEND 
TO PSYCHOLOGICAL TRAITS? 
YEUR = ↵0 + |X1↵1 + ·{·z· + XL↵L} 
European breeding value 
! 
• YEUR : European individual’s SCZ liability 
• Xj : number of SCZ + genes (0, 1, or 2) at the jth locus 
• αj : average effect of gene substitution on SCZ liability at 
the jth locus 
• E : individual’s “residual” with respect to SCZ liability—a 
composite of environmental effects, nonlinear (non-additive) 
interactions, etc. 
+E 
YAFR = 0 + |W11 + · ·{·z+WKK} 
African breeding value 
+E
WILL REPLICABILITY EXTEND 
TO PSYCHOLOGICAL TRAITS? 
YEUR = ↵0 + |X1↵1 + ·{·z· + XL↵L} 
European breeding value 
! 
• YAFR : African individual’s SCZ liability 
• Wj : number of SCZ + genes (0, 1, or 2) at the jth locus 
• βj : average effect of gene substitution on SCZ liability at 
the jth locus 
• E : individual’s “residual” with respect to SCZ liability—a 
composite of environmental effects, nonlinear (non-additive) 
interactions, etc. 
+E 
YAFR = 0 + |W11 + · ·{·z+WKK} 
African breeding value 
+E
WILL REPLICABILITY EXTEND 
TO PSYCHOLOGICAL TRAITS? 
YEUR = ↵0 + |X1↵1 + ·{·z· + XL↵L} 
European breeding value 
+E 
YAFR = 0 + |W11 + · ·{·z+WKK} 
African breeding value 
+E 
The genetic correlation between two phenotypes is 
simply the correlation between their respective 
breeding values.
WILL REPLICABILITY EXTEND 
TO PSYCHOLOGICAL TRAITS? 
YEUR = ↵0 + |X1↵1 + ·{·z· + XL↵L} 
European breeding value 
+E 
YAFR = 0 + |W11 + · ·{·z+WKK} 
African breeding value 
+E 
de Candia et al. (2013) used GCTA to estimate that 
the correlation between European and African 
breeding values with respect to schizophrenia is 
greater than 0.60.
WILL REPLICABILITY EXTEND 
TO PSYCHOLOGICAL TRAITS? 
• The latest GWAS meta-analysis 
of schizophrenia 
included a number of East 
Asian samples (Ripke et al., 
2014). 
• The concordance between 
Europeans and East Asians 
is strong.
ISSUE #2: CORRELATION VS. 
CAUSATION 
• GWAS of unrelated 
individuals can only tell us 
that a given SNP is 
correlated with the 
phenotype. 
• But we want to know 
whether variation at the 
genomic site causes 
variation in the phenotype. 
Sir Ronald Fisher, the founder of both 
population genetics and modern statistics.
ISSUE #2: CORRELATION VS. 
CAUSATION 
• Since a given SNP is 
correlated with many other 
variants in its genomic 
region, picking out the causal 
variant (if any) is a challenge. 
• Here I address the problem 
of whether a GWAS signal 
might be attributable to 
confounding with an 
environmental variable. 
Sir Ronald Fisher, the founder of both 
population genetics and modern statistics.
ISSUE #2: CORRELATION VS. 
CAUSATION 
• The simplest means of 
addressing confounding is 
the family-based design. 
• By Mendel’s Law of 
Segregation, a parent passes 
on a random gene from 
each homologous pair to a 
given offspring. 
father’s 
genome 
offspring’s 
genome 
mother’s 
genome
ISSUE #2: CORRELATION VS. 
CAUSATION 
• Whether a heterozygous parent 
(“+−”) passes on the “+” or “−” 
gene to its offspring is equivalent 
to randomized treatment 
status in experimental design. 
• If there is no selection bias, a 
within-family correlation 
between “+” transmission and 
the phenotype means that the 
marker must be linked and 
associated with a causal variant. 
father’s 
genome 
offspring’s 
genome 
mother’s 
genome
ISSUE #2: CORRELATION VS. 
CAUSATION 
• Within-family designs are not 
statistically powerful, but they 
can be used to check that 
studies of unrelated 
individuals are not unduly 
contaminated by confounding. 
• So far, family-based studies 
have affirmed the results of 
standard GWAS (Rietveld et 
al., 2013). 
father’s 
genome 
offspring’s 
genome 
mother’s 
genome
BUT WHY IS CAUSAL 
INFERENCE SO SIMPLE HERE? 
SNP 1 SNP 2 SNP 3 SNP 4 SNP 5 SNP 6 SNP 7 SNP 8 SNP 9 
phenotype 
This is the simplest possible causal system 
(directed acyclic graph). If there is no confounding, 
every partial regression coefficient is equal to its 
corresponding average effect.
BUT WHY IS CAUSAL 
INFERENCE SO SIMPLE HERE? 
• Why are genetic and 
environmental causes not 
confounded more severely? 
• Anthropomorphic answer. 
When Nature pushes up the 
frequencies of some alleles 
and pushes down others, she 
can only tell which alleles are 
correlated with fitness. She 
cannot tell which alleles cause 
higher fitness. 
The Papilio caterpillar, which has evolved to 
look like a snake.
BUT WHY IS CAUSAL 
INFERENCE SO SIMPLE HERE? 
• Nevertheless, Nature seems 
to adjust allele frequencies 
in the correct way more 
often than not. 
• She can only do this if gene-trait 
correlation is a robust 
guide to gene-trait 
causation. Be thankful that 
we live in such a universe! 
The Papilio caterpillar, which has evolved to 
look like a snake.
ISSUE #3: THE SCIENTIFIC 
WORTH OF SMALL EFFECTS 
• One might object that only 
large effect sizes are 
scientifically significant (as 
opposed to statistically 
significant in a large enough 
sample). 
• On this view the Fourth Law 
automatically discredits 
further inquiry into the 
genetic causes of behavior. 
The clinical psychologist Paul Meehl, a 
vocal critic of significance testing.
ISSUE #3: THE SCIENTIFIC 
WORTH OF SMALL EFFECTS 
• This critique draws on the 
penetrating writings of 
Meehl (1978, 1990). 
• Meehl thought that the null 
hypothesis is often a 
strawman because of 
ubiquitous biases and an 
abundance of alternative 
explanations. 
The clinical psychologist Paul Meehl, a 
vocal critic of significance testing.
ISSUE #3: THE SCIENTIFIC 
WORTH OF SMALL EFFECTS 
• In such cases the rejection 
of the null hypothesis is not 
scientifically valuable. 
• In GWAS, however, we 
have every reason to 
believe that the null 
hypothesis is true more 
often than not. 
The clinical psychologist Paul Meehl, a 
vocal critic of significance testing.
THE POLYGENIC ARCHITECTURE OF 
SCHIZOPHRENIA 
~8,300 common variants seems to be a lot—but there are 
~8 million common variants in the entire genome!
ISSUE #3: THE SCIENTIFIC 
WORTH OF SMALL EFFECTS 
• Against a large background 
of null effects, accepting the 
alternative hypothesis of a 
small effect is an inherently 
meaningful step toward the 
underlying biology. 
• Perhaps to the surprise of 
some, the latest GWAS meta-analysis 
of schizophrenia 
implicates acquired 
immunity (Ripke et al., 2014). 
The clinical psychologist Paul Meehl, a 
vocal critic of significance testing.
WHAT KINDS OF ENHANCERS 
HARBOR SCHIZOPHRENIA VARIANTS?
COMPRESSED SENSING: 
ADDRESSING THE N ≪ P PROBLEM 
• Point 1. Heritability is not 
missing; it is hiding in plain sight 
among thousands of variants 
(many of them common). 
• Point 2. Replicability crisis? 
Distinguishing causation from 
correlation? The Lykken-Meehl 
crud factor? Unlike much of 
behavioral science, GWAS is 
remarkably free from these 
problems. 
Over a million people attend the 
Minnesota State Fair each year.
COMPRESSED SENSING: 
ADDRESSING THE N ≪ P PROBLEM 
• But it is one thing to say 
that there is scientific gold 
buried somewhere. It is 
quite another to dig it up! 
• Can we identify enough 
variants to make meaningful 
scientific inferences without 
n greater than the number 
of protons in the Universe? 
Over a million people attend the 
Minnesota State Fair each year.
COMPRESSED SENSING: 
ADDRESSING THE N ≪ P PROBLEM 
• In Statistics 101, many of us 
learned that the sample size (n) 
must exceed the number of 
RHS variables (p) for the partial 
regression coefficients to be 
identified. 
• Recent work in the theory of 
compressed sensing (CS) has 
shown that coefficient recovery 
is possible in the n ≪ p case 
(Candes, Romberg,  Tao, 2006). Terence Tao is the most distinguished 
SMPY participant and perhaps the most 
famous mathematician in the world.
COMPRESSED SENSING: 
ADDRESSING THE N ≪ P PROBLEM 
Consider the noisy linear system y = Ax+e, where A 2 Rn⇥p is the design 
matrix and x 2 Rp has s nonzero elements. If n  Cslog p for some constant 
C, then the solution of the LASSO problem 
min 
ˆx 
! 
ky − Aˆxk2 
L2 + !kˆxkL1 
 
with a suitable choice of ! obeys 
kˆx − xk2 
L2  
2E 
n 
s polylog p, 
where 2E 
is the variance of the residuals in e.
COMPRESSED 0.2 
SENSING: 
0.2 
ADDRESSING THE N ≪ P PROBLEM 
• Simply statable CS theorems 
assume that the RHS variables (e.g., 
genetic variants) are uncorrelated. 
But in reality a genetic variant is in 
LD with nearby genetic variants. So 
do CS ideas apply here? 
• If you squint at the GWAS 
covariance matrix from a distance, 
it looks diagonal. So it might be 
reasonable to expect that LASSO 
will still perform well (up to 
GWAS precision). 
0 0.5 1 
b 
0 
0 
frequency 
SNP index i 
SNP index j 
C 
2000 4000 6000 8000 
2000 
4000 
6000 
8000 
1 
0.8 
0.6 
0.4 
0.2 
0 
A color-coded correlation matrix of SNPs 
on chromosome 22.
COMPRESSED SENSING: 
ADDRESSING THE N ≪ P PROBLEM 
Vattikuti, Lee, Chang, Hsu,  Chow (2014)
GIANT SNP 
L1 SNP, proxy 
L1 SNP, not proxy 
MR SNP
COMPRESSED SENSING: 
ADDRESSING THE N ≪ P PROBLEM 
• Can we tell when a GWAS has 
crossed n  C s log p? 
• Yes! Certain observable 
quantities (e.g., the typical p-value 
of called nonzeros) begin 
to decline sharply. 
• Applying this method to real 
GWAS data indicates that for a 
trait with h2≈0.50, n  30s 
triggers the phase transition to 
good performance. The theoretical physicist Stephen Hsu 
entertains a visitor to Michigan State 
University.
IMPORTANT SCIENTIFIC QUESTIONS: 
WHY TAKE THE ROAD TO 30S? 
“Man may be excused for 
feeling some pride at having 
risen … to the very summit 
of the organic scale; and the 
fact of his having thus risen, 
instead of having been 
aboriginally placed there, 
may give him hope for a still 
higher destiny in the distant 
future.
IMPORTANT SCIENTIFIC QUESTIONS: 
WHY TAKE THE ROAD TO 30S? 
“[But] man with all his noble 
qualities, with sympathy 
which feels for the most 
debased, with benevolence, 
which extends not only to 
other men but to the 
humblest living creature, with 
his god-like intellect which 
has penetrated into the 
movements and constitution 
of the solar system …
IMPORTANT SCIENTIFIC QUESTIONS: 
WHY TAKE THE ROAD TO 30S? 
“… with all these exalted 
powers, Man still bears in 
his bodily frame the 
indelible stamp of his lowly 
origin.”—CHARLES DARWIN, 
THE DESCENT OF MAN
IMPORTANT SCIENTIFIC QUESTIONS: 
WHY TAKE THE ROAD TO 30S? 
• Darwin knew no genes; we 
do. Can we trace the 
genetic basis of the 
evolutionary change that 
Darwin described? 
• Recent spectacular 
advances in the sequencing 
of ancient hominin DNA 
suggest that the answer 
may be yes.
THE HUMAN FAMILY TREE 
1.8 mya? 
500 kya 
380 kya 
Prüfer et al. (2014)
THE GENETICS OF ANCIENT 
HOMININS 
• Usable DNA was recently 
recovered from a 
Denisovan-like hominin 
who died more than 300 
kya (Meyer et al., 2014). 
• I will now show you a 
comparison of sequences 
from Neanderthals and 
modern humans. An artist’s reconstruction of a 
human-Neanderthal hybrid child.
THE GENETICS OF ANCIENT 
HOMININS 
This is the modern human sequence encompassing rs1487441, one of the “IQ hits” 
identified by Rietveld et al. (2014). A is the “plus” allele; G is the “minus” allele. 
TTCTTCCACTCACTCATCACCATAAA 
The ancestors of Neanderthals and Denisovans split from our lineage ~500 kya. 
Neanderthals probably did a lot of evolving since then … but it is still fun to ask: 
What allele did Neanderthals carry at this site? 
TTCTTCCACTCACTCG TCACCATAAA
IMPORTANT SCIENTIFIC QUESTIONS: 
WHY TAKE THE ROAD TO 30S?
PLEASE CITE THESE PAPERS! 
• Vattikuti S, Lee JJ, Chang CC, Hsu SDH, Chow CC (2014). Applying compressed 
sensing to genome-wide association studies. GigaScience, 3, 10. 
• Lee JJ, Chow CC (2014). Conditions for the validity of SNP-based heritability 
estimation. Human Genetics, 133, 1011-1022. 
• Rietveld CA, Esko T, Davies G, Pers TH, Benyamin B, Chabris CF, Emilsson V, 
Johnson AD, Lee JJ, de Leeuw C, et al. (2014). Common genetic variants 
associated with cognitive performance identified using the proxy-phenotype 
method. Proceedings of the National Academy of Sciences USA, 111, 13790-13794. 
• Chabris CF, Lee JJ, Cesarini D, Benjamin DJ, Laibson DI (in press). The fourth law 
of behavior genetics. Current Directions in Psychological Science.

Más contenido relacionado

Destacado

20100515 bioinformatics kapushesky_lecture06
20100515 bioinformatics kapushesky_lecture0620100515 bioinformatics kapushesky_lecture06
20100515 bioinformatics kapushesky_lecture06Computer Science Club
 
Contribution of genome-wide association studies to scientific research: a pra...
Contribution of genome-wide association studies to scientific research: a pra...Contribution of genome-wide association studies to scientific research: a pra...
Contribution of genome-wide association studies to scientific research: a pra...Mutiple Sclerosis
 
Report- Genome wide association studies.
Report- Genome wide association studies.Report- Genome wide association studies.
Report- Genome wide association studies.Varsha Gayatonde
 
The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...Meningitis Research Foundation
 
GEE & GLMM in GWAS
GEE & GLMM in GWASGEE & GLMM in GWAS
GEE & GLMM in GWASJinseob Kim
 
Accelerating GWAS epistatic interaction analysis methods
Accelerating GWAS epistatic interaction analysis methodsAccelerating GWAS epistatic interaction analysis methods
Accelerating GWAS epistatic interaction analysis methodsPriscill Orue Esquivel
 
Epi519 Gwas Talk
Epi519 Gwas TalkEpi519 Gwas Talk
Epi519 Gwas Talkjoshbis
 
01 introduction-f. laurens-20130221
01 introduction-f. laurens-2013022101 introduction-f. laurens-20130221
01 introduction-f. laurens-20130221fruitbreedomics
 
Introduction to association mapping and tutorial using tassel
Introduction to association mapping and tutorial using tasselIntroduction to association mapping and tutorial using tassel
Introduction to association mapping and tutorial using tasselAwais Khan
 
BioSMACK - Linux Live CD for GWAS
BioSMACK - Linux Live CD for GWASBioSMACK - Linux Live CD for GWAS
BioSMACK - Linux Live CD for GWASHong ChangBum
 
Genome wide association studies seminar
Genome wide association studies seminarGenome wide association studies seminar
Genome wide association studies seminarVarsha Gayatonde
 

Destacado (20)

20100515 bioinformatics kapushesky_lecture06
20100515 bioinformatics kapushesky_lecture0620100515 bioinformatics kapushesky_lecture06
20100515 bioinformatics kapushesky_lecture06
 
13 aranzana
13 aranzana13 aranzana
13 aranzana
 
Contribution of genome-wide association studies to scientific research: a pra...
Contribution of genome-wide association studies to scientific research: a pra...Contribution of genome-wide association studies to scientific research: a pra...
Contribution of genome-wide association studies to scientific research: a pra...
 
Gwas.emes.comp
Gwas.emes.compGwas.emes.comp
Gwas.emes.comp
 
Report- Genome wide association studies.
Report- Genome wide association studies.Report- Genome wide association studies.
Report- Genome wide association studies.
 
The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...
 
GEE & GLMM in GWAS
GEE & GLMM in GWASGEE & GLMM in GWAS
GEE & GLMM in GWAS
 
Accelerating GWAS epistatic interaction analysis methods
Accelerating GWAS epistatic interaction analysis methodsAccelerating GWAS epistatic interaction analysis methods
Accelerating GWAS epistatic interaction analysis methods
 
Intro gwas
Intro gwasIntro gwas
Intro gwas
 
Crowdsourcing GWAS
Crowdsourcing GWASCrowdsourcing GWAS
Crowdsourcing GWAS
 
Epi519 Gwas Talk
Epi519 Gwas TalkEpi519 Gwas Talk
Epi519 Gwas Talk
 
Genome wide association mapping
Genome wide association mappingGenome wide association mapping
Genome wide association mapping
 
Lecture 7 gwas full
Lecture 7 gwas fullLecture 7 gwas full
Lecture 7 gwas full
 
01 introduction-f. laurens-20130221
01 introduction-f. laurens-2013022101 introduction-f. laurens-20130221
01 introduction-f. laurens-20130221
 
Introduction to association mapping and tutorial using tassel
Introduction to association mapping and tutorial using tasselIntroduction to association mapping and tutorial using tassel
Introduction to association mapping and tutorial using tassel
 
GWAS
GWASGWAS
GWAS
 
BioSMACK - Linux Live CD for GWAS
BioSMACK - Linux Live CD for GWASBioSMACK - Linux Live CD for GWAS
BioSMACK - Linux Live CD for GWAS
 
Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...
Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...
Application of Genome-Wide Association Study (GWAS) and transcriptomics to st...
 
Genome wide association studies seminar
Genome wide association studies seminarGenome wide association studies seminar
Genome wide association studies seminar
 
Omics era
Omics eraOmics era
Omics era
 

Similar a THE GENETIC ARCHITECTURES OF PSYCHOLOGICAL TRAITS

Current Directions in PsychologicalScience2015, Vol. 24(4).docx
Current Directions in PsychologicalScience2015, Vol. 24(4).docxCurrent Directions in PsychologicalScience2015, Vol. 24(4).docx
Current Directions in PsychologicalScience2015, Vol. 24(4).docxannettsparrow
 
recombinantdnatech-200721165223 (2).pdf
recombinantdnatech-200721165223 (2).pdfrecombinantdnatech-200721165223 (2).pdf
recombinantdnatech-200721165223 (2).pdfssusered2921
 
Mapping and Cloning of Human disease gene
Mapping and Cloning of Human disease geneMapping and Cloning of Human disease gene
Mapping and Cloning of Human disease geneVASANTKUMAR31
 
human genetics and population genetics
human genetics and population geneticshuman genetics and population genetics
human genetics and population geneticsDEEPAK SAINI
 
Genetic mapping of behaviour and gene expression in the chicken
Genetic mapping of behaviour and gene expression in the chickenGenetic mapping of behaviour and gene expression in the chicken
Genetic mapping of behaviour and gene expression in the chickenMartin Johnsson
 
Dna profiling presentation x2
Dna profiling presentation x2Dna profiling presentation x2
Dna profiling presentation x2Eli Rosenthal
 
Dna profiling presentation x2
Dna profiling presentation x2Dna profiling presentation x2
Dna profiling presentation x2teamchaotex
 
Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.
Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.
Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.Varsha Gayatonde
 
Interpreting genetic-genealogy-results web-optimized
Interpreting genetic-genealogy-results web-optimizedInterpreting genetic-genealogy-results web-optimized
Interpreting genetic-genealogy-results web-optimizeddonnie harold harris
 
Dna fingerprinting
Dna fingerprintingDna fingerprinting
Dna fingerprintingWaqas Shams
 
Evolution lectures WK6
Evolution lectures WK6Evolution lectures WK6
Evolution lectures WK6Andrea Hatlen
 
Evolutionary arguments in medical genomics
Evolutionary arguments in medical genomicsEvolutionary arguments in medical genomics
Evolutionary arguments in medical genomicsNikita Khromov-Borisov
 
23_clark_missing_epistasis.pptx
23_clark_missing_epistasis.pptx23_clark_missing_epistasis.pptx
23_clark_missing_epistasis.pptxsanarao25
 
GENETICS IN PSYCHIATRY.pdf
GENETICS IN PSYCHIATRY.pdfGENETICS IN PSYCHIATRY.pdf
GENETICS IN PSYCHIATRY.pdfShanuSoni7
 
Learning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxLearning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxwrite4
 
Learning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxLearning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxwrite5
 
Montgomery expression
Montgomery expressionMontgomery expression
Montgomery expressionmorenorossi
 
Dan Geschwind, MD, PhD: Advances in Genetics 2016
Dan Geschwind, MD, PhD: Advances in Genetics 2016Dan Geschwind, MD, PhD: Advances in Genetics 2016
Dan Geschwind, MD, PhD: Advances in Genetics 2016Semel Admin
 

Similar a THE GENETIC ARCHITECTURES OF PSYCHOLOGICAL TRAITS (20)

Current Directions in PsychologicalScience2015, Vol. 24(4).docx
Current Directions in PsychologicalScience2015, Vol. 24(4).docxCurrent Directions in PsychologicalScience2015, Vol. 24(4).docx
Current Directions in PsychologicalScience2015, Vol. 24(4).docx
 
recombinantdnatech-200721165223 (2).pdf
recombinantdnatech-200721165223 (2).pdfrecombinantdnatech-200721165223 (2).pdf
recombinantdnatech-200721165223 (2).pdf
 
Mapping and Cloning of Human disease gene
Mapping and Cloning of Human disease geneMapping and Cloning of Human disease gene
Mapping and Cloning of Human disease gene
 
human genetics and population genetics
human genetics and population geneticshuman genetics and population genetics
human genetics and population genetics
 
Genetic mapping of behaviour and gene expression in the chicken
Genetic mapping of behaviour and gene expression in the chickenGenetic mapping of behaviour and gene expression in the chicken
Genetic mapping of behaviour and gene expression in the chicken
 
Dna profiling presentation x2
Dna profiling presentation x2Dna profiling presentation x2
Dna profiling presentation x2
 
Dna profiling presentation x2
Dna profiling presentation x2Dna profiling presentation x2
Dna profiling presentation x2
 
Genetics in Psychiatry
Genetics in PsychiatryGenetics in Psychiatry
Genetics in Psychiatry
 
Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.
Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.
Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.
 
Schizophrenia - Genetics
Schizophrenia - GeneticsSchizophrenia - Genetics
Schizophrenia - Genetics
 
Interpreting genetic-genealogy-results web-optimized
Interpreting genetic-genealogy-results web-optimizedInterpreting genetic-genealogy-results web-optimized
Interpreting genetic-genealogy-results web-optimized
 
Dna fingerprinting
Dna fingerprintingDna fingerprinting
Dna fingerprinting
 
Evolution lectures WK6
Evolution lectures WK6Evolution lectures WK6
Evolution lectures WK6
 
Evolutionary arguments in medical genomics
Evolutionary arguments in medical genomicsEvolutionary arguments in medical genomics
Evolutionary arguments in medical genomics
 
23_clark_missing_epistasis.pptx
23_clark_missing_epistasis.pptx23_clark_missing_epistasis.pptx
23_clark_missing_epistasis.pptx
 
GENETICS IN PSYCHIATRY.pdf
GENETICS IN PSYCHIATRY.pdfGENETICS IN PSYCHIATRY.pdf
GENETICS IN PSYCHIATRY.pdf
 
Learning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxLearning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docx
 
Learning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxLearning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docx
 
Montgomery expression
Montgomery expressionMontgomery expression
Montgomery expression
 
Dan Geschwind, MD, PhD: Advances in Genetics 2016
Dan Geschwind, MD, PhD: Advances in Genetics 2016Dan Geschwind, MD, PhD: Advances in Genetics 2016
Dan Geschwind, MD, PhD: Advances in Genetics 2016
 

Más de Nikolaos Tselios

Study id12322 global-internet-usage-statista-dossier
Study id12322 global-internet-usage-statista-dossierStudy id12322 global-internet-usage-statista-dossier
Study id12322 global-internet-usage-statista-dossierNikolaos Tselios
 
4 vasikes arhes shediasmoy diktiakon topon
4 vasikes arhes shediasmoy diktiakon topon4 vasikes arhes shediasmoy diktiakon topon
4 vasikes arhes shediasmoy diktiakon toponNikolaos Tselios
 
5 addie model design development phase
5 addie model design development phase5 addie model design development phase
5 addie model design development phaseNikolaos Tselios
 
seminar econ_allhlepidrash anthrwpoy_ypologisth_v2
seminar econ_allhlepidrash anthrwpoy_ypologisth_v2seminar econ_allhlepidrash anthrwpoy_ypologisth_v2
seminar econ_allhlepidrash anthrwpoy_ypologisth_v2Nikolaos Tselios
 
heuristic evaluation example
heuristic evaluation exampleheuristic evaluation example
heuristic evaluation exampleNikolaos Tselios
 
Διδακτορική διατριβή Αλτανοπούλου
Διδακτορική διατριβή ΑλτανοπούλουΔιδακτορική διατριβή Αλτανοπούλου
Διδακτορική διατριβή ΑλτανοπούλουNikolaos Tselios
 
παρουσιαση σεμιναριο Revythi
παρουσιαση σεμιναριο Revythiπαρουσιαση σεμιναριο Revythi
παρουσιαση σεμιναριο RevythiNikolaos Tselios
 
Using the internet to collect data_greek education departments_hindex_....pptx
Using the internet to collect data_greek education departments_hindex_....pptxUsing the internet to collect data_greek education departments_hindex_....pptx
Using the internet to collect data_greek education departments_hindex_....pptxNikolaos Tselios
 
2016 sapsani parousiash-diplomatikh
2016 sapsani  parousiash-diplomatikh2016 sapsani  parousiash-diplomatikh
2016 sapsani parousiash-diplomatikhNikolaos Tselios
 
PhD progress_2015-2016 Altanopoulou
PhD progress_2015-2016 AltanopoulouPhD progress_2015-2016 Altanopoulou
PhD progress_2015-2016 AltanopoulouNikolaos Tselios
 
Εισαγωγή στην αλληλεπίδραση Ανθρώπου Υπολογιστή; 2015 Σεμινάριο στο ΜΠΣ 'Εφαρ...
Εισαγωγή στην αλληλεπίδραση Ανθρώπου Υπολογιστή; 2015 Σεμινάριο στο ΜΠΣ 'Εφαρ...Εισαγωγή στην αλληλεπίδραση Ανθρώπου Υπολογιστή; 2015 Σεμινάριο στο ΜΠΣ 'Εφαρ...
Εισαγωγή στην αλληλεπίδραση Ανθρώπου Υπολογιστή; 2015 Σεμινάριο στο ΜΠΣ 'Εφαρ...Nikolaos Tselios
 
2015 Βιβλιομετρική επισκόπηση Τμημάτων Θετικών Επιστημών και Πολυτεχνικής Σχολής
2015 Βιβλιομετρική επισκόπηση Τμημάτων Θετικών Επιστημών και Πολυτεχνικής Σχολής2015 Βιβλιομετρική επισκόπηση Τμημάτων Θετικών Επιστημών και Πολυτεχνικής Σχολής
2015 Βιβλιομετρική επισκόπηση Τμημάτων Θετικών Επιστημών και Πολυτεχνικής ΣχολήςNikolaos Tselios
 
2015 Πτυχιακή εργασία / Χρήση Τεχνικών Learning Analytics για την εκτίμηση το...
2015 Πτυχιακή εργασία / Χρήση Τεχνικών Learning Analytics για την εκτίμηση το...2015 Πτυχιακή εργασία / Χρήση Τεχνικών Learning Analytics για την εκτίμηση το...
2015 Πτυχιακή εργασία / Χρήση Τεχνικών Learning Analytics για την εκτίμηση το...Nikolaos Tselios
 
Σεμινάριο OMEP: Βασικές αρχές σχεδιασμού δικτυακών τόπων
Σεμινάριο OMEP: Βασικές αρχές σχεδιασμού δικτυακών τόπωνΣεμινάριο OMEP: Βασικές αρχές σχεδιασμού δικτυακών τόπων
Σεμινάριο OMEP: Βασικές αρχές σχεδιασμού δικτυακών τόπωνNikolaos Tselios
 
Tselios teeaph current_research_and_activities_2014_2015
Tselios teeaph current_research_and_activities_2014_2015Tselios teeaph current_research_and_activities_2014_2015
Tselios teeaph current_research_and_activities_2014_2015Nikolaos Tselios
 
Εθισμός στο Διαδίκτυο
Εθισμός στο ΔιαδίκτυοΕθισμός στο Διαδίκτυο
Εθισμός στο ΔιαδίκτυοNikolaos Tselios
 

Más de Nikolaos Tselios (20)

Study id12322 global-internet-usage-statista-dossier
Study id12322 global-internet-usage-statista-dossierStudy id12322 global-internet-usage-statista-dossier
Study id12322 global-internet-usage-statista-dossier
 
4 vasikes arhes shediasmoy diktiakon topon
4 vasikes arhes shediasmoy diktiakon topon4 vasikes arhes shediasmoy diktiakon topon
4 vasikes arhes shediasmoy diktiakon topon
 
5 addie model design development phase
5 addie model design development phase5 addie model design development phase
5 addie model design development phase
 
seminar econ_allhlepidrash anthrwpoy_ypologisth_v2
seminar econ_allhlepidrash anthrwpoy_ypologisth_v2seminar econ_allhlepidrash anthrwpoy_ypologisth_v2
seminar econ_allhlepidrash anthrwpoy_ypologisth_v2
 
heuristic evaluation example
heuristic evaluation exampleheuristic evaluation example
heuristic evaluation example
 
Διδακτορική διατριβή Αλτανοπούλου
Διδακτορική διατριβή ΑλτανοπούλουΔιδακτορική διατριβή Αλτανοπούλου
Διδακτορική διατριβή Αλτανοπούλου
 
παρουσιαση σεμιναριο Revythi
παρουσιαση σεμιναριο Revythiπαρουσιαση σεμιναριο Revythi
παρουσιαση σεμιναριο Revythi
 
Hcicte2016 altanopoulou
Hcicte2016 altanopoulouHcicte2016 altanopoulou
Hcicte2016 altanopoulou
 
Using the internet to collect data_greek education departments_hindex_....pptx
Using the internet to collect data_greek education departments_hindex_....pptxUsing the internet to collect data_greek education departments_hindex_....pptx
Using the internet to collect data_greek education departments_hindex_....pptx
 
2016 sapsani parousiash-diplomatikh
2016 sapsani  parousiash-diplomatikh2016 sapsani  parousiash-diplomatikh
2016 sapsani parousiash-diplomatikh
 
Phdprogress altanopoulou
Phdprogress altanopoulouPhdprogress altanopoulou
Phdprogress altanopoulou
 
PhD progress_2015-2016 Altanopoulou
PhD progress_2015-2016 AltanopoulouPhD progress_2015-2016 Altanopoulou
PhD progress_2015-2016 Altanopoulou
 
Εισαγωγή στην αλληλεπίδραση Ανθρώπου Υπολογιστή; 2015 Σεμινάριο στο ΜΠΣ 'Εφαρ...
Εισαγωγή στην αλληλεπίδραση Ανθρώπου Υπολογιστή; 2015 Σεμινάριο στο ΜΠΣ 'Εφαρ...Εισαγωγή στην αλληλεπίδραση Ανθρώπου Υπολογιστή; 2015 Σεμινάριο στο ΜΠΣ 'Εφαρ...
Εισαγωγή στην αλληλεπίδραση Ανθρώπου Υπολογιστή; 2015 Σεμινάριο στο ΜΠΣ 'Εφαρ...
 
2015 Βιβλιομετρική επισκόπηση Τμημάτων Θετικών Επιστημών και Πολυτεχνικής Σχολής
2015 Βιβλιομετρική επισκόπηση Τμημάτων Θετικών Επιστημών και Πολυτεχνικής Σχολής2015 Βιβλιομετρική επισκόπηση Τμημάτων Θετικών Επιστημών και Πολυτεχνικής Σχολής
2015 Βιβλιομετρική επισκόπηση Τμημάτων Θετικών Επιστημών και Πολυτεχνικής Σχολής
 
2015 Πτυχιακή εργασία / Χρήση Τεχνικών Learning Analytics για την εκτίμηση το...
2015 Πτυχιακή εργασία / Χρήση Τεχνικών Learning Analytics για την εκτίμηση το...2015 Πτυχιακή εργασία / Χρήση Τεχνικών Learning Analytics για την εκτίμηση το...
2015 Πτυχιακή εργασία / Χρήση Τεχνικών Learning Analytics για την εκτίμηση το...
 
Weebly
WeeblyWeebly
Weebly
 
Σεμινάριο OMEP: Βασικές αρχές σχεδιασμού δικτυακών τόπων
Σεμινάριο OMEP: Βασικές αρχές σχεδιασμού δικτυακών τόπωνΣεμινάριο OMEP: Βασικές αρχές σχεδιασμού δικτυακών τόπων
Σεμινάριο OMEP: Βασικές αρχές σχεδιασμού δικτυακών τόπων
 
Tselios teeaph current_research_and_activities_2014_2015
Tselios teeaph current_research_and_activities_2014_2015Tselios teeaph current_research_and_activities_2014_2015
Tselios teeaph current_research_and_activities_2014_2015
 
ΠΛΗ42 ΟΣΣ1
ΠΛΗ42 ΟΣΣ1ΠΛΗ42 ΟΣΣ1
ΠΛΗ42 ΟΣΣ1
 
Εθισμός στο Διαδίκτυο
Εθισμός στο ΔιαδίκτυοΕθισμός στο Διαδίκτυο
Εθισμός στο Διαδίκτυο
 

Último

Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxAnaBeatriceAblay2
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 

Último (20)

Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 

THE GENETIC ARCHITECTURES OF PSYCHOLOGICAL TRAITS

  • 1. THE GENETIC ARCHITECTURES OF PSYCHOLOGICAL TRAITS James J. Lee University of Minnesota Twin Cities
  • 2. THREE LAWS OF BEHAVIOR GENETICS • First Law. All behavioral traits are heritable. • Second Law. The effect of being raised in the same family is smaller than the effect of genes. • Third Law. A substantial portion of the variance in behavioral traits is not accounted for by genes or families. Eric Turkheimer, the coiner of the Three Laws of Behavior Genetics.
  • 3. EVIDENCE FROM CLASSICAL QUANTITATIVE GENETICS 90 100 110 120 130 The Minnesota Adolescent Adoption Study (Scarr & Weinberg, 1978; Scarr, 1997) 80 90 100 110 120 130 BIOLOGICAL FAMILIES MIDPARENT IQ OFFSPRING IQ β = 0.61± 0.07 90 100 110 120 130 80 90 100 110 120 130 ADOPTIVE FAMILIES MIDPARENT IQ OFFSPRING IQ β = 0.13 ± 0.08
  • 4. EVIDENCE FROM CLASSICAL QUANTITATIVE GENETICS BIOLOGICAL FAMILIES ADOPTIVE FAMILIES The Sibling Interaction and Behavior Study (McGue et al., 2007)
  • 5. THE SEARCH FOR CAUSAL VARIANTS AT THE DNA LEVEL • If studies of twins and other kinships support the Three Laws, it seems justified to search for the causal loci at the DNA level. • This is the aim of genome-wide association studies (GWAS). A research subject provides DNA by spitting into a tube with a preservative.
  • 6. BACKLASH AGAINST GWAS OF PSYCHOLOGICAL TRAITS • Correlations between common variants and phenotypes such as general cognitive ability (g) and schizophrenia have turned out to be very small. • We have just reported three common SNPs that each account for ~0.02% of IQ variance (Rietveld et al., 2014).
  • 7. BACKLASH AGAINST GWAS OF PSYCHOLOGICAL TRAITS • In response, a fellow at the Center for Genetics and Society wrote a blog post called “The Stupidity of Smart Genes.” • Some academics are scarcely more charitable. Kevin Mitchell of Trinity College Dublin: “The idea that this trait is determined by common variants … is really unproven.”
  • 8. MY RESPONSE TO THE BACKLASH • We seem to have a paradox: if traits are as heritable as implied by classical studies, then where are the genes? • I argue that the heritability is hiding in plain sight: there are thousands of causal variants, each of which exerts a small effect—which means that it is difficult to find any single one.
  • 9. MY RESPONSE TO THE BACKLASH • I provide an estimate of the total GWAS sample size required to capture the entire heritability (due to common variants) of a phenotype like g. • Most importantly, I argue that chasing down thousands of DNA variants with small effects is a worthy scientific enterprise.
  • 10. HERITABILITY ESTIMATED DIRECTLY FROM DNA DATA • A critic might claim that GWAS of psychological traits cannot be guaranteed to produce more “hits” as sample size grows. • Perhaps “indirect” heritability estimates from studies of twins, adoptees, etc. are flawed and there are not that many causal loci after all. Richard Nixon and Forrest Gump may be slightly less similar at the DNA level than most other random pairs of people.
  • 11. HERITABILITY ESTIMATED DIRECTLY FROM DNA DATA • In recent years a new method, often called GCTA (after the software package Genome-wide Complex Trait Analysis), obtains “direct” estimates of heritability from DNA data. • Think of two people who are not related to you. Richard Nixon and Forrest Gump may be slightly less similar at the DNA level than most other random pairs of people.
  • 12. HERITABILITY ESTIMATED DIRECTLY FROM DNA DATA • If we genotype/sequence all three individuals, you will turn out by chance to be slightly more similar at the genetic level to person A than to person B. • Are you also phenotypically more similar to A than to B? Richard Nixon and Forrest Gump may be slightly less similar at the DNA level than most other random pairs of people.
  • 13. HERITABILITY ESTIMATED DIRECTLY FROM DNA DATA • In a large sample of unrelated people, we look at all pairs of people and calculate their genetic and phenotypic similarities. • Higher heritability means that genetically similar people will tend to be phenotypically similar. Richard Nixon and Forrest Gump may be slightly less similar at the DNA level than most other random pairs of people.
  • 14. HERITABILITY ESTIMATED DIRECTLY FROM DNA DATA E (yy0) = A2A + I2E , where Aij = 1 p Xp k=1 zikzjk ! • y: the vector of phenotypic values • σA2: additive genetic variance • σE2: residual variance • A: the matrix of “relatedness” coefficients • I: the identity matrix • zik: the standardized gene count of person i at locus k
  • 15. HERITABILITY ESTIMATED DIRECTLY FROM DNA DATA • According to GCTA, the heritability of g is roughly 0.45 (Davies et al., 2011; Chabris et al., 2012). • This is actually a lower bound on h2 because many causal variants (particularly those where one allele is rare) are probably not captured by SNP chips. Peter Visscher, quantitative geneticist and a developer of GCTA.
  • 16. HERITABILITY ESTIMATED DIRECTLY FROM DNA DATA • We have studied the conditions under which GCTA provides a valid estimate of SNP-based heritability (Lee Chow, 2014). • If the causal variants tend to be less well tagged (a realistic case), then GCTA will be biased downward. • Thus, h2GCTA h2SNP h2. My postdoctoral supervisor, Carson Chow, goes to the supermarket.
  • 17. HERITABILITY ESTIMATED DIRECTLY FROM DNA DATA 0.0 0.2 0.4 0.6 0.8 1.0 GREML HERITABILIY ESTIMATE VERY WEAKLY TAGGED WEAKLY TAGGED MODERATELY TAGGED STRONGLY TAGGED VERY STRONGLY TAGGED The purple horizontal line corresponds to the true h2SNP in our simulations.
  • 18. HERITABILITY ESTIMATED DIRECTLY FROM DNA DATA • “Direct” estimates based on DNA data and “indirect” estimates based on the correlations between relatives are thus fully consistent. • “Our results unequivocally confirm that a substantial proportion of individual differences in human intelligence is due to genetic variation” (Davies et al., 2011). Peter Visscher, quantitative geneticist and a developer of GCTA.
  • 19. HERITABILITY ESTIMATED DIRECTLY FROM DNA DATA • Some trait-associated SNPs might only be correlated (in linkage disequilibrium) with untyped causal variants. • How can we be sure that GCTA-estimated heritability reflects common variants? • Basic principle of psychometrics. Two dichotomously scored items can show a strong correlation only if their pass rates are similar. 0 0.5 1 b 0.2 0 0.2 0 frequency SNP index i SNP index j C 2000 4000 6000 8000 2000 4000 6000 8000 1 0.8 0.6 0.4 0.2 0 A color-coded correlation matrix of SNPs on chromosome 22.
  • 20. HERITABILITY ESTIMATED 0.2 0 DIRECTLY FROM 0 DNA 0.5 1 b DATA • This same principle also applies to genetics! • Two SNPs can show strong linkage disequilibrium (LD) only if their allele frequencies are similar. • Therefore, a substantial h2GCTA implies that common variants play a large role. 0.2 0 frequency SNP index i SNP index j C 2000 4000 6000 8000 2000 4000 6000 8000 1 0.8 0.6 0.4 0.2 0 A color-coded correlation matrix of SNPs on chromosome 22.
  • 21. … T A … … T A … … T A … … C G … … C … G … C … G … C G … Locus 1 MAF = 3/7 Locus 2 MAF = 3/7
  • 22. … T A … … T A … … T A … … T G … … T … G … T … G … C G … Locus 1 MAF = 1/7 Locus 2 MAF = 3/7
  • 23. THE NUMBER OF CAUSAL VARIANTS: THE “POLY” IN POLYGENIC • The simulations and mathematical arguments by Lee and Chow (2014) show that GCTA can be valid even if there is just one trait-associated SNP. • Can we find other evidence supporting the notion that missing heritability is distributed among many variants of very small effect? Peter Visscher, quantitative geneticist and a developer of GCTA.
  • 24. THE NUMBER OF CAUSAL VARIANTS: THE “POLY” IN POLYGENIC • GCTA has an advantage over classical pedigree-based methods. It can partition h2 among different parts of the genome. • E.g., we can determine how much heritability is contributed by each chromosome. Peter Visscher, quantitative geneticist and a developer of GCTA.
  • 25. THE NUMBER OF CAUSAL VARIANTS: THE “POLY” IN POLYGENIC • Basic idea. Calculate separate realized genetic similarities for different parts of the genome. • Suppose that there are many causal loci on chr1, but none on chr2. Then chr1 genetic similarity will predict phenotypic similarity, whereas chr2 genetic similarity will not. Peter Visscher, quantitative geneticist and a developer of GCTA.
  • 26. PARTITIONING SCHIZOPHRENIA HERITABILITY AMONG CHROMOSOMES Lee et al. (2012)
  • 27. PARTITIONING SCHIZOPHRENIA HERITABILITY AMONG CHROMOSOMES • The remarkable correlation between chromosome length and heritability contribution suggests that many loci contribute to SCZ liability (Gottesman Shields, 1967). • E.g., if there were only ten loci, each on a different chromosome, we would not see such a relationship. Prof. Emeritus Irving Gottesman, a pioneer in the genetic study of mental illness.
  • 28. THE NUMBER OF CAUSAL VARIANTS: THE “POLY” IN POLYGENIC • We know that there are many causal variants. But can we get more precise? • Even if a GWAS dataset has too little power to yield many “hits,” it still contains substantial information about the trait’s genetic architecture. Naomi Wray and Peter Visscher introduced a method to estimate parameters of genetic architectures in their 2009 study of schizophrenia.
  • 29. THE NUMBER OF CAUSAL VARIANTS: THE “POLY” IN POLYGENIC • We have seen how GCTA exploits this information in the estimation of heritability. • It is possible to get out more than just h2. • Approximate Bayesian polygenic analysis (ABPA) estimates the total number of genotyped SNPs that are associated with the trait (Stahl et al., 2012). Naomi Wray and Peter Visscher introduced a method to estimate parameters of genetic architectures in their 2009 study of schizophrenia.
  • 30. THE NUMBER OF CAUSAL VARIANTS: THE “POLY” IN POLYGENIC • Suppose that we estimate SNP regression coefficients in a GWAS and use them to predict the phenotypes of individuals in a new sample. • The cross-validation R2 is the predictive power of the estimated coefficients in the new sample. Eli Stahl introduced ABPA in 2012, extending a method devised by Visscher and colleagues.
  • 31. THE NUMBER OF CAUSAL VARIANTS: THE “POLY” IN POLYGENIC • Suppose that we bin the SNP effects estimated in the GWAS (“training sample”) by p-value. • If the GWAS results in every p-value bin—even in the bins corresponding to large p-values— show at least a small cross-validation R2, then the trait must be highly polygenic. Eli Stahl introduced ABPA in 2012, extending a method devised by Visscher and colleagues.
  • 32. THE NUMBER OF CAUSAL VARIANTS: THE “POLY” IN POLYGENIC • What if the heritability were due to just a few variants of large effect? These variants would be in a bin with low p-values, and all other bins would show no cross-validation. • A failure to observe this pattern implies polygenicity. Eli Stahl introduced ABPA in 2012, extending a method devised by Visscher and colleagues.
  • 33. THE NUMBER OF CAUSAL VARIANTS: THE “POLY” IN POLYGENIC • This logic extends to larger sample sizes. • What if the bins corresponding to p ≥ .05 no longer cross-validate? Then all trait-associated SNPs must have p .05! • The number of SNPs meeting the cutoff p .05 is then an upper bound on the total number of SNPs with nonzero regression coefficients. Eli Stahl introduced ABPA in 2012, extending a method devised by Visscher and colleagues.
  • 34. THE NUMBER OF CAUSAL VARIANTS: THE “POLY” IN POLYGENIC • Simulations can be used to determine what values of summary statistics (e.g., cross-validation R2 values of different p-value bins) are likely given the parameters (e.g., number of trait-associated SNPs). • Working backward from the simulation results leads to Bayesian posterior distributions. Eli Stahl introduced ABPA in 2012, extending a method devised by Visscher and colleagues.
  • 35. THE POLYGENIC ARCHITECTURE OF SCHIZOPHRENIA Application of ABPA to schizophrenia GWAS data has yielded an estimate of 8,300 common variants (Ripke et al., 2013).
  • 36. A FOURTH LAW OF BEHAVIOR GENETICS • Results from GWAS of mental illness, education, and intelligence justify an additional “law.” • Fourth Law. Genetic variation is caused by thousands of sites across the genome, all of which are individually responsible for a minuscule fraction of the variance (Chabris, Lee, Cesarini, Benjamin, Laibson, in press). My colleague Christopher Chabris, the coiner of the Fourth Law.
  • 37. A FOURTH LAW OF BEHAVIOR GENETICS • The coiner of the original Three Laws has already commented on some of the evidence supporting our proposed Fourth Law (Turkheimer, 2012). • Turkheimer suggests that this evidence points toward deemphasizing GWAS. Eric Turkheimer, the coiner of the Three Laws of Behavior Genetics.
  • 38. A FOURTH LAW OF BEHAVIOR GENETICS • Turkheimer’s arguments are important. They are related to recently expressed concerns regarding the trustworthiness of the scientific enterprise (Pashler Wagenmakers, 2012). • Close scrutiny, however, shows that these arguments do not apply to GWAS. Eric Turkheimer, the coiner of the Three Laws of Behavior Genetics.
  • 39. ISSUE #1: REPLICABILITY OF GWAS FINDINGS • Some have argued that GWAS findings show a poor track record of replication. • Kernel of truth. The small effects described by the Fourth Law are difficult to distinguish from noise in poorly powered studies and require large samples to be replicated.
  • 40. ISSUE #1: REPLICABILITY Given adequate sample sizes, however, the degree of quantitative replication in GWAS is nothing short of astounding.
  • 41. ISSUE #1: REPLICABILITY The best-fitting straight line is close to the line of zero intercept and unit slope (Marigorta Navarro, 2013).
  • 42. WILL REPLICABILITY EXTEND TO PSYCHOLOGICAL TRAITS? • There have been few GWAS of behavioral traits in distinct populations. • It is possible, however, to use GCTA to estimate the genetic correlation between populations with respect to a certain phenotype.
  • 43. WILL REPLICABILITY EXTEND TO PSYCHOLOGICAL TRAITS? YEUR = ↵0 + |X1↵1 + ·{·z· + XL↵L} European breeding value ! • YEUR : European individual’s SCZ liability • Xj : number of SCZ + genes (0, 1, or 2) at the jth locus • αj : average effect of gene substitution on SCZ liability at the jth locus • E : individual’s “residual” with respect to SCZ liability—a composite of environmental effects, nonlinear (non-additive) interactions, etc. +E YAFR = 0 + |W11 + · ·{·z+WKK} African breeding value +E
  • 44. WILL REPLICABILITY EXTEND TO PSYCHOLOGICAL TRAITS? YEUR = ↵0 + |X1↵1 + ·{·z· + XL↵L} European breeding value ! • YAFR : African individual’s SCZ liability • Wj : number of SCZ + genes (0, 1, or 2) at the jth locus • βj : average effect of gene substitution on SCZ liability at the jth locus • E : individual’s “residual” with respect to SCZ liability—a composite of environmental effects, nonlinear (non-additive) interactions, etc. +E YAFR = 0 + |W11 + · ·{·z+WKK} African breeding value +E
  • 45. WILL REPLICABILITY EXTEND TO PSYCHOLOGICAL TRAITS? YEUR = ↵0 + |X1↵1 + ·{·z· + XL↵L} European breeding value +E YAFR = 0 + |W11 + · ·{·z+WKK} African breeding value +E The genetic correlation between two phenotypes is simply the correlation between their respective breeding values.
  • 46. WILL REPLICABILITY EXTEND TO PSYCHOLOGICAL TRAITS? YEUR = ↵0 + |X1↵1 + ·{·z· + XL↵L} European breeding value +E YAFR = 0 + |W11 + · ·{·z+WKK} African breeding value +E de Candia et al. (2013) used GCTA to estimate that the correlation between European and African breeding values with respect to schizophrenia is greater than 0.60.
  • 47. WILL REPLICABILITY EXTEND TO PSYCHOLOGICAL TRAITS? • The latest GWAS meta-analysis of schizophrenia included a number of East Asian samples (Ripke et al., 2014). • The concordance between Europeans and East Asians is strong.
  • 48. ISSUE #2: CORRELATION VS. CAUSATION • GWAS of unrelated individuals can only tell us that a given SNP is correlated with the phenotype. • But we want to know whether variation at the genomic site causes variation in the phenotype. Sir Ronald Fisher, the founder of both population genetics and modern statistics.
  • 49. ISSUE #2: CORRELATION VS. CAUSATION • Since a given SNP is correlated with many other variants in its genomic region, picking out the causal variant (if any) is a challenge. • Here I address the problem of whether a GWAS signal might be attributable to confounding with an environmental variable. Sir Ronald Fisher, the founder of both population genetics and modern statistics.
  • 50. ISSUE #2: CORRELATION VS. CAUSATION • The simplest means of addressing confounding is the family-based design. • By Mendel’s Law of Segregation, a parent passes on a random gene from each homologous pair to a given offspring. father’s genome offspring’s genome mother’s genome
  • 51. ISSUE #2: CORRELATION VS. CAUSATION • Whether a heterozygous parent (“+−”) passes on the “+” or “−” gene to its offspring is equivalent to randomized treatment status in experimental design. • If there is no selection bias, a within-family correlation between “+” transmission and the phenotype means that the marker must be linked and associated with a causal variant. father’s genome offspring’s genome mother’s genome
  • 52. ISSUE #2: CORRELATION VS. CAUSATION • Within-family designs are not statistically powerful, but they can be used to check that studies of unrelated individuals are not unduly contaminated by confounding. • So far, family-based studies have affirmed the results of standard GWAS (Rietveld et al., 2013). father’s genome offspring’s genome mother’s genome
  • 53. BUT WHY IS CAUSAL INFERENCE SO SIMPLE HERE? SNP 1 SNP 2 SNP 3 SNP 4 SNP 5 SNP 6 SNP 7 SNP 8 SNP 9 phenotype This is the simplest possible causal system (directed acyclic graph). If there is no confounding, every partial regression coefficient is equal to its corresponding average effect.
  • 54. BUT WHY IS CAUSAL INFERENCE SO SIMPLE HERE? • Why are genetic and environmental causes not confounded more severely? • Anthropomorphic answer. When Nature pushes up the frequencies of some alleles and pushes down others, she can only tell which alleles are correlated with fitness. She cannot tell which alleles cause higher fitness. The Papilio caterpillar, which has evolved to look like a snake.
  • 55. BUT WHY IS CAUSAL INFERENCE SO SIMPLE HERE? • Nevertheless, Nature seems to adjust allele frequencies in the correct way more often than not. • She can only do this if gene-trait correlation is a robust guide to gene-trait causation. Be thankful that we live in such a universe! The Papilio caterpillar, which has evolved to look like a snake.
  • 56. ISSUE #3: THE SCIENTIFIC WORTH OF SMALL EFFECTS • One might object that only large effect sizes are scientifically significant (as opposed to statistically significant in a large enough sample). • On this view the Fourth Law automatically discredits further inquiry into the genetic causes of behavior. The clinical psychologist Paul Meehl, a vocal critic of significance testing.
  • 57. ISSUE #3: THE SCIENTIFIC WORTH OF SMALL EFFECTS • This critique draws on the penetrating writings of Meehl (1978, 1990). • Meehl thought that the null hypothesis is often a strawman because of ubiquitous biases and an abundance of alternative explanations. The clinical psychologist Paul Meehl, a vocal critic of significance testing.
  • 58. ISSUE #3: THE SCIENTIFIC WORTH OF SMALL EFFECTS • In such cases the rejection of the null hypothesis is not scientifically valuable. • In GWAS, however, we have every reason to believe that the null hypothesis is true more often than not. The clinical psychologist Paul Meehl, a vocal critic of significance testing.
  • 59. THE POLYGENIC ARCHITECTURE OF SCHIZOPHRENIA ~8,300 common variants seems to be a lot—but there are ~8 million common variants in the entire genome!
  • 60. ISSUE #3: THE SCIENTIFIC WORTH OF SMALL EFFECTS • Against a large background of null effects, accepting the alternative hypothesis of a small effect is an inherently meaningful step toward the underlying biology. • Perhaps to the surprise of some, the latest GWAS meta-analysis of schizophrenia implicates acquired immunity (Ripke et al., 2014). The clinical psychologist Paul Meehl, a vocal critic of significance testing.
  • 61. WHAT KINDS OF ENHANCERS HARBOR SCHIZOPHRENIA VARIANTS?
  • 62. COMPRESSED SENSING: ADDRESSING THE N ≪ P PROBLEM • Point 1. Heritability is not missing; it is hiding in plain sight among thousands of variants (many of them common). • Point 2. Replicability crisis? Distinguishing causation from correlation? The Lykken-Meehl crud factor? Unlike much of behavioral science, GWAS is remarkably free from these problems. Over a million people attend the Minnesota State Fair each year.
  • 63. COMPRESSED SENSING: ADDRESSING THE N ≪ P PROBLEM • But it is one thing to say that there is scientific gold buried somewhere. It is quite another to dig it up! • Can we identify enough variants to make meaningful scientific inferences without n greater than the number of protons in the Universe? Over a million people attend the Minnesota State Fair each year.
  • 64. COMPRESSED SENSING: ADDRESSING THE N ≪ P PROBLEM • In Statistics 101, many of us learned that the sample size (n) must exceed the number of RHS variables (p) for the partial regression coefficients to be identified. • Recent work in the theory of compressed sensing (CS) has shown that coefficient recovery is possible in the n ≪ p case (Candes, Romberg, Tao, 2006). Terence Tao is the most distinguished SMPY participant and perhaps the most famous mathematician in the world.
  • 65. COMPRESSED SENSING: ADDRESSING THE N ≪ P PROBLEM Consider the noisy linear system y = Ax+e, where A 2 Rn⇥p is the design matrix and x 2 Rp has s nonzero elements. If n Cslog p for some constant C, then the solution of the LASSO problem min ˆx ! ky − Aˆxk2 L2 + !kˆxkL1 with a suitable choice of ! obeys kˆx − xk2 L2  2E n s polylog p, where 2E is the variance of the residuals in e.
  • 66. COMPRESSED 0.2 SENSING: 0.2 ADDRESSING THE N ≪ P PROBLEM • Simply statable CS theorems assume that the RHS variables (e.g., genetic variants) are uncorrelated. But in reality a genetic variant is in LD with nearby genetic variants. So do CS ideas apply here? • If you squint at the GWAS covariance matrix from a distance, it looks diagonal. So it might be reasonable to expect that LASSO will still perform well (up to GWAS precision). 0 0.5 1 b 0 0 frequency SNP index i SNP index j C 2000 4000 6000 8000 2000 4000 6000 8000 1 0.8 0.6 0.4 0.2 0 A color-coded correlation matrix of SNPs on chromosome 22.
  • 67. COMPRESSED SENSING: ADDRESSING THE N ≪ P PROBLEM Vattikuti, Lee, Chang, Hsu, Chow (2014)
  • 68. GIANT SNP L1 SNP, proxy L1 SNP, not proxy MR SNP
  • 69. COMPRESSED SENSING: ADDRESSING THE N ≪ P PROBLEM • Can we tell when a GWAS has crossed n C s log p? • Yes! Certain observable quantities (e.g., the typical p-value of called nonzeros) begin to decline sharply. • Applying this method to real GWAS data indicates that for a trait with h2≈0.50, n 30s triggers the phase transition to good performance. The theoretical physicist Stephen Hsu entertains a visitor to Michigan State University.
  • 70. IMPORTANT SCIENTIFIC QUESTIONS: WHY TAKE THE ROAD TO 30S? “Man may be excused for feeling some pride at having risen … to the very summit of the organic scale; and the fact of his having thus risen, instead of having been aboriginally placed there, may give him hope for a still higher destiny in the distant future.
  • 71. IMPORTANT SCIENTIFIC QUESTIONS: WHY TAKE THE ROAD TO 30S? “[But] man with all his noble qualities, with sympathy which feels for the most debased, with benevolence, which extends not only to other men but to the humblest living creature, with his god-like intellect which has penetrated into the movements and constitution of the solar system …
  • 72. IMPORTANT SCIENTIFIC QUESTIONS: WHY TAKE THE ROAD TO 30S? “… with all these exalted powers, Man still bears in his bodily frame the indelible stamp of his lowly origin.”—CHARLES DARWIN, THE DESCENT OF MAN
  • 73. IMPORTANT SCIENTIFIC QUESTIONS: WHY TAKE THE ROAD TO 30S? • Darwin knew no genes; we do. Can we trace the genetic basis of the evolutionary change that Darwin described? • Recent spectacular advances in the sequencing of ancient hominin DNA suggest that the answer may be yes.
  • 74. THE HUMAN FAMILY TREE 1.8 mya? 500 kya 380 kya Prüfer et al. (2014)
  • 75. THE GENETICS OF ANCIENT HOMININS • Usable DNA was recently recovered from a Denisovan-like hominin who died more than 300 kya (Meyer et al., 2014). • I will now show you a comparison of sequences from Neanderthals and modern humans. An artist’s reconstruction of a human-Neanderthal hybrid child.
  • 76. THE GENETICS OF ANCIENT HOMININS This is the modern human sequence encompassing rs1487441, one of the “IQ hits” identified by Rietveld et al. (2014). A is the “plus” allele; G is the “minus” allele. TTCTTCCACTCACTCATCACCATAAA The ancestors of Neanderthals and Denisovans split from our lineage ~500 kya. Neanderthals probably did a lot of evolving since then … but it is still fun to ask: What allele did Neanderthals carry at this site? TTCTTCCACTCACTCG TCACCATAAA
  • 77. IMPORTANT SCIENTIFIC QUESTIONS: WHY TAKE THE ROAD TO 30S?
  • 78. PLEASE CITE THESE PAPERS! • Vattikuti S, Lee JJ, Chang CC, Hsu SDH, Chow CC (2014). Applying compressed sensing to genome-wide association studies. GigaScience, 3, 10. • Lee JJ, Chow CC (2014). Conditions for the validity of SNP-based heritability estimation. Human Genetics, 133, 1011-1022. • Rietveld CA, Esko T, Davies G, Pers TH, Benyamin B, Chabris CF, Emilsson V, Johnson AD, Lee JJ, de Leeuw C, et al. (2014). Common genetic variants associated with cognitive performance identified using the proxy-phenotype method. Proceedings of the National Academy of Sciences USA, 111, 13790-13794. • Chabris CF, Lee JJ, Cesarini D, Benjamin DJ, Laibson DI (in press). The fourth law of behavior genetics. Current Directions in Psychological Science.