SlideShare una empresa de Scribd logo
1 de 18
Advance Statistics: Introduction
to Regression
and Analysis of Variance
Fixed and Random Effects
Today’s Report
   Random effects.
   One-way random effects ANOVA.
   Two-way mixed & random effects
    ANOVA.
   Sattherwaite’s procedure.
Two-way ANOVA
   Second generalization: more than one grouping variable.
   Two-way ANOVA model: observations:
    (Yijk), 1 < i < r , 1 < j < m, 1 < k < nij : r groups in first grouping variable,
    m groups ins second and nij samples in
    (i; j)-“cell”:
    Yijk = µ + α i + βj + (α β)ij + Eijk, Eijk~N(0, σ ² ):
   n Constraints:
    r∑i=1 αi = 0
    m∑j=1 βj = 0
    m∑j=1 (αβ)ij = 0, 1 < i < r
    r∑i=1 (αβ)ij = 0, 1 < j < m
Random and fixed effects
   In ANOVA examples we have seen so far, the categorical
    variables are well-defined categories: below average fitness,
    long duration, etc.
   In some designs, the categorical variable is “subject”.
   Simplest example: repeated measures, where more than
    one (identical) measurement is taken on the same individual.
   In this case, the “group” effect αi is best thought of as
    random because we only sample a subset of the entire
    population of subjects.
When to use random effects?

   A “group” effect is random if we can think of the levels we
    observe in that group to be samples from a larger population.

   Example: if collecting data from different medical centers,
    “center” might be thought of as random.

   Example: if surveying students on different campuses,
    “campus” may be a random effect.
Example: sodium content in beer
   How much sodium is there in North American beer? How?
    much does this vary by brand?

   Observations: for 6 brands of beer, researchers recorded the
    sodium content of 8 12 ounce bottles.

   Questions of interest: what is the “grand mean” sodium
    content? How much variability is there from brand to brand?

   “Individuals” in this case are brands, repeated measures are
    the 8 bottles.
One-way random effects model
   Suppose we take n identical measurements from r subjects

   Yij ~ µ + αi + Eij, 1 < I < r, 1 < j < n

   Eij ~ N(0, σ²), 1 < i < r, 1 < j <n

   αi ~ N(0,σ²µ), 1 < i < r

   We might be interested in the population mean, µ. : CIs, is it zero? etc.

   Alternatively, we might be interested in the variability across
    subjects,σ²µ : CIs, is it zero?
One-way random ANOVA table
    Source                      SS                           df        E(MS)

    Treatments        SSTR = r ∑i=1 n(Y1 – Y..) ²            r–1       σ² + n σ²µ
    Error             SSE = r ∑i=1 (Yij- Yi.) ²             (n - 1)r      σ²




     Only change here is the expectation of SSTR which reflects
      randomness of αi’s
     ANOVA table is still useful to setup tests: the same F statistics for
      fixed or random will work here.
     Under Ho : σ²µ = 0, it is to see that

                                       MSTR         ~Fr-1, (n-1) r’
                                       MSE
Inference for µ
   We know that E(Y..) = µ. , and can show that
        Var(Y..) = nσ²µ + σ²
                           rn
   Therefore,
             Y.. - µ
    square SSTR ~tr - 1
           (r-1)rn
   Why r – 1 Degrees of freedom? Imagine we could record an infinite
    number of observations for each individual, so that Yi ---- µi

   To learn anything about µ. We still only have r observations (µ1,…., µr).

   Sampling more within an individual cannot narrow the CI for µ,.
Estimating ²σµ

    From the ANOVA table
      σ²µ = E(SSTR/(r-1))-E(SSE/((n-1)r))
                             n
    Natural estimate
      S²µ = SSTR/(r-1)-SSE/((n-1)r))
                        n
    Problem: this estimate can be negative! One of the difficulties in random
     effect model
Example: productivity study
   Imagine a study on the productivity of employees in a large
    manufacturing company.

   Company wants to get an idea of daily productivity, and how it depends
    on which machine on employee uses.

   Study: take m employee and r machines, having each employee work on
    each machine for a total of n days.

   As these employees are not all employees, and these machines are not
    all machines it take sense to think of both the effect of machine and
    employees (and interactions) as random
Two-way random effects model
   Yijk ~ µ.. + αi + βj + (αβ)ij + Eij, 1< i < r, 1 < j < m, 1 < k < n

   Eijk ~ N(0, σ²), 1 < i < r, 1 < j < m, 1 < k < n

   αi ~ N(0, σ²α), 1 < i < r.

   βj ~ N(0, σ²β), 1 < j < m.

   (αβ)ij ~ N(0, ²σαβ), 1 < j < m, 1< i < r.

   Cov (Yijk,Yi’j’k’) = &ii’ ²σα + &jj’ ²σβ + &ii’ &jj’ ²σαβ + &ii’ &jj’ &kk’ σ².
ANOVA table: Two-way (random)
SS                                               df          E(SS)
SSA = nm r∑i=1 (Yi.. – Y…) ²                    r -1          σ² + nm σ²α + nσ²αβ

SSB = nr m∑j=1 (Yj.. – Y…) ²                    m -1         σ² +nr σ²β + nσ²αβ

SSAB = n r∑i=1 (Yij. – Yi.. –Yj.. + Y…) ²   (m – 1)(r – 1)   σ² + nσ²αβ

SSE = r∑i=1 m∑j=1 n ∑k=1 (Yijk – Y ij.)²     (n – 1)ab       σ²




   To test Ho : σ²α = 0 use SSA and SSAB.

   To test H0 : σ²αβ = 0 use SSAB and SSE.
Mixed effects model
   In some studies, some factors can be thought of as fixed, others
    random.

   For instance, we might have a study of the effect of a standard
    part of the brewing process on sodium levels in the beer
    example

   Then, we might think of a model in which we have a fixed effect
    for “brewing technique” and a random effect for beer
Two-way mixed effects model
   Yijk ~ µ.. + αi + βj + (αβ)ij + Eij, 1< i < r, 1 < j < m, 1 < k < n

   Eijk ~ N(0, σ²), 1 < i < r, 1 < j < m, 1 < k < n

   αi ~ N(0, σ²α), 1 < i < r.

   βj ~ N(0, σ²β), 1 < j < m. are constant!

   (αβ)ij ~ N (0,(m-1) σ²αβ/m), 1 < j < m, 1< i < r.

   Constraints:
    m∑j=1 βj = 0
    m∑j=1 (αβ)ij = 0, 1 < i < r
    Cov ((αβ)ij, (αβ) i’j’ ) = - σ²αβ/m
   Cov (Yijk,Yi’j’k’) = &jj’ (σ²β + &ii’ m-1 σ²β – (1-&ii’) 1 σ²αβ + &ii’ &kk’ σ²).
                                          m                m
ANOVA table: Two-way (mixed)
SS                                               df                 E(MS)
SSA                                             r -1            σ² + nm σ²α

SSB                                             m -1        σ² + m∑j=1 β²i+ nσ²αβ
                                                                     m -1
SSAB                                       (m – 1)(r – 1)      σ² + nσ²αβ

SSE = r∑i=1 m∑j=1 n ∑k=1 (Yijk – Y ij.)²     (n – 1)ab         σ²




     To test Ho : σ²α = 0 use SSA and SSAB.

     To test Ho : β1 = … = βm = 0 0 use SSB and SSAB.

     To test Ho : σ²αβ use SSAB and SSE.
Confidence intervals for variances

   Consider estimating σ²β in two-way random effects ANOVA. A natural
    estimate is
                  σ²β = nr (MSB – MSAB)
   What about CI?

   A linear combination of x² - but not x².

   To form a confidence interval for σ²β we need to know distribution of a
    linear combination of MS.’s at least approximately.
Satterwaite’s procedure
   Given K independent MS.’s
                 L ~ k∑i=1 ciMSi
   Then
                 dfTL “~” x²dfT
                 E(L)
    where
                 dfT = (k∑i=1 ciMSi)²
                      k∑i=1 ciMS²i/dfi

    Where dfi the the degrees of freedom of the i-th MS.
   (1-α) . 100% CI for E(L):

        LL = dfT x L ,               LU =   dfT x L
            x² dfT; 1-α/2                x² dfT; 1-α/2

Más contenido relacionado

La actualidad más candente

9. basic concepts_of_one_way_analysis_of_variance_(anova)
9. basic concepts_of_one_way_analysis_of_variance_(anova)9. basic concepts_of_one_way_analysis_of_variance_(anova)
9. basic concepts_of_one_way_analysis_of_variance_(anova)Irfan Hussain
 
Analysis of Variance-ANOVA
Analysis of Variance-ANOVAAnalysis of Variance-ANOVA
Analysis of Variance-ANOVARabin BK
 
ANOVA TEST by shafeek
ANOVA TEST by shafeekANOVA TEST by shafeek
ANOVA TEST by shafeekShafeek S
 
Application of ANOVA
Application of ANOVAApplication of ANOVA
Application of ANOVARohit Patidar
 
Anova (Analysis of variation)
Anova (Analysis of variation)Anova (Analysis of variation)
Anova (Analysis of variation)Shakeel Rehman
 
Anova one way sem 1 20142015 dk
Anova one way sem 1 20142015 dkAnova one way sem 1 20142015 dk
Anova one way sem 1 20142015 dkSyifa' Humaira
 
Anova by Hazilah Mohd Amin
Anova by Hazilah Mohd AminAnova by Hazilah Mohd Amin
Anova by Hazilah Mohd AminHazilahMohd
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Sneh Kumari
 
Nonparametric statistics
Nonparametric statisticsNonparametric statistics
Nonparametric statisticsTarun Gehlot
 
Anova ancova manova_mancova
Anova  ancova manova_mancovaAnova  ancova manova_mancova
Anova ancova manova_mancovaCarlo Magno
 
Introduction to ANOVAs
Introduction to ANOVAsIntroduction to ANOVAs
Introduction to ANOVAsflorentinoz
 
Reporting an independent sample t test
Reporting an independent sample t testReporting an independent sample t test
Reporting an independent sample t testKen Plummer
 
One Way Anova
One Way AnovaOne Way Anova
One Way Anovashoffma5
 
Non parametrics tests
Non parametrics testsNon parametrics tests
Non parametrics testsrodrick koome
 

La actualidad más candente (18)

9. basic concepts_of_one_way_analysis_of_variance_(anova)
9. basic concepts_of_one_way_analysis_of_variance_(anova)9. basic concepts_of_one_way_analysis_of_variance_(anova)
9. basic concepts_of_one_way_analysis_of_variance_(anova)
 
Analysis of Variance-ANOVA
Analysis of Variance-ANOVAAnalysis of Variance-ANOVA
Analysis of Variance-ANOVA
 
ANOVA TEST by shafeek
ANOVA TEST by shafeekANOVA TEST by shafeek
ANOVA TEST by shafeek
 
Application of ANOVA
Application of ANOVAApplication of ANOVA
Application of ANOVA
 
Anova (Analysis of variation)
Anova (Analysis of variation)Anova (Analysis of variation)
Anova (Analysis of variation)
 
Anova one way sem 1 20142015 dk
Anova one way sem 1 20142015 dkAnova one way sem 1 20142015 dk
Anova one way sem 1 20142015 dk
 
Chi square test
Chi square testChi square test
Chi square test
 
Anova by Hazilah Mohd Amin
Anova by Hazilah Mohd AminAnova by Hazilah Mohd Amin
Anova by Hazilah Mohd Amin
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)
 
Nonparametric statistics
Nonparametric statisticsNonparametric statistics
Nonparametric statistics
 
Anova ancova manova_mancova
Anova  ancova manova_mancovaAnova  ancova manova_mancova
Anova ancova manova_mancova
 
Introduction to ANOVAs
Introduction to ANOVAsIntroduction to ANOVAs
Introduction to ANOVAs
 
Reporting an independent sample t test
Reporting an independent sample t testReporting an independent sample t test
Reporting an independent sample t test
 
One Way Anova
One Way AnovaOne Way Anova
One Way Anova
 
Chi square using excel
Chi square using excelChi square using excel
Chi square using excel
 
Crd tutorial
Crd tutorialCrd tutorial
Crd tutorial
 
T test statistic
T test statisticT test statistic
T test statistic
 
Non parametrics tests
Non parametrics testsNon parametrics tests
Non parametrics tests
 

Destacado (18)

T test and ANOVA
T test and ANOVAT test and ANOVA
T test and ANOVA
 
Unit 7 lesson 1
Unit 7 lesson 1Unit 7 lesson 1
Unit 7 lesson 1
 
t-test vs ANOVA
t-test vs ANOVAt-test vs ANOVA
t-test vs ANOVA
 
non parametric statistics
non parametric statisticsnon parametric statistics
non parametric statistics
 
F Distribution
F  DistributionF  Distribution
F Distribution
 
Regression vs ANOVA
Regression vs ANOVARegression vs ANOVA
Regression vs ANOVA
 
Presentation non parametric
Presentation non parametricPresentation non parametric
Presentation non parametric
 
One way anova final ppt.
One way anova final ppt.One way anova final ppt.
One way anova final ppt.
 
Ppt of statistics for teachers
Ppt of statistics for teachersPpt of statistics for teachers
Ppt of statistics for teachers
 
Applied statistics lecture_8
Applied statistics lecture_8Applied statistics lecture_8
Applied statistics lecture_8
 
F-Distribution
F-DistributionF-Distribution
F-Distribution
 
Anova (f test) and mean differentiation
Anova (f test) and mean differentiationAnova (f test) and mean differentiation
Anova (f test) and mean differentiation
 
Parametric vs Non-Parametric
Parametric vs Non-ParametricParametric vs Non-Parametric
Parametric vs Non-Parametric
 
Nonparametric tests
Nonparametric testsNonparametric tests
Nonparametric tests
 
Non-Parametric Tests
Non-Parametric TestsNon-Parametric Tests
Non-Parametric Tests
 
discriminant analysis
discriminant analysisdiscriminant analysis
discriminant analysis
 
Hypothesis testing ppt final
Hypothesis testing ppt finalHypothesis testing ppt final
Hypothesis testing ppt final
 
Non parametric tests
Non parametric testsNon parametric tests
Non parametric tests
 

Similar a Advance statistics 2

An introduction to digital health surveillance from online user-generated con...
An introduction to digital health surveillance from online user-generated con...An introduction to digital health surveillance from online user-generated con...
An introduction to digital health surveillance from online user-generated con...Vasileios Lampos
 
M. Dimitrijević, Noncommutative models of gauge and gravity theories
M. Dimitrijević, Noncommutative models of gauge and gravity theoriesM. Dimitrijević, Noncommutative models of gauge and gravity theories
M. Dimitrijević, Noncommutative models of gauge and gravity theoriesSEENET-MTP
 
regression analysis .ppt
regression analysis .pptregression analysis .ppt
regression analysis .pptTapanKumarDash3
 
A study on the ANOVA ANALYSIS OF VARIANCE.pptx
A study on the ANOVA ANALYSIS OF VARIANCE.pptxA study on the ANOVA ANALYSIS OF VARIANCE.pptx
A study on the ANOVA ANALYSIS OF VARIANCE.pptxjibinjohn140
 
BlUP and BLUE- REML of linear mixed model
BlUP and BLUE- REML of linear mixed modelBlUP and BLUE- REML of linear mixed model
BlUP and BLUE- REML of linear mixed modelKyusonLim
 
Some Examples of Scaling Sets
Some Examples of Scaling SetsSome Examples of Scaling Sets
Some Examples of Scaling SetsVjekoslavKovac1
 
InnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas helpInnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas helpInnerSoft
 
U1.4-RVDistributions.ppt
U1.4-RVDistributions.pptU1.4-RVDistributions.ppt
U1.4-RVDistributions.pptSameeraasif2
 
Descriptive Statistics Formula Sheet Sample Populatio.docx
Descriptive Statistics Formula Sheet    Sample Populatio.docxDescriptive Statistics Formula Sheet    Sample Populatio.docx
Descriptive Statistics Formula Sheet Sample Populatio.docxsimonithomas47935
 
Basics of Regression analysis
 Basics of Regression analysis Basics of Regression analysis
Basics of Regression analysisMahak Vijayvargiya
 
Variance component analysis by paravayya c pujeri
Variance component analysis by paravayya c pujeriVariance component analysis by paravayya c pujeri
Variance component analysis by paravayya c pujeriParavayya Pujeri
 
Reading Seminar (140515) Spectral Learning of L-PCFGs
Reading Seminar (140515) Spectral Learning of L-PCFGsReading Seminar (140515) Spectral Learning of L-PCFGs
Reading Seminar (140515) Spectral Learning of L-PCFGsKeisuke OTAKI
 

Similar a Advance statistics 2 (20)

One Way ANOVA.pdf
One Way ANOVA.pdfOne Way ANOVA.pdf
One Way ANOVA.pdf
 
An introduction to digital health surveillance from online user-generated con...
An introduction to digital health surveillance from online user-generated con...An introduction to digital health surveillance from online user-generated con...
An introduction to digital health surveillance from online user-generated con...
 
Anov af03
Anov af03Anov af03
Anov af03
 
M. Dimitrijević, Noncommutative models of gauge and gravity theories
M. Dimitrijević, Noncommutative models of gauge and gravity theoriesM. Dimitrijević, Noncommutative models of gauge and gravity theories
M. Dimitrijević, Noncommutative models of gauge and gravity theories
 
Review of probability calculus
Review of probability calculusReview of probability calculus
Review of probability calculus
 
NODDEA2012_VANKOVA
NODDEA2012_VANKOVANODDEA2012_VANKOVA
NODDEA2012_VANKOVA
 
{ANOVA} PPT-1.pptx
{ANOVA} PPT-1.pptx{ANOVA} PPT-1.pptx
{ANOVA} PPT-1.pptx
 
regression analysis .ppt
regression analysis .pptregression analysis .ppt
regression analysis .ppt
 
A study on the ANOVA ANALYSIS OF VARIANCE.pptx
A study on the ANOVA ANALYSIS OF VARIANCE.pptxA study on the ANOVA ANALYSIS OF VARIANCE.pptx
A study on the ANOVA ANALYSIS OF VARIANCE.pptx
 
BlUP and BLUE- REML of linear mixed model
BlUP and BLUE- REML of linear mixed modelBlUP and BLUE- REML of linear mixed model
BlUP and BLUE- REML of linear mixed model
 
Correlation
CorrelationCorrelation
Correlation
 
Some Examples of Scaling Sets
Some Examples of Scaling SetsSome Examples of Scaling Sets
Some Examples of Scaling Sets
 
InnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas helpInnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas help
 
U1.4-RVDistributions.ppt
U1.4-RVDistributions.pptU1.4-RVDistributions.ppt
U1.4-RVDistributions.ppt
 
Descriptive Statistics Formula Sheet Sample Populatio.docx
Descriptive Statistics Formula Sheet    Sample Populatio.docxDescriptive Statistics Formula Sheet    Sample Populatio.docx
Descriptive Statistics Formula Sheet Sample Populatio.docx
 
Basics of Regression analysis
 Basics of Regression analysis Basics of Regression analysis
Basics of Regression analysis
 
Variance component analysis by paravayya c pujeri
Variance component analysis by paravayya c pujeriVariance component analysis by paravayya c pujeri
Variance component analysis by paravayya c pujeri
 
MITx_14310_CLT
MITx_14310_CLTMITx_14310_CLT
MITx_14310_CLT
 
L20_D.pdf
L20_D.pdfL20_D.pdf
L20_D.pdf
 
Reading Seminar (140515) Spectral Learning of L-PCFGs
Reading Seminar (140515) Spectral Learning of L-PCFGsReading Seminar (140515) Spectral Learning of L-PCFGs
Reading Seminar (140515) Spectral Learning of L-PCFGs
 

Más de Tim Arroyo

Individual behavior Ch 12
Individual behavior Ch 12Individual behavior Ch 12
Individual behavior Ch 12Tim Arroyo
 
Forecasting Problem Set
Forecasting Problem SetForecasting Problem Set
Forecasting Problem SetTim Arroyo
 
Es 08 pert case 1
Es 08 pert case 1Es 08 pert case 1
Es 08 pert case 1Tim Arroyo
 
Es 08 pert case 2
Es 08 pert case 2Es 08 pert case 2
Es 08 pert case 2Tim Arroyo
 
Es 08 forecasting topic final
Es 08 forecasting topic finalEs 08 forecasting topic final
Es 08 forecasting topic finalTim Arroyo
 
Es 08 pert final
Es 08 pert finalEs 08 pert final
Es 08 pert finalTim Arroyo
 
Lab 4 handout 043012
Lab 4 handout 043012Lab 4 handout 043012
Lab 4 handout 043012Tim Arroyo
 
Adstat upload 1
Adstat upload 1Adstat upload 1
Adstat upload 1Tim Arroyo
 
Porters comp forces
Porters comp forcesPorters comp forces
Porters comp forcesTim Arroyo
 
Management 5 - 7
Management 5 - 7Management 5 - 7
Management 5 - 7Tim Arroyo
 
Management 1 - 4
Management 1 - 4Management 1 - 4
Management 1 - 4Tim Arroyo
 
Situation of the youth presentation (as of june 18 2010)[1]
Situation of the youth presentation (as of june 18 2010)[1]Situation of the youth presentation (as of june 18 2010)[1]
Situation of the youth presentation (as of june 18 2010)[1]Tim Arroyo
 
Peace education communication.volunteers 2011
Peace education communication.volunteers 2011Peace education communication.volunteers 2011
Peace education communication.volunteers 2011Tim Arroyo
 

Más de Tim Arroyo (18)

Individual behavior Ch 12
Individual behavior Ch 12Individual behavior Ch 12
Individual behavior Ch 12
 
11
1111
11
 
10
1010
10
 
5 7
5 75 7
5 7
 
13
1313
13
 
8 9
8 98 9
8 9
 
Forecasting Problem Set
Forecasting Problem SetForecasting Problem Set
Forecasting Problem Set
 
Es 08 pert case 1
Es 08 pert case 1Es 08 pert case 1
Es 08 pert case 1
 
Es 08 pert case 2
Es 08 pert case 2Es 08 pert case 2
Es 08 pert case 2
 
Es 08 forecasting topic final
Es 08 forecasting topic finalEs 08 forecasting topic final
Es 08 forecasting topic final
 
Es 08 pert final
Es 08 pert finalEs 08 pert final
Es 08 pert final
 
Lab 4 handout 043012
Lab 4 handout 043012Lab 4 handout 043012
Lab 4 handout 043012
 
Adstat upload 1
Adstat upload 1Adstat upload 1
Adstat upload 1
 
Porters comp forces
Porters comp forcesPorters comp forces
Porters comp forces
 
Management 5 - 7
Management 5 - 7Management 5 - 7
Management 5 - 7
 
Management 1 - 4
Management 1 - 4Management 1 - 4
Management 1 - 4
 
Situation of the youth presentation (as of june 18 2010)[1]
Situation of the youth presentation (as of june 18 2010)[1]Situation of the youth presentation (as of june 18 2010)[1]
Situation of the youth presentation (as of june 18 2010)[1]
 
Peace education communication.volunteers 2011
Peace education communication.volunteers 2011Peace education communication.volunteers 2011
Peace education communication.volunteers 2011
 

Último

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 

Último (20)

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 

Advance statistics 2

  • 1. Advance Statistics: Introduction to Regression and Analysis of Variance Fixed and Random Effects
  • 2. Today’s Report  Random effects.  One-way random effects ANOVA.  Two-way mixed & random effects ANOVA.  Sattherwaite’s procedure.
  • 3. Two-way ANOVA  Second generalization: more than one grouping variable.  Two-way ANOVA model: observations: (Yijk), 1 < i < r , 1 < j < m, 1 < k < nij : r groups in first grouping variable, m groups ins second and nij samples in (i; j)-“cell”: Yijk = µ + α i + βj + (α β)ij + Eijk, Eijk~N(0, σ ² ):  n Constraints: r∑i=1 αi = 0 m∑j=1 βj = 0 m∑j=1 (αβ)ij = 0, 1 < i < r r∑i=1 (αβ)ij = 0, 1 < j < m
  • 4. Random and fixed effects  In ANOVA examples we have seen so far, the categorical variables are well-defined categories: below average fitness, long duration, etc.  In some designs, the categorical variable is “subject”.  Simplest example: repeated measures, where more than one (identical) measurement is taken on the same individual.  In this case, the “group” effect αi is best thought of as random because we only sample a subset of the entire population of subjects.
  • 5. When to use random effects?  A “group” effect is random if we can think of the levels we observe in that group to be samples from a larger population.  Example: if collecting data from different medical centers, “center” might be thought of as random.  Example: if surveying students on different campuses, “campus” may be a random effect.
  • 6. Example: sodium content in beer  How much sodium is there in North American beer? How? much does this vary by brand?  Observations: for 6 brands of beer, researchers recorded the sodium content of 8 12 ounce bottles.  Questions of interest: what is the “grand mean” sodium content? How much variability is there from brand to brand?  “Individuals” in this case are brands, repeated measures are the 8 bottles.
  • 7. One-way random effects model  Suppose we take n identical measurements from r subjects  Yij ~ µ + αi + Eij, 1 < I < r, 1 < j < n  Eij ~ N(0, σ²), 1 < i < r, 1 < j <n  αi ~ N(0,σ²µ), 1 < i < r  We might be interested in the population mean, µ. : CIs, is it zero? etc.  Alternatively, we might be interested in the variability across subjects,σ²µ : CIs, is it zero?
  • 8. One-way random ANOVA table Source SS df E(MS) Treatments SSTR = r ∑i=1 n(Y1 – Y..) ² r–1 σ² + n σ²µ Error SSE = r ∑i=1 (Yij- Yi.) ² (n - 1)r σ²  Only change here is the expectation of SSTR which reflects randomness of αi’s  ANOVA table is still useful to setup tests: the same F statistics for fixed or random will work here.  Under Ho : σ²µ = 0, it is to see that MSTR ~Fr-1, (n-1) r’ MSE
  • 9. Inference for µ  We know that E(Y..) = µ. , and can show that Var(Y..) = nσ²µ + σ² rn  Therefore, Y.. - µ square SSTR ~tr - 1 (r-1)rn  Why r – 1 Degrees of freedom? Imagine we could record an infinite number of observations for each individual, so that Yi ---- µi  To learn anything about µ. We still only have r observations (µ1,…., µr).  Sampling more within an individual cannot narrow the CI for µ,.
  • 10. Estimating ²σµ  From the ANOVA table σ²µ = E(SSTR/(r-1))-E(SSE/((n-1)r)) n  Natural estimate S²µ = SSTR/(r-1)-SSE/((n-1)r)) n  Problem: this estimate can be negative! One of the difficulties in random effect model
  • 11. Example: productivity study  Imagine a study on the productivity of employees in a large manufacturing company.  Company wants to get an idea of daily productivity, and how it depends on which machine on employee uses.  Study: take m employee and r machines, having each employee work on each machine for a total of n days.  As these employees are not all employees, and these machines are not all machines it take sense to think of both the effect of machine and employees (and interactions) as random
  • 12. Two-way random effects model  Yijk ~ µ.. + αi + βj + (αβ)ij + Eij, 1< i < r, 1 < j < m, 1 < k < n  Eijk ~ N(0, σ²), 1 < i < r, 1 < j < m, 1 < k < n  αi ~ N(0, σ²α), 1 < i < r.  βj ~ N(0, σ²β), 1 < j < m.  (αβ)ij ~ N(0, ²σαβ), 1 < j < m, 1< i < r.  Cov (Yijk,Yi’j’k’) = &ii’ ²σα + &jj’ ²σβ + &ii’ &jj’ ²σαβ + &ii’ &jj’ &kk’ σ².
  • 13. ANOVA table: Two-way (random) SS df E(SS) SSA = nm r∑i=1 (Yi.. – Y…) ² r -1 σ² + nm σ²α + nσ²αβ SSB = nr m∑j=1 (Yj.. – Y…) ² m -1 σ² +nr σ²β + nσ²αβ SSAB = n r∑i=1 (Yij. – Yi.. –Yj.. + Y…) ² (m – 1)(r – 1) σ² + nσ²αβ SSE = r∑i=1 m∑j=1 n ∑k=1 (Yijk – Y ij.)² (n – 1)ab σ²  To test Ho : σ²α = 0 use SSA and SSAB.  To test H0 : σ²αβ = 0 use SSAB and SSE.
  • 14. Mixed effects model  In some studies, some factors can be thought of as fixed, others random.  For instance, we might have a study of the effect of a standard part of the brewing process on sodium levels in the beer example  Then, we might think of a model in which we have a fixed effect for “brewing technique” and a random effect for beer
  • 15. Two-way mixed effects model  Yijk ~ µ.. + αi + βj + (αβ)ij + Eij, 1< i < r, 1 < j < m, 1 < k < n  Eijk ~ N(0, σ²), 1 < i < r, 1 < j < m, 1 < k < n  αi ~ N(0, σ²α), 1 < i < r.  βj ~ N(0, σ²β), 1 < j < m. are constant!  (αβ)ij ~ N (0,(m-1) σ²αβ/m), 1 < j < m, 1< i < r.  Constraints: m∑j=1 βj = 0 m∑j=1 (αβ)ij = 0, 1 < i < r Cov ((αβ)ij, (αβ) i’j’ ) = - σ²αβ/m  Cov (Yijk,Yi’j’k’) = &jj’ (σ²β + &ii’ m-1 σ²β – (1-&ii’) 1 σ²αβ + &ii’ &kk’ σ²). m m
  • 16. ANOVA table: Two-way (mixed) SS df E(MS) SSA r -1 σ² + nm σ²α SSB m -1 σ² + m∑j=1 β²i+ nσ²αβ m -1 SSAB (m – 1)(r – 1) σ² + nσ²αβ SSE = r∑i=1 m∑j=1 n ∑k=1 (Yijk – Y ij.)² (n – 1)ab σ²  To test Ho : σ²α = 0 use SSA and SSAB.  To test Ho : β1 = … = βm = 0 0 use SSB and SSAB.  To test Ho : σ²αβ use SSAB and SSE.
  • 17. Confidence intervals for variances  Consider estimating σ²β in two-way random effects ANOVA. A natural estimate is σ²β = nr (MSB – MSAB)  What about CI?  A linear combination of x² - but not x².  To form a confidence interval for σ²β we need to know distribution of a linear combination of MS.’s at least approximately.
  • 18. Satterwaite’s procedure  Given K independent MS.’s L ~ k∑i=1 ciMSi  Then dfTL “~” x²dfT E(L) where dfT = (k∑i=1 ciMSi)² k∑i=1 ciMS²i/dfi Where dfi the the degrees of freedom of the i-th MS.  (1-α) . 100% CI for E(L): LL = dfT x L , LU = dfT x L x² dfT; 1-α/2 x² dfT; 1-α/2