SlideShare una empresa de Scribd logo
1 de 80
Descargar para leer sin conexión
Bayesian Networks
                      Unit 2
                Statistics Review
              Wang, Yuan-Kai, 王元凱
                     ykwang@mails.fju.edu.tw
                      http://www.ykwang.tw

    Department of Electrical Engineering, Fu Jen Univ.
                  輔仁大學電機工程系

                                2006~2011
                        Reference this document as:
    Wang, Yuan-Kai, “Statistics Review," Lecture Notes of Wang, Yuan-Kai,
                      Fu Jen University, Taiwan, 2011.
Fu Jen University   Department of Electrical Engineering   Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 2



                    Goal of this Unit
         Review basic concepts of
          statistics in terms of
           Image processing
           Pattern recognition




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 3



                      Related Units
         Previous unit(s)
           Probability Review
         Next units
           Uncertainty Inference (Discrete)
           Uncertainty Inference (Continuous)




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 4



                          Self-Study
         Artificial Intelligence: a modern
          approach
           Russell & Norvig, 2nd, Prentice Hall,
            2003. pp.462~474,
           Chapter 13, Sec. 13.1~13.3
         統計學的世界
           墨爾著,鄭惟厚譯, 天下文化,2002
         深入淺出統計學
           D. Grifiths, 楊仁和譯,2009, O’ Reilly


Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                Unit - Statistics Review                       p. 5




                             Contents
      1. Introduction ……………………………                                        6
      2. Histogram ………………………………                                         12
      3. Central Tendency ..............................                18
      4. Variance .............................................         26
      5. Frequency Distribution …………......                              34
      6. Covariance .........................................           52
      7. Covariance Matrix ……………………                                     57
      8. Correlation ..........................................         64
      9. Chart and Graph ……………………..                                     68
      10.References ……………………………                                         79
Fu Jen University    Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 6




                    1. Introduction
         Probability and statistics are
          about uncertainty
           The world is full of uncertainty
           Our hardware/software
            implementation needs to consider
            uncertainty



Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 7



          Uncertainty by Probability
         It summarizes the uncertainty
          that arises from laziness and
          ignorance
         An example
           P(your toothache is caused by a
            cavity) = 0.8
           20% represents your laziness
            and ignorance – all other
            possible causes

Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 8



            Uncertainty by Statistics
         It derives probabilistic facts
          from a set of data
           Derive actual probability
            number
             P(your toothache is caused by
              a cavity) = 0.8
           Describe characteristics of data
             Mean, variance, moment, ...
           Build the statistic model of data
             Gaussian, Gaussian Mixture
           Reason new facts from the data
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                Unit - Statistics Review                       p. 9



                    What Is Statistics
         Given a set of data from a random
          variable
         A statistic is a number that
          provides information about the
          data
           Descriptive statistics
         Two way to describe data
           Measures of central tendency
           Measures of dispersion

Fu Jen University    Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                 Unit - Statistics Review                       p. 10



                    Measures of Central
                        Tendency
         Mean
           Average, expected value of the
            random variable
         Median
           Middle value of the R.V.
         Mode
           The variable value at the peak of the
            pmf/pdf

Fu Jen University     Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 11



             Measures of Dispersion
         Dispersion
           Variance
           Covariance
         Correlation
         Moment
         Others: range, percentiles,
          95% percentile,



Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 12



                       2. Histogram
         This course has 15 students
         Every student has a score with
          values: 0, 10, 20, ... 100
         Random variable
          X = Student's score
         Scores of the 15 student
           {20, 90, 90, 100, 50, 60, 70, 60 ,80,
             70, 80, 90, 80, 70, 70}
           20x1; 50x1; 60x2; 70x4; 80x3; 90x3;
            100x1
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                        p. 13



                    The Histogram
         20x1; 50x1; 60x2; 70x4; 80x3;
          90x3; 100x1     No. of X
                                                                              X

                                             10 20 30 40 50 60 70 80 90 100

         20x1/15; 50x1/15; 60x2/15;
          70x4/15; 80x3/15; 90x3/15;
          100x1/15        P(X)
                                                                              X
      Histogram is P.D.F.
                                             10 20 30 40 50 60 70 80 90 100
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 14



                          Definitions
         For a random variable X
           X has n possible values
            {x1, x2, ..., xn}
         Now there are N random data
          of X
           x1, x2, .., xN
         Histogram & Distribution
           The number of xi : N(xi)
           The probabilities of xi :
            p(xi)= N(xi)/N
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                  Unit - Statistics Review                       p. 15



                    Histogram v.s. P.D.F.




Fu Jen University      Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 16



                       2D Gaussian




             Histogram                                       pdf


Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 17



               Histogram of an Image




   • Random variable X (Gray level) has n possible
     values {x1, x2, ..., xn}, n=256
   • N random data x1, x2, .., xN of X, N=Width*Height
   • Histogram: N(xi)
   • Distribution: P(xi) = N(xi) / N
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                 Unit - Statistics Review                       p. 18



                    3. Central Tendency
       Random variable
        X = Student's score
       Scores of the 15 student
         {20, 90, 90, 100, 50, 60, 70, 60 ,80, 70,
           80, 90, 80, 70, 70}
         20x1; 50x1; 60x2; 70x4; 80x3; 90x3;
          100x1
                                 P(X)
       Histogram
                                                                   X
                                 10 20 30 40 50 60 70 80 90 100
       Mean ?
Fu Jen University     Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                  Unit - Statistics Review                       p. 19



                                     Mean
         Mean from the set of data
                                 1              N
                    E[ X ]  x 
                                 N
                                               x
                                                i 1
                                                          i



         Mean from the p.d.f
                                          n
                    E [ X ]  x   xi p( xi )
                                        i 1




Fu Jen University      Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                Unit - Statistics Review                       p. 20



                    Mean of an Image




                 1          N                                    n
    E[ X ]  x             xi             E [ X ]  x   xi p( xi )
                 N          i 1                                i 1

Fu Jen University    Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                      Unit - Statistics Review                        p. 21



               Disadvantage of Mean
         Mean is easily influenced by
          outlier (extreme values)
         Mean may not be the real value
                                                    P(X)
              1     N
           x
              N
                     x i  72
                    i 1
                                                                                     X
                                                    10 20 30 40 50 60 70 80 90 100




Fu Jen University          Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                           p. 22



                    Median & Mode
        Median and mode are another
         measures of central tendency
    Median: (1) Sort the scores, (2) Select the middle
    {20, 50, 60, 60, 70, 70, 70, 70, 80 ,80, 80, 90, 90, 90, 100}

   Mode: select the score with the maximum N(X) or P(X)
   20x1; 50x1; 60x2; 70x4; 80x3; 90x3; 100x1
                    P(X)
                                                             X
                    10 20 30 40 50 60 70 80 90 100

Fu Jen University   Department of Electrical Engineering         Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 23



      Advantage of Median & Mode
         Median and mode is not
          influenced by outlier
         Median and mode will be the real
          value




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                    Unit - Statistics Review                           p. 24



                     Expected Value
         E[X] : mean
                    n
            E[ X ]  x   xi p ( xi ) E[ X ]  x 
                                                                            

                               i 1
                                                     xp( x)dx              

         E[Xr] – rth moment of X
                    n                       
           E[ X ]   xi p ( xi ) E[ X ]   x r p ( x)dx
               r       r
                                                                       
                         i 1
     E[(X-µ)r]           – rthn central moment of X
              E[( X   ) ]   ( xi   ) p( xi )
                           r                                r

                                      i 1
                                             
             E[( X   ) ]   ( x   ) p ( x)dx
                               r                                  r
                                         
Fu Jen University     Department of Electrical Engineering            Wang, Yuan-Kai Copyright
王元凱                                 Unit - Statistics Review                       p. 25



           Deviation about the Mean

                            x xi

         It indicates how far a value is
          from the center
         It is a very important number to
          measure the dispersion of how a
          distribution spreads out


Fu Jen University   Department of Electrical Engineering       Wang, Yuan-Kai Copyright
王元凱                                   Unit - Statistics Review                            p. 26



                             4. Variance
         Variance and standard deviation
          come from the “deviation”
         Average Deviation
           Calculate all of the deviations and
            find their average
           It is a measure of the typical amount
            any given data point might vary
                                                            N

                                                           ( xi  x )
                    x x
                    i
                                            AD            i 1
                                                                  N
Fu Jen University       Department of Electrical Engineering          Wang, Yuan-Kai Copyright
王元凱                                      Unit - Statistics Review                          p. 27



       We need Absolute Deviation

    xi              x x
                    i
                                              N                        xi     | xi  x |
    1          1-3=-2           xi  x                                1     |1-3|=2
    2          2-3=-1 AAD  i 1                                       2     |2-3|=1
    3          3-3=0             N                                     3     |3-3|=0
    4          4-3=1                                                   4     |4-3|=1
    5          5-3=2                                                   5     |5-3|=2
   =15         =?        N                                         =15       =6
 x  15/5                                 (x       i
                                                         x)
                                                                    x  15/5 ABD=6/
                               AD        i 1
   =3.0                                           N                  =3.0 5 =1.2
Fu Jen University          Department of Electrical Engineering        Wang, Yuan-Kai Copyright
王元凱                                          Unit - Statistics Review                          p. 28



         Or Square of the Deviation
                                                       N
    Square the
    deviations                                    ( x i  x ) 2 Take the
                                                                 square root
    to remove                      Variance  i 1
                                                                 to return to the
    minus signs                                       N
                                                                 original scale
                    N

                          x xi                                        N

                                                                         (x    i
                                                                                     x)   2

  AAD              i 1
                           N                                          i 1
                                                                               N
                    N

                (x        i
                                x)
      AD       i 1
                           N
Fu Jen University              Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                 Unit - Statistics Review                               p. 29



                       Sample Mean
                    and Sample Variance
         We can approximate the expected
          value by the sample mean
                                            N
                               x     1
                                      N   x
                                           i 1
                                                    i

                                                                   N
       Sample variance s 
                                       i
                                                   2           1
                                                               N    (x  x)
                                                                          i
                                                                                  2


           But, strangely enough, if you1want a good
           approximation of the true variance, you
           should use 2      N 2        1 N i
                        
                        ˆ
                            N 1
                                 s         (x  x)
                                      N  1 i 1
                                                     2



Fu Jen University     Department of Electrical Engineering             Wang, Yuan-Kai Copyright
王元凱                                     Unit - Statistics Review                              p. 30



                    Variance of an Image




                                                           n 1
           1                                    2   ( xi  x ) 2 p ( xi ), n  256
                      N
       
       ˆ2
               
          N  1 i 1
                     ( xi  x )2                ˆ
                                                           i 0
                                                    1               n
                                                 x
                                                    N
                                                                    x p( x ) Moments
                                                                   i 1
                                                                          i      i

Fu Jen University         Department of Electrical Engineering            Wang, Yuan-Kai Copyright
王元凱                                   Unit - Statistics Review                         p. 31



             An Example of Variance
         Variance of the scores of 15
          students in this course = ?
           {20, 90, 90, 100, 50, 60, 70, 60 ,80, 70,
             80, 90, 80, 70, 70}
                                                      P(X)
               1           N

         x   x  72
                          i
                                                                                       X
              N i 1                                  10 20 30 40 50 60 70 80 90 100
             1 N i
       2 
       ˆ         
            N  1 i 1
                       ( x  x )2

                    = 388.6
Fu Jen University       Department of Electrical Engineering      Wang, Yuan-Kai Copyright
王元凱                                  Unit - Statistics Review                       p. 32



           Ex. of Standard Deviation
         Standard deviation (SD): 
           = (Var)1/2
                         1           N
              
              ˆ               ( x  x ) = 19.7
                        N  1 i 1
                                   i    2


          P(X)
                                                        1       N

                                                                | x
                                           X
          10 20 30 40 50 60 70 80 90 100                               i
                                                                            x|
                                                        N       i 1

                    52.3 72 91.7
Fu Jen University      Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                               p. 33



      Formal Definition of Variance

      Var ( X )   x  E[( X  E[ X ]) 2 ]   ( xi  x ) 2 p( xi )
                    2

                                                                 i


    Var ( X )    E[( X  E[ X ]) ]   ( xi  x ) p( xi )dx
                    2
                    x
                                                     2                       2

                                                             x




Fu Jen University   Department of Electrical Engineering             Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 34



         5. Frequency Distributions
      • A graph or chart that shows the
        number of observations of a
        given value, or class interval




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                  Unit - Statistics Review                       p. 35



                    Shape of Distribution
         Modality
           The number of peaks in the curve
         Skewness
           An asymmetry in a distribution
            where values are shifted to one
            extreme or the other.
         Kurtosis
           The degree of Peakedness/flatness
            in the curve

Fu Jen University      Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 36



                             Modality
             Unimodal


             Bimodal


             Multimodal


Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 37



                    Skewness (1/3)
         The third moment about the
          Mean
           =0: symmetry distribution
                (Normal distribution )
           >0 : Right Skew (Positive Skew)
           <0 : Left Skew (Negative Skew)

        Right                                                         Left
        skew                       Symmetry                           skew


Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                                       p. 38



                    Skewness (2/3)

                                                              x                   
         Skewness formula                                       N
                                                                                      3
          from data
                                                                           i
                                                                               x
                                                             i 1
                                                                     N  1
         Skewness formula
          from p.d.f                                         n

                                                        x  x 
                                                                                  3
                                                                      i               p ( xi )
                                                        i 1




Fu Jen University   Department of Electrical Engineering                  Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                              p. 39



                    Skewness (3/3)
              Coefficient of Skewness
              (Normalized skewness)

                                x  x 
                                 N
         Skewness formula           i     3
          from data             i 1
                                        3
                                 N  1
                                                              n

                                                              x  x 
         Skewness formula                                                   3
                                                                                 p ( xi )
          from p.d.f                                         i 1
                                                                    i

                                                                        3
                                                                        
Fu Jen University   Department of Electrical Engineering            Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 40



                      Kurtosis (1/3)
         The normalized fourth
          moment about the Mean
           K=3: normal peak (Gaussian)
           K>3: sharp peak
           K<3: flat peak
                                               K=3



Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                                p. 41



                      Kurtosis (2/3)
                                  x                    
                                   N
                                                          4
         From data
                                            i
                                                    x
                                  i 1
                                                     4
                                   N  1
                                    n

                                   x  x 
                                                              4
         From p.d.f                            i                 p ( xi )
                                   i 1
                                                     4
                                                    

Fu Jen University   Department of Electrical Engineering              Wang, Yuan-Kai Copyright
王元凱                                    Unit - Statistics Review                                p. 42



                             Kurtosis (3/3)
         Another definition of kurtosis
           K=0: normal peak (Gaussian)
           K>0: sharp peak
           K<0: flat peak

             x             
             N                                   n

                                               x  x 
                             4
                    i
                        x                                i
                                                                       4
                                                                           p ( xi )
                         4 3
            i 1
             N  1
                                               i 1
                                                                  4                  3
                                                                  

Fu Jen University        Department of Electrical Engineering              Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 43



                                 Demo
         A demo of shape of frequency
          distribution




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                  Unit - Statistics Review                       p. 44



                    Shape of Distribution
                        of an Image




Fu Jen University      Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 45



           Frequency Distributions -
                   Types
         The Normal
         The Uniform
         The Log-normal
         The Exponential
         Statistical Distributions
           t
           -Square
           F

Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 46



          Frequency Distributions –
                Types (cont.)
         Hyper-geometric
         Poisson
         Binomial
         Gamma
         Weibull
         Cauchy
         Beta

Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 47



        Gaussian Dist. Of Data (1/5)




 Observe: Mean, Principal axes,
 implication of off-diagonal               Common convention: show contour
 covariance term, max gradient                corresponding to 2 standard
 zone of p(x)                                    deviations from mean

Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 48



        Gaussian Dist. Of Data (2/5)




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 49



        Gaussian Dist. Of Data (3/5)




      In this example, x and y are almost
      independent
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 50



        Gaussian Dist. Of Data (4/5)




   In this example, x and “x+y” are clearly
   not independent
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 51



        Gaussian Dist. Of Data (5/5)




      In this example, x and “20x+y” are
      clearly not independent
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 52



                     6. Covariance
       In addition to know the mean &
        variance of random variables X & Y
       We usually want to know
         If X increase,
         Will Y probably increase or
          decrease?
       We want to know the probable
        relationship between X and Y
         We will use covariance to identify the
          relationship
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                  Unit - Statistics Review                       p. 53



                         Mean Vector
         Random variable X: score of
          student of this course
         Random variable Y: score of
          student of another course
         Z=(X,Y) is called a random vector
           Mean vector ( x , y )is the mean of Z
                                                                Ny
                       1      Nx
                                                       1
                    x
                       Nx
                             x        i
                                                    y
                                                       Ny
                                                                y
                                                                i 1
                                                                       i

                              i 1


Fu Jen University     Department of Electrical Engineering      Wang, Yuan-Kai Copyright
王元凱                                    Unit - Statistics Review                       p. 54



                        Variance Vector ?
         Variance of X and Y: x2, y2
                                                1         Nx
                 VAR[ X ] 
                    2
                    x
                                          N x  1 i 1
                                                          ( x i  x )2
                                                          Ny
                           1
            VAR[Y ] 
                    2
                    y           
                        N y  1 i 1
                                     ( y i  y )2

      Variance vector (x2, y2)? No. Not Enough.
      •We want to know
        •If X increase,
        •Will Y probably increase or decrease?
Fu Jen University        Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 55



                         Covariance
         For two random variables X and Y
         Covariance indicates the
          tendency of the two random
          variables X & Y to vary together
           i.e., to co-vary
      Cov ( X , Y )   XY  E[( X  E[ X ])(Y  E[Y ])]
                      ( xi  x )( yi  y ) p( xi , yi )
                          i
                       1 N i
                          
                      N  1 i 1
                                 ( x  x )( y i  y )

Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 56



              Property of Covariance
         If X and Y tend to increase
          together, then XY>0
         If X tends to decrease when Y
          increases, then XY<0
         If X and Y are uncorrelated, then
           XY=0
         |XY|≤ XY, where X is the
          standard deviation of X
         XX = 2X = VAR(X)
         XY = YX
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                 Unit - Statistics Review                       p. 57



                    7. Covariance Matrix
         For two random variables X, Y
         There are two variances XX, YY
          and two covariance XY ,YX
         We usually write the four
          variances into a covariance
          matrix   E[(x  μ)(x  μ)T ]
                               E[( x μx ) ]
                                     2
                                                  E[( x μx )( y μy )]
                                                                     
                           E[( x μx )( y μy )]     E[( x μx ) ] 
                                                                  2
                                                                      
                            x  xy 
                              2

                                  2 
                           xy  y 
                                     
Fu Jen University     Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 58



                                   Quiz




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 59



                                   Quiz
         For 3 random variables: x,y,z
         The random vector v=(x,y,z)
         The covariance matrix of v?




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                Unit - Statistics Review                       p. 60



                    Independence and
                       Covariance




Fu Jen University    Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 61



               Formal Definition (1/2)
         For a random vector
          X = (X1, X2, ..., Xm) with n random
          variables
           The probability of x =(x1, x2, ..., xn),
            P(x), is a joint probability
            P(x) = P(x1, x2, ..., xn)
         Its expected values are
          a mean vector  = (1, 2, ..., m)
           E[X] = (E[X1], E[X2], ..., E[Xm])
                 = (1, 2, ..., m) = 
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                  Unit - Statistics Review                       p. 62



               Formal Definition (2/2)
            For the random vector
             X = (X1, X2, ..., Xm)
            All of its covariance values are
             a covariance matrix
            Cov (X)  E[(X   )(X   )T ]
               E[( X 1  1 )( X 1  1 )]  E[( X 1  1 )( X m   m )] 
                                                                      
                                                                            
               E[( X m   m )( X 1  1 )]  E[( X m   m )( X m   m )]
                                                                           
                12  12   1m  •  is a square and
                                        
               21  2   2 m 
                         2
                                              symmetric matrix
                          • Its diagonal elements
                                      2 
               m1  m 2   m 
                                             are 2i
Fu Jen University      Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 63



    Property of Covariance Matrix

         Symmetric matrix
         Positive definition matrix
         Eigenvalue




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 64



                      8. Correlation
         Correlation is another measure of
          the relationship of two random
          variables Cov( X , Y )         XY
              XY                    
                     Var ( X )Var (Y )  X  Y
        • -1  XY  1
        • XY = 0 means that the variables are
          uncorrelated
        • XY = 0    E[XY] = E[X] E[Y]
           • Independence means uncorrelated
           • But not vice versa!
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 65



           Correlation & Covariance
         |XY|≤ XY
           -XY ≤ XY ≤ XY
           XY = XYXY
             -1  XY  1
             XY is called the correlation
              coefficient 
               -1   xy 
                              xy
                                    +1
                                          x  y
Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                      Unit - Statistics Review                           p. 66



          Covariance v.s. Correlation
                                                        Y
                    Y



                                          X                                    X
                        XY = XY                      XY = 1/2XY
                          XY = +1                              XY = 1/2
      Y                              Y                                Y



                            X                                   X                          X
          XY = -XY                XY = -1/2XY                         XY = 0
              XY = -1                      XY = -1/2                        XY = 0
Fu Jen University         Department of Electrical Engineering          Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 67



                               Demos
         Flash demos of correlation with
          scatter plot




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                 Unit - Statistics Review                       p. 68



                    9. Charts and Graphs
         Descriptive Graphs
           Bar Chart
           Pie Graph
           Line Graph
         Distributions
           Histogram
           Box Plot
           Steam and Leaf

Fu Jen University     Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 69



                          Bar Charts
         Best for displaying actual values.
         Can handle moderate # of cases
          (bars)
         Excel calls it a column chart




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                             Unit - Statistics Review                       p. 70



             Bar Chart – An Example
                                  Vote for President in 2000

                         60
             Millions


                         50
                         40
                         30
                         20
                         10
                          0
                                           e




                                                                      r
                                                     sh
                              an




                                                                     e
                                         or




                                                                  ad
                                                  Bu
                             an



                                        G




                                                                N
                           ch
                        Bu




Fu Jen University                 Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 71



                          Pie Charts
         Best used with small number of
          categories or cases to display
         Good for showing relative
          distribution
           Percentages, proportions
         Use only one column of data
           Plus one column of labels


Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                         Unit - Statistics Review                                 p. 72



                    Pie Chart Examples
                                                                Vote for President in 2004




                                                                                             Bush
                                                                                             Nader
                                                                                             Kerry




            Vote for President in 2000



                                         Buchanan
                                         Gore
                                         Bush
                                         Nader




Fu Jen University         Department of Electrical Engineering             Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 73



                        Line graphs
         Best for showing data across
          time
         Always give dates
         Label X axis
         Indicate units on Y axis
         Use legend for multiple lines


Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                                     Unit - Statistics Review                               p. 74



                  Line Graphs - Example
                                   Defense Spending - 1940-2002

                          400000

                          300000
             Millions $




                                                                                               Defense
                          200000
                                                                                               Spending
                          100000

                              0
                                   1940
                                          1948
                                                 1956
                                                        1964
                                                               1972
                                                                      1980
                                                                             1988
                                                                                    1996
                                                           Year


Fu Jen University                  Department of Electrical Engineering                    Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 75



                             Box Plot
         Quick picture of a distribution
         Parts of box give distribution
          characteristics
         Your Text is not quite accurate!




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                Unit - Statistics Review                       p. 76



                    Stem and Leaf Plot
         Good for showing distribution
          while preserving data
         Figuring out stems can be tricky




Fu Jen University    Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                 Unit - Statistics Review                       p. 77



                    Review for Next Unit
         We learn
           Histogram, mean, variance,
            covariance, correlation
           Frequency distribution, Gaussian
         We will learn in next unit
           Moment, Gaussian Mixture, Linear
            Gaussian
           Moment  mean, variance

Fu Jen University     Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                                Unit - Statistics Review                              p. 78



      Mean & Variance of an Image




                                                 1 n
      2 
      ˆ
            1 N i
                     ( x  x )2
                                           2 
                                           ˆ         
                                                N  1 i 1
                                                           ( xi  x ) 2 p ( xi )
           N  1 i 1
                                               1               n
                                            x
                                               N
                                                               x p( x ) Moments
                                                              i 1
                                                                     i      i

Fu Jen University    Department of Electrical Engineering            Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 79



                    10. References
         An excellent textbook
          統計學的世界,鄭惟厚譯,天下文化2002
           Statistics: concepts and controversies,
            5th, D. S. Moore, 2001
           Chart & graph: Chap. 10,11
           Central tendency & dispersion: Chap. 12
           Gaussian distribution: Chap. 13
           Correlation: Chap. 14, 15




Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright
王元凱                               Unit - Statistics Review                       p. 80



                    透過Excel學統計
       統計學,劉明德等著,全華,93
         Chart & graph : Chap. 2
         Central tendency & dispersion: Chap. 3
         Frequency distribution: Chap. 6,7
       統計學與Excel,王文中著,博碩,93
         Central tendency (mean, mode, median) :
          Chap. 2
         Dispersion (variance, standard deviation) :
          Chap. 3
         Gaussian distribution : Chap. 4
         Correlation: Chap. 11


Fu Jen University   Department of Electrical Engineering     Wang, Yuan-Kai Copyright

Más contenido relacionado

Similar a 02 Statistics review

Theory Communication
Theory CommunicationTheory Communication
Theory CommunicationHikari Riten
 
Convergence Problems Of Contingency Analysis In Electrical Power Transmission...
Convergence Problems Of Contingency Analysis In Electrical Power Transmission...Convergence Problems Of Contingency Analysis In Electrical Power Transmission...
Convergence Problems Of Contingency Analysis In Electrical Power Transmission...CSCJournals
 
Composite Analysis of Phase Resolved Partial Discharge Patterns using Statist...
Composite Analysis of Phase Resolved Partial Discharge Patterns using Statist...Composite Analysis of Phase Resolved Partial Discharge Patterns using Statist...
Composite Analysis of Phase Resolved Partial Discharge Patterns using Statist...IJMER
 
Probabilistic Performance Index based Contingency Screening for Composite Pow...
Probabilistic Performance Index based Contingency Screening for Composite Pow...Probabilistic Performance Index based Contingency Screening for Composite Pow...
Probabilistic Performance Index based Contingency Screening for Composite Pow...IJECEIAES
 
Resume-Sangbin Jeon
Resume-Sangbin JeonResume-Sangbin Jeon
Resume-Sangbin JeonSangbin Jeon
 
signals_and_systems.pptx
signals_and_systems.pptxsignals_and_systems.pptx
signals_and_systems.pptxMTiti1
 

Similar a 02 Statistics review (17)

03 Uncertainty inference(discrete)
03 Uncertainty inference(discrete)03 Uncertainty inference(discrete)
03 Uncertainty inference(discrete)
 
04 Uncertainty inference(continuous)
04 Uncertainty inference(continuous)04 Uncertainty inference(continuous)
04 Uncertainty inference(continuous)
 
01 Probability review
01 Probability review01 Probability review
01 Probability review
 
07 approximate inference in bn
07 approximate inference in bn07 approximate inference in bn
07 approximate inference in bn
 
Theory Communication
Theory CommunicationTheory Communication
Theory Communication
 
06 exact inference in bn
06 exact inference in bn06 exact inference in bn
06 exact inference in bn
 
Convergence Problems Of Contingency Analysis In Electrical Power Transmission...
Convergence Problems Of Contingency Analysis In Electrical Power Transmission...Convergence Problems Of Contingency Analysis In Electrical Power Transmission...
Convergence Problems Of Contingency Analysis In Electrical Power Transmission...
 
08 probabilistic inference over time
08 probabilistic inference over time08 probabilistic inference over time
08 probabilistic inference over time
 
Prob - Syllabus.pdf
Prob - Syllabus.pdfProb - Syllabus.pdf
Prob - Syllabus.pdf
 
05 probabilistic graphical models
05 probabilistic graphical models05 probabilistic graphical models
05 probabilistic graphical models
 
Composite Analysis of Phase Resolved Partial Discharge Patterns using Statist...
Composite Analysis of Phase Resolved Partial Discharge Patterns using Statist...Composite Analysis of Phase Resolved Partial Discharge Patterns using Statist...
Composite Analysis of Phase Resolved Partial Discharge Patterns using Statist...
 
Towards Embedded Computer Vision - New @ 2013
Towards Embedded Computer Vision - New @ 2013Towards Embedded Computer Vision - New @ 2013
Towards Embedded Computer Vision - New @ 2013
 
Probabilistic Performance Index based Contingency Screening for Composite Pow...
Probabilistic Performance Index based Contingency Screening for Composite Pow...Probabilistic Performance Index based Contingency Screening for Composite Pow...
Probabilistic Performance Index based Contingency Screening for Composite Pow...
 
PhD Qualifying Exam Slides
PhD Qualifying Exam SlidesPhD Qualifying Exam Slides
PhD Qualifying Exam Slides
 
Estimating Space-Time Covariance from Finite Sample Sets
Estimating Space-Time Covariance from Finite Sample SetsEstimating Space-Time Covariance from Finite Sample Sets
Estimating Space-Time Covariance from Finite Sample Sets
 
Resume-Sangbin Jeon
Resume-Sangbin JeonResume-Sangbin Jeon
Resume-Sangbin Jeon
 
signals_and_systems.pptx
signals_and_systems.pptxsignals_and_systems.pptx
signals_and_systems.pptx
 

Más de IEEE International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Más de IEEE International Conference on Intelligent Information Hiding and Multimedia Signal Processing (9)

Computer Vision in the Age of IoT
Computer Vision in the Age of IoTComputer Vision in the Age of IoT
Computer Vision in the Age of IoT
 
2014/07/17 Parallelize computer vision by GPGPU computing
2014/07/17 Parallelize computer vision by GPGPU computing2014/07/17 Parallelize computer vision by GPGPU computing
2014/07/17 Parallelize computer vision by GPGPU computing
 
老師與教學助理的互動經驗分享 1010217
老師與教學助理的互動經驗分享 1010217老師與教學助理的互動經驗分享 1010217
老師與教學助理的互動經驗分享 1010217
 
Parallel Vision by GPGPU/CUDA
Parallel Vision by GPGPU/CUDAParallel Vision by GPGPU/CUDA
Parallel Vision by GPGPU/CUDA
 
Markov Random Field (MRF)
Markov Random Field (MRF)Markov Random Field (MRF)
Markov Random Field (MRF)
 
Monocular Human Pose Estimation with Bayesian Networks
Monocular Human Pose Estimation with Bayesian NetworksMonocular Human Pose Estimation with Bayesian Networks
Monocular Human Pose Estimation with Bayesian Networks
 
Towards Embedded Computer Vision邁向嵌入式電腦視覺
Towards Embedded Computer Vision邁向嵌入式電腦視覺Towards Embedded Computer Vision邁向嵌入式電腦視覺
Towards Embedded Computer Vision邁向嵌入式電腦視覺
 
Intelligent Video Surveillance with Cloud Computing
Intelligent Video Surveillance with Cloud ComputingIntelligent Video Surveillance with Cloud Computing
Intelligent Video Surveillance with Cloud Computing
 
Intelligent Video Surveillance and Sousveillance
Intelligent Video Surveillance and SousveillanceIntelligent Video Surveillance and Sousveillance
Intelligent Video Surveillance and Sousveillance
 

Último

SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...KokoStevan
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.MateoGardella
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 

Último (20)

SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 

02 Statistics review

  • 1. Bayesian Networks Unit 2 Statistics Review Wang, Yuan-Kai, 王元凱 ykwang@mails.fju.edu.tw http://www.ykwang.tw Department of Electrical Engineering, Fu Jen Univ. 輔仁大學電機工程系 2006~2011 Reference this document as: Wang, Yuan-Kai, “Statistics Review," Lecture Notes of Wang, Yuan-Kai, Fu Jen University, Taiwan, 2011. Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 2. 王元凱 Unit - Statistics Review p. 2 Goal of this Unit  Review basic concepts of statistics in terms of  Image processing  Pattern recognition Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 3. 王元凱 Unit - Statistics Review p. 3 Related Units  Previous unit(s)  Probability Review  Next units  Uncertainty Inference (Discrete)  Uncertainty Inference (Continuous) Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 4. 王元凱 Unit - Statistics Review p. 4 Self-Study  Artificial Intelligence: a modern approach  Russell & Norvig, 2nd, Prentice Hall, 2003. pp.462~474,  Chapter 13, Sec. 13.1~13.3  統計學的世界  墨爾著,鄭惟厚譯, 天下文化,2002  深入淺出統計學  D. Grifiths, 楊仁和譯,2009, O’ Reilly Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 5. 王元凱 Unit - Statistics Review p. 5 Contents 1. Introduction …………………………… 6 2. Histogram ……………………………… 12 3. Central Tendency .............................. 18 4. Variance ............................................. 26 5. Frequency Distribution …………...... 34 6. Covariance ......................................... 52 7. Covariance Matrix …………………… 57 8. Correlation .......................................... 64 9. Chart and Graph …………………….. 68 10.References …………………………… 79 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 6. 王元凱 Unit - Statistics Review p. 6 1. Introduction  Probability and statistics are about uncertainty  The world is full of uncertainty  Our hardware/software implementation needs to consider uncertainty Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 7. 王元凱 Unit - Statistics Review p. 7 Uncertainty by Probability  It summarizes the uncertainty that arises from laziness and ignorance  An example  P(your toothache is caused by a cavity) = 0.8  20% represents your laziness and ignorance – all other possible causes Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 8. 王元凱 Unit - Statistics Review p. 8 Uncertainty by Statistics  It derives probabilistic facts from a set of data  Derive actual probability number  P(your toothache is caused by a cavity) = 0.8  Describe characteristics of data  Mean, variance, moment, ...  Build the statistic model of data  Gaussian, Gaussian Mixture  Reason new facts from the data Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 9. 王元凱 Unit - Statistics Review p. 9 What Is Statistics  Given a set of data from a random variable  A statistic is a number that provides information about the data  Descriptive statistics  Two way to describe data  Measures of central tendency  Measures of dispersion Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 10. 王元凱 Unit - Statistics Review p. 10 Measures of Central Tendency  Mean  Average, expected value of the random variable  Median  Middle value of the R.V.  Mode  The variable value at the peak of the pmf/pdf Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 11. 王元凱 Unit - Statistics Review p. 11 Measures of Dispersion  Dispersion  Variance  Covariance  Correlation  Moment  Others: range, percentiles, 95% percentile, Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 12. 王元凱 Unit - Statistics Review p. 12 2. Histogram  This course has 15 students  Every student has a score with values: 0, 10, 20, ... 100  Random variable X = Student's score  Scores of the 15 student  {20, 90, 90, 100, 50, 60, 70, 60 ,80, 70, 80, 90, 80, 70, 70}  20x1; 50x1; 60x2; 70x4; 80x3; 90x3; 100x1 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 13. 王元凱 Unit - Statistics Review p. 13 The Histogram  20x1; 50x1; 60x2; 70x4; 80x3; 90x3; 100x1 No. of X X 10 20 30 40 50 60 70 80 90 100  20x1/15; 50x1/15; 60x2/15; 70x4/15; 80x3/15; 90x3/15; 100x1/15 P(X) X Histogram is P.D.F. 10 20 30 40 50 60 70 80 90 100 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 14. 王元凱 Unit - Statistics Review p. 14 Definitions  For a random variable X  X has n possible values {x1, x2, ..., xn}  Now there are N random data of X  x1, x2, .., xN  Histogram & Distribution  The number of xi : N(xi)  The probabilities of xi : p(xi)= N(xi)/N Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 15. 王元凱 Unit - Statistics Review p. 15 Histogram v.s. P.D.F. Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 16. 王元凱 Unit - Statistics Review p. 16 2D Gaussian Histogram pdf Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 17. 王元凱 Unit - Statistics Review p. 17 Histogram of an Image • Random variable X (Gray level) has n possible values {x1, x2, ..., xn}, n=256 • N random data x1, x2, .., xN of X, N=Width*Height • Histogram: N(xi) • Distribution: P(xi) = N(xi) / N Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 18. 王元凱 Unit - Statistics Review p. 18 3. Central Tendency  Random variable X = Student's score  Scores of the 15 student  {20, 90, 90, 100, 50, 60, 70, 60 ,80, 70, 80, 90, 80, 70, 70}  20x1; 50x1; 60x2; 70x4; 80x3; 90x3; 100x1 P(X)  Histogram X 10 20 30 40 50 60 70 80 90 100  Mean ? Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 19. 王元凱 Unit - Statistics Review p. 19 Mean  Mean from the set of data 1 N E[ X ]  x  N x i 1 i  Mean from the p.d.f n E [ X ]  x   xi p( xi ) i 1 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 20. 王元凱 Unit - Statistics Review p. 20 Mean of an Image 1 N n E[ X ]  x   xi E [ X ]  x   xi p( xi ) N i 1 i 1 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 21. 王元凱 Unit - Statistics Review p. 21 Disadvantage of Mean  Mean is easily influenced by outlier (extreme values)  Mean may not be the real value P(X) 1 N x N  x i  72 i 1 X 10 20 30 40 50 60 70 80 90 100 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 22. 王元凱 Unit - Statistics Review p. 22 Median & Mode  Median and mode are another measures of central tendency Median: (1) Sort the scores, (2) Select the middle {20, 50, 60, 60, 70, 70, 70, 70, 80 ,80, 80, 90, 90, 90, 100} Mode: select the score with the maximum N(X) or P(X) 20x1; 50x1; 60x2; 70x4; 80x3; 90x3; 100x1 P(X) X 10 20 30 40 50 60 70 80 90 100 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 23. 王元凱 Unit - Statistics Review p. 23 Advantage of Median & Mode  Median and mode is not influenced by outlier  Median and mode will be the real value Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 24. 王元凱 Unit - Statistics Review p. 24 Expected Value  E[X] : mean n E[ X ]  x   xi p ( xi ) E[ X ]  x   i 1  xp( x)dx   E[Xr] – rth moment of X n  E[ X ]   xi p ( xi ) E[ X ]   x r p ( x)dx r r  i 1  E[(X-µ)r] – rthn central moment of X E[( X   ) ]   ( xi   ) p( xi ) r r i 1  E[( X   ) ]   ( x   ) p ( x)dx r r  Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 25. 王元凱 Unit - Statistics Review p. 25 Deviation about the Mean x xi  It indicates how far a value is from the center  It is a very important number to measure the dispersion of how a distribution spreads out Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 26. 王元凱 Unit - Statistics Review p. 26 4. Variance  Variance and standard deviation come from the “deviation”  Average Deviation  Calculate all of the deviations and find their average  It is a measure of the typical amount any given data point might vary N  ( xi  x ) x x i AD  i 1 N Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 27. 王元凱 Unit - Statistics Review p. 27 We need Absolute Deviation xi x x i N xi | xi  x | 1 1-3=-2  xi  x 1 |1-3|=2 2 2-3=-1 AAD  i 1 2 |2-3|=1 3 3-3=0 N 3 |3-3|=0 4 4-3=1 4 |4-3|=1 5 5-3=2 5 |5-3|=2 =15 =? N =15 =6 x  15/5  (x i  x) x  15/5 ABD=6/ AD  i 1 =3.0 N =3.0 5 =1.2 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 28. 王元凱 Unit - Statistics Review p. 28 Or Square of the Deviation N Square the deviations  ( x i  x ) 2 Take the square root to remove Variance  i 1 to return to the minus signs N original scale N  x xi N  (x i  x) 2 AAD  i 1 N  i 1 N N  (x i  x) AD  i 1 N Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 29. 王元凱 Unit - Statistics Review p. 29 Sample Mean and Sample Variance  We can approximate the expected value by the sample mean N x 1 N x i 1 i N  Sample variance s  i 2 1 N  (x  x) i 2 But, strangely enough, if you1want a good approximation of the true variance, you should use 2 N 2 1 N i   ˆ N 1 s   (x  x) N  1 i 1 2 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 30. 王元凱 Unit - Statistics Review p. 30 Variance of an Image n 1 1  2   ( xi  x ) 2 p ( xi ), n  256 N   ˆ2  N  1 i 1 ( xi  x )2 ˆ i 0 1 n x N  x p( x ) Moments i 1 i i Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 31. 王元凱 Unit - Statistics Review p. 31 An Example of Variance  Variance of the scores of 15 students in this course = ?  {20, 90, 90, 100, 50, 60, 70, 60 ,80, 70, 80, 90, 80, 70, 70} P(X) 1 N  x   x  72 i X N i 1 10 20 30 40 50 60 70 80 90 100 1 N i 2  ˆ  N  1 i 1 ( x  x )2 = 388.6 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 32. 王元凱 Unit - Statistics Review p. 32 Ex. of Standard Deviation  Standard deviation (SD):   = (Var)1/2 1 N  ˆ  ( x  x ) = 19.7 N  1 i 1 i 2 P(X) 1 N | x X 10 20 30 40 50 60 70 80 90 100  i x| N i 1 52.3 72 91.7 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 33. 王元凱 Unit - Statistics Review p. 33 Formal Definition of Variance Var ( X )   x  E[( X  E[ X ]) 2 ]   ( xi  x ) 2 p( xi ) 2 i Var ( X )    E[( X  E[ X ]) ]   ( xi  x ) p( xi )dx 2 x 2 2 x Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 34. 王元凱 Unit - Statistics Review p. 34 5. Frequency Distributions • A graph or chart that shows the number of observations of a given value, or class interval Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 35. 王元凱 Unit - Statistics Review p. 35 Shape of Distribution  Modality  The number of peaks in the curve  Skewness  An asymmetry in a distribution where values are shifted to one extreme or the other.  Kurtosis  The degree of Peakedness/flatness in the curve Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 36. 王元凱 Unit - Statistics Review p. 36 Modality  Unimodal  Bimodal  Multimodal Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 37. 王元凱 Unit - Statistics Review p. 37 Skewness (1/3)  The third moment about the Mean  =0: symmetry distribution (Normal distribution )  >0 : Right Skew (Positive Skew)  <0 : Left Skew (Negative Skew) Right Left skew Symmetry skew Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 38. 王元凱 Unit - Statistics Review p. 38 Skewness (2/3)  x   Skewness formula N 3 from data i x i 1 N  1  Skewness formula from p.d.f n  x  x  3 i p ( xi ) i 1 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 39. 王元凱 Unit - Statistics Review p. 39 Skewness (3/3) Coefficient of Skewness (Normalized skewness)  x  x  N  Skewness formula i 3 from data i 1 3 N  1 n  x  x   Skewness formula 3 p ( xi ) from p.d.f i 1 i 3  Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 40. 王元凱 Unit - Statistics Review p. 40 Kurtosis (1/3)  The normalized fourth moment about the Mean  K=3: normal peak (Gaussian)  K>3: sharp peak  K<3: flat peak K=3 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 41. 王元凱 Unit - Statistics Review p. 41 Kurtosis (2/3)  x  N 4  From data i x i 1 4 N  1 n  x  x  4  From p.d.f i p ( xi ) i 1 4  Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 42. 王元凱 Unit - Statistics Review p. 42 Kurtosis (3/3)  Another definition of kurtosis  K=0: normal peak (Gaussian)  K>0: sharp peak  K<0: flat peak  x  N n  x  x  4 i x i 4 p ( xi ) 4 3 i 1 N  1 i 1 4 3  Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 43. 王元凱 Unit - Statistics Review p. 43 Demo  A demo of shape of frequency distribution Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 44. 王元凱 Unit - Statistics Review p. 44 Shape of Distribution of an Image Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 45. 王元凱 Unit - Statistics Review p. 45 Frequency Distributions - Types  The Normal  The Uniform  The Log-normal  The Exponential  Statistical Distributions  t  -Square  F Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 46. 王元凱 Unit - Statistics Review p. 46 Frequency Distributions – Types (cont.)  Hyper-geometric  Poisson  Binomial  Gamma  Weibull  Cauchy  Beta Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 47. 王元凱 Unit - Statistics Review p. 47 Gaussian Dist. Of Data (1/5) Observe: Mean, Principal axes, implication of off-diagonal Common convention: show contour covariance term, max gradient corresponding to 2 standard zone of p(x) deviations from mean Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 48. 王元凱 Unit - Statistics Review p. 48 Gaussian Dist. Of Data (2/5) Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 49. 王元凱 Unit - Statistics Review p. 49 Gaussian Dist. Of Data (3/5) In this example, x and y are almost independent Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 50. 王元凱 Unit - Statistics Review p. 50 Gaussian Dist. Of Data (4/5) In this example, x and “x+y” are clearly not independent Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 51. 王元凱 Unit - Statistics Review p. 51 Gaussian Dist. Of Data (5/5) In this example, x and “20x+y” are clearly not independent Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 52. 王元凱 Unit - Statistics Review p. 52 6. Covariance  In addition to know the mean & variance of random variables X & Y  We usually want to know  If X increase,  Will Y probably increase or decrease?  We want to know the probable relationship between X and Y  We will use covariance to identify the relationship Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 53. 王元凱 Unit - Statistics Review p. 53 Mean Vector  Random variable X: score of student of this course  Random variable Y: score of student of another course  Z=(X,Y) is called a random vector  Mean vector ( x , y )is the mean of Z Ny 1 Nx 1 x Nx x i y Ny y i 1 i i 1 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 54. 王元凱 Unit - Statistics Review p. 54 Variance Vector ?  Variance of X and Y: x2, y2 1 Nx   VAR[ X ]  2 x N x  1 i 1  ( x i  x )2 Ny 1   VAR[Y ]  2 y  N y  1 i 1 ( y i  y )2 Variance vector (x2, y2)? No. Not Enough. •We want to know •If X increase, •Will Y probably increase or decrease? Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 55. 王元凱 Unit - Statistics Review p. 55 Covariance  For two random variables X and Y  Covariance indicates the tendency of the two random variables X & Y to vary together  i.e., to co-vary Cov ( X , Y )   XY  E[( X  E[ X ])(Y  E[Y ])]   ( xi  x )( yi  y ) p( xi , yi ) i 1 N i   N  1 i 1 ( x  x )( y i  y ) Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 56. 王元凱 Unit - Statistics Review p. 56 Property of Covariance  If X and Y tend to increase together, then XY>0  If X tends to decrease when Y increases, then XY<0  If X and Y are uncorrelated, then  XY=0  |XY|≤ XY, where X is the standard deviation of X  XX = 2X = VAR(X)  XY = YX Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 57. 王元凱 Unit - Statistics Review p. 57 7. Covariance Matrix  For two random variables X, Y  There are two variances XX, YY and two covariance XY ,YX  We usually write the four variances into a covariance matrix   E[(x  μ)(x  μ)T ]  E[( x μx ) ] 2 E[( x μx )( y μy )]    E[( x μx )( y μy )] E[( x μx ) ]  2     x  xy  2  2   xy  y    Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 58. 王元凱 Unit - Statistics Review p. 58 Quiz Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 59. 王元凱 Unit - Statistics Review p. 59 Quiz  For 3 random variables: x,y,z  The random vector v=(x,y,z)  The covariance matrix of v? Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 60. 王元凱 Unit - Statistics Review p. 60 Independence and Covariance Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 61. 王元凱 Unit - Statistics Review p. 61 Formal Definition (1/2)  For a random vector X = (X1, X2, ..., Xm) with n random variables  The probability of x =(x1, x2, ..., xn), P(x), is a joint probability P(x) = P(x1, x2, ..., xn)  Its expected values are a mean vector  = (1, 2, ..., m)  E[X] = (E[X1], E[X2], ..., E[Xm]) = (1, 2, ..., m) =  Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 62. 王元凱 Unit - Statistics Review p. 62 Formal Definition (2/2)  For the random vector X = (X1, X2, ..., Xm)  All of its covariance values are a covariance matrix Cov (X)  E[(X   )(X   )T ]  E[( X 1  1 )( X 1  1 )]  E[( X 1  1 )( X m   m )]         E[( X m   m )( X 1  1 )]  E[( X m   m )( X m   m )]     12  12   1m  •  is a square and    21  2   2 m  2  symmetric matrix       • Its diagonal elements  2   m1  m 2   m    are 2i Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 63. 王元凱 Unit - Statistics Review p. 63 Property of Covariance Matrix  Symmetric matrix  Positive definition matrix  Eigenvalue Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 64. 王元凱 Unit - Statistics Review p. 64 8. Correlation  Correlation is another measure of the relationship of two random variables Cov( X , Y )  XY  XY   Var ( X )Var (Y )  X  Y • -1  XY  1 • XY = 0 means that the variables are uncorrelated • XY = 0 E[XY] = E[X] E[Y] • Independence means uncorrelated • But not vice versa! Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 65. 王元凱 Unit - Statistics Review p. 65 Correlation & Covariance  |XY|≤ XY  -XY ≤ XY ≤ XY  XY = XYXY  -1  XY  1  XY is called the correlation coefficient  -1   xy  xy  +1  x  y Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 66. 王元凱 Unit - Statistics Review p. 66 Covariance v.s. Correlation Y Y X X XY = XY XY = 1/2XY XY = +1 XY = 1/2 Y Y Y X X X XY = -XY XY = -1/2XY XY = 0 XY = -1 XY = -1/2 XY = 0 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 67. 王元凱 Unit - Statistics Review p. 67 Demos  Flash demos of correlation with scatter plot Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 68. 王元凱 Unit - Statistics Review p. 68 9. Charts and Graphs  Descriptive Graphs  Bar Chart  Pie Graph  Line Graph  Distributions  Histogram  Box Plot  Steam and Leaf Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 69. 王元凱 Unit - Statistics Review p. 69 Bar Charts  Best for displaying actual values.  Can handle moderate # of cases (bars)  Excel calls it a column chart Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 70. 王元凱 Unit - Statistics Review p. 70 Bar Chart – An Example Vote for President in 2000 60 Millions 50 40 30 20 10 0 e r sh an e or ad Bu an G N ch Bu Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 71. 王元凱 Unit - Statistics Review p. 71 Pie Charts  Best used with small number of categories or cases to display  Good for showing relative distribution  Percentages, proportions  Use only one column of data  Plus one column of labels Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 72. 王元凱 Unit - Statistics Review p. 72 Pie Chart Examples Vote for President in 2004 Bush Nader Kerry Vote for President in 2000 Buchanan Gore Bush Nader Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 73. 王元凱 Unit - Statistics Review p. 73 Line graphs  Best for showing data across time  Always give dates  Label X axis  Indicate units on Y axis  Use legend for multiple lines Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 74. 王元凱 Unit - Statistics Review p. 74 Line Graphs - Example Defense Spending - 1940-2002 400000 300000 Millions $ Defense 200000 Spending 100000 0 1940 1948 1956 1964 1972 1980 1988 1996 Year Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 75. 王元凱 Unit - Statistics Review p. 75 Box Plot  Quick picture of a distribution  Parts of box give distribution characteristics  Your Text is not quite accurate! Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 76. 王元凱 Unit - Statistics Review p. 76 Stem and Leaf Plot  Good for showing distribution while preserving data  Figuring out stems can be tricky Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 77. 王元凱 Unit - Statistics Review p. 77 Review for Next Unit  We learn  Histogram, mean, variance, covariance, correlation  Frequency distribution, Gaussian  We will learn in next unit  Moment, Gaussian Mixture, Linear Gaussian  Moment  mean, variance Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 78. 王元凱 Unit - Statistics Review p. 78 Mean & Variance of an Image 1 n 2  ˆ 1 N i  ( x  x )2 2  ˆ  N  1 i 1 ( xi  x ) 2 p ( xi ) N  1 i 1 1 n x N  x p( x ) Moments i 1 i i Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 79. 王元凱 Unit - Statistics Review p. 79 10. References  An excellent textbook 統計學的世界,鄭惟厚譯,天下文化2002  Statistics: concepts and controversies, 5th, D. S. Moore, 2001  Chart & graph: Chap. 10,11  Central tendency & dispersion: Chap. 12  Gaussian distribution: Chap. 13  Correlation: Chap. 14, 15 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright
  • 80. 王元凱 Unit - Statistics Review p. 80 透過Excel學統計  統計學,劉明德等著,全華,93  Chart & graph : Chap. 2  Central tendency & dispersion: Chap. 3  Frequency distribution: Chap. 6,7  統計學與Excel,王文中著,博碩,93  Central tendency (mean, mode, median) : Chap. 2  Dispersion (variance, standard deviation) : Chap. 3  Gaussian distribution : Chap. 4  Correlation: Chap. 11 Fu Jen University Department of Electrical Engineering Wang, Yuan-Kai Copyright