SlideShare una empresa de Scribd logo
1 de 18
Guessing the Unknown
The Quest
                                  What can we
                                  say about this
                                    black box?
                                           E.g., What is the
                                          probability that it
                                         generates a number
               5   12 3   9 28
                                            bigger than 5?




Observations
Distributions

What if we had many many
      observations?

                           Value   Frequency
                           -1      0.3
                           0       0.2
                                                  Sum of
                           1       0.1         frequencies
  This table is the        2       0.1              is 1
    distribution           3       0.1
  associated with
                           4       0.1
   this black box
                           5       0.1
Distributions Graphically




                             -1 0 1 2 3 4 5




            Area under the
              curve is 1
The Challenge
We do not have many many
      observations


                So we cannot infer the
                 distribution from the
                     observations



                                 What can we do then?
What can we do with few
             observations?
Assume distribution is known
                                                 E.g., Normal, Binomial
  (from prior knowledge or
                                                           etc
        other means)


           I.e., model approximately using
                 a canonical distribution


                               But the parameters are not
                                         known


                                          Can these parameters be
                                            determined from the
                                               observations?
Why Canonical Distributions

 Value   Frequency
 -1      0.3
 0       0.2
 1       0.1                        Too verbose a
                                  description for the
 2       0.1                         distribution
 3       0.1
 4       0.1
 5       0.1


                   Can the entire distribution be
               described (even approximately) by just
               a few parameters, while modeling the
                           data accurately
Example: Binomial Distribution
       A coin that yields 1 with                           Observations
       probability p and 0 with
       probability 1- p, tossed n           1 0 1 1 1 ….
         times, independently

                                    Value    Frequency
            Number of 1’s?          0

                                    1

    Distribution,                   2
  μ=np,σ2=np(1-p)

                                    n-1
Can one determine p
   from the (few)                   n
   observations?
Other Canonical Distributions
               Normal μ, σ2




                         Poisson μ =r,σ2=r



                                  Negative Binomial μ =rp/(1-p),
                                          σ2= rp/(1-p)2
  What are
these? Later
    talk
                                                 Gamma μ=kθ, σ2 =kθ2
Back to the Quest

We have few observations

     Assume these are from a
     known distribution family

                 But with unknown
                    parameters

                     How do we determine the
                          parameters?

                             How do we determine μ,
                                      σ2?
Estimating Mean


          μ, σ2




Estimate for the
 mean; a good
  estimate??
μ, σ2




               What is the mean and variance
Normal!! For       of this distribution?
 modest n.
μ, σ2




 Unbiased




 Tight as n
grows larger
Estimating Variance


            μ, σ2




Estimate for the
variance; a good
   estimate??
μ, σ2
           μ, σ2




Bias
Estimating Variance Correctly


         μ, σ2




Unbiased!!
A Mind Reading Game
• Your friend chooses a number (one of 1,3,5) in his/her
  mind
   – Call this i

• He/She then rolls a 6-faced die 30 times, privately
   – For each roll, he/she declares Heads if the number on the
     die is <=i, and Tails otherwise

• Your goal is to guess i solely from this sequence of n
  Heads and Tails.

• Can you read your friend’s mind?
Thank You

Más contenido relacionado

Similar a Introduction to statistics

Binomail distribution 23 jan 21
Binomail distribution 23 jan 21Binomail distribution 23 jan 21
Binomail distribution 23 jan 21Arun Mishra
 
Classics 2011
Classics 2011Classics 2011
Classics 2011goodbeem
 
The renyi entropy and the uncertainty relations in quantum mechanics
The renyi entropy and the uncertainty relations in quantum mechanicsThe renyi entropy and the uncertainty relations in quantum mechanics
The renyi entropy and the uncertainty relations in quantum mechanicswtyru1989
 
Talk given at Kobayashi-Maskawa Institute, Nagoya University, Japan.
Talk given at Kobayashi-Maskawa Institute, Nagoya University, Japan.Talk given at Kobayashi-Maskawa Institute, Nagoya University, Japan.
Talk given at Kobayashi-Maskawa Institute, Nagoya University, Japan.Peter Coles
 
SPATIAL POINT PATTERNS
SPATIAL POINT PATTERNSSPATIAL POINT PATTERNS
SPATIAL POINT PATTERNSLiemNguyenDuy
 
lecture 8
lecture 8lecture 8
lecture 8sajinsc
 
Diffraction,unit 2
Diffraction,unit  2Diffraction,unit  2
Diffraction,unit 2Kumar
 
Standard Scores
Standard ScoresStandard Scores
Standard Scoresshoffma5
 
Chapter 2 Probabilty And Distribution
Chapter 2 Probabilty And DistributionChapter 2 Probabilty And Distribution
Chapter 2 Probabilty And Distributionghalan
 
Normal distribution and hypothesis testing
Normal distribution and hypothesis testingNormal distribution and hypothesis testing
Normal distribution and hypothesis testingLorelyn Turtosa-Dumaug
 
Probability distribution
Probability distributionProbability distribution
Probability distributionRanjan Kumar
 

Similar a Introduction to statistics (12)

6주차
6주차6주차
6주차
 
Binomail distribution 23 jan 21
Binomail distribution 23 jan 21Binomail distribution 23 jan 21
Binomail distribution 23 jan 21
 
Classics 2011
Classics 2011Classics 2011
Classics 2011
 
The renyi entropy and the uncertainty relations in quantum mechanics
The renyi entropy and the uncertainty relations in quantum mechanicsThe renyi entropy and the uncertainty relations in quantum mechanics
The renyi entropy and the uncertainty relations in quantum mechanics
 
Talk given at Kobayashi-Maskawa Institute, Nagoya University, Japan.
Talk given at Kobayashi-Maskawa Institute, Nagoya University, Japan.Talk given at Kobayashi-Maskawa Institute, Nagoya University, Japan.
Talk given at Kobayashi-Maskawa Institute, Nagoya University, Japan.
 
SPATIAL POINT PATTERNS
SPATIAL POINT PATTERNSSPATIAL POINT PATTERNS
SPATIAL POINT PATTERNS
 
lecture 8
lecture 8lecture 8
lecture 8
 
Diffraction,unit 2
Diffraction,unit  2Diffraction,unit  2
Diffraction,unit 2
 
Standard Scores
Standard ScoresStandard Scores
Standard Scores
 
Chapter 2 Probabilty And Distribution
Chapter 2 Probabilty And DistributionChapter 2 Probabilty And Distribution
Chapter 2 Probabilty And Distribution
 
Normal distribution and hypothesis testing
Normal distribution and hypothesis testingNormal distribution and hypothesis testing
Normal distribution and hypothesis testing
 
Probability distribution
Probability distributionProbability distribution
Probability distribution
 

Más de Strand Life Sciences Pvt Ltd (7)

Dynamic programming for simd
Dynamic programming for simdDynamic programming for simd
Dynamic programming for simd
 
Complex numbers polynomial multiplication
Complex numbers polynomial multiplicationComplex numbers polynomial multiplication
Complex numbers polynomial multiplication
 
Converting High Dimensional Problems to Low Dimensional Ones
Converting High Dimensional Problems to Low Dimensional OnesConverting High Dimensional Problems to Low Dimensional Ones
Converting High Dimensional Problems to Low Dimensional Ones
 
Searching using Quantum Rules
Searching using Quantum RulesSearching using Quantum Rules
Searching using Quantum Rules
 
Randomized algorithms
Randomized algorithmsRandomized algorithms
Randomized algorithms
 
Suffix arrays
Suffix arraysSuffix arrays
Suffix arrays
 
Alignment of raw reads in Avadis NGS
Alignment of raw reads in Avadis NGSAlignment of raw reads in Avadis NGS
Alignment of raw reads in Avadis NGS
 

Último

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 

Último (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

Introduction to statistics

  • 2. The Quest What can we say about this black box? E.g., What is the probability that it generates a number 5 12 3 9 28 bigger than 5? Observations
  • 3. Distributions What if we had many many observations? Value Frequency -1 0.3 0 0.2 Sum of 1 0.1 frequencies This table is the 2 0.1 is 1 distribution 3 0.1 associated with 4 0.1 this black box 5 0.1
  • 4. Distributions Graphically -1 0 1 2 3 4 5 Area under the curve is 1
  • 5. The Challenge We do not have many many observations So we cannot infer the distribution from the observations What can we do then?
  • 6. What can we do with few observations? Assume distribution is known E.g., Normal, Binomial (from prior knowledge or etc other means) I.e., model approximately using a canonical distribution But the parameters are not known Can these parameters be determined from the observations?
  • 7. Why Canonical Distributions Value Frequency -1 0.3 0 0.2 1 0.1 Too verbose a description for the 2 0.1 distribution 3 0.1 4 0.1 5 0.1 Can the entire distribution be described (even approximately) by just a few parameters, while modeling the data accurately
  • 8. Example: Binomial Distribution A coin that yields 1 with Observations probability p and 0 with probability 1- p, tossed n 1 0 1 1 1 …. times, independently Value Frequency Number of 1’s? 0 1 Distribution, 2 μ=np,σ2=np(1-p) n-1 Can one determine p from the (few) n observations?
  • 9. Other Canonical Distributions Normal μ, σ2 Poisson μ =r,σ2=r Negative Binomial μ =rp/(1-p), σ2= rp/(1-p)2 What are these? Later talk Gamma μ=kθ, σ2 =kθ2
  • 10. Back to the Quest We have few observations Assume these are from a known distribution family But with unknown parameters How do we determine the parameters? How do we determine μ, σ2?
  • 11. Estimating Mean μ, σ2 Estimate for the mean; a good estimate??
  • 12. μ, σ2 What is the mean and variance Normal!! For of this distribution? modest n.
  • 13. μ, σ2 Unbiased Tight as n grows larger
  • 14. Estimating Variance μ, σ2 Estimate for the variance; a good estimate??
  • 15. μ, σ2 μ, σ2 Bias
  • 16. Estimating Variance Correctly μ, σ2 Unbiased!!
  • 17. A Mind Reading Game • Your friend chooses a number (one of 1,3,5) in his/her mind – Call this i • He/She then rolls a 6-faced die 30 times, privately – For each roll, he/she declares Heads if the number on the die is <=i, and Tails otherwise • Your goal is to guess i solely from this sequence of n Heads and Tails. • Can you read your friend’s mind?