SlideShare a Scribd company logo
1 of 24
SAS tutorial 0308:
   Correlation
PROC CORR
BY, VAR, WEIGHT
           FREQ    – 指定各觀察值的出現次數。
                   – 指定變項用以描述資料。
              ID
                   – 指定要以那個變項為基準計算淨
         PARTIAL     相關。

          WITH     – 指定變項,和用VAR指定的變項作
                     相關。
data Fitness;
      input Age Weight Oxygen RunTime @@;
      datalines;
   44 89.47 44.609 11.37    40 75.07 45.313 10.07
   44 85.84 54.297 8.65     42 68.15 59.571 8.17
   38 89.02 49.874   .      47 77.45 44.811 11.63
   40 75.98 45.681 11.95    43 81.19 49.091 10.85
   44 81.42 39.442 13.08    38 81.87 60.055 8.63
                       ......
;

ods graphics on;
proc corr data=Fitness plots=matrix(histogram);
run;
ods graphics off;
proc corr;
var x1 x2;
with y1 y2 y3;
run;
data Setosa;
input SepalLength SepalWidth PetalLength PetalWidth @@;
label sepallength='Sepal Length in mm.'
       sepalwidth='Sepal Width in mm.'
       petallength='Petal Length in mm.'
       petalwidth='Petal Width in mm.';
datalines;
   50 33 14 02 46 34 14 03 46 36 . 02
   51 33 17 05 55 35 13 02 48 31 16 02
   52 34 14 02 49 36 14 01 44 32 13 02
   48 30 14 01 45 23 13 03 57 38 17 03
   51 38 15 03 54 34 17 02 51 37 15 04
                              ......
;

ods graphics on;
title 'Fisher (1936) Iris Setosa Data';
proc corr data=Setosa sscp cov plots=scatter;
var sepallength sepalwidth;
with petallength petalwidth;
run;
ods graphics off;
PROC CORR statements-1
PROC CORR statements-2
Check
Display scatter plots.
Compute
  Pearson’s correlation coefficient.
  Fisher’s z transformation.
  Kendall’s tau.
  Spearman rank-order correlation.
  Partial correlation.
  Cronbach’s alpha.
Handle missing values.
Variance/covariance matrix.
Review
• PROC FREQ
• PROC STANDARD
Chi-square distribution
Example




• Q: 男性跟女性同意婚前性行為的傾向有差
  別嗎?
Agree No opinion Disagree Total
Male      279         73      225 577
Female    175         47      201 423
Total     454        120      426 1000          Why?


Expected Agree No opinion Disagree
Male     261.96      69.24   245.80
Female 192.04        50.76   180.20

   Why?
                                         Why?
SAS Code
DATA ex1;
DO i=1 to 2;           PROC PRINT DATA=ex1;
  DO j=1 to 3;
     INPUT x @@;       PROC FREQ DATA=ex1;
     OUTPUT;           TABLES i*j/CHISQ;
  END;                 WEIGHT x;
END;                   RUN;
DATALINES;
279 73 225
175 47 201
;
Results




這些指標各代表什麼意義?
想一下
• Q1: 為什麼 Fisher’s exact test 是 “Exact”?
• Q2: Binomial counts 可以跑Chi-square test嗎?
• Q3: 找一組2*2的資料,比較以下兩者
  output的差別。
 PROC FREQ          PROC FREQ
 DATA=AAA;          DATA=AAA;
 EXACT FISHER;      EXACT CHISQ;
 ...;               ...;

             怎麼算出來的?
PROC STANDARD
      • 把分數調成平均80,
        標準差10的常態分配。
SAS code
PROC IMPORT OUT=EX2
DATAFILE="C:UsersuserDesktopQUIZ.txt"
DBMS=dlm REPLACE;
GETNAMES=YES;          從文字檔讀資料
DELIMITER='09'x;

PROC STANDARD DATA=ex2 MEAN=80 STD=10 OUT=ex3
     PRINT;               標準化
VAR Score;

PROC PRINT DATA= ex2;
PROC PRINT DATA= ex3;
RUN;
Results

More Related Content

Similar to Sas tutorial 0308

Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdfHailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
cookie1969
 
Sas code for examples from a first course in statistics
Sas code for examples from a first course in statisticsSas code for examples from a first course in statistics
Sas code for examples from a first course in statistics
Dr P Deepak
 
Oracle OpenWorld 2010 - Consolidating Microsoft SQL Server Databases into an ...
Oracle OpenWorld 2010 - Consolidating Microsoft SQL Server Databases into an ...Oracle OpenWorld 2010 - Consolidating Microsoft SQL Server Databases into an ...
Oracle OpenWorld 2010 - Consolidating Microsoft SQL Server Databases into an ...
djkucera
 

Similar to Sas tutorial 0308 (20)

ATE Pattern Structure Basics
ATE Pattern Structure BasicsATE Pattern Structure Basics
ATE Pattern Structure Basics
 
visualisasi data praktik pakai excel, py
visualisasi data praktik pakai excel, pyvisualisasi data praktik pakai excel, py
visualisasi data praktik pakai excel, py
 
Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdfHailey_Database_Performance_Made_Easy_through_Graphics.pdf
Hailey_Database_Performance_Made_Easy_through_Graphics.pdf
 
Implementing qrcode
Implementing qrcodeImplementing qrcode
Implementing qrcode
 
Sas code for examples from a first course in statistics
Sas code for examples from a first course in statisticsSas code for examples from a first course in statistics
Sas code for examples from a first course in statistics
 
Lecture 2: arrays and pointers
Lecture 2: arrays and pointersLecture 2: arrays and pointers
Lecture 2: arrays and pointers
 
Simple Nested Sets and some other DB optimizations
Simple Nested Sets and some other DB optimizationsSimple Nested Sets and some other DB optimizations
Simple Nested Sets and some other DB optimizations
 
.NET Fest 2019. Łukasz Pyrzyk. Daily Performance Fuckups
.NET Fest 2019. Łukasz Pyrzyk. Daily Performance Fuckups.NET Fest 2019. Łukasz Pyrzyk. Daily Performance Fuckups
.NET Fest 2019. Łukasz Pyrzyk. Daily Performance Fuckups
 
Slides ensae 11bis
Slides ensae 11bisSlides ensae 11bis
Slides ensae 11bis
 
Lecture03 displaying data
Lecture03 displaying dataLecture03 displaying data
Lecture03 displaying data
 
1027實習
1027實習1027實習
1027實習
 
C++ and Assembly: Debugging and Reverse Engineering
C++ and Assembly: Debugging and Reverse EngineeringC++ and Assembly: Debugging and Reverse Engineering
C++ and Assembly: Debugging and Reverse Engineering
 
Java bytecode Malware Analysis
Java bytecode Malware AnalysisJava bytecode Malware Analysis
Java bytecode Malware Analysis
 
Data Wars: The Bloody Enterprise strikes back
Data Wars: The Bloody Enterprise strikes backData Wars: The Bloody Enterprise strikes back
Data Wars: The Bloody Enterprise strikes back
 
Oracle OpenWorld 2010 - Consolidating Microsoft SQL Server Databases into an ...
Oracle OpenWorld 2010 - Consolidating Microsoft SQL Server Databases into an ...Oracle OpenWorld 2010 - Consolidating Microsoft SQL Server Databases into an ...
Oracle OpenWorld 2010 - Consolidating Microsoft SQL Server Databases into an ...
 
Sas practice programs
Sas practice programsSas practice programs
Sas practice programs
 
Regression and Classification with R
Regression and Classification with RRegression and Classification with R
Regression and Classification with R
 
Conf soat tests_unitaires_Mockito_jUnit_170113
Conf soat tests_unitaires_Mockito_jUnit_170113Conf soat tests_unitaires_Mockito_jUnit_170113
Conf soat tests_unitaires_Mockito_jUnit_170113
 
Sas tutorial 1018
Sas tutorial 1018Sas tutorial 1018
Sas tutorial 1018
 
Cryptographic algorithms
Cryptographic algorithmsCryptographic algorithms
Cryptographic algorithms
 

More from WenSheng Chang (14)

futureNo52
futureNo52futureNo52
futureNo52
 
Sas plot
Sas plotSas plot
Sas plot
 
Sas reg multi
Sas reg multiSas reg multi
Sas reg multi
 
Sas reg simple
Sas reg simpleSas reg simple
Sas reg simple
 
Sas corr
Sas corrSas corr
Sas corr
 
1129 sas實習課
1129 sas實習課1129 sas實習課
1129 sas實習課
 
Sas tutorial 0524
Sas tutorial 0524Sas tutorial 0524
Sas tutorial 0524
 
Sas tutorial glm2
Sas tutorial glm2Sas tutorial glm2
Sas tutorial glm2
 
Sas tutorial glm1
Sas tutorial glm1Sas tutorial glm1
Sas tutorial glm1
 
Regression ci
Regression ciRegression ci
Regression ci
 
SAS 0412
SAS 0412SAS 0412
SAS 0412
 
Sas tutorial 0308
Sas tutorial 0308Sas tutorial 0308
Sas tutorial 0308
 
0925 sas實習課
0925 sas實習課0925 sas實習課
0925 sas實習課
 
Science p79 -- polished
Science p79 -- polishedScience p79 -- polished
Science p79 -- polished
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 

Sas tutorial 0308

  • 1. SAS tutorial 0308: Correlation
  • 2. PROC CORR BY, VAR, WEIGHT FREQ – 指定各觀察值的出現次數。 – 指定變項用以描述資料。 ID – 指定要以那個變項為基準計算淨 PARTIAL 相關。 WITH – 指定變項,和用VAR指定的變項作 相關。
  • 3. data Fitness; input Age Weight Oxygen RunTime @@; datalines; 44 89.47 44.609 11.37 40 75.07 45.313 10.07 44 85.84 54.297 8.65 42 68.15 59.571 8.17 38 89.02 49.874 . 47 77.45 44.811 11.63 40 75.98 45.681 11.95 43 81.19 49.091 10.85 44 81.42 39.442 13.08 38 81.87 60.055 8.63 ...... ; ods graphics on; proc corr data=Fitness plots=matrix(histogram); run; ods graphics off;
  • 4.
  • 5.
  • 6. proc corr; var x1 x2; with y1 y2 y3; run;
  • 7. data Setosa; input SepalLength SepalWidth PetalLength PetalWidth @@; label sepallength='Sepal Length in mm.' sepalwidth='Sepal Width in mm.' petallength='Petal Length in mm.' petalwidth='Petal Width in mm.'; datalines; 50 33 14 02 46 34 14 03 46 36 . 02 51 33 17 05 55 35 13 02 48 31 16 02 52 34 14 02 49 36 14 01 44 32 13 02 48 30 14 01 45 23 13 03 57 38 17 03 51 38 15 03 54 34 17 02 51 37 15 04 ...... ; ods graphics on; title 'Fisher (1936) Iris Setosa Data'; proc corr data=Setosa sscp cov plots=scatter; var sepallength sepalwidth; with petallength petalwidth; run; ods graphics off;
  • 8.
  • 9.
  • 10.
  • 11.
  • 14. Check Display scatter plots. Compute Pearson’s correlation coefficient. Fisher’s z transformation. Kendall’s tau. Spearman rank-order correlation. Partial correlation. Cronbach’s alpha. Handle missing values. Variance/covariance matrix.
  • 15. Review • PROC FREQ • PROC STANDARD
  • 18. Agree No opinion Disagree Total Male 279 73 225 577 Female 175 47 201 423 Total 454 120 426 1000 Why? Expected Agree No opinion Disagree Male 261.96 69.24 245.80 Female 192.04 50.76 180.20 Why? Why?
  • 19. SAS Code DATA ex1; DO i=1 to 2; PROC PRINT DATA=ex1; DO j=1 to 3; INPUT x @@; PROC FREQ DATA=ex1; OUTPUT; TABLES i*j/CHISQ; END; WEIGHT x; END; RUN; DATALINES; 279 73 225 175 47 201 ;
  • 21. 想一下 • Q1: 為什麼 Fisher’s exact test 是 “Exact”? • Q2: Binomial counts 可以跑Chi-square test嗎? • Q3: 找一組2*2的資料,比較以下兩者 output的差別。 PROC FREQ PROC FREQ DATA=AAA; DATA=AAA; EXACT FISHER; EXACT CHISQ; ...; ...; 怎麼算出來的?
  • 22. PROC STANDARD • 把分數調成平均80, 標準差10的常態分配。
  • 23. SAS code PROC IMPORT OUT=EX2 DATAFILE="C:UsersuserDesktopQUIZ.txt" DBMS=dlm REPLACE; GETNAMES=YES; 從文字檔讀資料 DELIMITER='09'x; PROC STANDARD DATA=ex2 MEAN=80 STD=10 OUT=ex3 PRINT; 標準化 VAR Score; PROC PRINT DATA= ex2; PROC PRINT DATA= ex3; RUN;