Air Quality Analysis 2

Air Quality Analysis Part 2

We have air quality data (NO2, O3, PM25, and PM10) from 2021-03-25 to 2021-12-07 at three different locations (Hine Medical District Drive, Hine Record Crossing, and Inwood UTSW). We want to see if there is any statistical difference between average air quality between 7-10 AM and 4-7 PM.

Hines Medical District Drive

There is no statistical difference in NO2, PM25, and PM10 between 7-10 AM and 4-7 PM, while there is a statistical difference in O3 and temperature between 7-10 AM and 4-7 PM.


    Paired t-test

data:  data_processed$no2_pick[data_processed$SiteName == location] and data_processed$no2_mor[data_processed$SiteName == location]
t = 1.0047, df = 190, p-value = 0.3163
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -2.044974  6.290880
sample estimates:
mean of the differences 
               2.122953 




    Paired t-test

data:  data_processed$o3_pick[data_processed$SiteName == location] and data_processed$o3_mor[data_processed$SiteName == location]
t = 21.997, df = 190, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 18.43130 22.06248
sample estimates:
mean of the differences 
               20.24689 




    Paired t-test

data:  data_processed$pm25_pick[data_processed$SiteName == location] and data_processed$pm25_mor[data_processed$SiteName == location]
t = -0.37052, df = 190, p-value = 0.7114
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -1.924065  1.315531
sample estimates:
mean of the differences 
              -0.304267 




    Paired t-test

data:  data_processed$pm10_pick[data_processed$SiteName == location] and data_processed$pm10_mor[data_processed$SiteName == location]
t = 0.0082282, df = 190, p-value = 0.9934
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -2.008554  2.025381
sample estimates:
mean of the differences 
            0.008413613 




    Paired t-test

data:  data_processed$temp_pick[data_processed$SiteName == location] and data_processed$temp_mor[data_processed$SiteName == location]
t = 30.6, df = 190, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 6.673333 7.592971
sample estimates:
mean of the differences 
               7.133152 

Hines Record Crossing

There is no statistical difference in NO2, PM25, and PM10 between 7-10 AM and 4-7 PM, while there is a statistical difference in O3 and temperature between 7-10 AM and 4-7 PM.


    Paired t-test

data:  data_processed$no2_pick[data_processed$SiteName == location] and data_processed$no2_mor[data_processed$SiteName == location]
t = 1.0047, df = 190, p-value = 0.3163
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -2.044974  6.290880
sample estimates:
mean of the differences 
               2.122953 




    Paired t-test

data:  data_processed$o3_pick[data_processed$SiteName == location] and data_processed$o3_mor[data_processed$SiteName == location]
t = 21.997, df = 190, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 18.43130 22.06248
sample estimates:
mean of the differences 
               20.24689 




    Paired t-test

data:  data_processed$pm25_pick[data_processed$SiteName == location] and data_processed$pm25_mor[data_processed$SiteName == location]
t = -0.37052, df = 190, p-value = 0.7114
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -1.924065  1.315531
sample estimates:
mean of the differences 
              -0.304267 




    Paired t-test

data:  data_processed$pm10_pick[data_processed$SiteName == location] and data_processed$pm10_mor[data_processed$SiteName == location]
t = 0.0082282, df = 190, p-value = 0.9934
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -2.008554  2.025381
sample estimates:
mean of the differences 
            0.008413613 




    Paired t-test

data:  data_processed$temp_pick[data_processed$SiteName == location] and data_processed$temp_mor[data_processed$SiteName == location]
t = 30.6, df = 190, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 6.673333 7.592971
sample estimates:
mean of the differences 
               7.133152 

Inwood Crossing

There is no statistical difference in NO2, PM25, and PM10 between 7-10 AM and 4-7 PM, while there is a statistical difference in O3 and temperature between 7-10 AM and 4-7 PM.


    Paired t-test

data:  data_processed$no2_pick[data_processed$SiteName == location] and data_processed$no2_mor[data_processed$SiteName == location]
t = 1.0047, df = 190, p-value = 0.3163
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -2.044974  6.290880
sample estimates:
mean of the differences 
               2.122953 




    Paired t-test

data:  data_processed$o3_pick[data_processed$SiteName == location] and data_processed$o3_mor[data_processed$SiteName == location]
t = 21.997, df = 190, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 18.43130 22.06248
sample estimates:
mean of the differences 
               20.24689 




    Paired t-test

data:  data_processed$pm25_pick[data_processed$SiteName == location] and data_processed$pm25_mor[data_processed$SiteName == location]
t = -0.37052, df = 190, p-value = 0.7114
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -1.924065  1.315531
sample estimates:
mean of the differences 
              -0.304267 




    Paired t-test

data:  data_processed$pm10_pick[data_processed$SiteName == location] and data_processed$pm10_mor[data_processed$SiteName == location]
t = 0.0082282, df = 190, p-value = 0.9934
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -2.008554  2.025381
sample estimates:
mean of the differences 
            0.008413613 




    Paired t-test

data:  data_processed$temp_pick[data_processed$SiteName == location] and data_processed$temp_mor[data_processed$SiteName == location]
t = 30.6, df = 190, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 6.673333 7.592971
sample estimates:
mean of the differences 
               7.133152 

Below we want to see if there is any statistical difference between average air quality between 1-3 AM and 4-7 PM.

Hines Medical District Drive

There is no statistical difference in PM25 and PM10 between 1-3 AM and 4-7 PM, while there is a statistical difference in O3, NO2, and temperature between 1-3 AM and 4-7 PM.


    Paired t-test

data:  data_processed$no2_pick[data_processed$SiteName == location] and data_processed$no2_mor[data_processed$SiteName == location]
t = 2.9386, df = 195, p-value = 0.003695
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 0.8956649 4.5513963
sample estimates:
mean of the differences 
               2.723531 




    Paired t-test

data:  data_processed$o3_pick[data_processed$SiteName == location] and data_processed$o3_mor[data_processed$SiteName == location]
t = 13.845, df = 183, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 12.31064 16.40239
sample estimates:
mean of the differences 
               14.35652 




    Paired t-test

data:  data_processed$pm25_pick[data_processed$SiteName == location] and data_processed$pm25_mor[data_processed$SiteName == location]
t = 0.016599, df = 183, p-value = 0.9868
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -2.019638  2.053910
sample estimates:
mean of the differences 
             0.01713587 




    Paired t-test

data:  data_processed$pm10_pick[data_processed$SiteName == location] and data_processed$pm10_mor[data_processed$SiteName == location]
t = 0.47072, df = 183, p-value = 0.6384
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -1.747881  2.843229
sample estimates:
mean of the differences 
              0.5476739 




    Paired t-test

data:  data_processed$temp_pick[data_processed$SiteName == location] and data_processed$temp_mor[data_processed$SiteName == location]
t = 25.637, df = 183, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 6.509370 7.594804
sample estimates:
mean of the differences 
               7.052087 

Hines Record Crossing

There is no statistical difference in PM25 and PM10 between 1-3 AM and 4-7 PM, while there is a statistical difference in O3, NO2, and temperature between 1-3 AM and 4-7 PM.


    Paired t-test

data:  data_processed$no2_pick[data_processed$SiteName == location] and data_processed$no2_mor[data_processed$SiteName == location]
t = 2.9386, df = 195, p-value = 0.003695
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 0.8956649 4.5513963
sample estimates:
mean of the differences 
               2.723531 




    Paired t-test

data:  data_processed$o3_pick[data_processed$SiteName == location] and data_processed$o3_mor[data_processed$SiteName == location]
t = 13.845, df = 183, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 12.31064 16.40239
sample estimates:
mean of the differences 
               14.35652 




    Paired t-test

data:  data_processed$pm25_pick[data_processed$SiteName == location] and data_processed$pm25_mor[data_processed$SiteName == location]
t = 0.016599, df = 183, p-value = 0.9868
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -2.019638  2.053910
sample estimates:
mean of the differences 
             0.01713587 




    Paired t-test

data:  data_processed$pm10_pick[data_processed$SiteName == location] and data_processed$pm10_mor[data_processed$SiteName == location]
t = 0.47072, df = 183, p-value = 0.6384
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -1.747881  2.843229
sample estimates:
mean of the differences 
              0.5476739 




    Paired t-test

data:  data_processed$temp_pick[data_processed$SiteName == location] and data_processed$temp_mor[data_processed$SiteName == location]
t = 25.637, df = 183, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 6.509370 7.594804
sample estimates:
mean of the differences 
               7.052087 

Inwood Crossing

There is no statistical difference in PM25 and PM10 between 1-3 AM and 4-7 PM, while there is a statistical difference in O3, NO2, and temperature between 1-3 AM and 4-7 PM.


    Paired t-test

data:  data_processed$no2_pick[data_processed$SiteName == location] and data_processed$no2_mor[data_processed$SiteName == location]
t = 2.9386, df = 195, p-value = 0.003695
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 0.8956649 4.5513963
sample estimates:
mean of the differences 
               2.723531 




    Paired t-test

data:  data_processed$o3_pick[data_processed$SiteName == location] and data_processed$o3_mor[data_processed$SiteName == location]
t = 13.845, df = 183, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 12.31064 16.40239
sample estimates:
mean of the differences 
               14.35652 




    Paired t-test

data:  data_processed$pm25_pick[data_processed$SiteName == location] and data_processed$pm25_mor[data_processed$SiteName == location]
t = 0.016599, df = 183, p-value = 0.9868
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -2.019638  2.053910
sample estimates:
mean of the differences 
             0.01713587 




    Paired t-test

data:  data_processed$pm10_pick[data_processed$SiteName == location] and data_processed$pm10_mor[data_processed$SiteName == location]
t = 0.47072, df = 183, p-value = 0.6384
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 -1.747881  2.843229
sample estimates:
mean of the differences 
              0.5476739 




    Paired t-test

data:  data_processed$temp_pick[data_processed$SiteName == location] and data_processed$temp_mor[data_processed$SiteName == location]
t = 25.637, df = 183, p-value < 2.2e-16
alternative hypothesis: true difference in means is not equal to 0
95 percent confidence interval:
 6.509370 7.594804
sample estimates:
mean of the differences 
               7.052087