1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78
|
* T-TEST example pspp code
* Generate an example dataset for male and female humans
* with weight, height, beauty and iq data
* Weight and Height data are generated as normal distributions with
* different mean values. iq is generated with the same mean value (100).
* Beauty is only slightly different.
* Every run of the program will produce new data
input program.
* Females have gender 0
* Create 8 female cases
loop #i = 1 to 8.
compute weight = rv.normal (65, 10).
compute height = rv.normal(170.7,6.3).
compute beauty = rv.normal (10,4).
compute iq = rv.normal(100,15).
compute gender = 0.
end case.
end loop.
* Males have gender 1
loop #i = 1 to 8.
compute weight = rv.normal (83, 13).
compute height = rv.normal(183.8,7.1).
compute beauty = rv.normal(11,4).
compute iq = rv.normal(100,15).
compute gender = 1.
end case.
end loop.
end file.
end input program.
* Add a label to the gender values to have descriptive names
value labels
/gender 0 female 1 male.
* Plot the data as boxplot
examine
/variables=weight height beauty iq by gender
/plot=boxplot.
* Do a Scatterplot to check if weight and height
* might be correlated. As both the weight and the
* height for males is higher than for females
* the combination of male and female data is correlated.
* Weigth increases with height.
graph
/scatterplot = height with weight.
* Within the male and female groups there is no correlation between
* weight and height. This becomes visible by marking male and female
* datapoints with different colour.
graph
/scatterplot = height with weight by gender.
* The T-Test checks if male and female humans have
* different weight, height, beauty and iq. See that Significance for the
* weight and height variable tends to 0, while the Significance
* for iq should not go to 0.
* Significance in T-Test means the probablity for the assumption that the
* height (weight, beauty,iq) of the two groups (male,female) have the same
* mean value. As the data for the iq values is generated as normal distribution
* with the same mean value, the significance should not go down to 0.
t-test groups=gender(0,1)
/variables=weight height beauty iq.
* Run the Code several times to see the effect that different data
* is generated. Every run is like a new sample from the population.
* Change the number of samples (cases) by changing the
* loop range to see the effect on significance!
* With increasing number of cases the sample size increases and
* the estimation of mean values and standard deviation becomes better.
* The difference in beauty becomes visible only with larger sample sizes.
|