#+ fig.width=12, fig.height=8
#### Chapter 3: Descriptive univariate statistics
# (1) Create the following numeric vector -- 1, 2, 4, 8, 10, 12 -- and compute summary/descriptive statistics of its raw version, its centered version, and its z-standardized version.
# (2) Based on their performance in a test, 37 students were awarded the following grades (1=best, 6=worst):
grades.in.test<-rep(1:6, c(2, 15, 10, 4, 4, 2))
# (a) Compute the most appropriate measure of central tendency.
# (b) Compute the most appropriate measure of dispersion.
# (c) Represent the frequency distribution of the grades graphically in a dot chart.
# (d) Represent the frequency distribution of the grades graphically in a bar plot.
# (3) Load the file <104_03_uh(m).csv> into a dataframe UHM
# (4) Sort the data frame according to the factor SEX (ascending) and, within SEX, according to the disfluencies (descending) and, within disfluencies, according to the lengths of the disfluencies (ascending).
# (5) Compute the 95% confidence intervals for the proportions of the three disfluencies and discuss briefly what the confidence intervals suggest concerning the different frequencies of the disfluency markers.
# (6) Determine how many disfluencies occurred in each genre.
# (7) Represent in a graph how many disfluencies occurred in each genre.
# (8) Was the 990th disfluency produced by a man or a woman?
# (9) Determine how many disfluencies are longer than average.