100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Summary

Summary How to do linguistics with R, Natalia Levshina (LCX046B05)

Rating
-
Sold
1
Pages
12
Uploaded on
25-10-2021
Written in
2021/2022

A summary of chapters 1, 2, 3, and 5 of 'How to do linguistics with R' written by Natalia Levshina.

Institution
Course









Whoops! We can’t load your doc right now. Try again or contact support.

Connected book

Written for

Institution
Study
Course

Document information

Summarized whole book?
No
Which chapters are summarized?
Chapter 1, 2, 3, and 5.
Uploaded on
October 25, 2021
Number of pages
12
Written in
2021/2022
Type
Summary

Subjects

Content preview

Natalia Levshina (2015). How to do Linguistics with R. Data exploration and
statistical analysis. John Benjamins:
https://benjamins.com/#catalog/books/z.195/main (EUR 45).

Chapter 1 – what is statistics?
Main statistical notions and principles

1.1 Statistics and statistics
Statistics, as a noun in singular like mathematics, is a set of techniques and tools for describing and
analyzing data. Statistics, as in the plural, are measures obtained from samples.

A population is a group that represents all objects of interest. The values obtained from a population
are called parameters. If the population is too big, one will deal with samples, which are meant to be
representative of the population. The difference between a sample statistic and the corresponding
population parameter is called the sampling error: the smaller, the more representative.
 The best method, i.e. most reliable, is random sampling, where everyone of the population
has equal chances to be selected. Other methods are representative sampling, where the
researcher draws a sample in such a way that it matches the population on certain
characteristics, and convenience sampling, sampling to one’s convenience.

Statistics can be subdivided into descriptive statistics, describing the characteristics of a sample, and
inferential statistics, allowing the researcher to use the characteristics of a sample in order to make
conclusions about the population in general (e.g., a statistically significant difference).

1.2 How to formulate and test your hypotheses
1.2.1 Null and alternative hypotheses
Before beginning statistical analysis, a research hypothesis needs to be formulated: the research
hypothesis, your thoughts of the outcome of the research, i.e. alternative hypothesis (H0) together
with the null hypothesis (H1) which says there is no difference between, e.g., the different groups.

The alternative hypothesis can be directional, an assumed direction is expressed (e.g., X is more than
Y), or non-directional, where there is an assumption of a difference but unclear in which direction
(e.g., X is not equal to Y).

1.2.2 Those mysterious p-values…
When the distribution, a collection of scores, or values, on a variable, is normal, it has a bell-shaped
figure/curve. Knowing the shape of a distribution, one can compute the exact probabilities for a
range of x.
 The entire area under the curve corresponds to the probability of 1, i.e. 100%.
 In case of a symmetric distribution, the middle value, e.g. 110 cm, corresponds the
probability 0.5 or 50%, e.g. 50% is under 110 cm, or 50% is above 110 cm.




1

, The p-value shows the probability of obtaining a given test statistic value or more extreme values if
the null hypothesis is true. If the p-value is smaller than some conventional level (usually 0.05 or
0.01), then the null-hypothesis is rejected and it is to believe that the result is not due to chance.
 P<0.05, H0 = rejected, and there is a true difference between, e.g., the groups.
 P>0.05, H0 = accepted, so there is no sufficient evidence that the, e.g., groups are different.

The number of the p-value, e.g. 0.05, is the significant level: the degree of risk you are willing to take
that you will reject a null hypothesis that is actually true. It needs to be decided on before the
statistical analysis.

In order to compute the p-value, one has to know the number of degrees of freedom (df): the
number of values that are free to vary, which is often the sample size minus one.

1.2.3 Type I and Type II errors
If H0 is rejected, when it is in fact true, meaning there is no true difference between the groups,
there is a Type I error; ‘false alarm’ or ‘false positive’. If the significance level is 0.05, there is a 5%
chance of rejecting H0 when it is in fact true.

If H0 is accepted, while it is in fact false, meaning there is a true difference between the groups,
there is a Type II error; ‘false negative’.

Decreasing the significance level will decrease the changes of a Type I error, and increase the chances
of a Type II error.

1.2.4 One-tailed and two-tailed statistical tests
The distinction of a (non-)directional H1 is important when one chooses an appropriate statistical
test. Most tests come in two flavors: one-tailed, if H1 is directional, and two-tailed, if H1 is non-
directional.

If H1 is ‘X is greater than Y’, the test statistic
should be somewhere in the blue area. If it
would be ‘smaller’, then the test statistics
should be located on the left.
If H1 is ‘X is different from Y’, you can observe
an extreme result either in the left or right tail.




It is crucial that you formulate your alternative hypothesis and make your choice between one- and
two-tailed tests before you compute any test statistic.

2

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
hLianne Rijksuniversiteit Groningen
Follow You need to be logged in order to follow users or courses
Sold
251
Member since
6 year
Number of followers
158
Documents
25
Last sold
1 month ago
Communication and Information documents by Lianne

Op mijn profiel vind je allerlei samenvattingen en aantekeningen die ik maak voor mijn studie Communication and Information Studies en die jou zeker zullen helpen bij het studeren! Deze studie volg ik aan de Rijksuniversiteit Groningen. Vakken gericht op o.a. communicatie, taalkunde en academische vaardigheden komen aan bod. De documenten zijn natuurlijk ook te gebruiken voor andere studies.

4.1

21 reviews

5
9
4
7
3
4
2
0
1
1

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions