WG assignment 1A
1A.1
1. Big data refers to the large, diverse sets of information that grow at ever-increasing
rates. It encompasses the volume of information, the velocity or speed at which it is
created and collected, and the variety or scope of the data points being covered.
https://www.investopedia.com/terms/b/big-data.asp
The ability of society to harness information in novel ways to produce useful insights
or goods and services of significant value.
https://www.forbes.com/sites/gilpress/2014/09/03/12-big-data-definitions-whats-
yours/#5ea88a4813ae
Data of a very large size, typically to the extent that its manipulation and management
present significant logistical challenges.
https://www.oed.com/view/Entry/18833?redirectedFrom=Big+data#eid301162177
2. The three V’s are volume, variety, and velocity. The volume of a data set refers to
how much data it holds. The variety of a data set refers to the different types of data
within one data set. The velocity of a data set refers to the speed of data processing.
The first definition I found includes all three V’s. The second definition I found
includes none of the three V’s. The last definition I found only mentions the volume
of big data.
1A.2
1. Yes, there is a large amount of data.
2. Yes, there is a large variety of different types of data.
3. Yes, a lot of data is added very fast.
4. Summer is most common in the summer months, just like in winter is more common
in the winter months. The word ski is searched in the same months as the word winter,
they seem to be related.
5. The word stress is most searched in October and April.
6. The word depression peaks at the same times but is more common than the word
stress at all times.
7. The peak occurs in August 2016. Multiple natural disasters occurred and there was a
suicide bombing in Turkey.
8. It has peaks at the same points in time.
, Mandy Roosendaal 2663488
9. The search for the word depression has dropped worldwide but has risen in the
Netherlands.
1A.3
This type of data gathering is big data because there is a large amount of data (volume), there
is a large variety of data (different types of research), and a lot of data has been processed in a
short amount of time (velocity).