Exam Review. 100% Accurate, verified.
Stemming - ✔✔-finding and comparing roots of words
Data types - ✔✔-unstructured and structured
structured Dataset - ✔✔-fixed dimensions (fixed number of rows and columns), well organised, tabular
data, key-value pairs
unstructured Dataset - ✔✔-no fixed dimensions, no structure, can take any forms, audios, videos,
images and text can be unstructured data. Information presented in unstructured data is not available
for any analysis.
key-value pairs (KVP) - ✔✔-means there is one key, which is an unique identifier and a value, which is
either the data or a pointer to the location of that data
text data - ✔✔-example of unstructured data, that can be Social Media: tweets, posts, comments;
Conversations: messages, emails. chats; Articles: news and blogs, transcripts. Contains words arranged
in a meaningful manner; is written in from of language and is defined by grammar and other structure
natural language processing (NLP) - ✔✔-is part of computer science and artificial intelligence which
deals with human languages to gain information and insights.
Applied NLP - ✔✔-The use of NLP for designing and developing applications or systems in which an
interaction between machines and natural languages
Textmining - ✔✔-derives useful information from text for Information and Insights.
sentiment analysis - ✔✔-a technique that allows marketers to analyze data from social media sites to
collect consumer comments about companies and their products