Intro to data analytics D491 questions and answers latest top score.
Intro to data analytics D491 questions and answers latest top score. Data Transformation - correct answers.Data mapping: converting data from one format to another. Data deduplication: eliminating repeated or redundant data. Derived variables: creating new variables from existing ones. Data sorting or ordering: arranging data in a specific sequence. Data Transformation occurs in the ______ _____________ phase, the role this applies to is the _____ A___ - correct answers.Preparation phase, Data Analyst ____________ is great at combining unstructured data feeds from multiple sources. - correct answers.Hadoop Examples of when to use _____ : stream processing, fraud detection, and prevention, content management, risk management. - correct answers.Hadoop Set up sandbox, extract and transform data, condition data and exploring visually occurs in the ______ ____________ phase - correct answers.Data Preparation When you convert a Microsoft Word file to a PDF, for example, you are ________data - correct forming Running a virtual machine on Linux operating system on Windows is an example of ---------- - correct oxing Some key features of an Analytical Sandbox may include tools and features for c---------- and s---ing work with colleagues. Flexibility to allow analysts to try out different analytical approaches and techniques. Clear documentation and support resources to help analysts get up to speed quickly. - correct boration, sharing Why is it important to collect data in a certain time frame? - correct answers.Result: more precise findings than working with an open-ended timeframe. ___ testing works by randomly showing two versions of the same asset (ad, website, pop-up, offer, etc.) to different users - correct answers.A/B What does it mean for a dependent variable to be binary? (this is always applied to logistic regression) - correct answers.A binary variable is a categorical variable that can only take one of two values, usually represented as a Boolean — True or False — or an integer variable — 0 or 1, yes or no, sick or not sick, obese or underweight, etc., depending on the independent variable. ______ Analysis when you're looking to segment or categorize a dataset into groups based on similarities, but aren't sure what those groups should be. - correct answers.Cluster Analysis Preprocessing (of data) - correct process of transforming raw data into an understandable format Bounce Rate - correct percentage of visitors to a particular website who navigate away from the site after viewing only one page. Logistic Regression - correct answers.A statistical analysis which determines an individual's risk of the outcome as a function of a risk factor. The outcome of interest has two categories (yes or no, obese or not obese, at risk of cancer or not at risk of cancer, happens or does not happen, etc.). K-means clustering - correct answers.Informally, goal is to find groups of points that are close to each other but far from points in other groups. • Each cluster is defined entirely and only by its centre, or mean value µk Random Forest - correct answers.An algorithm used for regression or classification that uses a collection of tree data structures trees "vote" on the best model. Examples of when to use Random Forest - correct answers.In HC: to identify the correct combination of components in medicine and to analyze a patient's medical history to identify diseases (for example using symptoms to predict whether a person's symptoms are more closely tied to malaria or a simple fever, another example can be a cold or a sinus infection).
Written for
- Institution
- Intro to data analytics D491
- Course
- Intro to data analytics D491
Document information
- Uploaded on
- January 5, 2024
- Number of pages
- 25
- Written in
- 2023/2024
- Type
- Exam (elaborations)
- Contains
- Questions & answers
Subjects
-
intro to data analytics d491 questions
Also available in package deal