100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.6 TrustPilot
logo-home
Exam (elaborations)

WGU D204: The Data Analytics Journey (Answered) Verified Solution 2023 100%

Rating
-
Sold
-
Pages
30
Grade
A+
Uploaded on
29-11-2023
Written in
2023/2024

WGU D204: The Data Analytics Journey (Answered) Verified Solution 2023 100% Data scientists are able to find ______, _________, and _____ in unstructured data. order, meaning, and value What is involved in the planning phase? 1. Defining goals 2. Organizing resources 3. Coordinate people 4. Schedule project What is involved in the wrangling phase? 5. Get data 6. Clean data 7. Explore data 8. Refine data What is involved in the Modeling phase? 9. Create model 10. Validate model 11. Evaluate model 12. Refine model What is involved in the Applying phase? 13. Present model 14. Deploy model 15. Revisit model 16. Archive assets ____________________ are programming languages that are very frequently used for data manipulation and modeling. Python or R ___________ are general-purpose languages that are used for the back end, the foundational elementsterm-26 of data science, and they provide maximum speed. C, and C++, and Java ____________________ is a language for working with relational databases to do queries and data manipulation. SQL What does SQL stand for? structured query language This is where you actually create the statistical model and you do the linear regression. You do the decision tree. You do the deep learning neural network. Modeling These are the developers, and the system architects, the people who focus on the hardware and the software that make data science possible Data engineers This is the phase of collecting data. Data acquisition Which phase? - Working with stakeholders to help them ask better questions so that both they and you understand the outcome. Discovery What are the 4 parts of data analytics cycle? Planning, Wrangling, Modeling and Applying This phase is also known as the discovery phase. During this phase, an analyst defines the major questions of interest that need to be answered, understand the needs of the stakeholders, and assess the resource constraints in the project. Business understanding ____________________ is the person who champions the vision of the project and has the authority to allocate resources. The project sponsor __________________ is responsible for making sure things get done on time and within budget and removes roadblocks. Project manager ___________ is when new requirements are added to the project that increases the time/resources needed to complete it. Scope creep What are the 3 types of analysis? Descriptive, Predictive, Prescriptive ___________________________ describes the data that is present. Mean, Median, Mode, counting things. How many of each size and color of shirt were sold in the last month? Do we sell more shirts in the summer vs winter? Descriptive analysis ____________________ makes predictions about future state of business. Forecasting volumes for example. Based on last summer and winter, what will we sell next year? Predictive analytics _______________________ analysis with an end goal of making a recommendation. What colors and sizes of shirts should we sell to maximize profits? Prescriptive analytics ______________________ is just looking at any variable over time Time series analysis ____________________ is a programing language that is specific to statistics. It also has capabilities to visualize data. R _______________ is a multipurpose programing language that has libraries that extend its capabilities to do statistical analysis. Python ______________________ are platforms that specialize in visualization. This is where you can make graphs and charts for presentations and data storytelling to executive leaders. Tableau and Power BI _______________________ are instant messaging platforms that facilitate in a faster, but less formal, way than email. Teams, Slack An European union law regulating their citizens must have informed consent and ability to request or delete their own data that you collect. GDPR When the researching organization consciously ignores data that calls their results into question or only presents one side of the results that puts them in a positive light. Conflict of interest Sometimes data might not be available and the analyst will use tools such as web scraping or surveys to acquire it during which phase? Data aquisition The ____________ states that the sampling distribution of the sample means approaches a normal distribution as the sample size gets larger (if you were to take 50 people out of that population and get the mean, then take another 50 random people and get their mean age, and so forth, all of those means would follow the normal distribution (bell curve)). Central Limit Theorem In this phase, the analyst begins to understand the basic nature of data and the relationships within it. This phase often relies on the use of data visualization tools and numerical summaries, such as measures of central tendency and variability. Data Exploration __________________ enables an analyst to move beyond describing the data to creating models that enable predicting outcomes of interest. Predictive Modeling Tools such as _______________ play an important role in automating the training and using of models. Python and R In this phase, an analyst tells the story of the data and uses graphs or interactive dashboards to inform others of the findings from the analyses. Reporting and Visualization Even if you have a wide spread of a variable, let's say, age in a population, and you take lots of sample groups, the mean age of those sample groups would tend to have a normal distribution. Central Limit theorem This is the phase of collecting data. Frequently, data will be retrieved from a database, perhaps a component of a data warehouse, by using a language like SQL. Data Acquisition "Collect the data" is synonymous with ____________________ data acquisition Exploring the data could be seen either in "________________" or "_____________" Prepare the data Create a model Predictive or data mining models could be considered in the "_________________________" grouping. Create a model ____________________ examines the distances between each point and the closest point to it, and then compares these to expected values for a random sample of points from a CSR (complete spatial randomness) pattern. Nearest Neighbor ______________ is a simple mathematical formula used for calculating conditional probabilities. Bayes' Theorem Interactive dashboards tools, such as _____________, allow even the novice user the ability to interact with the data and spot trends and patterns. Tableau Data Acquisition (Step 5), Data Cleaning (Step 6), and Data Exploration (Step 7) in this framework all fall under the "____________" domain. "Wrangling" domain. The ______________ section would contain the ideas of predictive modeling as well as data mining/machine learning. "Modeling" These are people who have extensive work in computer science and in mathematics. They work in deep learning. They work in artificial intelligence. And they're the ones who have the intimate understanding of the algorithms and understand exactly how they're working with the data to produce the results that you're looking for. Machine learning specialists They focus on domain-specific research like, for instance, physics and genetics are common, so is astrophysics, so is medicine, so is psychology, and these kinds of researchers, while they connect with data science, they are usually better versed in the design of research within their particular field and doing common statistical analyses, that's where their expertise lies, but they connect with data science in that they're trying to find the answers to some of these big-picture questions that data scientists can also contribute to. Topical researchers These are people who do the day-to-day data tasks that are necessary for any business to run efficiently. Those include things like web analytics, and S-Q-L, that's SQL or Structured Query Language, data visualizations, and the reports that go into business intelligence. These allow people to make decisions. Analysts They need to frame the business-relevant questions and solutions. Then, they need to keep people on track and moving towards it. Managers They don't necessarily need to know how to do a neural network, they don't need to make the data visualization, but they need to speak data so they can understand how the data relates to the question they're trying to answer, and they can help take the information that the other people are getting and putting it together into a cohesive whole. Managers They often need all of the skills, including the business acumen, to make the business run well. They also need some great creativity in planning your projects and the execution that get them towards their entrepreneurial goals. Entrepreneurs This is a full-stack data scientist who can do it all, and do it at absolute peak performance. unicorn, also known as the rock star, or the ninja. True or False: you can get a unicorn by a team where you can get the people who have all the necessary skills True _________________________ means algorithms that learn from data. Artificial intelligence ________________________ are sets of algorithms intended to recognize patterns and interpret data through clustering or labeling Neural networks True or false: Data science can be done without machine learning. True If a person feels that they have been harmed by a decision made by a neural network, such as it refused a loan application, they can sue the organization. right to explanation ____________ is data that is characterized by any or all of three characteristics. Unusual volume, unusual velocity, and unusual variety. Big data ______________ analytics is about causation Prescriptive the gold standard for establishing cause and effect is what's called an _________________________________ Randomized controlled trial (RCT) ___________________ are a whole host of research designs that let you use correlational data to try to estimate the size of the causal relationship between the two variables quasi-experiments You can do a very good _______________ without needing everything that goes into data science. prescriptive analysis ____________ may be, at least in theory, impossible. But ____________________ can get you close enough for any practical purposes and help put you and your organization on the right path to maximizing the outcomes that are most important to you. Causality, prescriptive analytics __________________ is all about getting the insight to do something better in your business. Business intelligence You can get the analytics and see how well is this performing, who's watching it and when. That's a ______________________________ of a form. business intelligence dashboard two of the most important things you can do in business intelligence are _________________, to predict what's likely to happen next, and to ___________________. find trends, flag anomalies ___________________ is what makes business intelligence possible. Data science ___________________ really shows to the best extent how data science can be used to make practical decisions that make organizations function more effectively and more efficiently. Business intelligence we have open-source programming languages like _________________ that make more rigorous data analysis inexpensive and relatively easy as well. R and Python we can convey key performance indicators of our business to ____________, ______________, _____________ using dashboards. executives, management, and employees we can convey complex information about our business to a wider audience using _____________ that allow users to rapidly consume and digest data infographics analytics answers what has happened in the past. Descriptive ________________ data is information that is gathered in non-numerical form that is typically ___________ and may be recoded to try and quantify its meaning. Qualitative, descriptive ________________ data includes things such as: summaries of written comments on customer cards collected from suggestion boxes at stores, results from interviews of store managers by an outside consultant, a paragraph taken from an employee's self-evaluation on a performance review. Qualitative Data is made up of a set of ______________, the individual units being measured. Observations The ____________ is the middle number in a series that is arranged from smallest to largest. median The ________________ is the most commonly occurring number in the dataset. mode The fact that the _________ is not close to the ________ or the __________ tells us the distribution of scores are skewed. The scores are not evenly distributed around the ________. average, median, mode, mean The _______ and the _________ inform a user of the central points in skew of the data. mean, median If the mean and the median are fairly close to each other means we likely have some type of ________________________

Show more Read less
Institution
WGU D204: The Data Analytics Journey Ve
Module
WGU D204: The Data Analytics Journey Ve










Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
WGU D204: The Data Analytics Journey Ve
Module
WGU D204: The Data Analytics Journey Ve

Document information

Uploaded on
November 29, 2023
Number of pages
30
Written in
2023/2024
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
Topscorer1 South University
Follow You need to be logged in order to follow users or courses
Sold
247
Member since
4 year
Number of followers
207
Documents
7292
Last sold
6 hours ago
TOPSCORER1

Expert Study Solutions | Nursing, Business, Accounting & More! Looking for top-quality study materials to excel in college or university? You're in the right place! I provide highly graded, almost A+ solutions across various subjects, including Nursing (my main expertise), Business, Accounting, Statistics, Chemistry, Biology, and many more. ✅ Accurate & Well-Researched Guides ✅ Comprehensive Solutions for Better Grades ✅ Student-Friendly Approach & Full Support ✅ Satisfaction Guaranteed – Refund Available if Not Satisfied I’m committed to helping students succeed by providing reliable, high-quality academic resources. Let’s boost your grades together!

Read more Read less
3.8

40 reviews

5
22
4
5
3
4
2
0
1
9

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their exams and reviewed by others who've used these revision notes.

Didn't get what you expected? Choose another document

No problem! You can straightaway pick a different document that better suits what you're after.

Pay as you like, start learning straight away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and smashed it. It really can be that simple.”

Alisha Student

Frequently asked questions