IBM Data Science Quiz Questions and answers 2024 latest update
Data Science Landscape Quiz As a data Journalist, which of the following tasks are most germane to your role? Communication skills Brainpower Read More Previous Play Next Rewind 10 seconds Move forward 10 seconds Unmute 0:07 / 0:15 Full screen Which of the following is one of the most fundamental characteristics of a data scientist? Having a sense of curiosity about all things Which of the following are examples of unstructured data? Select all that applies. Facebook images Twitter feeds The Venn diagram that depicts the intersection of Science, Technology and Data has highlighted a cross section known as the 'danger zone.' Which of the following is an accurate depiction of this overlap in the Venn diagram? Has technology and data experience but no science (analytics) background. Data Science Methodology Quiz The eight data science methodology approaches can be viewed as two larger groupings, the second grouping comprises: train, validate, deploy models and the feedback environment. How is this second grouping different in overall approach from the first grouping (business understanding, exploration, transformation and visualization of data)? The second grouping addresses predictive and prescriptive analytics, whereas the first grouping addresses descriptive analytics. Which of the following is a true statement? Data scientists transform data into knowledge to solve business problems. Data journalists capture domain knowledge for successful business alignment. Data engineer architect how data is organized and ensure operability. All of the above are true Business understanding is the first part of your analytics journey. Which of the following come to mind when you are planning your business approach? Select one or more. Perform demand planning and supply chain optimization for your offerings across different segments Reduce costs If you had to choose one overarching difference between these methodologies in Question 19, which of the following would best depict that difference in approach? Unlike KDD and SEMMA, CRIPS-DM considers business understanding. Descriptive tables share which of the following characteristics? Measures of Central Tendency Measures of Dispersion Measures of Distribution All of the above answers are correct The data science methodology includes the following stages: (fill in the missing stage) business understanding, data exploration and preparation, data representation and transformation, ________________, validate data models, ______________, and environment feedback. Train data models, deploy data models Data Science on the Cloud Quiz Which of the following is an example of open source visualization and plotting tool or tools? Matplotlib Pixiedust OpenCV All of the above are correct. The Profile view, under the Refinery tab of Watson Studio is designed to present you with which of the following pieces of information? Frequency and statistics When working with Data Refinery in Watson Studio, you are presented with three tabs: Data, Profile and Visualization. What is the purpose of the Profile view? In the Profile view, the user can validate the data to see if any features may need further Data Refinery. The Communities tab of Watson Studio provides which of the following artifacts? Tutorials Data Sets Articles All of the above are correct. There are many ideas as to why some data scientists prefer Python over RStudio. Which of the following seems to be the prevailing argument that favors Python over R? Python is a more generalized language versus R which is more statistics focused. When using Jupyter Notebooks, inevitably, you will need to import libraries such as NumPy and SciPy. Which of the following integration layers best describes this kind of an activity? Scientific computing and statistics packages Explore and Prepare Data Quiz Hadley Wickham is known for saying "Tidy datasets are all alike, but every messy dataset is messy in its own way." Which of the following statements supports this assertion? Select all that apply. Avoid redundancy, logical errors, or issues with updates. Complement programming languages' ability to perform vectorized operations. Ensure Boolean values are encoded appropriately. When transforming messy data to tidy data, which of the following is a good practice? Multiple variables are stored in one column. Variables are stored in both rows and columns. Multiple types of observational units are stored in the same table. All of the above are correct. Data scientist and data engineers often access RDBMS databases to retrieve data. Which of the following specific tasks is an example of such tasks? Data scientists access the data via SQL or language-specific libraries. Data engineers perform a task called ETL (Extract, Transform, Load) where they take data from one source and move it to another. Use of NoSQL, since it is best for high latency and JSON based storage All of the above are correct. You can flag missing observations using machine learning (ML) model. Not all models address missing data equally. Which of the following statements is true regarding using ML models to flag missing data? Regression models handle summary statistics better. Tree based models handle outliers better. Represent and Transform Data Quiz ... With ____________ data, you have categorical variables that can be described by groups rather than numbers. Structured When would you use a histogram? To understand the distribution of a variable When would you use a bar chart? When I want to explore in time When I have categorized data
Written for
- Institution
- IBM Data Science
- Module
- IBM Data Science
Document information
- Uploaded on
- February 17, 2024
- Number of pages
- 9
- Written in
- 2023/2024
- Type
- Exam (elaborations)
- Contains
- Questions & answers