EXAM FOE DATA MINING EXAM
QUESTIONS AND ASNWERS
A healthcare provider is building a patient readmission prediction model. Which
dataset should be utilized to fine-tune the model's parameters and select the best-
performing algorithm?
- Training set
- Validation set
- Test set - Answer-Validation set
An insurance company is building a model to predict customer churn. Which dataset
would be utilized to experiment with different features and model architectures?
- Training set
- Validation set
- Test set - Answer-Training set
A manufacturing company is developing a predictive maintenance model for its
equipment. Which dataset should be primarily used to analyze historical
maintenance data and build the model?
- Training set
- Validation set
- Test set - Answer-Training set
What values should be assigned to x and y to split the data into train (40%),
validation (20%), and test (40%) sets, respectively? Enter the values as a pair
separated by a comma, rounded to two decimal places. For example, (0.40, 0.60).
train,temp=train_test_split(data, test_size= X)
valid,test=train_test_split(temp, test_size= Y)
_______________________________ - Answer-(0.60,0.67)
What values should be assigned to x and y to split the data into train (50%),
validation (25%), and test (25%) sets, respectively? Enter the values as a pair
separated by a comma, rounded to two decimal places. For example, (0.40, 0.60).
train,temp=train_test_split(data, test_size= X)
valid,test=train_test_split(temp, test_size= Y)
___________________________________ - Answer-(0.50,0.50)
What values should be assigned to x and y to split the data into train (30%),
validation (35%), and test (35%) sets, respectively? Enter the values as a pair
separated by a comma, rounded to two decimal places. For example, (0.40, 0.60).
QUESTIONS AND ASNWERS
A healthcare provider is building a patient readmission prediction model. Which
dataset should be utilized to fine-tune the model's parameters and select the best-
performing algorithm?
- Training set
- Validation set
- Test set - Answer-Validation set
An insurance company is building a model to predict customer churn. Which dataset
would be utilized to experiment with different features and model architectures?
- Training set
- Validation set
- Test set - Answer-Training set
A manufacturing company is developing a predictive maintenance model for its
equipment. Which dataset should be primarily used to analyze historical
maintenance data and build the model?
- Training set
- Validation set
- Test set - Answer-Training set
What values should be assigned to x and y to split the data into train (40%),
validation (20%), and test (40%) sets, respectively? Enter the values as a pair
separated by a comma, rounded to two decimal places. For example, (0.40, 0.60).
train,temp=train_test_split(data, test_size= X)
valid,test=train_test_split(temp, test_size= Y)
_______________________________ - Answer-(0.60,0.67)
What values should be assigned to x and y to split the data into train (50%),
validation (25%), and test (25%) sets, respectively? Enter the values as a pair
separated by a comma, rounded to two decimal places. For example, (0.40, 0.60).
train,temp=train_test_split(data, test_size= X)
valid,test=train_test_split(temp, test_size= Y)
___________________________________ - Answer-(0.50,0.50)
What values should be assigned to x and y to split the data into train (30%),
validation (35%), and test (35%) sets, respectively? Enter the values as a pair
separated by a comma, rounded to two decimal places. For example, (0.40, 0.60).