Microsoft Analyzing Big Data with Microsoft R - 070-773 Exam Practice Test

You have one-class support vector machines (SVMs).
You have a large dataset, but you do not have enough training time to fully test the model.
What is an alternative method to validate the model?
Correct Answer: A Vote an answer
You need to use the ScaleR distributed processing in an Apache Hadoop environment.
Which data source should you use?
Correct Answer: C Vote an answer
Explanation: Only visible for PassTestking members. You can sign-up / login (it's free).
You have a dataset.
You need to repeatedly split randomly the dataset so that 80 percent of the data is used as a training set and the remaining 20 percent is used as a test set.
Which method should you use?
Correct Answer: E Vote an answer
You are planning the compute contexts for your environment.
You need to execute rx-function calls in parallel.
What are three possible compute contexts that you can use to achieve this goal? Each correct answer presents a complete solution.
NOTE: Each correct selection is worth one point.
Correct Answer: A,C,E Vote an answer
Explanation: Only visible for PassTestking members. You can sign-up / login (it's free).