070-773 by Microsoft Actual Free Exam Q&As

Question 1

You have one-class support vector machines (SVMs).
You have a large dataset, but you do not have enough training time to fully test the model.
What is an alternative method to validate the model?

A. Use Principal Components Analysis (PCA)-Based Anomaly Detection. B. Perform feature selection. C. Replace the SVMs with two-class SVMs. D. Use outlier detection.

Discussion 0

Correct Answer: A Vote an answer

Question 2

You need to use the ScaleR distributed processing in an Apache Hadoop environment.
Which data source should you use?

A. Microsoft SQL Server database B. ODBC data C. XDF data files D. Teradata database

Discussion 0

Correct Answer: C Vote an answer

Explanation: Only visible for PassTestking members. You can sign-up / login (it's free).

Question 3

You have a dataset.
You need to repeatedly split randomly the dataset so that 80 percent of the data is used as a training set and the remaining 20 percent is used as a test set.
Which method should you use?

A. imputation B. pruning C. threshold D. binary classification E. cross validation

Discussion 0

Correct Answer: E Vote an answer

Question 4

You are planning the compute contexts for your environment.
You need to execute rx-function calls in parallel.
What are three possible compute contexts that you can use to achieve this goal? Each correct answer presents a complete solution.
NOTE: Each correct selection is worth one point.

A. Map Reduce B. local sequential C. local parallel D. SQL E. Spark

Discussion 0

Correct Answer: A,C,E Vote an answer

Explanation: Only visible for PassTestking members. You can sign-up / login (it's free).

Microsoft Analyzing Big Data with Microsoft R - 070-773 Exam Practice Test