

MCQOPTIONS
Saved Bookmarks
This section includes 93 Mcqs, each offering curated multiple-choice questions to sharpen your Mongodb knowledge and support exam preparation. Choose a topic below to get started.
1. |
Which is used to find the factor congruence coefficients? |
A. | factor.mosaicplot |
B. | factor.xyplot |
C. | factor.congruence |
D. | factor.cumsum |
Answer» D. factor.cumsum | |
2. |
What is true about Data Visualization? |
A. | Data Visualization is used to communicate information clearly and efficiently to users by the usage of information graphics such as tables and charts. |
B. | Data Visualization helps users in analyzing a large amount of data in a simpler way. |
C. | Data Visualization makes complex data more accessible, understandable, and usable. |
D. | All of the above |
Answer» E. | |
3. |
The new source of big data that will trigger a Big Data revolution in theyears to come is? |
A. | Business transactions |
B. | Social media |
C. | Transactional data and sensor data |
D. | RDBMS |
Answer» D. RDBMS | |
4. |
What is a sentence parser typically used for? |
A. | It is used to parse sentences to check if they are utf-8 compliant. |
B. | It is used to parse sentences to derive their most likely syntax tree structures. |
C. | It is used to parse sentences to assign POS tags to all tokens. |
D. | It is used to check if sentences can be parsed into meaningful tokens. |
Answer» C. It is used to parse sentences to assign POS tags to all tokens. | |
5. |
In descriptive statistics, data from the entire population or a sample issummarized with ? |
A. | integer descriptors |
B. | floating descriptors |
C. | numerical descriptors |
D. | decimal descriptors |
Answer» D. decimal descriptors | |
6. |
Which of the following can be used to impute data sets based only on informationin the training set? |
A. | postprocess |
B. | preProcess |
C. | process |
D. | All of the Mentioned |
Answer» C. process | |
7. |
Which of the following is a reasonable way to select the number of principal components "k"? |
A. | Choose k to be the smallest value so that at least 99% of the varinace is retained. |
B. | Choose k to be 99% of m (k = 0.99*m, rounded to the nearest integer). |
C. | Choose k to be the largest value so that 99% of the variance is retained. |
D. | Use the elbow method. |
Answer» B. Choose k to be 99% of m (k = 0.99*m, rounded to the nearest integer). | |
8. |
The branch of statistics which deals with development of particularstatistical methods is classified as |
A. | industry statistics |
B. | economic statistics |
C. | applied statistics |
D. | applied statistics |
Answer» E. | |
9. |
Numbers ,text, image, audio and video data is ____ |
A. | Volume |
B. | Value |
C. | Varity |
D. | Variety |
Answer» E. | |
10. |
Which of the following is tool for checking normality? |
A. | qqline() |
B. | qline() |
C. | anova() |
D. | lm() |
Answer» B. qline() | |
11. |
File containing R scripts end with extension _______. |
A. | .R |
B. | .S |
C. | .bigdata |
D. | All of the above |
Answer» B. .S | |
12. |
Common use cases for data visualization include? |
A. | Politics |
B. | Sales and marketing |
C. | Healthcare |
D. | All of the above |
Answer» E. | |
13. |
According to analysts, for what can traditional IT systems provide afoundation when they’re integrated with big data technologies like Hadoop? |
A. | Big data management and data mining |
B. | Data warehousing and business intelligence |
C. | Management of Hadoop clusters |
D. | Collecting and storing unstructured data |
Answer» B. Data warehousing and business intelligence | |
14. |
The examination of large amounts of data to see what patterns or otheruseful information can be found is known as |
A. | Data examination |
B. | Information analysis |
C. | Big data analytics |
D. | Data analysis |
Answer» D. Data analysis | |
15. |
______ is the term that is used to describe data that is high volume , highvelocity and /or high variety. |
A. | Analytics |
B. | Bigdata |
C. | Hadoop Data |
D. | Bigdata analytics |
Answer» C. Hadoop Data | |
16. |
Raw data should be processed only one time. |
A. | True |
B. | False |
C. | Can be true or false |
D. | Can not say |
Answer» C. Can be true or false | |
17. |
Which of the following are ML methods? |
A. | based on human supervision |
B. | supervised Learning |
C. | semi-reinforcement Learning |
D. | All of the above |
Answer» B. supervised Learning | |
18. |
Which of the following are correct component for data science? |
A. | Data Engineering |
B. | Advanced Computing |
C. | Domain expertise |
D. | All of the above |
Answer» E. | |
19. |
Which of the following techniques can not be used for normalization intext mining? |
A. | Stemming |
B. | Lemmatization |
C. | Stop Word Removal |
D. | None of the above |
Answer» D. None of the above | |
20. |
In Model based learning methods, an iterative process takes place on theML models that are built based on various model parameters, called ? |
A. | mini-batches |
B. | optimizedparameters |
C. | hyperparameters |
D. | superparameters |
Answer» D. superparameters | |
21. |
What is true about Machine Learning? |
A. | Machine Learning (ML) is that field of computer science |
B. | ML is a type of artificial intelligence that extract patterns out of raw data by using an algorithm or method. |
C. | The main focus of ML is to allow computer systems learn from experience without being explicitly programmed or human intervention. |
D. | All of the above |
Answer» E. | |
22. |
Data Analysis is defined by the statistician? |
A. | William S. |
B. | Hans Peter Luhn |
C. | Gregory Piatetsky-Shapiro |
D. | John Tukey |
Answer» E. | |
23. |
A__________ begins by hypothesizing a sentence (the symbol S) and successively predicting lower level constituents until individual preterminal symbols are written. |
A. | bottow-up parser |
B. | top parser |
C. | top-down parser |
D. | bottom parser |
Answer» D. bottom parser | |
24. |
Which of the following step is performed by data scientist after acquiringthe data? |
A. | Data Cleaning |
B. | Data Integration |
C. | Data Replication |
D. | All of the above |
Answer» B. Data Integration | |
25. |
The modern conception of data science as an independent discipline issometimes attributed to? |
A. | William S. |
B. | John McCarthy |
C. | Arthur Samuel |
D. | Satoshi Nakamoto |
Answer» B. John McCarthy | |
26. |
Which of the following is true about regression analysis? |
A. | answering yes/no questions about the data |
B. | estimating numerical characteristics of the data |
C. | modeling relationships within the data |
D. | describing associations within the data |
Answer» D. describing associations within the data | |
27. |
How many layers Deep learning algorithms are constructed? |
A. | 2 |
B. | 3 |
C. | 4 |
D. | 5 |
Answer» C. 4 | |
28. |
Which of the following is a subset of machine learning? |
A. | Numpy |
B. | SciPy |
C. | Deep Learning |
D. | All of the above |
Answer» D. All of the above | |
29. |
Who popularized bigdata term? |
A. | John deere |
B. | John Mashey |
C. | johny Mashe |
D. | Jhon Mash |
Answer» C. johny Mashe | |
30. |
___________ is general-purpose computing model and runtime system fordistributed data analytics. |
A. | Mapreduce |
B. | Drill |
C. | Oozie |
D. | None of the above |
Answer» B. Drill | |
31. |
In which of the following cases will K-means clustering fail to give goodresults? 1) Data points with outliers 2) Data points with different densities 3) Data points with nonconvex shapes |
A. | 1 and 2 |
B. | 2 and 3 |
C. | 1 and 3 |
D. | All of the above |
Answer» E. | |
32. |
Data science is the process of diverse set of data through ? |
A. | organizing data |
B. | processing data |
C. | analysing data |
D. | All of the above |
Answer» E. | |
33. |
To find the minimum or the maximum of a function, we set the gradient to zero because: |
A. | The value of the gradient at extrema of a function is always zero |
B. | Depends on the type of problem |
C. | Both A and B |
D. | None of the above |
Answer» B. Depends on the type of problem | |
34. |
To find the minimum or the maximum of a function, we set the gradient tozero because: |
A. | The value of the gradient at extrema of a function is always zero |
B. | Depends on the type of problem |
C. | Both A and B |
D. | None of the above |
Answer» B. Depends on the type of problem | |
35. |
Which of the following are the Data Sources in data science? |
A. | Structured |
B. | Unstructured |
C. | Both A and B |
D. | None Of the above |
Answer» D. None Of the above | |
36. |
The model will be trained with data in one single batch is known as ? |
A. | Batch learning |
B. | Offline learning |
C. | Both A and B |
D. | None of the above |
Answer» D. None of the above | |
37. |
Data can be visualized using? |
A. | graphs |
B. | charts |
C. | maps |
D. | All of the above |
Answer» E. | |
38. |
The goal of business intelligence is to allow easy interpretation of largevolumes of data to identify new opportunities. |
A. | TRUE |
B. | FALSE |
C. | Can be true or false |
D. | Can not say |
Answer» B. FALSE | |
39. |
Which of the following model model include a backwards elimination featureselection routine? |
A. | MCV |
B. | MARS |
C. | MCRS |
D. | All of the Mentioned |
Answer» C. MCRS | |
40. |
Which method shows hierarchical data in a nested format? |
A. | Treemaps |
B. | Scatter plots |
C. | Population pyramids |
D. | Area charts |
Answer» B. Scatter plots | |
41. |
Which is used to inference for 1 proportion using normal approx? |
A. | fisher.test() |
B. | chisq.test() |
C. | Lm.test() |
D. | prop.test() |
Answer» E. | |
42. |
Which of the following is the common goal of statistical modelling? |
A. | Inference |
B. | Summarizing |
C. | Subsetting |
D. | None of the above |
Answer» B. Summarizing | |
43. |
When performing regression or classification, which of the following is thecorrect way to preprocess the data? |
A. | Normalize the data -> PCA -> training |
B. | PCA -> normalize PCA output -> training |
C. | Normalize the data -> PCA -> normalize PCA output -> training |
D. | None of the above |
Answer» B. PCA -> normalize PCA output -> training | |
44. |
Which of the following is one of the key data science skills? |
A. | Statistics |
B. | Machine Learning |
C. | Data Visualization |
D. | All of the above |
Answer» E. | |
45. |
Text Analytics, also referred to as Text Mining? |
A. | TRUE |
B. | FALSE |
C. | Can be true or false |
D. | Can not say |
Answer» B. FALSE | |
46. |
Which of the following techniques can not be used for normalization in text mining? |
A. | Stemming |
B. | Lemmatization |
C. | Stop Word Removal |
D. | None of the above |
Answer» D. None of the above | |
47. |
Real time data is ______. |
A. | Field |
B. | Primary Key |
C. | unique |
D. | record |
Answer» D. record | |
48. |
Which of the following model is usually a gold standard for data analysis? |
A. | Inferential |
B. | Descriptive |
C. | Causal |
D. | All of the above |
Answer» D. All of the above | |
49. |
Which of the following is a key characteristic of a hacker? |
A. | Afraid to say they don't know the answer |
B. | Willing to find answers on their own |
C. | Not Willing to find answers on their own |
D. | All of the above |
Answer» C. Not Willing to find answers on their own | |
50. |
In which of the following cases will K-means clustering fail to give good results?1) Data points with outliers 2) Data points with different densities 3) Data points with nonconvex shapes |
A. | 1 and 2 |
B. | 2 and 3 |
C. | 1 and 3 |
D. | All of the above |
Answer» E. | |