

MCQOPTIONS
This section includes 230 Mcqs, each offering curated multiple-choice questions to sharpen your Technical Programming knowledge and support exam preparation. Choose a topic below to get started.
151. |
This gathering of data can be? |
A. | scheduled |
B. | event-driven |
C. | user-defined |
D. | both a and b |
Answer» E. | |
152. |
Which of the following is the slowest compression technique? |
A. | LZO |
B. | Bzip2 |
C. | Gzip |
D. | All of the mentioned |
Answer» C. Gzip | |
153. |
Which of the following supports splittable compression? |
A. | LZO |
B. | Bzip2 |
C. | Gzip |
D. | All of the mentioned |
Answer» B. Bzip2 | |
154. |
__________ typically compresses files to within 10% to 15% of the best available techniques. |
A. | LZO |
B. | Bzip2 |
C. | Gzip |
D. | All of the mentioned |
Answer» C. Gzip | |
155. |
The LZO compression format is composed of approximately __________ blocks of compressed data. |
A. | 128k |
B. | 256k |
C. | 24k |
D. | 36k |
Answer» C. 24k | |
156. |
The ________ method in the ModelCountReducer class “reduces” the values the mapper collects into a derived value. |
A. | count |
B. | add |
C. | reduce |
D. | all of the mentioned |
Answer» D. all of the mentioned | |
157. |
Which of the following is based on the DEFLATE algorithm? |
A. | LZO |
B. | Bzip2 |
C. | Gzip |
D. | All of the mentioned |
Answer» D. All of the mentioned | |
158. |
Avro schemas describe the format of the message and are defined using ______________ |
A. | JSON |
B. | XML |
C. | JS |
D. | All of the mentioned |
Answer» B. XML | |
159. |
Which of the following works well with Avro? |
A. | Lucene |
B. | kafka |
C. | MapReduce |
D. | None of the mentioned |
Answer» D. None of the mentioned | |
160. |
Apache _______ is a serialization framework that produces data in a compact binary format. |
A. | Oozie |
B. | Impala |
C. | kafka |
D. | Avro |
Answer» E. | |
161. |
When using reflection to automatically build our schemas without code generation, we need to configure Avro using? |
A. | AvroJob.Reflect(jConf); |
B. | AvroJob.setReflect(jConf); |
C. | Job.setReflect(jConf); |
D. | None of the mentioned |
Answer» D. None of the mentioned | |
162. |
With ______ we can store data and read it easily with various programming languages. |
A. | Thrift |
B. | Protocol Buffers |
C. | Avro |
D. | None of the mentioned |
Answer» D. None of the mentioned | |
163. |
Which of the following is a primitive data type in Avro? |
A. | |
B. | boolean |
C. | float |
D. | all of the mentioned |
Answer» E. | |
164. |
________ are encoded as a series of blocks. |
A. | Arrays |
B. | Enum |
C. | Unions |
D. | Maps |
Answer» B. Enum | |
165. |
The __________ codec uses Google’s Snappy compression library. |
A. | |
B. | snappy |
C. | deflate |
D. | none of the mentioned |
Answer» C. deflate | |
166. |
BatchEE projects aims to provide an _________ implementation. (aka JSR352) |
A. | JBat |
B. | JBatch |
C. | JBash |
D. | None of the mentioned |
Answer» C. JBash | |
167. |
_________ allows database-like access, and in particular a SQL interface. |
A. | JBatch |
B. | Calcite |
C. | Blur |
D. | All of the mentioned |
Answer» C. Blur | |
168. |
___________ is a toolkit/application for converting between and editing common office file formats. |
A. | Droids |
B. | DataFu |
C. | Corinthia |
D. | Ignite |
Answer» D. Ignite | |
169. |
Spark runs on top of ___________ a cluster manager system which provides efficient resource isolation across distributed applications. |
A. | Mesjs |
B. | Mesos |
C. | Mesus |
D. | All of the mentioned |
Answer» C. Mesus | |
170. |
Users can easily run Spark on top of Amazon’s __________ |
A. | Infosphere |
B. | EC2 |
C. | EMR |
D. | None of the mentioned |
Answer» C. EMR | |
171. |
Spark architecture is ___________ times as fast as Hadoop disk-based Apache Mahout and even scales better than Vowpal Wabbit. |
A. | 10 |
B. | 20 |
C. | 50 |
D. | 100 |
Answer» B. 20 | |
172. |
____________ is a distributed machine learning framework on top of Spark. |
A. | MLlib |
B. | Spark Streaming |
C. | GraphX |
D. | RDDs |
Answer» B. Spark Streaming | |
173. |
GraphX provides an API for expressing graph computation that can model the __________ abstraction. |
A. | GaAdt |
B. | Spark Core |
C. | Pregel |
D. | None of the mentioned |
Answer» D. None of the mentioned | |
174. |
____________ data file takes is based on avro serialization framework which was primarily created for hadoop. |
A. | Oozie |
B. | Avro |
C. | cTakes |
D. | Lucene |
Answer» C. cTakes | |
175. |
Which of the following compression is similar to Snappy compression? |
A. | LZO |
B. | Bzip2 |
C. | Gzip |
D. | All of the mentioned |
Answer» B. Bzip2 | |
176. |
How many formats of SequenceFile are present in Hadoop I/O? |
A. | 2 |
B. | 3 |
C. | 4 |
D. | 5 |
Answer» C. 4 | |
177. |
Which of the following format is more compression-aggressive? |
A. | Partition Compressed |
B. | Record Compressed |
C. | Block-Compressed |
D. | Uncompressed |
Answer» D. Uncompressed | |
178. |
Point out the correct statement : |
A. | All hadoop commands are invoked by the bin/hadoop script |
B. | Hadoop has an option parsing framework that employs only parsing generic options |
C. | archive command creates a hadoop archive |
D. | All of the mentioned |
Answer» B. Hadoop has an option parsing framework that employs only parsing generic options | |
179. |
__________ Manager’s Service feature monitors dozens of service health and performance metrics about the services and role instances running on your cluster. |
A. | Microsoft |
B. | Cloudera |
C. | Amazon |
D. | None of the mentioned |
Answer» C. Amazon | |
180. |
Which of the following is a common reason to restart hadoop process ? |
A. | Upgrade Hadoop |
B. | React to incidents |
C. | Remove worker nodes |
D. | All of the mentioned |
Answer» E. | |
181. |
__________ mode is a Namenode state in which it does not accept changes to the name space. |
A. | Recover |
B. | Safe |
C. | Rollback |
D. | None of the mentioned |
Answer» D. None of the mentioned | |
182. |
_________ command is used to copy file or directories recursively. |
A. | dtcp |
B. | distcp |
C. | dcp |
D. | distc |
Answer» C. dcp | |
183. |
Point out the wrong statement : |
A. | classNAME displays the class name needed to get the Hadoop jar |
B. | Balancer Runs a cluster balancing utility |
C. | An administrator can simply press Ctrl-C to stop the rebalancing process |
D. | None of the mentioned |
Answer» B. Balancer Runs a cluster balancing utility | |
184. |
The updated queue configuration should be a valid one i.e. queue-capacity at each level should be equal to : |
A. | 50% |
B. | 75% |
C. | 100% |
D. | 0% |
Answer» D. 0% | |
185. |
Apache Hadoop YARN stands for : |
A. | Yet Another Reserve Negotiator |
B. | Yet Another Resource Network |
C. | Yet Another Resource Negotiator |
D. | All of the mentioned |
Answer» D. All of the mentioned | |
186. |
Although the Hadoop framework is implemented in Java , MapReduce applications need not be written in : |
A. | Java |
B. | C |
C. | C# |
D. | None of the mentioned |
Answer» B. C | |
187. |
The __________ provides a proxy between the web applications exported by an application and an end user. |
A. | ProxyServer |
B. | WebAppProxy |
C. | WebProxy |
D. | None of the mentioned |
Answer» C. WebProxy | |
188. |
In order to turn on RPC authentication in hadoop, set the value of hadoop.security.authentication property to : |
A. | zero |
B. | kerberos |
C. | false |
D. | None of the mentioned |
Answer» C. false | |
189. |
__________ get events indicating completion (success/failure) of component tasks. |
A. | getJobName() |
B. | getJobState() |
C. | getPriority() |
D. | getTaskCompletionEvents(int startFrom) |
Answer» E. | |
190. |
Which of the following method is used to get user-specified job name ? |
A. | getJobName() |
B. | getJobState() |
C. | getPriority() |
D. | All of the mentioned |
Answer» B. getJobState() | |
191. |
Which of the following is only for storage with limited compute ? |
A. | Hot |
B. | Cold |
C. | Warm |
D. | All_SSD |
Answer» C. Warm | |
192. |
Which of the following storage policy is used for both storage and compute ? |
A. | Hot |
B. | Cold |
C. | Warm |
D. | All_SSD |
Answer» B. Cold | |
193. |
_________ method clears all keys from the configuration. |
A. | clear |
B. | addResource |
C. | getClass |
D. | None of the mentioned |
Answer» B. addResource | |
194. |
______________ is another implementation of the MapRunnable interface that runs mappers concurrently in a configurable number of threads. |
A. | MultithreadedRunner |
B. | MultithreadedMap |
C. | MultithreadedMapRunner |
D. | SinglethreadedMapRunner |
Answer» D. SinglethreadedMapRunner | |
195. |
An ___________ is responsible for creating the input splits, and dividing them into records. |
A. | TextOutputFormat |
B. | TextInputFormat |
C. | OutputInputFormat |
D. | InputFormat |
Answer» E. | |
196. |
__________ class allows you to specify the InputFormat and Mapper to use on a per-path basis. |
A. | MultipleOutputs |
B. | MultipleInputs |
C. | SingleInputs |
D. | None of the mentioned |
Answer» C. SingleInputs | |
197. |
__________ is a variant of SequenceFileInputFormat that converts the sequence file’s keys and values to Text objects |
A. | SequenceFile |
B. | SequenceFileAsTextInputFormat |
C. | SequenceAsTextInputFormat |
D. | All of the mentioned |
Answer» C. SequenceAsTextInputFormat | |
198. |
______________ class allows the Map/Reduce framework to partition the map outputs based on certain key fields, not the whole keys. |
A. | KeyFieldPartitioner |
B. | KeyFieldBasedPartitioner |
C. | KeyFieldBased |
D. | None of the mentioned |
Answer» C. KeyFieldBased | |
199. |
The ________ option allows you to copy jars locally to the current working directory of tasks and automatically unjar the files. |
A. | archives |
B. | files |
C. | task |
D. | None of the mentioned |
Answer» B. files | |
200. |
Which is the most popular NoSQL database for scalable big data store with Hadoop ? |
A. | Hbase |
B. | MongoDB |
C. | Cassandra |
D. | None of the mentioned |
Answer» B. MongoDB | |