Explore topic-wise MCQs in Apache Hadoop.

This section includes 20 Mcqs, each offering curated multiple-choice questions to sharpen your Apache Hadoop knowledge and support exam preparation. Choose a topic below to get started.

1.

Pig Latin statements are generally organized in one of the following ways :

A. A LOAD statement to read data from the file system
B. A series of “transformation” statements to process the data
C. A DUMP statement to view results or a STORE statement to save the results
D. All of the mentioned
Answer» E.
2.

You can run Pig in batch mode using __________ .

A. Pig shell command
B. Pig scripts
C. Pig options
D. All of the mentioned
Answer» C. Pig options
3.

You can run Pig in interactive mode using the ______ shell.

A. Grunt
B. FS
C. HDFS
D. None of the mentioned
Answer» B. FS
4.

Which of the following function is used to read data in PIG ?

A. WRITE
B. READ
C. LOAD
D. None of the mentioned
Answer» D. None of the mentioned
5.

Pig operates in mainly how many nodes ?

A. Two
B. Three
C. Four
D. Five
Answer» B. Three
6.

Which of the following is the default mode ?

A. Mapreduce
B. Tez
C. Local
D. All of the mentioned
Answer» B. Tez
7.

Which of the following will run pig in local mode ?

A. $ pig -x local …
B. $ pig -x tez_local …
C. $ pig …
D. None of the mentioned
Answer» B. $ pig -x tez_local …
8.

$ pig -x tez_local … will enable ________ mode in Pig.

A. Mapreduce
B. Tez
C. Local
D. None of the mentioned
Answer» E.
9.

____________ method will be called by Pig both in the front end and back end to pass a unique signature to the Loader.

A. relativeToAbsolutePath()
B. setUdfContextSignature()
C. getCacheFiles()
D. getShipFiles()
Answer» C. getCacheFiles()
10.

__________ abstract class has three main methods for loading data and for most use cases it would suffice to extend it.

A. Load
B. LoadFunc
C. FuncLoad
D. None of the mentioned
Answer» C. FuncLoad
11.

The loader should use ______ method to communicate the load information to the underlying InputFormat.

A. relativeToAbsolutePath()
B. setUdfContextSignature()
C. getCacheFiles()
D. setLocation()
Answer» E.
12.

A loader implementation should implement __________ if casts (implicit or explicit) from DataByteArray fields to other types need to be supported.

A. LoadPushDown
B. LoadMetadata
C. LoadCaster
D. All of the mentioned
Answer» D. All of the mentioned
13.

___________ return a list of hdfs files to ship to distributed cache.

A. relativeToAbsolutePath()
B. setUdfContextSignature()
C. getCacheFiles()
D. getShipFiles()
Answer» E.
14.

____________ method enables the RecordReader associated with the InputFormat provided by the LoadFunc is passed to the LoadFunc.

A. getNext()
B. relativeToAbsolutePath()
C. prepareToRead()
D. All of the mentioned
Answer» D. All of the mentioned
15.

__________ method tells LoadFunc which fields are required in the Pig script.

A. pushProjection()
B. relativeToAbsolutePath()
C. prepareToRead()
D. None of the mentioned
Answer» B. relativeToAbsolutePath()
16.

_________ are scanned in the order they are specified on the command line.

A. Command line parameters
B. Parameter files
C. Declare and default preprocessors
D. Both parameter files and command line parameters
Answer» E.
17.

In comparison to SQL, Pig uses :

A. Lazy evaluation
B. ETL
C. Supports pipeline splits
D. All of the mentioned
Answer» E.
18.

Pig Latin is _______ and fits very naturally in the pipeline paradigm while SQL is instead declarative.

A. functional
B. procedural
C. declarative
D. all of the mentioned
Answer» C. declarative
19.

You can specify parameter names and parameter values in one of the ways:

A. As part of a command line.
B. In parameter file, as part of a command line
C. With the declare statement, as part of Pig script
D. All of the mentioned
Answer» E.
20.

Which of the following is an entry in jobconf ?

A. pig.job
B. pig.input.dirs
C. pig.feature
D. none of the mentioned
Answer» C. pig.feature