Explore topic-wise MCQs in Apache Hadoop.

This section includes 51 Mcqs, each offering curated multiple-choice questions to sharpen your Apache Hadoop knowledge and support exam preparation. Choose a topic below to get started.

1.

Internal authentication stores usernames and bcrypt-hashed passwords in the ____________ table.

A. system_auth.creds
B. system_auth.credentials
C. system.credentials
D. sys_auth.credentials
Answer» C. system.credentials
2.

Cassandra creates a ___________ for each table, which allows you to symlink a table to a chosen physical drive or data volume.

A. directory
B. subdirectory
C. domain
D. path
Answer» C. domain
3.

When ___________ contents exceed a configurable threshold, the memtable data, which includes indexes, is put in a queue to be flushed to disk

A. subtable
B. memtable
C. intable
D. memorytable
Answer» C. intable
4.

Data in the commit log is purged after its corresponding data in the memtable is flushed to an _________ .

A. SSHables
B. SSTable
C. Memtables
D. None of the mentioned
Answer» C. Memtables
5.

A _________ grants initial permissions, and subsequently a user may or may not be given the permission to grant/revoke permissions.

A. keyspace
B. superuser
C. sudouser
D. none of the mentioned
Answer» C. sudouser
6.

__________ is one of many possible IAuthorizer implementations, and the one that stores permissions in the system_auth.permissions table to support all authorization-related CQL statements.

A. CassandraAuth
B. CassandraAuthorizer
C. CassAuthorizer
D. All of the mentioned
Answer» C. CassAuthorizer
7.

For each SSTable, Cassandra creates _________ index .

A. memory
B. partition
C. in memory
D. all of the mentioned
Answer» C. in memory
8.

Zookeeper essentially mirrors the _______ functionality exposed in the Linux kernel.

A. iread
B. inotify
C. iwrite
D. icount
Answer» C. iwrite
9.

ZooKeeper is used for configuration, leader election in Cloud edition of :

A. Solr
B. Solur
C. Solar101
D. All of the mentioned
Answer» B. Solur
10.

_________ serves distributed Lucene indexes in a grid environment.

A. Katta
B. Helprace
C. Neo4j
D. 101tec
Answer» B. Helprace
11.

The Email & Apps team of ___________ uses ZooKeeper to coordinate sharding and responsibility changes in a distributed email client

A. Katta
B. Helprace
C. Rackspace
D. None of the mentioned
Answer» D. None of the mentioned
12.

You need to have _________ installed before running ZooKeeper.

A. Java
B. C
C. C++
D. SQLGUI
Answer» B. C
13.

ZooKeeper allows distributed processes to coordinate with each other through registers, known as :

A. znodes
B. hnodes
C. vnodes
D. rnodes
Answer» B. hnodes
14.

The ZooKeeper Data Directory contains files which are _________ copy of the znodes stored by a particular serving ensemble.

A. transient
B. read only
C. persistent
D. none of the mentioned
Answer» D. none of the mentioned
15.

Helprace is using ZooKeeper on a _______ cluster in conjunction with Hadoop and HBase.

A. 3-node
B. 4-node
C. 5-node
D. 6-node
Answer» B. 4-node
16.

A ___________ server is a machine that keeps a copy of the state of the entire system and persists this information in local log files.

A. Master
B. Region
C. Zookeeper
D. All of the mentioned
Answer» D. All of the mentioned
17.

A number of constants used in the client ZooKeeper API were renamed in order to reduce ________ collision

A. value
B. namespace
C. counter
D. none of the mentioned
Answer» C. counter
18.

The ________ master will register its own address in this znode at startup, making this znode the source of truth for identifying which server is the Master.

A. active
B. passive
C. region
D. all of the mentioned
Answer» B. passive
19.

The ______ znode is used for synchronizing the changes made to the _acl_ table by the grant/revoke commands.

A. zcl
B. acl
C. scl
D. bnl
Answer» C. scl
20.

___________ is used to decommission more than one RegionServer at a time by creating sub-znodes.

A. /hbase/master
B. /hbase/draining
C. /hbase/passive
D. None of the mentioned
Answer» C. /hbase/passive
21.

How many types of special znodes are present in Zookeeper ?

A. 1
B. 2
C. 3
D. All of the mentioned
Answer» C. 3
22.

Zookeeper keep track of the cluster state such as the ______ table location.

A. DOMAIN
B. NODE
C. ROOT
D. All of the mentioned
Answer» D. All of the mentioned
23.

_______ has a design policy of using ZooKeeper only for transient data

A. Hive
B. Imphala
C. Hbase
D. Oozie
Answer» D. Oozie
24.

Sqoop has been tested with Oracle ______ Express Edition.

A. 11.2.0
B. 10.2.0
C. 12.2.0
D. 10.3.0
Answer» C. 12.2.0
25.

Sqoop is an open source tool written at ________

A. Cloudera
B. IBM
C. Microsoft
D. All of the mentioned
Answer» D. All of the mentioned
26.

Sqoop direct mode does not support imports of ______ columns

A. BLOB
B. LONGVARBINARY
C. CLOB
D. All of the mentioned
Answer» E.
27.

The fields parsed by ____________ are backed by an internal buffer.

A. LargeObjectLoader
B. ProcessingException
C. RecordParser
D. None of the Mentioned
Answer» D. None of the Mentioned
28.

Microsoft uses a Sqoop-based connector to help transfer data from _________ databases to Hadoop

A. PostreSQL
B. SQL Server
C. Oracle
D. MySQL
Answer» C. Oracle
29.

__________ provides a Couchbase Server-Hadoop connector by means of Sqoop.

A. MemCache
B. Couchbase
C. Hbase
D. All of the mentioned
Answer» B. Couchbase
30.

Which of the following interface is implemented by Sqoop for recording ?

A. SqoopWrite
B. SqoopRecord
C. SqoopRead
D. None of the mentioned
Answer» C. SqoopRead
31.

Records are terminated by a __________ character.

A. RECORD_DELIMITER
B. FIELD_DELIMITER
C. FIELD_LIMITER
D. None of the mentioned
Answer» B. FIELD_DELIMITER
32.

Sqoop uses _________ to fetch data from RDBMS and stores that on HDFS.

A. Hive
B. Map reduce
C. Imphala
D. BigTOP
Answer» C. Imphala
33.

_________ allows users to specify the target location inside of Hadoop.

A. Imphala
B. Oozie
C. Sqoop
D. Hive
Answer» D. Hive
34.

_________ tool can list all the available database schemas.

A. sqoop-list-tables
B. sqoop-list-databases
C. sqoop-list-schema
D. sqoop-list-columns
Answer» C. sqoop-list-schema
35.

________ does not support the notion of enclosing characters that may include field delimiters in the enclosed string.

A. Imphala
B. Oozie
C. Sqoop
D. Hive
Answer» E.
36.

Data can be imported in maximum ______ file formats.

A. 1
B. 2
C. 3
D. All of the mentioned
Answer» C. 3
37.

The __________ tool imports a set of tables from an RDBMS to HDFS.

A. export-all-tables
B. import-all-tables
C. import-tables
D. none of the mentioned
Answer» D. none of the mentioned
38.

Which of the following argument is not supported by import-all-tables tool ?

A. –class-name
B. –package-name
C. –database-name
D. –table-name
Answer» B. –package-name
39.

Apache Cassandra™ is a massively scalable open source _______ database.

A. SQL
B. NoSQL
C. NewSQL
D. All of the mentioned
Answer» C. NewSQL
40.

________ text is appropriate for most non-binary data types.

A. Character
B. Binary
C. Delimited
D. None of the mentioned
Answer» D. None of the mentioned
41.

If you set the inline LOB limit to ________ all large objects will be placed in external storage.

A. 0
B. 1
C. 2
D. 3
Answer» B. 1
42.

Cassandra marks data to be deleted using :

A. tombstone
B. combstone
C. tenstone
D. none of the mentioned
Answer» B. combstone
43.

You configure sample frequency by changing the ________ property in the table definition.

A. index_time
B. index_interval
C. index_secs
D. none of the mentioned
Answer» C. index_secs
44.

If the table has been configured with the __________ property, the coordinator node for the read request will retry the request with another replica node .

A. rapid_retry
B. speculative_retry
C. speculative_rapid
D. none of the mentioned
Answer» C. speculative_rapid
45.

The compression offset map grows to ____ GB per terabyte compressed.

A. 1-3
B. 10-16
C. 20-22
D. 0-1
Answer» B. 10-16
46.

The type of __________ strategy Cassandra performs on your data is configurable and can significantly affect read performance.

A. compression
B. collection
C. compaction
D. decompression
Answer» D. decompression
47.

Cassandra searches the __________ to determine the approximate location on disk of the index entry.

A. partition record
B. partition summary
C. partition search
D. all of the mentioned
Answer» C. partition search
48.

There are _________ types of read requests that a coordinator can send to a replica

A. two
B. three
C. four
D. all of the mentioned
Answer» C. four
49.

Tombstones exist for a configured time period defined by the _______ value set on the table.

A. gc_grace_minutes
B. gc_grace_time
C. gc_grace_seconds
D. gc_grace_hours
Answer» D. gc_grace_hours
50.

_________ is a Cassandra feature that optimizes the cluster consistency process

A. Hinted handon
B. Hinted handoff
C. Tombstone
D. Hinted tomb
Answer» C. Tombstone