Online CCA-500 Practice TestMore Cloudera Products >

Free Cloudera CCA-500 Exam Dumps Questions

Cloudera CCA-500: Cloudera Certified Administrator for Apache Hadoop (CCAH)

- Get instant access to CCA-500 practice exam questions

- Get ready to pass the Cloudera Certified Administrator for Apache Hadoop (CCAH) exam right now using our Cloudera CCA-500 exam package, which includes Cloudera CCA-500 practice test plus an Cloudera CCA-500 Exam Simulator.

- The best online CCA-500 exam study material and preparation tool is here.

4.5 
(4110 ratings)

Question 1

You have a cluster running with a FIFO scheduler enabled. You submit a large job A to the cluster, which you expect to run for one hour. Then, you submit job B to the cluster, which you expect to run a couple of minutes only.
You submit both jobs with the same priority.
Which two best describes how FIFO Scheduler arbitrates the cluster resources for job and its tasks?(Choose two)

Correct Answer:AD

Question 2

Your company stores user profile records in an OLTP databases. You want to join these records with web server logs you have already ingested into the Hadoop file system. What is the best way to obtain and ingest these user records?

Correct Answer:C

Question 3

Which three basic configuration parameters must you set to migrate your cluster from MapReduce 1 (MRv1) to MapReduce V2 (MRv2)?(Choose three)

Correct Answer:AEF

Question 4

You need to analyze 60,000,000 images stored in JPEG format, each of which is approximately 25 KB. Because you Hadoop cluster isn’t optimized for storing and processing many small files, you decide to do the following actions:
1. Group the individual images into a set of larger files
2. Use the set of larger files as input for a MapReduce job that processes them directly with python using Hadoop streaming.
Which data serialization system gives the flexibility to do this?

Correct Answer:E
Sequence files are block-compressed and provide direct serialization and deserialization of several arbitrary data types (not just text). Sequence files can be generated as the output of other MapReduce tasks and are an efficient intermediate representation for data that is passing from one MapReduce job to anther.

Question 5

You decide to create a cluster which runs HDFS in High Availability mode with automatic failover, using Quorum Storage. What is the purpose of ZooKeeper in such a configuration?

Correct Answer:A
Reference: Reference:http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/latest/PDF/CDH4-High-Availability-Guide.pdf(page 15)

Question 6

Identify two features/issues that YARN is designated to address:(Choose two)

Correct Answer:DE
Reference:http://www.revelytix.com/?q=content/hadoop-ecosystem(YARN, first para)

START CCA-500 EXAM