You want to sink data from a Kafka topic to S3 using Kafka Connect. There are 10 brokers in the cluster, the topic has 2 partitions with replication factor of 3. How many tasks will you configure for the S3 connector?
Correct Answer:D
You cannot have more sink tasks (= consumers) than the number of partitions, so 2.
What's a Kafka partition made of?
Correct Answer:C
Kafka partitions are made of segments (usually each segment is 1GB), and each segment has two corresponding indexes (offset index and time index)
Which actions will trigger partition rebalance for a consumer group? (select three)
Correct Answer:ACD
Rebalance occurs when a new consumer is added, removed or consumer dies or paritions increased.
Consumer failed to process record # 10 and succeeded in processing record # 11. Select the course of action that you should choose to guarantee at least once processing
Correct Answer:C
Here, you shouldn't commit offsets 11 or 10 as it would indicate that the message #10 has been processed successfully.
To prevent network-induced duplicates when producing to Kafka, I should use
Correct Answer:B
Producer idempotence helps prevent the network introduced duplicates. More details herehttps://cwiki.apache.org/confluence/display/KAFKA/Idempotent+Producer
What data format isn't natively available with the Confluent REST Proxy?
Correct Answer:C
Protocol buffers isn't a natively supported type for the Confluent REST Proxy, but you may use the binary format instead