From the course: Kafka Essentials: Quick Start for Building Effective Data Pipelines by Pearson
Unlock this course with a free trial
Join today to access over 25,200 courses taught by industry experts.
Installing Zookeeper and Kafka - Kafka Tutorial
From the course: Kafka Essentials: Quick Start for Building Effective Data Pipelines by Pearson
Installing Zookeeper and Kafka
In prior lessons, I mentioned the Zookeeper package that was needed for Kafka, and I also mentioned it was used to keep global information. I'd like to define Zookeeper a little better. Zookeeper is an independent tool that is used to keep track of cluster-wide information. Think of it like a global directory. It wasn't so important in our single LHM version because we had one server, basically, so it wasn't all that important to talk about Kafka at that point. An application can connect to any ZooKeeper server and get information that has been written or registered by a local or remote application. This means all servers in the cluster can write things to ZooKeeper and then it becomes accessible to every other server that has access to Zookeeper. It is implemented as a quorum, a minimum number of nodes to be up and running since the quorum can vote to agree on actual state. And it's usually an odd number. So when Zookeeper is communicating, it tries to keep current all its…