Kafka is a publishing and subscription-based messaging system. It is often referred to as a "distributed commit log" or "distributed streaming platform". File system or database commit logs are used to provide a persistent record of all transactions, and the state of the system can be reconstructed by rebuilding these logs. Similarly, Kafka's data is persisted and stored in a certain order and can be read on demand. 1. Kafka topology 2. Characteristics of Kafka It also provides high throughput for distribution and subscription. It is understood that Kafka can produce about 250,000 messages per second (50MB) and process 550,000 messages per second (110MB). The number of messages mentioned here may not be particularly accurate because the size of the messages may not be consistent; Persistence operations can be performed to persist messages to disk and store them in the form of logs, so they can be used for batch consumption, such as ETL, and real-time applications. Data loss can be prevented by persisting data to hard disk and replication. The state of message processing is maintained on the consumer side, not on the server side, and can be automatically balanced in the event of a failure. 3. Kafka's core concepts Glossary 3.1 The concept of Producers Message and data generators. The process of publishing messages to a Kafka topic is called producers. 3.2 The concept of broker: Broker does not have a replica mechanism. Once a broker goes down, the messages of that broker will be unavailable. 3.3 Message Composition Message: It is the basic unit of communication. Each producer can publish a message to a topic. 3.4 The concept of consumers Message and data consumers, the process of subscribing to a topic and processing the messages it publishes is called consumers. In Kafka, we can think of a group as a "subscriber". Each partition in a topic will only be consumed by a consumer in a "subscriber", but a consumer can consume messages from multiple partitions. Note: Kafka's design principle determines that for a topic, the same group cannot have more consumers than the number of partitions consuming at the same time, otherwise it will mean that some consumers cannot get the message |
<<: How to smoothly go online after MySQL table sharding?
>>: Experience the Serverless application programming model in cloud native scenarios
Recently, with the "Xinzhou Public Trading P...
Hello everyone, I am Brother Ming. I sorted out s...
OpLink is a foreign hosting company founded in 19...
Tencent Cloud launched a Spring Festival shopping...
The Internet is facing a technological split, and...
LOCVPS has started the 10th anniversary event war...
[51CTO.com original article] Bitcoin, as a "...
LOCVPS (Global Cloud) is a Chinese VPS service pr...
[[429757]] Will edge computing and 5G drive a pos...
27 years ago, anyone who mentioned the word "...
File descriptor limits System-level limit: The op...
Adhering to the service-first concept, Senhua Yit...
During the Global Mobile Broadband Forum (MBBF202...
The rollout of 5G networks has been alarmingly sl...
According to the Ethernet Switch and Router Quart...