A few days ago, a friend of mine went to an interview. He came back and told me that the interviewer asked him how the IP address was stored in the database. He didn’t think much about it at the time and directly answered that it was stored in a string (thinking: Such a simple question, I’m afraid he is looking down on me) The previous paragraph is just for reference. After all, an IP address is originally a string, and it is understandable to store it in the database as a character string. However, I am a person who likes to think from the perspective of others. From the perspective of the interviewer, do you think I would ask such a low-level question? Then of course the answer is no. Therefore, what the interviewer wants to know is whether you have a deep thought about this question, and from this, to a certain extent judge whether you are just a simple "brick-moving" coder in your daily development, or a coder with soul.
Preface Regarding this question, first of all, let me state that it is completely fine to store IP addresses in the database as strings. Then you may have a question? If there is no problem, then why are you comparing it here? Although this is a contradictory topic, we have other storage methods besides storing strings. For example, the more common method is to store IP addresses as int data. Although this storage method is not complicated to implement, the fact that you can think of this method also shows that you are a good thinker and have a good grasp of the underlying data foundation. Because an int type data occupies 4 bytes, each byte is 8 bits, and its range is 0~(2^8-1), and the IPv4 address can be divided into 4 segments, each segment ranges from 0 to 255, which is just enough to be stored, so by slightly converting it, the IP address is cleverly stored in the database with the smallest space (unless otherwise specified in the following description, it refers to the IPv4 address). You may think that this small change does not matter, but when the amount of data increases, the difference between 15 bytes and 4 bytes will surprise you. So when designing a database, use the appropriate field type, just enough, and save as much as possible. As in Section 4.1.7 of High Performance MySQL 3rd Edition, the author recommends that when storing IPv4 addresses, 32-bit unsigned integers (UNSIGNED INT) should be used to store IP addresses instead of using strings. Compared with string storage, using unsigned integers for storage has the following advantages:
Usually, when saving an IPv4 address, an IPv4 address requires at least 7 characters and at most 15 characters, so VARCHAR(15) is sufficient. When MySQL saves a variable-length string, it also requires an extra byte to save the length of the string. However, if an unsigned integer is used for storage, only 4 bytes are required. In addition, 4 fields can be used to store each part of the IPv4 address separately, but usually this will not be very efficient in terms of storage space and query efficiency (although some scenarios are suitable for this storage method). However, using unsigned integers for storage also has the disadvantages of being difficult to read and requiring manual conversion. Tool class to implement conversion To store the IP address as an int type in the database, one method is to use the shift operation and & calculation in the Java code to get the corresponding value:
In fact, this is a kind of binary thinking, and it is also a number system widely used in computing technology. Although it is not used much in daily life, once you master it, it will help us strengthen our understanding of machine language and improve our coding level, especially when facing scenarios with tight resources (memory), it will help us analyze and optimize problems. Database function implementation conversion Another way is to use the database's built-in functions INET_ATON and INET_NTOA for conversion:
If it is an IPv6 address, use the functions INET6_ATON and INET6_NTOA to convert it:
Then the database is defined as varbinary type and 128 bits of space is allocated (because IPv6 uses 128 bits, 16 bytes); or defined as char type and 32 bits of space is allocated. If you use the database function, you only need to do a slight conversion when entering the IP address into the database, which is convenient and quick; and at this point, don’t you think that converting the IP address into digital storage is a good choice? After all, the database has taken this into consideration for us, and it also indirectly proves that this is indeed a feasible and good storage method. |
<<: How does 5G achieve beamforming?
>>: How can edge computing win in the context of 5G "new infrastructure"?
[51CTO.com original article] Today I want to shar...
1. Background At 10:04 on July 2, 2024, the publi...
New 5G networks are increasing connectivity betwe...
According to the latest report data provided to G...
DigitalVirt recently offered a 50% discount coupo...
The average lifespan of an American car is about ...
BudgetVM is a foreign hosting company that has be...
Hostodo is a foreign VPS hosting company founded ...
Since 2018, some Western countries, led by the Un...
A student fan sent a question: From the title, yo...
[[383115]] According to Gartner's latest fore...
During the Dragon Boat Festival holiday, it is ne...
Recently, the Ministry of Industry and Informatio...
In the first month of 2024, Sharktech sent a new ...
1. HTTP1.1 and HTTP2 1. HTTP1.1 flaws High Latenc...