How to store IP addresses in MySQL efficiently?

How to store IP addresses in MySQL efficiently?

A few days ago, a friend of mine went to an interview. He came back and told me that the interviewer asked him how the IP address was stored in the database. He didn’t think much about it at the time and directly answered that it was stored in a string (thinking: Such a simple question, I’m afraid he is looking down on me)

The previous paragraph is just for reference. After all, an IP address is originally a string, and it is understandable to store it in the database as a character string. However, I am a person who likes to think from the perspective of others. From the perspective of the interviewer, do you think I would ask such a low-level question? Then of course the answer is no. Therefore, what the interviewer wants to know is whether you have a deep thought about this question, and from this, to a certain extent judge whether you are just a simple "brick-moving" coder in your daily development, or a coder with soul.

[[328314]]

Preface

Regarding this question, first of all, let me state that it is completely fine to store IP addresses in the database as strings. Then you may have a question? If there is no problem, then why are you comparing it here?

Although this is a contradictory topic, we have other storage methods besides storing strings. For example, the more common method is to store IP addresses as int data. Although this storage method is not complicated to implement, the fact that you can think of this method also shows that you are a good thinker and have a good grasp of the underlying data foundation.

Because an int type data occupies 4 bytes, each byte is 8 bits, and its range is 0~(2^8-1), and the IPv4 address can be divided into 4 segments, each segment ranges from 0 to 255, which is just enough to be stored, so by slightly converting it, the IP address is cleverly stored in the database with the smallest space (unless otherwise specified in the following description, it refers to the IPv4 address). You may think that this small change does not matter, but when the amount of data increases, the difference between 15 bytes and 4 bytes will surprise you. So when designing a database, use the appropriate field type, just enough, and save as much as possible.

As in Section 4.1.7 of High Performance MySQL 3rd Edition, the author recommends that when storing IPv4 addresses, 32-bit unsigned integers (UNSIGNED INT) should be used to store IP addresses instead of using strings.

Compared with string storage, using unsigned integers for storage has the following advantages:

  • Save space, both data storage space and index storage space
  • It is convenient to use range query (BETWEEN...AND) and more efficient

Usually, when saving an IPv4 address, an IPv4 address requires at least 7 characters and at most 15 characters, so VARCHAR(15) is sufficient. When MySQL saves a variable-length string, it also requires an extra byte to save the length of the string. However, if an unsigned integer is used for storage, only 4 bytes are required. In addition, 4 fields can be used to store each part of the IPv4 address separately, but usually this will not be very efficient in terms of storage space and query efficiency (although some scenarios are suitable for this storage method). However, using unsigned integers for storage also has the disadvantages of being difficult to read and requiring manual conversion.

Tool class to implement conversion

To store the IP address as an int type in the database, one method is to use the shift operation and & calculation in the Java code to get the corresponding value:

  1. package com.java.mmzsit;
  2.  
  3. /**
  4. * @author :mmzsblog
  5. * @description: IPv4 address conversion
  6. * @date :2020/5/27 22:43
  7. */
  8. public class Ipv4Covert {
  9. public static void main(String[] args) {
  10. String ip = "10.108.149.219" ;
  11.  
  12. // step1: decompose the IP string and write the corresponding byte array
  13. byte[] ip1 = ipToBytes (ip);
  14.  
  15. // step2: Shift each byte in the byte array to the left, corresponding to the 4 bytes of the integer variable
  16. int ip2 = bytesToInt (ip1);
  17. System.out.println("integer ip ---- > " + ip2);
  18.  
  19. // step3: right shift the integer variable to restore the IP string
  20. String ip3 = intToIp (ip2);
  21. System.out.println("string ip---- > " + ip3);
  22.  
  23. }
  24.  
  25.      
  26. /**
  27. * Convert IP address to int
  28. * @param ipAddr
  29. * @return int
  30. */
  31. public static byte[] ipToBytesByReg(String ipAddr) {
  32. byte[] ret = new byte[4];
  33. try {
  34. String[] ipArr = ipAddr .split("\\.");
  35. ret[0] = (byte) (Integer.parseInt(ipArr[0]) & 0xFF);
  36. ret[1] = (byte) (Integer.parseInt(ipArr[1]) & 0xFF);
  37. ret[2] = (byte) (Integer.parseInt(ipArr[2]) & 0xFF);
  38. ret[3] = (byte) (Integer.parseInt(ipArr[3]) & 0xFF);
  39. return ret;
  40. } catch (Exception e) {
  41. throw new IllegalArgumentException(ipAddr + " is invalid IP");
  42. }
  43.  
  44. }
  45.  
  46.  
  47.  
  48. /**
  49. * The first step is to decompose the IP address into a bittype array
  50. */
  51. public static byte[] ipToBytes(String ipAddr) {
  52. // Initialize the byte array and define the length as 4
  53. byte[] ret = new byte[4];
  54. try {
  55. String[] ipArr = ipAddr .split("\\.");
  56. // Write the string array into the byte array one by one
  57. ret[0] = (byte) (Integer.parseInt(ipArr[0]));
  58. ret[1] = (byte) (Integer.parseInt(ipArr[1]));
  59. ret[2] = (byte) (Integer.parseInt(ipArr[2]));
  60. ret[3] = (byte) (Integer.parseInt(ipArr[3]));
  61. return ret;
  62. } catch (Exception e) {
  63. throw new IllegalArgumentException("invalid IP : " + ipAddr);
  64. }
  65. }
  66.  
  67. /**
  68. * Convert byte[] - > int based on bitwise operation
  69. * Principle: Each byte is converted into 8-bit binary code, and then shifted left by 8 bits in sequence, corresponding to the 4 bytes of the Int variable
  70. */
  71. public static int bytesToInt(byte[] bytes) {
  72. // Shift first and then force conversion directly while specifying the number of digits
  73. int addr = bytes [3] & 0xFF;
  74. addr |= ((bytes[2] < <   8 ) & 0xFF00);
  75. addr |= ((bytes[1] < <   16 ) & 0xFF0000);
  76. addr |= ((bytes[0] < <   24 ) & 0xFF000000);
  77. return addr;
  78. }
  79.  
  80. /**
  81. * Convert int- > string address
  82. *
  83. * @param ipInt
  84. * @return String
  85. */
  86. public static String intToIp(int ipInt) {
  87. //First force binary conversion, then shift processing
  88. return new StringBuilder()
  89. // Shift right 3 bytes (24 bits) to get the first segment of the IP address, which is byte[0]. In order to prevent the sign bit from being 1, which is a negative number, & 0xFF again at the end
  90. .append(((ipInt & 0xFF000000) > > 24) & 0xFF).append('.')
  91. .append((ipInt & 0xFF0000) > > 16).append('.')
  92. .append((ipInt & 0xFF00) > > 8).append('.')
  93. .append((ipInt & 0xFF))
  94. .toString();
  95. }
  96. }

In fact, this is a kind of binary thinking, and it is also a number system widely used in computing technology. Although it is not used much in daily life, once you master it, it will help us strengthen our understanding of machine language and improve our coding level, especially when facing scenarios with tight resources (memory), it will help us analyze and optimize problems.

Database function implementation conversion

Another way is to use the database's built-in functions INET_ATON and INET_NTOA for conversion:

  1. mysql > SELECT INET_ATON('192.168.0.1');
  2. +--------------------------+
  3. | INET_ATON('192.168.0.1') |
  4. +--------------------------+
  5. | 3232235521 |
  6. +--------------------------+
  7. 1 row in set
  8.  
  9. mysql > SELECT INET_NTOA(3232235521);
  10. +-----------------------+
  11. | INET_NTOA(3232235521) |
  12. +-----------------------+
  13. | 192.168.0.1 |
  14. +-----------------------+
  15. 1 row in set

If it is an IPv6 address, use the functions INET6_ATON and INET6_NTOA to convert it:

  1. mysql > SELECT HEX(INET6_ATON('1030::C9B4:FF12:48AA:1A2B'));
  2.  
  3. +------------------------------------------------+
  4. | HEX(INET6_ATON('1030::C9B4:FF12:48AA:1A2B')) |
  5. +------------------------------------------------+
  6. | 1030000000000000C9B4FF1248AA1A2B |
  7. +------------------------------------------------+
  8. 1 row in set
  9.  
  10. mysql > SELECT INET6_NTOA(UNHEX('1030000000000000C9B4FF1248AA1A2B'));
  11. +-------------------------------------------------------------+
  12. | INET6_NTOA(UNHEX('1030000000000000C9B4FF1248AA1A2B')) |
  13. +-------------------------------------------------------------+
  14. | 1030::c9b4:ff12:48aa:1a2b |
  15. +-------------------------------------------------------------+
  16. 1 row in set

Then the database is defined as varbinary type and 128 bits of space is allocated (because IPv6 uses 128 bits, 16 bytes); or defined as char type and 32 bits of space is allocated.

If you use the database function, you only need to do a slight conversion when entering the IP address into the database, which is convenient and quick; and at this point, don’t you think that converting the IP address into digital storage is a good choice? After all, the database has taken this into consideration for us, and it also indirectly proves that this is indeed a feasible and good storage method.

<<:  How does 5G achieve beamforming?

>>:  How can edge computing win in the context of 5G "new infrastructure"?

Blog    

Recommend

Wu Zhongjie: How to become an excellent network engineer

[51CTO.com original article] Today I want to shar...

7.2 Our computer room is disconnected from the Internet! What should I do?

1. Background At 10:04 on July 2, 2024, the publi...

How 5G will impact data centers and how to prepare

New 5G networks are increasing connectivity betwe...

Omdia: Global 5G subscriptions doubled in the second quarter of 2020

According to the latest report data provided to G...

DigitalVirt: 95 yuan/year-1GB/10GB NVMe/1TB@200Mbps/Hong Kong International Line

DigitalVirt recently offered a 50% discount coupo...

Age and technology determine building control lifespan

The average lifespan of an American car is about ...

Hostodo: $17.99/year KVM-1GB/12GB/4TB/Las Vegas

Hostodo is a foreign VPS hosting company founded ...

Will Huawei's 5G industry see a turnaround in 2021?

Since 2018, some Western countries, led by the Un...

Design and implement a TCP protocol semi-connected port scanner

A student fan sent a question: From the title, yo...

WiFi optimization has tricks to surf the Internet without fighting

During the Dragon Boat Festival holiday, it is ne...

Understanding HTTP/1, HTTP/2, and HTTP/3 in one article

1. HTTP1.1 and HTTP2 1. HTTP1.1 flaws High Latenc...