Network security programming: C language reverse loop structure analysis

Network security programming: C language reverse loop structure analysis

[[392807]]

The loop structures of C language include for loop, while loop, do loop and goto loop. This article introduces the first three loop methods.

1. for loop structure

The for loop can also be called a step loop. Its characteristic is that it is often used to define the scope of the loop. Let's look at a simple C language code, as follows:

  1. #include < stdio.h >    
  2. int main()  
  3. {  
  4. int nNum = 0 , nSum = 0 ;  
  5. for ( nNum = 1 ; nNum < = 100; nNum ++ )  
  6. {  
  7. nSum += nNum;  
  8. }  
  9. printf(" nSum = %d \r\n", nSum);
  10. return 0;  
  11. }

This is a typical program for finding the cumulative sum of 1 to 100. Through this program, we can understand the disassembly code of the for loop structure.

  1. .text:00401028 mov [ebp+nNum], 0  
  2. .text:0040102F mov [ebp+nSum], 0  
  3. .text:00401036 mov [ebp+nNum], 1  
  4. .text:0040103D jmp short LOC_CMP  
  5. .text:0040103F ; ---------------------------------------------------------------  
  6. .text:0040103F  
  7. .text:0040103F LOC_STEP: ; CODE XREF: _main+47j  
  8. .text:0040103F mov eax, [ebp+nNum]  
  9. .text:00401042 add eax, 1  
  10. .text:00401045 mov [ebp+nNum], eax  
  11. .text:00401048
  12. .text:00401048 LOC_CMP: ; CODE XREF: _main+2Dj  
  13. .text:00401048 cmp [ebp+nNum], 64h  
  14. .text:0040104C jg short LOC_ENDFOR  
  15. .text:0040104E mov ecx, [ebp+nSum]  
  16. .text:00401051 add ecx, [ebp+nNum]  
  17. .text:00401054 mov [ebp+nSum], ecx  
  18. .text:00401057 jmp short LOC_STEP  
  19. .text:00401059 ; ---------------------------------------------------------------  
  20. .text:00401059  
  21. .text:00401059 LOC_ENDFOR: ; CODE XREF: _main+3Cj  
  22. .text:00401059 mov edx, [ebp+nSum]  
  23. .text:0040105C push edx  
  24. .text:0040105D push offset Format; " nSum = %d \r\n"  
  25. .text:00401062 call _printf  
  26. .text:00401067 add esp, 8  
  27. .text:0040106A xor eax, eax

This time, the disassembled code has modified the variables and labels, which looks more intuitive. From the modified labels, the for structure can be divided into three parts: the part above LOC_STEP is the initialization part, the part below LOC_STEP is the part that modifies the loop variable, and the part below LOC_CMP and above LOC_ENDFOR is the part that compares the loop condition and the loop body.

The disassembled structure of the for loop is as follows:

  1. ; Initialize loop variables  
  2. jmp LOC_CMP  
  3. LOC_STEP:  
  4. ; Modify loop variable  
  5. LOC_CMP:  
  6. ; Loop variable determination  
  7. jxx LOC_ENDFOR  
  8. ; Loop body  
  9. jmp LOC_STEP  
  10. LOC_ENDOF:

Let’s use IDA to look at the generated process structure diagram, as shown in Figure 1.

Figure 1 Flowchart of the for structure

2. do…while loop structure

The body of the do loop is always executed once, which is the difference between the do loop and the while loop. Here is the code for adding 1 to 100. Let's take a look at its disassembly structure. First, let's look at the C language code, as follows:

  1. #include < stdio.h >    
  2. int main()  
  3. {  
  4. int nNum = 1 , nSum = 0 ;  
  5. do  
  6. {  
  7. nSum += nNum;  
  8. nNum++;  
  9. } while ( nNum < = 100 );  
  10. printf(" nSum = %d \r\n", nSum);  
  11. return 0;  
  12. }

The structure of the do loop is much simpler than that of the for loop, and the disassembled code is much less. Let's first look at the flowchart generated by IDA, as shown in Figure 2.

Figure 2 do loop flow chart

The disassembled code is as follows:

  1. .text:00401028 mov [ebp+nNum], 1  
  2. .text:0040102F mov [ebp+nSum], 0  
  3. .text:00401036  
  4. .text:00401036 LOC_DO: ; CODE XREF: _main+3Cj  
  5. .text:00401036 mov eax, [ebp+nSum]
  6. .text:00401039 add eax, [ebp+nNum]  
  7. .text:0040103C mov [ebp+nSum], eax  
  8. .text:0040103F mov ecx, [ebp+nNum]  
  9. .text:00401042 add ecx, 1  
  10. .text:00401045 mov [ebp+nNum], ecx  
  11. .text:00401048 cmp [ebp+nNum], 64h  
  12. .text:0040104C jle short LOC_DO  
  13. .text:0040104E mov edx, [ebp+nSum]  
  14. .text:00401051 push edx  
  15. .text:00401052 push offset Format; " nSum = %d \r\n"  
  16. .text:00401057 call _printf  
  17. .text:0040105C add esp, 8  
  18. .text:0040105F xor eax, eax

The body of the do loop is between LOC_DO and the jle of 0040104C. Its structure is as follows:

  1. ; Initialize loop variables  
  2. C_DO:  
  3. ; Execute loop body  
  4. ; Modify loop variable  
  5. ; Comparison of loop variables  
  6. Jxx LOC_DO

3. While loop structure

The difference between a while loop and a do loop is that a conditional check must be performed before entering the loop body. The loop body may not be executed once because the loop condition is not met.

  1. #include < stdio.h >    
  2. int main()  
  3. {  
  4. int nNum = 1 , nSum = 0 ;  
  5. while ( nNum < = 100 )  
  6. {  
  7. nSum += nNum;  
  8. nNum++;  
  9. }  
  10. printf(" nSum = %d \r\n", nSum);  
  11. return 0;
  12. }

Let's take a look at its disassembled code. The while loop has one more condition than the do loop, so there is one more branch. The disassembled code is as follows:

  1. .text:00401028 mov [ebp+nNum], 1  
  2. .text:0040102F mov [ebp+nSum], 0  
  3. .text:00401036  
  4. .text:00401036 LOC_WHILE: ; CODE XREF: _main+3Ej  
  5. .text:00401036 cmp [ebp+nNum], 64h  
  6. .text:0040103A jg short LOC_WHILEEND  
  7. .text:0040103C mov eax, [ebp+nSum]  
  8. .text:0040103F add eax, [ebp+nNum]  
  9. .text:00401042 mov [ebp+nSum], eax  
  10. .text:00401045 mov ecx, [ebp+nNum]  
  11. .text:00401048 add ecx, 1  
  12. .text:0040104B mov [ebp+nNum], ecx  
  13. .text:0040104E jmp short LOC_WHILE  
  14. .text:00401050 ; ---------------------------------------------------------------  
  15. .text:00401050  
  16. .text:00401050 LOC_WHILEEND: ; CODE XREF: _main+2Aj  
  17. .text:00401050 mov edx, [ebp+nSum]  
  18. .text:00401053 push edx  
  19. .text:00401054 push offset Format; " nSum = %d \r\n"  
  20. .text:00401059 call _printf  
  21. .text:0040105E add esp, 8  
  22. .text:00401061 xor eax, eax

The main part of the while loop is between LOC_WHILE and LOC_WHILEEND. The two sentences below LOC_WHILE are cmp and jxx instructions, and the jmp instruction is above LOC_WHILEEND. These two parts are in a fixed format, and their structure is organized as follows:

  1. ; Initialize loop variables, etc.  
  2. LOC_WHILE:  
  3. cmp xxx, xxx  
  4. jxx LOC_WHILEEND  
  5. ; Loop body  
  6. jmp LOC_WHILE  
  7. LOC_WHILEEND:

Let’s take a look at the flowchart generated by IDA, as shown in Figure 3.

Figure 3 while loop flow chart

Among the three types of loops, for loop, do loop and while loop, the do loop is obviously more efficient, and the while loop is relatively more efficient than the for loop.

<<:  In-depth: Two major challenges need to be solved to achieve mature 5G applications in the power industry

>>:  How wireless technology is changing the world

Recommend

What happens when SDN meets 5G?

SDN is a profound change to traditional IP networ...

How to replace the Query field in the URL?

[[420519]] When we write a crawler, we may need t...

Wi-Fi Alliance launches next-generation WPA3 security certification program

[51CTO.com original article] On June 26, the Wi-F...

CloudCone: $1.99/month KVM-768MB/15GB/3TB/Los Angeles MC Data Center

CloudCone's 2021 flash sale has started again...