Fatal question: How many HTTP requests can be sent through a TCP connection?

There was once such an interview question: What happens from the time the URL is entered into the browser to the time the page is displayed?

I believe that most of the students who have prepared can answer this question, but if you continue to ask: If the received HTML contains dozens of image tags, how are these images downloaded, in what order, how many connections are established, and what protocol is used?

[[266708]]

To understand this problem, we need to solve the following five problems:

After a modern browser establishes a TCP connection with a server, will it disconnect after an HTTP request is completed? Under what circumstances will it disconnect?
How many HTTP requests can a TCP connection correspond to?
Can HTTP requests be sent together in one TCP connection (for example, three requests are sent together and three responses are received together)?
Why sometimes refreshing a page does not require re-establishing an SSL connection?
Is there any limit on the number of TCP connections that a browser can establish to the same host?

*** Questions

After a modern browser establishes a TCP connection with a server, will it disconnect after an HTTP request is completed? Under what circumstances will it disconnect?

In HTTP/1.0, a server will disconnect the TCP connection after sending an HTTP response. However, each request will re-establish and disconnect the TCP connection, which is too costly. Therefore, although it is not set in the standard, some servers support the Connection: keep-alive Header. This means that after completing the HTTP request, do not disconnect the TCP connection used by the HTTP request. The advantage of this is that the connection can be reused, and there is no need to re-establish the TCP connection when sending HTTP requests later. If the connection is maintained, the overhead of SSL can also be avoided. The two pictures are the time statistics of my two visits to https://www.github.com in a short period of time:

For the first access, there is initial connection and SSL overhead.

The initial connection and SSL overhead disappears, indicating that the same TCP connection is being used.

Persistent connection: Since maintaining a TCP connection has so many benefits, HTTP/1.1 includes the Connection header in the standard and enables persistent connections by default. Unless the request specifies Connection: close, the TCP connection between the browser and the server will be maintained for a period of time and will not be disconnected when a request is completed.

So the answer to the first question is: by default, an established TCP connection will not be disconnected. The connection will only be closed after the request is completed if Connection: close is declared in the request header.

Second question

How many HTTP requests can a TCP connection correspond to?

After understanding the first question, in fact, this question already has an answer. If the connection is maintained, one TCP connection can send multiple HTTP requests.

The third question

Can HTTP requests be sent together in one TCP connection (for example, three requests are sent together and three responses are received together)?

There is a problem with HTTP/1.1. A single TCP connection can only process one request at a time. This means that the lifecycles of two requests cannot overlap, and the start and end time of any two HTTP requests cannot overlap in the same TCP connection.

Although Pipelining is specified in the HTTP/1.1 specification to try to solve this problem, this feature is turned off by default in browsers.

Let's first take a look at what Pipelining is. RFC 2616 stipulates:

A client that supports persistent connections MAY "pipeline" its requests (ie, send multiple requests without waiting for each response). A server MUST send its responses to those requests in the same order that the requests were received.

As for why the standard is set this way, we can roughly speculate one reason: HTTP/1.1 is a text protocol, and the returned content cannot distinguish which request it corresponds to, so the order must be consistent. For example, if you send two requests to the server, GET/query?q=A and GET/query?q=B, and the server returns two results, the browser has no way to determine which request the response corresponds to based on the response results.

Pipelining is a good idea, but there are many problems in practice:
Some proxy servers do not handle HTTP Pipelining correctly.
Correct pipelining implementation is complex.

Head-of-line Blocking: After a TCP connection is established, suppose the client sends several requests to the server in succession. According to the standard, the server should return the results in the order in which the requests were received. Suppose the server spends a lot of time processing the first request, then all subsequent requests need to wait for the first request to be completed before responding.

Therefore, modern browsers do not enable HTTP Pipelining by default.

However, HTTP2 provides the Multiplexing feature, which can complete multiple HTTP requests simultaneously in one TCP connection. As for how Multiplexing is implemented, that is another question. Let's take a look at the effect of using HTTP2.

Green is the waiting time from initiating the request to the request returning, and blue is the download time of the response. You can see that they are all completed in parallel on the same Connection.

So this question has an answer: In HTTP/1.1, there is Pipelining technology that can complete the sending of multiple requests at the same time, but since it is disabled by default in browsers, it can be considered infeasible. In HTTP2, due to the Multiplexing feature, multiple HTTP requests can be performed in parallel in the same TCP connection.

So how do browsers improve page loading efficiency in the HTTP/1.1 era? There are two main reasons:

Maintain the established TCP connection with the server and process multiple requests sequentially on the same connection.
Establish multiple TCP connections with the server.

The fourth question

Why sometimes refreshing a page does not require re-establishing an SSL connection?

The answer has been given in the discussion of the first question. Sometimes the TCP connection will be maintained for a period of time by the browser and the server. TCP does not need to be re-established, and SSL will naturally use the previous one.

The fifth question

Is there any limit on the number of TCP connections that a browser can establish to the same host?

Assuming we are still in the HTTP/1.1 era, when there was no multi-channel transmission, what should the browser do when it gets a web page with dozens of pictures? It certainly cannot just open a TCP connection to download them sequentially, as that would make the user wait uncomfortably. However, if a TCP connection is opened for each picture to send an HTTP request, the computer or server may not be able to handle it. If there are 1,000 pictures, you cannot open 1,000 TCP connections, and your computer may not agree even if NAT is used.

So the answer is: Yes. Chrome allows up to six TCP connections to the same host. There are some differences between different browsers.

https://developers.google.com/web/tools/chrome-devtools/network/issues#queued-or-stalled-requestsdevelopers.google.com

So back to the original question, if the received HTML contains dozens of image tags, how are these images downloaded, in what order, how many connections are established, and what protocol is used?

If all images are HTTPS connections and under the same domain name, then after the SSL handshake, the browser will negotiate with the server whether HTTP2 can be used. If it can, it will use the Multiplexing function to multiplex the connection. However, it is not necessarily the case that all resources on this domain name will be obtained using a TCP connection, but it is certain that Multiplexing will most likely be used.

What if you find that you cannot use HTTP2? Or you cannot use HTTPS (in reality, HTTP2 is implemented on HTTPS, so you can only use HTTP/1.1). Then the browser will establish multiple TCP connections on a HOST. The maximum number of connections depends on the browser settings. These connections will be used by the browser to send new requests when they are idle. What if all connections are sending requests? Then other requests can only wait.

<<: Ten ways for Vue.js parent-child component communication

>>: 5G is coming: 3 ways it will benefit your business

Experience sharing: key points, difficulties and treatment measures of integrated wiring construction

Cisco released the IT Operations Readiness Index report, and Chinese enterprises' IT operations provide more value to their businesses

Blog

ShockHosting: $4.99/month-2G/30GB/2TB/13 data centers including Los Angeles, Seattle, Japan, etc.

Fatal question: How many HTTP requests can be sent through a TCP connection?

Experience sharing: key points, difficulties and treatment measures of integrated wiring construction

WiFi 7 is coming, but is it really reliable?

Japanese telecom operator NTT DoCoMo suffers massive communications outage, affecting 2 million users

Cisco released the IT Operations Readiness Index report, and Chinese enterprises' IT operations provide more value to their businesses

ShockHosting: $4.99/month-2G/30GB/2TB/13 data centers including Los Angeles, Seattle, Japan, etc.

What should you know about 5G technology? What will happen in the future?

United States: Suspend 5G deployment!

What does it feel like to immerse yourself in 50 industry scenes in a 7,000-square-meter exhibition hall?

7 key SD-WAN trends to watch in 2021

New infrastructure is included in the government work report, and "5G+" becomes the future focus

Recommend

Huawei focuses on intelligent scenarios in five major industries and releases new products in the F5G-A series

Protecting corporate intranet data security in just seven steps

IBM releases 'quantum-safe' tool with end-to-end encryption to protect against quantum computing attacks

Why can't I access my home computer from work?

TCP state transition and production problem practice

It’s time to promote 5G applications

What benefits will 5G technology bring to smart fire protection construction?

With so many mobile payment options available, which one will dominate the market?

Gartner: China's IT spending is expected to grow 7.7% in 2021

GSA: 140 operators in 59 countries and regions around the world have launched commercial 5G networks

Interviewer: How do you understand the TCP/IP protocol?

China Mobile's 5G package customers increased by 15.593 million in March, reaching a total of 188.761 million

What is edge computing and how will it impact businesses?

Farewell to the NYSE! The NYSE maintains the delisting decision of the three major operators