I was chatting with my boss that day and we mentioned Meteor inadvertently, then talked about WebSocket, and then the following conversation took place. I have to say that if you look at problems in different ways, you will see very different things.
A: Meteor is a very new development framework, and I think it is very cleverly designed. B: How is it clever? A: Its front-end and back-end all use JS, achieving true front-end and back-end unification; a copy of the database opened from the back-end is stored in the front-end browser, which is fast; the WebSocket protocol is used as the data transmission protocol to synchronize the front-end and back-end databases, achieving true real-time synchronization. B: Oh? What is WebSocket? When is it real? Is it still a polling method at the bottom? How is it different from HTTP's persistent connection? A: (starting to feel guilty) It is a new application layer protocol based on TCP. It only requires one connection. Subsequent data does not need to re-establish a connection and can be sent directly. It is based on TCP and has the same status as HTTP (uh, starting to make up stories). The underlying layer is not polling, and the difference with long connections... I'm not sure about this. B: What is the transmission process like? A: First, there is a handshake connection (another nonsense). It seems that the connection can be established based on HTTP (I have used Socket.io before, just making it up). After the connection is established, data can be transmitted, and it also includes mechanisms such as reconnection after disconnection. B: It looks similar to what HTTP long connection does. It seems to be a protocol based on HTTP and Socket. A: Uh... (I'd better go back and read a book) Sometimes we look at things too superficially. We understand the general outline of each thing, but we don't seek to understand it in depth. When we talk about it with friends, few people will ask us about it in depth, which leads to a lot of weak basic knowledge. So I came back and roughly read the RFC documents of HTTP and WebSocket protocols (RFC2616 and RFC6455). I happened to be a little vague about the transmission process of HTTP, so here I summarize the similarities and differences between the two protocols. Protocol Basics If you look at these two protocols carefully, they are actually very simple, but anything you want to do well will slowly become extremely complicated with all kinds of details. Here I will only briefly describe the structure of the two protocols, and will not go into deep details, which is enough for understanding HTTP. HTTP The HTTP address format is as follows: http_URL = "http:" "//" host [ ":" port ] [ abs_path [ "?" query ]] The protocol and host are not case sensitive. HTTP Messages An HTTP message may be a request or a response message. Both types of messages consist of a start-line, zero or more header fields, a blank line indicating the end of the header field (that is, a blank line prefixed with CRLF), and a possibly empty message body. A conforming HTTP client should not add extra CRLF to the message header or end, and the server will ignore these characters. The header value does not include any leading or trailing LWS (linear whitespace), which may appear before or after the first non-whitespace character of the field value. Leading or trailing LWS may be removed without changing the semantics of the field value. Any LWS appearing between filed-content may be replaced by a SP (space). The order of header fields is not important, but it is recommended to put commonly used headers first (so the protocol says). Request message RFC2616 defines the HTTP Request message as follows:
An HTTP request message starts with a request line. The second line contains the header, followed by a blank line, indicating the end of the header, and finally the message body. The request line is defined as follows:
The header used in the Request message can be a general-header or a request-header, request-header (to be explained later). One of the special ones is the Host, which is used together with the request Uri as the receiver of the Request message to determine the conditions for requesting resources, as follows:
Response message The response message is almost identical to the request message and is defined as follows:
As you can see, except that the header does not use the request-header, the only difference is the first line. The first line of the response message is the status line, which contains the famous return code. The content of Status-Line is first the version number of the protocol, followed by the return code, and finally the interpretation content, each separated by a space, and the end of the line is terminated by a carriage return line feed character. The definition is as follows:
Return Code The return code is a 3-digit number. The first digit defines the category of the return code. There are 5 categories in total, which are:
RFC2616 then gives a series of return code extensions, which we usually use, but those are just examples. HTTP1.1 does not force the communicating parties to comply with these extended return codes. The communicating parties only need to comply with the definitions of the five categories defined above in the implementation of return codes. That is to say, the first digit of the return code must be strictly in accordance with what is described in the document, and the others can be defined at will. Anyone who receives an unknown return code xyz can treat it as x00. Response messages with unknown return codes cannot be cached. Header RFC2616 defines four types of headers. If all parties agree, the request header can be extended (trusted extensions can only be made when the protocol version is updated). If the receiver receives an unknown request header, it will be treated as an entity header. The four types of headers are as follows: 1. General Header Fields: can be used as request or response headers, but cannot be used as entity headers, only as message headers.
2. Request Header Fields: Headers used by the request initiator to change the request behavior.
3. Response Header Fields: Used by the server to further describe the resource.
4. Entity Header Fields: If the message has a message body, the entity header is used as meta-information; if there is no message body, it is used to describe the information of the requested resource.
Message Body and Entity Body If there is a Transfer-Encoding header, then the message body is the entity body after decoding. If there is no Transfer-Encoding header, the message body is the entity body.
In the request message, the message header contains Content-Length or Transfer-Encoding, indicating that there will be a message body following. If the request method should not contain a message body (such as OPTION), then the request message must not contain a message body. Even if the client sends it, the server will not read the message body. In the response message, whether there is a message body is determined by the request method and the return code. For example, 1xx, 204, and 304 will not have a message body. Length of the message body The message body length is determined by the following rules, which are executed in order:
Get the entity body from the message body. Its type is defined by two headers, Content-Type and Content-Encoding (usually used for compression). If there is an entity body, there must be a Content-Type. If not, the receiver needs to guess. If it can't guess, use application/octet-stream. HTTP Connection HTTP1.1 connections use persistent connections by default. Persistent connections mean that sometimes the client needs to request a large number of related resources from the server in a short period of time. If it is not a persistent connection, a new connection must be established for each resource. HTTP uses TCP at the bottom layer, so a three-way handshake must be used to establish a TCP connection each time, which will cause a huge waste of resources. Constant connectivity can bring many benefits:
HTTP1.1 servers use TCP flow control to control HTTP traffic. When HTTP1.1 clients receive an error message from the server during a connection, they must immediately close the connection. There are many more details about HTTP connections, which will be discussed later. WebSocket Judging from the time of RFC release, WebSocket is much later, HTTP 1.1 was released in 1999, and WebSocket was released 12 years later. The opening of the WebSocket protocol states that the purpose of this protocol is to solve the problem that browser-based programs must initiate multiple HTTP requests and long polling when pulling resources... and it was created. to be continued I originally planned to sort out the general details of HTTP and WebSocket in one article and then compare them. But as I was writing, I realized that the article might be too long and not very user-friendly, so I decided to write a second article. In the second article, I will describe the general situation of WebSocket and compare it with the applicable scenarios of HTTP. |
After the 5G trial commercialization in 2018 and ...
DogYun is a Chinese hosting company established i...
As China's major cities gradually enter the e...
CentOS8 has been released for some time. I person...
In the era of the Internet of Everything, with th...
Under the tough order of "admit everyone who...
[[251967]] Recently, the Ministry of Industry and...
In 2019, my country built more than 130,000 5G ba...
After nearly 20 years of development, the integra...
Lisahost is a foreign hosting service provider re...
Many commercial security surveillance networks ar...
As the tech industry talks up 5G networks, Cisco ...
Network monitoring is one of the most important n...
LRU Introduction LRU is the abbreviation of Least...
iOVZ Cloud is a Chinese business. Although the do...