Why does the HTTP request return 304?

Why does the HTTP request return 304?

[[402402]]

I believe most web developers are familiar with the HTTP 304 status code. In this article, I will introduce how the HTTP 304 status code and the fresh function in the fresh module implement resource freshness detection based on the Koa cache example. If you don't know the browser's cache mechanism, it is recommended that you read the article "In-depth understanding of the browser's cache mechanism" first.

1. 304 Status Code

How is ETag generated in HTTP? This article introduces how ETag is generated. In the ETag practical section, Abaoge demonstrated how to use the ETag response header and If-None-Match request header to implement resource cache control in actual projects based on libraries such as koa, koa-conditional-get, koa-etag, and koa-static.

  1. // server.js
  2. const Koa = require( "koa" );
  3. const path = require( "path" );
  4. const serve = require( "koa-static" );
  5. const etag = require( "koa-etag" );
  6. const conditional = require( "koa-conditional-get" );
  7.  
  8. const app = new Koa();
  9.  
  10. app.use(conditional()); // Use conditional request middleware
  11. app.use(etag()); // Use etag middleware
  12. app.use( // Use static resource middleware
  13. serve( path.join (__dirname, "/public" ), {
  14. maxage: 10 * 1000, // Set the maximum period of cache storage in seconds
  15. });
  16. );
  17.  
  18. app.listen(3000, () => {
  19. console.log( "app starting at port 3000" );
  20. });

After starting the server, we open the Chrome developer tools and switch to the Network tab, then enter the http://localhost:3000/ address in the browser address bar, and then visit the address multiple times (press Enter multiple times in the address bar).

The above figure shows the result of Abaoge's multiple visits. In the figure, we can see the 200 and 304 status codes. The 304 status code indicates that the resource has not been modified since the version specified by the If-Modified-Since or If-None-Match parameter in the request header. In this case, the client still has a previously downloaded copy, so there is no need to retransmit the resource.

Let's take the index.js resource as an example to take a closer look at the 304 response message:

  1. HTTP/1.1 304 Not Modified
  2. Last -Modified: Sat, 29 May 2021 02:24:53 GMT
  3. Cache-Control: max -age=10
  4. ETag: W/ "29-179b5f04654"  
  5. Date : Sat, 29 May 2021 02:25:26 GMT
  6. Connection : keep-alive

For the above response message, the response header contains Last-Modified, Cache-Control and ETag, which are cache-related fields. If you are not familiar with the functions of these fields, you can read the two articles: In-depth understanding of browser cache mechanism and How is ETag generated in HTTP? Next, Abao will explore with you why the request for index.js resource returns 304 after 10 seconds?

2. Why return 304 status code?

In the previous example, we registered three middlewares by using the app.use method:

  1. app.use(conditional()); // Use conditional request middleware
  2. app.use(etag()); // Use etag middleware
  3. app.use( // Use static resource middleware
  4. serve( path.join (__dirname, "/public" ), {
  5. maxage: 10 * 1000, // Set the maximum period of cache storage in seconds
  6. })
  7. );

The first thing to register is the koa-conditional-get middleware, which is used to handle HTTP conditional requests. In this type of request, the result of the request, and even the status of the request success, will change with the comparison result of the validator and the affected resource. HTTP conditional requests can be used to verify the validity of the cache and eliminate unnecessary control measures.

In fact, the implementation of koa-conditional-get middleware is very simple, as shown below:

  1. // https://github.com/koajs/conditional-get/blob/master/ index .js
  2. module.exports = function conditional () {
  3. return async function (ctx, next ) {
  4. await next ()
  5. if (ctx.fresh) {
  6. ctx.status = 304
  7. ctx.body = null  
  8. }
  9. }
  10. }

From the above code, we can see that when the fresh property of the request context object is true, the response status code will be set to 304. Therefore, our next focus is to analyze the setting conditions of the ctx.fresh value.

By reading the source code of the koa/lib/context.js file, we know that when accessing the fresh property of the context object, we are actually accessing the fresh property of the request object.

  1. // Proxy request object
  2. delegate(proto, 'request' )
  3. // Omit other agents
  4. .getter( 'fresh' )
  5. .getter( 'ips' )
  6. .getter( 'ip' );

The fresh property on the request object is defined using the getter method, as shown below:

  1. // node_modules/koa/lib/request.js
  2. module.exports = {
  3. // Omit some code
  4. get fresh() {
  5. const method = this.method; // Get request method
  6. const s = this.ctx.status; // Get status code
  7.  
  8. if ( 'GET' !== method && 'HEAD' !== method) return   false ;
  9.  
  10. // 2xx or 304 as per rfc2616 14.26
  11. if ((s >= 200 && s < 300) || 304 === s) {
  12. return fresh(this.header, this.response.header);
  13. }
  14. return   false ;
  15. },
  16. }

method && 'HEAD' !== method) return false; // 2xx or 304 as per rfc2616 14.26 if ((s >= 200 && s < 300) || 304 === s) { return fresh(this.header, this.response.header); } return false; },}

In the fresh method, freshness detection is performed only when the request is a GET/HEAD request and the status code is 2xx or 304. The corresponding freshness detection logic is encapsulated in the fresh module, so let's analyze how this module detects freshness?

3. How to detect freshness

The fresh module provides a fresh function that supports two parameters: reqHeaders and resHeaders. Within this function, the logic of freshness detection can be divided into the following four parts:

3.1 Determine whether it is a conditional request

  1. // https://github.com/jshttp/fresh/blob/master/ index .js
  2. function fresh (reqHeaders, resHeaders) {
  3. var modifiedSince = reqHeaders[ 'if-modified-since' ]
  4. var noneMatch = reqHeaders[ 'if-none-match' ]
  5.  
  6. // Non-conditional request
  7. if (!modifiedSince && !noneMatch) {
  8. return   false  
  9. }
  10. }

If the request header does not contain the if-modified-since and if-none-match fields, false is returned directly.

3.2 Determine the cache-control request header

  1. // https://github.com/jshttp/fresh/blob/master/ index .js
  2. var CACHE_CONTROL_NO_CACHE_REGEXP = /(?:^|,)\s*? no -cache\s*?(?:,|$)/
  3.  
  4. function fresh (reqHeaders, resHeaders) {
  5. var modifiedSince = reqHeaders[ 'if-modified-since' ]
  6. var noneMatch = reqHeaders[ 'if-none-match' ]
  7.    
  8. // Always return stale when Cache-Control: no -cache
  9. // to support end - to - end reload requests
  10. // https://tools.ietf.org/html/rfc2616# section -14.9.4
  11. var cacheControl = reqHeaders[ 'cache-control' ]
  12. if (cacheControl && CACHE_CONTROL_NO_CACHE_REGEXP.test(cacheControl)) {
  13. return   false  
  14. }
  15. }

When the value of the cache-control request header is no-cache, false is returned to support end-to-end reload requests. It should be noted that no-cache does not mean no caching, but means that the resource is cached, but immediately invalidated, and a request will be made next time to verify whether the resource is expired. If you do not cache any responses, you need to set the cache-control value to no-store.

3.3 Check if ETag matches

  1. // https://github.com/jshttp/fresh/blob/master/ index .js
  2. function fresh (reqHeaders, resHeaders) {
  3. var modifiedSince = reqHeaders[ 'if-modified-since' ]
  4. var noneMatch = reqHeaders[ 'if-none-match' ]
  5.    
  6. // Omit some code
  7. if (noneMatch && noneMatch !== '*' ) {
  8. var etag = resHeaders[ 'etag' ] // Get the value of the etag field in the response header
  9.  
  10. if (!etag) { // If the response header does not set etag, return false directly  
  11. return   false  
  12. }
  13.  
  14. var etagStale = true // stale: not fresh
  15. var matches = parseTokenList(noneMatch) // Parse noneMatch
  16. for (var i = 0; i < matches.length; i++) { // Execute loop matching operation
  17. var match = matches[i]
  18. if (match === etag || match === 'W/' + etag || 'W/' + match === etag) {
  19. etagStale = false  
  20. break
  21. }
  22. }
  23.  
  24. if (etagStale) {
  25. return   false  
  26. }
  27. }
  28. return   true  
  29. }

In the above code, the parseTokenList function is used to handle the situation of 'if-none-match': ' "bar" , "foo"'. During the parsing process, extra spaces will be removed, and the etag values ​​separated by commas will be split. The purpose of performing loop matching is also to support the following test cases:

  1. // https://github.com/jshttp/fresh/blob/master/test/fresh.js
  2. describe( 'when at least one matches' , function () {
  3. it( 'should be fresh' , function () {
  4. var reqHeaders = { 'if-none-match' : ' "bar" , "foo"' }
  5. var resHeaders = { 'etag' : '"foo"' }
  6. assert.ok(fresh(reqHeaders, resHeaders))
  7. })
  8. })

In addition, the W/ (case-sensitive) in the above code indicates the use of a weak validator. Weak validators are easy to generate, but difficult to compare. If the etag does not contain W/, it indicates a strong validator, which is a more ideal choice, but difficult to generate effectively. Two weak etag values ​​for the same resource may be semantically equivalent, but not byte-for-byte identical.

3.4 Determine whether Last-Modified has expired

  1. // https://github.com/jshttp/fresh/blob/master/ index .js
  2. function fresh (reqHeaders, resHeaders) {
  3. var modifiedSince = reqHeaders[ 'if-modified-since' ] // Get the modification time in the request header
  4. var noneMatch = reqHeaders[ 'if-none-match' ]
  5.  
  6. // if-modified-since
  7. if (modifiedSince) {
  8. var lastModified = resHeaders[ 'last-modified' ] // Get the modification time in the response header
  9. var modifiedStale = !lastModified || !(parseHttpDate(lastModified) <= parseHttpDate(modifiedSince))
  10.  
  11. if (modifiedStale) {
  12. return   false  
  13. }
  14. }
  15.  
  16. return   true  
  17. }

The judgment logic of Last-Modified is very simple. When the response header does not set the last-modified field information or the last-modified value in the response header is greater than the modification time corresponding to the if-modified-since field in the request header, the freshness detection result is false, which means that the resource has been modified and is no longer fresh.

After understanding the specific implementation of the fresh function, let's review the difference between Last-Modified and ETag:

  • In terms of accuracy, Etag is better than Last-Modified. The time unit of Last-Modified is seconds. If a file is changed multiple times within 1 second, its Last-Modified will not reflect the change, but Etag will change every time, thus ensuring accuracy; in addition, if it is a load-balanced server, the Last-Modified generated by each server may also be inconsistent.
  • In terms of performance, Etag is inferior to Last-Modified. After all, Last-Modified only needs to record time, while ETag requires the server to calculate a hash value through a message digest algorithm.
  • In terms of priority, when checking the freshness of a resource, the server will give priority to Etag. That is, if the request header of a conditional request carries both the If-Modified-Since and If-None-Match fields, it will first determine whether the ETag value of the resource has changed.

I believe you have a general understanding of why the index.js resource request in the example returns 304. If you are interested in how the koa-etag middleware generates ETag, you can read the article How is ETag generated in HTTP?

4. Cache Mechanism

Strong cache takes precedence over negotiated cache. If strong cache (Expires and Cache-Control) is in effect, the cache is used directly. If it is not in effect, negotiated cache (Last-Modified/If-Modified-Since and Etag/If-None-Match) is performed. The server decides whether to use the negotiated cache. If the negotiated cache fails, the cache of the request is invalid, and 200 is returned. The resource and cache identifier are returned again and stored in the browser cache. If it is in effect, 304 is returned and the cache continues to be used.

The specific caching mechanism is shown in the following figure:

In order to help you better understand the cache mechanism, let's briefly analyze the Koa cache example introduced earlier:

  1. // server.js
  2. const Koa = require( "koa" );
  3. const path = require( "path" );
  4. const serve = require( "koa-static" );
  5. const etag = require( "koa-etag" );
  6. const conditional = require( "koa-conditional-get" );
  7.  
  8. const app = new Koa();
  9.  
  10. app.use(conditional()); // Use conditional request middleware
  11. app.use(etag()); // Use etag middleware
  12. app.use( // Use static resource middleware
  13. serve( path.join (__dirname, "/public" ), {
  14. maxage: 10 * 1000, // Set the maximum period of cache storage in seconds
  15. });
  16. );
  17.  
  18. app.listen(3000, () => {
  19. console.log( "app starting at port 3000" );
  20. });

The above example uses koa-conditional-get, koa-etag and koa-static middleware. Their specific definitions are as follows:

4.1 koa-conditional-get

  1. // https://github.com/koajs/conditional-get/blob/master/ index .js
  2. module.exports = function conditional () {
  3. return async function (ctx, next ) {
  4. await next ()
  5. if (ctx.fresh) { // If the resource is not updated, return 304 Not Modified
  6. ctx.status = 304
  7. ctx.body = null  
  8. }
  9. }
  10. }

The implementation of koa-conditional-get middleware is very simple. If the resource is fresh, it directly returns a 304 status code and sets the response body to null.

4.2 koa-etag

  1. // https://github.com/koajs/etag/blob/master/ index .js
  2. module.exports = function etag (options) {
  3. return async function etag (ctx, next ) {
  4. await next ()
  5. const entity = await getResponseEntity(ctx) // Get the response entity object
  6. setEtag(ctx, entity, options)
  7. }
  8. }

Inside the koa-etag middleware, after getting the response entity object, the setEtag function is called to set the ETag. The definition of the setEtag function is as follows:

  1. // https://github.com/koajs/etag/blob/master/ index .js
  2. const calculate = require( 'etag' )
  3.  
  4. function setEtag (ctx, entity, options) {
  5. if (!entity) return  
  6. ctx.response.etag = calculate(entity, options)
  7. }

Obviously, the koa-etag middleware uses the etag library to generate the corresponding etag for the response entity.

4.3 koa-static

  1. // https://github.com/koajs/ static /blob/master/ index .js
  2. function serve (root, opts) {
  3. opts = Object.assign(Object. create ( null ), opts)
  4. // Omit some code
  5. return async function serve (ctx, next ) {
  6. await next ()
  7. if (ctx.method !== 'HEAD' && ctx.method !== 'GET' ) return  
  8. // response is already handled
  9. if (ctx.body != null || ctx.status !== 404) return  
  10. try {
  11. await send(ctx, ctx.path, opts)
  12. } catch (err) {
  13. if (err.status !== 404) {
  14. throw err
  15. }
  16. }
  17. }
  18. }

For the koa-static middleware, when the request method is not a GET or HEAD request (which should not contain a response body), it will be returned directly. The processing capability of static resources is actually implemented by the send library.

Finally, in order to help you better understand the processing logic of the above middleware, Brother Abao will take you to briefly review the onion model:

In the above figure, each layer in the onion represents an independent middleware, which is used to implement different functions, such as exception handling, cache processing, etc. Each request will pass through each layer of middleware from the left layer, and when it enters the innermost layer of middleware, it will return layer by layer from the innermost layer of middleware. Therefore, for each layer of middleware, there are two opportunities to add different processing logic in a request and response cycle.

V. Conclusion

In this article, based on Koa's cache example, Abao Ge introduces how the HTTP 304 status code and the fresh function in the fresh module implement resource freshness detection. I hope that after reading this article, you will have a deeper understanding of HTTP and browser cache mechanisms. In addition, this article only briefly introduces Koa's onion model. If you are interested in the onion model, you can continue to read the article How to Better Understand Middleware and Onion Model.

6. Reference Resources

  • MDN - HTTP Conditional Requests
  • How is ETag generated in HTTP?

<<:  How to implement a custom serial communication protocol?

>>:  What exactly does edge computing mean?

Recommend

7 New Year's Resolutions for the Internet of Things

The beginning of a new year is often a time for p...

I don't know the router's address.

When we set up a wireless router, we need to ente...

IDC: Global edge computing market will reach $250.6 billion in 2024

Industry data: Gartner conducted a survey and int...

T-Mobile and Sprint to merge

Early morning news on February 11, 2020, accordin...

Even monkeys can penetrate the intranet!

Hello, everyone, I am amazing. I recently turned ...

...

Are you ready for network automation?

[[374510]] This article is reprinted from the WeC...

Principles of nine cross-domain implementation methods (full version)

[Original article from 51CTO.com] Cross-domain re...