I wrote that I would show you what various graphs of various types of simple models look like for deaths so far due to the current corona virus: linear, exponential, polynomial, and so on. I think that a fourth-degree (not third-degree, like I wrote earlier) seems to fit the data best so far, and that’s better than exponential growth.

First, let’s look at a straight-line best-fit model, superimposed by Excel on the data. (Note: deaths are on the Y, or vertical, axis; the X-axis represents days since the beginning of March. So today, the 6th of April, is day 37 (31 + 6). The dotted red line represents the line of best fit, and the blue dots are the CDC-announced numbers of deaths so far.

As you can see, the straight dotted line doesn’t fit the data very well at all. R-squared, known as the correlation coefficient, tells us numerically how well it fits. If R or R^2 equals 1.000, then you have absolutely perfect correlation of the data to your model. Which we do NOT have here. By the way, in that model, then by mid-June we would have about 22,000 dead from this disease.

OK, let’s look at an exponential curve-of-best fit next:

As you can see, this red curve fits the data a LOT better, and R-squared is a lot higher.

Unfortunately.

We do NOT WANT EXPONENTIAL GROWTH OF THIS OR ANY OTHER DISEASE, BECAUSE IT MEANS WE ALL GET IT! **In fact, if this model is accurate and isn’t slowed down, then by mid-June, just plugging in the numbers, we would have 3.3 BILLION (not million) people dead in the US alone.** Fortunately, that won’t happen.

BUT there are some parts of the data where the curve doesn’t fit perfectly — let me point them out:

At the upper right-hand end, the red dotted line is quite a bit higher than the blue dots. Fortunately. And near the middle of the graph, the blue dots of death are higher than the red line.

OK, let’s look at some polynomial models instead:

This is a fancy version of the simple y=x^2 parabolas you may have graphed in Algebra 1. Once again, this doesn’t do a terrific job of conforming to the actual data. At the right-hand end, the blue dots of death are higher than the curve. In addition, if we continued the red curve to the left, we would find that something like two thousand people had already died in the US, and presumably came back to life. Which is ridiculous.

However, if this model were to hold true until mid-June, we would have 127 thousand dead. Not good.

Let’s try a third-degree polynomial (a cubic):

That’s pretty remarkable agreement between the data and the equation! That’s the equation I was using in my earlier post. The R-squared correlation is amazing. Unfortunately, if this continues to hold, then we would have about 468 thousand dead in the US.

Let’s continue by looking at a fourth-degree polynomial curve fitted to the data:

That is an amazingly good fit to the data! Unfortunately, let’s hope that it won’t continue to fit the data, because if it does, then we are looking at a little over a MILLION dead.

Let’s hope we can get these totals to level off by physically distancing ourselves from other households, washing our hands, and getting proper protective garments and testing technology to our medical personnel.

=============

Here’s another model that unfortunately does NOT work: logarithmic growth. If it were the case, then we would have about 10,700 deaths by mid-June.