Mathematics 134 -- Intensive Calculus for Science

Lab 5: Population Growth Models

April 24 and 25, 2006

Background

One type of mathematical model of population growth for a species is a differential equation whose solutions exhibit some of the same behavior as the actual population. If we have a good model of this type, we might even be able to make predictions about the future real-world population. One such model is the logistic model we have studied in class. If our population is living in a fixed habitat with limited, but self-renewing, resources (for instance, plants as food for the individuals of an animal species to eat), then one reasonable assumption is that there is a limited sustainable maximum population M that the habitat can support. In biology, M would be called the carrying capacity. The logistic model was proposed by the Belgian scientist P.F. Verhulst in the 1830's, and has been intensively studied by biologists and demographers since. Verhulst's idea was to consider the following relationship:

(*) The rate of change of the natural logarithm of the population is proportional to the difference between M (the maximum sustainable population) and the population.

Verhulst's model can be rephrased as the differential equation

P'/P = k(M - P)

which is called the logistic equation.

As we saw previously, the general solution of this equation can be written in the form:

P(t) = M/(1 + c e-Mkt),

where c is an arbitrary constant.
We want to study these equations, their solutions, and see how an actual population problem might be attacked using this model.

In a "real-world" population study in biology or demographics, the idea would be to

If the fit is good, you might make predictions about what the population would be at times in the future.

We are going to do this, in two different ways using the following table of estimated data for the world human population starting from the beginning of the 20th century:

(year)19001910192019301940195019601970198019902000
P (pop. in millions)15501750186020702300255530393707445652836080

Lab Questions

If these points were exactly on some logistic equation solution graph, then for each of these t values, the points (P, P'/P) would lie on some line z = k(M - P) in the (P,z)-plane. To begin,

A) Using the table of values above, estimate the rate of change P' (in million people per year) at each of the given times, using a forward difference

(P(t + 10) - P(t))/10

and divide by P to get an approximate value for the ratio P'/P. For instance, to estimate P' at 1900, you will use (1750-1550)/10 = 20. The whole calculation can be done by hand, or using Maple as a calculator.

B) Make a list of your approximate data points (P,P'/P) in the following format. Your first data point (corresponding to the year 1900) should have P' = 20 as above and P = 1550 from the table. Then 20/1550 = .0129 (approx.) is your first P'/P value, and the first point in the list is as here:

dl:=[[1550,0.0129],[P2,P2'/P2],...,[P10,P10'/P10]];

(Since there's no obvious good way to estimate P'(2000) from the table, only include data points for 1900 through 1990 -- ten points in all.) You can plot your list of (P,P'/P) points to see what they look like, using a plot command like this

plot(dl,style=point);

Try this, and note the output. You should see what looks roughly like two different groups of points. The one to the left corresponds to the years 1900, 1910, 1920, 1930, 1940. The group to the right corresponds to the years 1950, 1960, 1970, 1980, 1990. From (*) above again, if our population values were all on the graph of a logistic function, then the points we have plotted here would all lie on a single straight line z = k(M - P) in the (P,z)-plane. That isn't true here, so we'll need to do some data analysis. In fact, what we will do will be to look at the trend from the first and second halves of the 20th century separately(!) You may have seen a method called linear regression or least-squares curve fitting for doing this in another course. We will do this now, in two different ways, using Maple's least-squares routine.

C) First we will find a logistic model that fits the data well for the first half of the 20th century. For technical reasons we will omit the 1910 data point. This is not entirely ``kosher'' but will be necessary to get reasonable results by our method.

  1. Note that our estimate of the growth rate for that data point uses the period 1910 - 1920. What happened then in the world? Why might that decade be ``unrepresentative'' of general trends?
  2. Enter the following command to load the stats package that contains this routine:

    with(stats):

    To put the data into the form Maple wants, we will need to separate out the P- and z=P'/P-coordinates of the data from dl, as follows:

    Plist1 := [dl[1][1], dl[2][1], dl[3][1], dl[4][1], dl[5][1]];
    zlist2 := [dl[1][2], dl[2][2], dl[3][2], dl[4][2], dl[5][2]];
  3. Now we're ready to compute a best-fit line. Enter

    fit[leastsquare[[P,z]]]([Plist1,zlist1]);

    If all your [, ]'s and (, )'s match, this should generate the equation of a line in the form

    z = (intercept) + (slope) P

  4. From the slope and intercept values, determine the best-fit k and M for the logistic model, for this data. (Recall, M is the maximum sustainable population in the logistic equation.)
  5. Using your k and M values, and the population in 1900, determine a formula for the solution of the logistic equation that should model the population in 1900 - 1950. Plot your solution curve as a function of time for the period 1900 - 2050. (Take t = 0 as the year 1900.)
D) Now repeat parts 2-5 of C for the data from the second half of the 20th century. Pick out the data points for a second regression using

Plist2 := [dl[6][1],dl[7][1],dl[8][1],dl[9][1],dl[10][1]];
zlist2 := [dl[6][2],dl[7][2],dl[8][2],dl[9][2],dl[10][2]];
Use

fit[leastsquare[[P,z]]]([Plist2,zlist2]);

and generate the equation of a second line in the form

z = (intercept) + (slope) P

From the slope and intercept values of this new line, determine the best-fit k and M for the logistic model, for this data. Determine a formula for the solution of the logistic equation that should model the population in 1950 - 2000. Plot your solution curve as a function of time for the period 1900 - 2050. (Take t = 0 as the year 1900.)

E) Discuss your results. What world population do your models predict for 2010,2020,2030,2040,2050? In what year is the maximum population growth rates experienced? What are the carrying capacities? Also, think about the following questions:

  1. Thinking about 20th century history, what factor(s) might account for the apparent difference in the period up to 1940 and the period from 1950 on?
  2. What are some trends that might affect population size that are not included in the logistic models we have been using?
  3. Is a fixed carrying capacity for human populations realistic? What might a more realistic model look like?

Assignment

Solutions will be due on Monday May 1. Because it is more extensive than some of the other labs we have done this term, this assignment will count as two of the parts of the lab/discussion grade for the course.